Summary
Biology development
Debian Med packages for development of micro-biology applications
This metapackage will install Debian packages which might be helpful
for development of applications for micro-biological research.
The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Med
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Med mailing list
Links to other tasks
|
Debian Med Biology development packages
Official Debian packages with high relevance
|
Bioperl
Perl tools for computational molecular biology
|
| Versions of package bioperl |
| Release | Version | Architectures |
| sid | 1.6.1-1 | all |
| squeeze | 1.6.1-1 | all |
| lenny | 1.5.2.102-3 | all |
| etch | 1.4-1 | all |
| Debtags of package bioperl: |
| devel | library, lang:perl |
| field | biology:bioinformatics, biology |
| role | shared-lib, devel-lib |
|
License: DFSG free
|
|
The Bioperl project is a coordinated effort to collect computational methods
routinely used in bioinformatics into a set of standard CPAN-style,
well-documented, and freely available Perl modules. It is well-accepted
throughout the community and used in many high-profile projects, e.g.,
Ensembl.
When publishing work prepared with this package, you should cite:
J.E. Stajich, D. Block, K. Boulez, S.E. Brenner, S.A. Chervitz,
C. Dagdigian, G. Fuellen, J.G. Gilbert, I. Korf, H. Lapp, H. Lehvaslaiho,
C. Matsalla, C.J. Mungall, B.I. Osborne, M.R. Pocock, P. Schattner,
M. Senger, L.D. Stein, E. Stupka, M.D. Wilkinson, E. Birney (2002)
"The Bioperl toolkit: Perl modules for the life sciences." Genome
Res. 12(10):1611-1618.
The package is enhanced by the following packages:
mcl
|
|
|
Bioperl-run
Wrapper modules for BioPerl
|
| Versions of package bioperl-run |
| Release | Version | Architectures |
| sid | 1.6.1-1 | all |
| squeeze | 1.6.1-1 | all |
|
License: DFSG free
|
|
Contains modules that provide a Perl interface to various
bioinformatics applications to allow them to be
used with common BioPerl objects.
Not all the wrappable applications are packaged in Debian. The ones
that are are "Suggested" by this package.
|
|
|
Biosquid
utilities for biological sequence analysis
|
| Versions of package biosquid |
| Release | Version | Architectures |
| lenny | 1.9g+cvs20050121-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 1.9g+cvs20050121-2 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.9g+cvs20050121-2 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 1.9g+cvs20050121-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package biosquid: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | editing, converting, comparing |
|
License: DFSG free
|
|
SQUID is a library of C code functions for sequence analysis. It also
includes a number of small utility programs to convert, show statistics,
manipulate and do other functions on sequence files.
The original name of the package is "squid", but since there is already
a squid on the archive (a proxy cache), it was renamed to "biosquid".
|
|
|
Libace-perl
Object-Oriented Access to ACEDB Databases
|
| Versions of package libace-perl |
| Release | Version | Architectures |
| sid | 1.92-2+b1 | ia64,armel |
| squeeze | 1.92-2+b1 | ia64,armel |
| sid | 1.92-2 | s390,powerpc,mips,sparc,mipsel,hppa,amd64,i386,alpha |
| squeeze | 1.92-2 | sparc,amd64,hppa,i386,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
AcePerl is an object-oriented Perl interface for the AceDB
database. It provides functionality for connecting to remote AceDB
databases, performing queries, fetching ACE objects, and updating
databases. The programmer's API is compatible with the JADE Java API,
and interoperable with the API used by BoulderIO.
AceDB is a genome database system developed since 1989 primarily by
Jean Thierry-Mieg (CNRS, Montpellier) and Richard Durbin (Sanger
Institute). It was originally developed for the C.elegans genome
project , from which its name was derived (A C. elegans DataBase).
|
|
|
Libajax5-dev
Development files for libajax
|
| Versions of package libajax5-dev |
| Release | Version | Architectures |
| lenny | 5.0.0-7 | alpha,amd64,arm,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| Debtags of package libajax5-dev: |
| field | biology:bioinformatics, biology |
| role | devel-lib |
|
License: DFSG free
|
|
This package contains the header and development files needed to build
programs and packages using libajax.
|
|
|
Libbio-graphics-perl
Generate GD images of Bio::Seq objects
|
| Versions of package libbio-graphics-perl |
| Release | Version | Architectures |
| sid | 1.982-1 | all |
| squeeze | 1.982-1 | all |
|
License: DFSG free
|
|
The Bio::Graphics::Panel class provides drawing and formatting
services for any object that implements the Bio::SeqFeatureI
interface, including Ace::Sequence::Feature, Das::Segment::Feature and
Bio::DB::Graphics objects. It can be used to draw sequence
annotations, physical (contig) maps, protein domains, or any other
type of map in which a set of discrete ranges need to be laid out on
the number line.
|
|
|
Libbio-mage-perl
Container module for classes in the MAGE package: MAGE
|
| Versions of package libbio-mage-perl |
| Release | Version | Architectures |
| lenny | 20030502.3-1 | all |
| sid | 20030502.3-1 | all |
| squeeze | 20030502.3-1 | all |
| Debtags of package libbio-mage-perl: |
| field | biology:bioinformatics, biology |
| role | shared-lib, devel-lib |
|
License: DFSG free
|
|
MAGE-TAB (MicroArray Gene Expression Tabular) format is a standard from the
Microarray Gene Expression Data Society (MGED). This package contains Perl
modules in the Bio::MAGE hierarchy to manipulate MIAME-compliant (Minimum
Information About a Microarray Experiment) records of microarray ("DNA chips")
experiments.
The Bio::MAGE module contains the following Bio::MAGE classes:
* NameValueType
* Extendable
* Identifiable
* Describable
|
|
|
Libbio-primerdesigner-perl
Perl module to design PCR primers using primer3 and epcr
|
| Versions of package libbio-primerdesigner-perl |
| Release | Version | Architectures |
| sid | 0.07-1 | all |
| squeeze | 0.07-1 | all |
|
License: DFSG free
|
|
Bio::PrimerDesigner provides a low-level interface to the primer3 and epcr
binary executables and supplies methods to return the results. In addition to
accessing local installations of primer3 or e-PCR, it also offers the ability to
accessing the primer3 binary via a remote server.
|
|
|
Libbio-ruby
bioruby tools for computational molecular biology
|
| Versions of package libbio-ruby |
| Release | Version | Architectures |
| sid | 1.3.1-1 | all |
| squeeze | 1.3.1-1 | all |
| lenny | 1.1.0-1 | all |
| etch | 1.0.0-1 | all |
| Debtags of package libbio-ruby: |
| devel | library, lang:ruby |
| field | biology |
| role | metapackage |
|
License: DFSG free
|
|
BioRuby project aims to implement an integrated environment for
Bioinformatics with Ruby language. Design philosophy of the BioRuby library
is KISS (keep it simple and stupid) to maximize the usability and the
efficiency for biologists as a daily tool. The project was started in Japan
and supported by University of Tokyo (Human Genome Center), Kyoto University
(Bioinformatics Center) and the Open Bio Foundation.
This project is a dependency package, which depends on Debian's default Ruby
version (currently 1.8.x)
|
|
|
Libbiojava-java
Java API to biological data and applications
|
| Versions of package libbiojava-java |
| Release | Version | Architectures |
| sid | 1.7.svn.20090419-3 | all |
| squeeze | 1.7.svn.20090419-3 | all |
|
License: DFSG free
|
|
BioJava is an open-source project dedicated to providing a Java framework
for processing biological data. It includes objects for manipulating
sequences, file parsers, DAS client and server support, access to BioSQL
and Ensembl databases, and powerful analysis and statistical routines
including a dynamic programming toolkit.
BioJava is provided by a vibrant community which meets annually at
the Bioinformatics Open Source Conference (BOSC) that traditionally
accompanies the Intelligent Systems in Molecular Biology (ISMB)
meeting. Much like BioPerl, the employment of this library is valuable
for everybody active in the field because of the many tricks of the
trade one learns just by communicating on the mailing list.
If using BioJava, please cite: R.C.G. Holland, T. Down, M. Pocock,
A. Prlić, D. Huen, K. James, S. Foisy, A. Dräger, A. Yates,
M. Heuer; M.J. Schreiber (2008) "BioJava: an Open-Source Framework
for Bioinformatics" Bioinformatics 24(18):2096-2097.
This is a wrapper package which should enable smooth upgrades to new
versions.
Please cite:
R.C.G. Holland, T. Down, M. Pocock, A. Prlić, D. Huen, K. James, S. Foisy, A. Dräger, A. Yates, M. Heuer, M.J. Schreiber:
BioJava: an Open-Source Framework for Bioinformatics.
(DOI)
Bioinformatics 24(18):2096-2097
(2008)
|
|
|
Libgenome-1.3-0-dev
toolkit for developing bioinformatic related software
|
| Versions of package libgenome-1.3-0-dev |
| Release | Version | Architectures |
| sid | 1.3.1-2 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.3.1-2 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 1.3.1-1 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| Debtags of package libgenome-1.3-0-dev: |
| devel | lang:c++ |
| field | biology:bioinformatics, biology |
| role | devel-lib |
|
License: DFSG free
|
|
libGenome is a freely available toolkit for developing bioinformatic related
software in C++. It is intended to take the hassle out of performing common
tasks on genetic sequence and annotation data.
Among other things, libGenome can help you:
-
Read and write Multi-FastA format files
-
Read and write GenBank flat file database entries
-
Append, chop, truncate, reverse, complement, translate, and otherwise
mangle sequence data
-
Access annotation in GenBank flat files
This is the development package containing the statically linked
library and the header files.
|
|
|
Libgo-perl
perl modules for GO and other OBO ontologies
|
| Versions of package libgo-perl |
| Release | Version | Architectures |
| sid | 0.10-1 | all |
| squeeze | 0.10-1 | all |
| lenny | 0.09-1 | all |
| Debtags of package libgo-perl: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | shared-lib, program, devel-lib |
| scope | utility |
| use | converting, analysing |
| works-with-format | xml, plaintext |
|
License: DFSG free
|
|
This is a collection of perl code for dealing with Gene Ontologies (GO) and
Open Biomedical Ontologies (OBO) style ontologies. It is part of the ‘go-dev’
distribution, but this Debian package is made from the CPAN archive. This
package contains both scripts (which can be used with no knowledge of perl),
and libraries which will be of use to perl programmers using GO or OBO.
|
|
|
Libncbi6-dev
NCBI libraries for biology applications (development files)
|
| Versions of package libncbi6-dev |
| Release | Version | Architectures |
| sid | 6.1.20090809-1+b1 | ia64,armel |
| squeeze | 6.1.20090809-1+b1 | armel,ia64 |
| sid | 6.1.20090809-1 | s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc |
| squeeze | 6.1.20090809-1 | mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64 |
| lenny | 6.1.20080302-4 | powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha |
| etch | 6.1.20061015-1 | hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha |
| Debtags of package libncbi6-dev: |
| devel | library |
| field | biology |
| role | devel-lib |
| use | searching, converting |
|
License: DFSG free
|
|
This package is for developers or people who want to compile programs
only. In addition to development headers and libraries, it contains the
utilities asntool and errhdr.
|
|
|
Libnucleus5-dev
Development files for libajax
|
| Versions of package libnucleus5-dev |
| Release | Version | Architectures |
| lenny | 5.0.0-7 | alpha,amd64,arm,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| Debtags of package libnucleus5-dev: |
| field | biology:bioinformatics, biology |
| role | devel-lib |
|
License: DFSG free
|
|
This package contains the header and development files needed to build
programs and packages using libnucleus.
|
|
|
Libqsearch-dev
nondeterministic quartet tree search library for unrooted trees
|
| Versions of package libqsearch-dev |
| Release | Version | Architectures |
| sid | 1.0.8-3 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.0.8-3 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 1.0.8-1 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| Debtags of package libqsearch-dev: |
| devel | library |
|
License: DFSG free
|
|
qsearch is a library for universal hierarchical clustering using an arbitrary
distance matrix as input. It searches through the space of all possible
unrooted trees of a given size and finds the closest match based on a
weighted quartet cost function determined by the distance matrix. When
used in combination with other feature extraction libraries such as
libcomplearn this system can be used for fast and accurate phylogenetic
reconstruction, linguistic analysis, or stemmatology.
|
|
|
Libsrf-dev
C++ implementation of the SRF format for DNA sequence data
|
| Versions of package libsrf-dev |
| Release | Version | Architectures |
| sid | 0.1+dfsg-1 | amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| squeeze | 0.1+dfsg-1 | powerpc,amd64,mips,armel,sparc,hppa,mipsel,i386,s390,ia64 |
|
License: DFSG free
|
|
SRF (sort for Sequence Read Format) is a generic format capable of storing
data generated by any DNA sequencing technology. This library is an
implementation of SRF and provides basic input-output functions.
|
|
|
Libstaden-read-dev
development files for libstaden-read
|
| Versions of package libstaden-read-dev |
| Release | Version | Architectures |
| sid | 1.12.1-1 | amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.12.1-1 | powerpc,amd64,mips,armel,sparc,hppa,mipsel,i386,s390,ia64 |
|
License: DFSG free
|
|
This package contains the header and development files needed to build
programs and packages using the Staden io_lib.
The io_lib from the Staden package is a library of file reading and writing
code to provide a general purpose trace file (and Experiment File) reading
interface. It has been compiled and tested on a variety of unix systems,
MacOS X and MS Windows.
|
|
|
Libtfbs-perl
scanning DNA sequence with a position weight matrix
|
| Versions of package libtfbs-perl |
| Release | Version | Architectures |
| sid | 0.5.svn.20080722-2 | mips,armel,mipsel,i386,powerpc,amd64,s390,ia64,sparc,alpha,hppa |
| squeeze | 0.5.svn.20080722-2 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
| Debtags of package libtfbs-perl: |
| field | biology:bioinformatics, biology |
| role | shared-lib, devel-lib |
|
License: DFSG free
|
|
The TFBS perl modules comprise a set of routines to interact with the
Transfac and Jaspar databases that describe a special family of proteins,
the transcription factors. These bind to genomic DNA to initiate (or
prevent) the readout of a gene. Once multiple binding sites are known
for a transcription factor, these are gathered in a single file and are
aligned in order to find position-specific characteristica that might
be used to predict such binding events in novel DNA sequences.
If you use TFBS in your work, please cite "Lenhard B., Wasserman W.W. (2002)
TFBS: Computational framework for transcription factor binding site analysis.
Bioinformatics 18:1135-1136".
|
|
|
Libvibrant6-dev
NCBI libraries for graphic biology applications (development files)
|
| Versions of package libvibrant6-dev |
| Release | Version | Architectures |
| sid | 6.1.20090809-1+b1 | ia64,armel |
| squeeze | 6.1.20090809-1+b1 | armel,ia64 |
| sid | 6.1.20090809-1 | s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc |
| squeeze | 6.1.20090809-1 | mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64 |
| lenny | 6.1.20080302-4 | powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha |
| etch | 6.1.20061015-1 | hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha |
| Debtags of package libvibrant6-dev: |
| devel | library, lang:c |
| field | biology |
| interface | x11 |
| role | devel-lib |
| uitoolkit | motif |
| use | viewing |
|
License: DFSG free
|
|
Vibrant allows you to develop portable (Motif, MS-Windows, Mac-OS) graphic
biological applications.
|
|
|
Mcl
the Markov Cluster algorithm
|
| Versions of package mcl |
| Release | Version | Architectures |
| sid | 09-261-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 09-261-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 06-058-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 06-021-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| upstream | 09-308 |
| Debtags of package mcl: |
| field | mathematics |
| role | program |
|
License: DFSG free
|
|
The MCL package is an implementation of the MCL algorithm, and offers
utilities for manipulating sparse matrices (the essential data
structure in the MCL algorithm) and conducting cluster experiments.
MCL is currently being used in sciences like biology (protein family
detection, genomics), computer science (node clustering in
Peer-to-Peer networks), and linguistics (text analysis).
The package is enhanced by the following packages:
zoem
|
|
|
Octave-bioinfo
bioinformatic functions for Octave
|
| Versions of package octave-bioinfo |
| Release | Version | Architectures |
| sid | 0.1.2-1 | all |
| squeeze | 0.1.2-1 | all |
| lenny | 0.1.0-1 | sparc,powerpc,s390,alpha,amd64,arm,armel,hppa,i386,ia64,mips,mipsel |
|
License: DFSG free
|
|
This package provides functions to do work in bioinformatics in
Octave, a numerical computation software. It contains functions to
convert amino acid characters into integers (aa2int), convert between
amino acid representations (aminolookup), cleave peptides (cleave),
convert integers into amino acid characters (int2aa) and reverse a
nucleotide sequence (seqreverse).
This Octave add-on package is part of the Octave-Forge project.
|
|
|
Python-biopython
Python library for bioinformatics
|
| Versions of package python-biopython |
| Release | Version | Architectures |
| sid | 1.52-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.52-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 1.45-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 1.42-2 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package python-biopython: |
| devel | library, lang:python |
| field | biology |
| role | shared-lib, devel-lib |
|
License: DFSG free
|
|
The Biopython Project is an international association
of developers of freely available Python tools for
computational molecular biology.
It is a distributed collaborative effort to develop Python
libraries and applications which address the needs of
current and future work in bioinformatics. The source code
is made available under the Biopython License, which is
extremely liberal and compatible with almost every license
in the world. We work along with the Open Bioinformatics
Foundation, who generously provide web and CVS space for
the project.
Please cite:
P.J. Cock, T. Antao, J.T. Chang, B.A. Chapman, C.J. Cox, A. Dalke, I. Friedberg, T. Hamelryck, F. Kauff, B. Wilczynski, M.J. de Hoon:
Biopython: freely available Python tools for computational molecular biology and bioinformatics.
Bioinformatics, 25(11):1422-1423
(2009)
|
|
|
R-cran-genetics
GNU R package for population genetics
|
| Versions of package r-cran-genetics |
| Release | Version | Architectures |
| sid | 1.3.4-1 | all |
| squeeze | 1.3.4-0.1 | all |
|
License: DFSG free
|
|
Classes and methods for handling genetic data. Includes
The package provides a library for the statistics environment R that
contains classes to represent genotypes and haplotypes at single markers up
to multiple markers on multiple chromosomes. Function include
allele frequencies, flagging homo/heterozygotes, flagging carriers
of certain alleles, estimating and testing for Hardy-Weinberg
disequilibrium, estimating and testing for linkage disequilibrium,
and more.
NOTE: THIS PACKAGE IS NOW OBSOLETE.
The R-Genetics project has developed an set of enhanced genetics
packages to replace 'genetics'. Please visit the project homepage
at http://rgenetics.org for information.
|
|
|
R-cran-haplo.stats
GNU R package for haplotype analysis
|
| Versions of package r-cran-haplo.stats |
| Release | Version | Architectures |
| sid | 1.4.4-1 | mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa |
| squeeze | 1.4.4-1 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
The package provides routines for the GNU R statistics environment
for statistical Analysis of indirectly measured Haplotypes with Traits
and Covariates when Linkage Phase is Ambiguous. The statistical methods
assume that all subjects are unrelated and that haplotypes are ambiguous
(due to unknown linkage phase of the genetic markers). The main functions
are: haplo.em, haplo.glm, haplo.score, haplo.power, and seqhap.
|
|
|
Seqan-dev
A C++ template library for the analysis of sequences
|
| Versions of package seqan-dev |
| Release | Version | Architectures |
| sid | 1.2-1 | all |
| squeeze | 1.2-1 | all |
|
License: DFSG free
|
|
SeqAn is a C++ template library of efficient algorithms and data
structures for the analysis of sequences with the focus on
biological data. This library applies a unique generic design that
guarantees high performance, generality, extensibility, and
integration with other libraries. SeqAn is easy to use and
simplifies the development of new software tools with a minimal loss
of performance. This package contains the developer files, documentation
and examples.
|
|
Official Debian packages with lower relevance
|
R-cran-rocr
GNU R package to prepare and display ROC curves
|
| Versions of package r-cran-rocr |
| Release | Version | Architectures |
| sid | 1.0-3-2 | all |
| squeeze | 1.0-2-1.1 | all |
| Debtags of package r-cran-rocr: |
| field | statistics |
| role | shared-lib |
| use | viewing, analysing |
|
License: DFSG free
|
|
ROC graphs, sensitivity/specificity curves, lift charts,
and precision/recall plots are popular examples of trade-off
visualizations for specific pairs of performance measures. ROCR is a
flexible tool for creating cutoff-parametrized 2D performance curves
by freely combining two from over 25 performance measures (new
performance measures can be added using a standard interface).
Curves from different cross-validation or bootstrapping runs can be
averaged by different methods, and standard deviations, standard
errors or box plots can be used to visualize the variability across
the runs. The parametrization can be visualized by printing cutoff
values at the corresponding curve positions, or by coloring the
curve according to cutoff. All components of a performance plot can
be quickly adjusted using a flexible parameter dispatching
mechanism. Despite its flexibility, ROCR is easy to use, with only
three commands and reasonable default values for all optional
parameters.
ROCR features: ROC curves, precision/recall plots, lift charts, cost
curves, custom curves by freely selecting one performance measure for the
x axis and one for the y axis, handling of data from cross-validation
or bootstrapping, curve averaging (vertically, horizontally, or by
threshold), standard error bars, box plots, curves that are color-coded
by cutoff, printing threshold values on the curve, tight integration
with Rs plotting facilities (making it easy to adjust plots or to combine
multiple plots), fully customizable, easy to use (only 3 commands).
Performance measures that ROCR knows: Accuracy, error rate, true
positive rate, false positive rate, true negative rate, false negative
rate, sensitivity, specificity, recall, positive predictive value,
negative predictive value, precision, fallout, miss, phi correlation
coefficient, Matthews correlation coefficient, mutual information, chi
square statistic, odds ratio, lift value, precision/recall F measure,
ROC convex hull, area under the ROC curve, precision/recall break-even
point, calibration error, mean cross-entropy, root mean squared error,
SAR measure, expected cost, explicit cost.
|
|
Packaging has started and developers might try the packaging code in VCS
|
Libbio-das-perl
interface to Distributed Annotation System (DAS)
|
License: Perl Artistic License
Debian package not available
|
|
Bio::Das provides access to genome sequencing and annotation databases
that export their data in Distributed Annotation System (DAS) format
version 1.5. Both unencrypted (http:) and SSL-encrypted (https:)
DAS servers are supported.
|
|
Libsbml1
System Biology Markup Language Library
|
License: LGPL
|
|
LibSBML is a library designed to help you read, write, manipulate,
translate, and validate SBML files and data streams. It is not an
application itself (though it does come with many example programs),
but rather a library you can embed in your own applications.
LibSBML is written in ISO C and C++ but as a library it may be
used from all the programming languages listed to the left. In fact,
we strive to adhere to the natural idioms of each particular language
to make the libSBML programming experience seamless. For example, SBML
elements behave like lists and sequences in Python, but
vectors in Matlab. Also, the C and C++ interfaces are completely
distinct (it's possible to program in pure C), but in C++ the C APIs
may be called without sacrifcing type safety.
The LibSBML code is very portable and is supported on Linux, Windows
(native), and Mac OS X.
|
Unofficial packages built by somebody else
|
Bioconductor
analysis and comprehension of genomic data
|
License: GPL/LGPL
|
|
Bioconductor is an open source and open development software project
to provide tools for the analysis and comprehension of genomic data.
The Bioconductor core team is based primarily at the Biostatistics Unit of
the Dana Farber Cancer Institute at the Harvard Medical School/Harvard School
of Public Health. Other members come from various US and international
institutions.
The broad goals of the projects are to
-
provide access to a wide range of powerful statistical and graphical methods
for the analysis of genomic data;
-
facilitate the integration of biological metadata in the analysis of
experimental data: e.g. literature data from PubMed, annotation data from
LocusLink;
-
allow the rapid development of extensible, scalable, and interoperable
software;
-
promote high-quality documentation and reproducible research;
-
provide training in computational and statistical methods for the analysis
of genomic data.
A nice overview about all modules of BioDonductor is given at
http://www.bioconductor.org/packages/release/bioc/
|
Libbio-mage-utils-perl
- wnpp
Extra modules for classes in the MAGE package: MAGE
|
License: MIT/X
|
|
MAGE-TAB (MicroArray Gene Expression Tabular) format is a standard from the
Microarray Gene Expression Data Society (MGED). This package contains Perl
modules in the Bio::MAGE hierarchy to manipulate MIAME-compliant (Minimum
Information About a Microarray Experiment) records of microarray ("DNA chips")
experiments.
Bio-MAGE-Utils contains extra modules for handling MAGE XML and MGED ontology,
as well as SQL utilities.
|
|
Libmems-1.5-1
library to support DNA string matching and comparative genomics
|
License: GPL
|
|
libMems is a freely available software development library to support DNA
string matching and comparative genomics. Among other things, libMems
implements an algorithm to perform approximate multi-MUM and multi-MEM
identification. The algorithm uses spaced seed patterns in conjunction
with a seed-and-extend style hashing method to identify matches. The method
is efficient, requiring a maximum of only 16 bytes per base of the largest
input sequence, and this data can be stored externally (i.e. on disk) to
further reduce memory requirements.
|
No known packages available but some record of interest (WNPP bug)
Perl bindings for GFF Annotation Protocol
|
License: GPL-1+
Debian package not available
|
|
GFF (Gene Finding Feature) is a format for describing genes and other
features associated with DNA, RNA and Protein sequences.
This package provides a Perl module to use GFF objects.
|
library for linear models and differential gene expression analysis
|
License: LGPL
Debian package not available
|
|
Limma is an essential package of the bioconductor.org effort to
investigate gene expression data from microarrays with the
statistical package R.
|
No known packages available
|
Bioclipse
platform for chemo- and bioinformatics based on Eclipse
|
License: Eclipse Public License (EPL) + exception
Debian package not available
|
|
The Bioclipse project is aimed at creating a Java-based, open source,
visual platform for chemo- and bioinformatics based on the Eclipse
Rich Client Platform (RCP). Bioclipse, as any RCP application, is based
on a plugin architecture that inherits basic functionality and visual
interfaces from Eclipse, such as help system, software updates,
preferences, cross-platform deployment etc.
Bioclipse will provide functionality for chemo- and bioinformatics,
and extension points that easily can be extended by plugins to provide
added functionality. The first version of Bioclipse includes a
CDK-plugin (bc_cdk) to provide a chemoinformatic backend, a Jmol-plugin
(bc_jmol) for 3D-visualization and a general logging plugin. To stay
updated on upcoming features, releases, new plugins etc, please register
for the mailing list bioclipse-announce. The development is best
followed on the Bioclipse Wiki where we document the progress and
ideas of the development on a daily basis.
|
|