Debian Med Project
Help us to see Debian used by medical practicioners and researchers! Join us on the Alioth page.
Summary
Biology
Debian Med micro-biology packages

This metapackage will install Debian packages related to molecular biology, structural biology and bioinformatics for use in life sciences.

The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Med to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Med mailing list

Links to other tasks

Debian Med Biology packages

Official Debian packages with high relevance

Adun.app
Molecular Simulator for GNUstep
Versions of package adun.app
ReleaseVersionArchitectures
sid0.81-1sparc,powerpc,hurd-i386,hppa,armel,s390,amd64,alpha,ia64,i386
sid0.8.2-1+b2mips,mipsel
squeeze0.8.2-1+b2s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny0.8.2-1+b1armel,i386,amd64,hppa,powerpc,mips,arm,sparc,s390,alpha,mipsel,ia64
etch0.6-3powerpc,sparc,mipsel,arm,alpha,hppa,amd64,s390,i386,ia64
Debtags of package adun.app:
fieldbiology:structural, biology
interfacex11
roleprogram
scopeapplication
suitegnustep
uitoolkitgnustep
useviewing, organizing, analysing
works-withdb, 3dmodel
x11application
Popcon: 29 users (29 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Adun is a biomolecular simulator that also includes data management and analysis capabilities. It was developed at the Computational Biophysics and Biochemistry Laboratory, a part of the Research Unit on Biomedical Informatics of the UPF.

Please cite: M.A. Johnston, I.F. Galván, J. Villà-Freixa: Framework Based Design of a New All-Purpose Molecular Simulation Application: The Adun Simulator. (DOI) J. Comp. Chem (2005)
Screenshots of package adun.app
Alien-hunter
Interpolated Variable Order Motifs to identify horizontally acquired DNA
Versions of package alien-hunter
ReleaseVersionArchitectures
sid1.7-1all
squeeze1.7-1all
Popcon: 1 users (3 upd.)*
Versions and Archs
License: DFSG free
Svn

Alien_hunter is an application for the prediction of putative Horizontal Gene Transfer (HGT) events with the implementation of Interpolated Variable Order Motifs (IVOMs). An IVOM approach exploits compositional biases using variable order motif distributions and captures more reliably the local composition of a sequence compared to fixed-order methods. Optionally the predictions can be parsed into a 2-state 2nd order Hidden Markov Model (HMM), in a change-point detection framework, to optimize the localization of the boundaries of the predicted regions. The predictions (embl format) can be automatically loaded into Artemis genome viewer freely available at: http://www.sanger.ac.uk/Software/Artemis/.

The manuscript describing the alien_hunter algorithm is available from Bioinformatics: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Vernikos GS, Parkhill J Bioinformatics. 2006;. PMID: 16837528

Altree
program to perform phylogeny based analyses
Versions of package altree
ReleaseVersionArchitectures
lenny1.0.1-3arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc
sid1.0.1-3ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc
squeeze1.0.1-3mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
upstream1.1.0
Debtags of package altree:
fieldbiology:bioinformatics, biology
interfacecommandline
roleshared-lib, program
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 11 users (7 upd.)*
Newer upstream!
Debtags
License: DFSG free
Svn

ALTree was designed to perform phylogeny based analysis: first, it allows the detection of an association between a candidate gene and a disease, and second, it enables to make hypothesis about the susceptibility loci.

Remark of Debian Med team: altree 1.1.0 should be not be packaged

According to Vincent Danjean vdanjean.ml@free.fr version 1.1.0 should not be packaged for two reasons:

 1. New dependencies (libtamuanova-perl, nanova and libnanova-perl) which
    need to be packaged.
 2. There are still bugs in the new method added in altree 1.1.0 and the doc
    is not updated.

See http://lists.debian.org/debian-med/2009/08/msg00104.html for further details.

Amap-align
Protein multiple alignment by sequence annealing
Versions of package amap-align
ReleaseVersionArchitectures
lenny2.2-1i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid2.2-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.2-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.0-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package amap-align:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usespecial:todo, comparing, analysing
works-withspecial:todo
works-with-formatplaintext
Popcon: 12 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

AMAP is a command line tool to perform multiple alignment of peptidic sequences. It utilizes posterior decoding, and a sequence-annealing alignment, instead of the traditional progressive alignment method. It is the only alignment program that allows to control the sensitivity / specificity tradeoff. It is based on the ProbCons source code, but uses alignment metric accuracy and eliminates the consistency transformation.

The java visualisation tool of AMAP 2.2 is not yet packaged in Debian.

Autodock
analysis of ligand binding to protein structure
Versions of package autodock
ReleaseVersionArchitectures
sid4.2.2.1-2hppa,s390,mipsel,armel,alpha,powerpc,amd64,i386,ia64,hurd-i386,sparc,mips
squeeze4.2.1+rc1-2ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
lenny4.0.1-3mips,i386,s390,alpha,ia64,arm,armel,amd64,powerpc,hppa,sparc,mipsel
Debtags of package autodock:
fieldbiology:structural, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
works-with3dmodel
Popcon: 21 users (25 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

AutoDock is a prime representative of the programs addressing the simulation of the docking of fairly small chemical ligands to rather big protein receptors. Earlier versions had all flexibility in the ligands while the protein was kept rather ridgid. This latest version 4 also allows for a flexibility of selected sidechains of surface residues, i.e., takes the rotamers into account.

The AutoDock program performs the docking of the ligand to a set of grids describing the target protein. AutoGrid pre-calculates these grids.

The package is enhanced by the following packages: autogrid
Please register by following this link if you are using autodock.
Please cite: G.M. Morris, R. Huey, W. Lindstrom, M.F. Sanner, R.K. Belew, D.S. Goodsell, A.J. Olson: AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. (2009)
Screenshots of package autodock
Ballview
A free molecular modeling and molecular graphics tool
Maintainer: Andreas Hildebrandt
Versions of package ballview
ReleaseVersionArchitectures
sid1.3.0-1i386,amd64,alpha,s390
squeeze1.3.0-1powerpc,amd64,s390,i386,ia64
squeeze-proposed-updates1.3.0-1sparc,ia64,powerpc
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

BALLView provides fast OpenGL-based visualization of molecular structures, molecular mechanics methods (minimization, MD simulation using the AMBER, CHARMM, and MMFF94 force fields), calculation and visualization of electrostatic properties (FDPB) and molecular editing features.

BALLView can be considered a graphical user interface on the basis of BALL (Biochemical Algorithms Library), having taken the most common demands of protein chemists and biophysicists in particular taken into account. which is currently being developed in the groups of Hans-Peter Lenhof (Saarland University, Saarbruecken, Germany) and Oliver Kohlbacher (University of Tuebingen, Germany). BALL is an application framework in C++ that has been specifically designed for rapid software development in Molecular Modeling and Computational Molecular Biology. It provides an extensive set of data structures as well as classes for Molecular Mechanics, advanced solvation methods, comparison and analysis of protein structures, file import/export, and visualization.

Please cite: A. Moll, A. Hildebrandt, H.P.Lenhof, O. Kohlbacher: BALLView: a tool for research and education in molecular modeling.. Bioinformatics, 22(3):365-6 (2006)
Blast2
Basic Local Alignment Search Tool
Maintainer: Aaron M. Ucko
Versions of package blast2
ReleaseVersionArchitectures
sid2.2.21.20090809-1+b1ia64,armel
squeeze2.2.21.20090809-1+b1armel,ia64
sid2.2.21.20090809-1s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc
squeeze2.2.21.20090809-1mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64
lenny2.2.18.20080302-4powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha
etch2.2.15.20061015-1hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha
Debtags of package blast2:
fieldbiology
interfacecommandline
roleprogram
scopeutility
usesearching
Popcon: 60 users (19 upd.)*
Versions and Archs
Debtags
License: DFSG free

The famous sequence alignment program. This is "official" NCBI version, #2. The blastall executable allows you to give a nucleotide or protein sequence to the program. It is compared against databases and a summary of matches is returned to the user.

Note that databases are not included in Debian; they must be retrieved manually.

The package is enhanced by the following packages: mcl
Please cite: S.F. Altschul, W. Gish, W. Miller, E.W. Myers, D.J. Lipman: Basic local alignment search tool. J Mol Biol. 215(3):403-410 (1990)
Boxshade
Pretty-printing of multiple sequence alignments
Versions of package boxshade
ReleaseVersionArchitectures
sid3.3.1-4s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze3.3.1-4sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny3.3.1-3i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch3.1.1-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package boxshade:
biologypeptidic, nuceleic-acids, format:aln
develprettyprint
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usetypesetting
works-with-formattex, special:todo, postscript, plaintext, html
Popcon: 13 users (10 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Boxshade is a program for creating good looking printouts from multiple-aligned protein or DNA sequences. The program does not perform the alignment by itself and requires as input a file that was created by a multiple alignment program or manually edited with respective tools.

Boxshade reads multiple-aligned sequences from either PILEUP-MSF, CLUSTAL-ALN, MALIGNED-data and ESEE-save files (limited to a maximum of 150 sequences with up to 10000 elements each). Various kinds of shading can be applied to identical/similar residues. Output is written to screen or to a file in the following formats: ANSI/VT100, PS/EPS, RTF, HPGL, ReGIS, LJ250-printer, ASCII, xFIG, PICT, HTML

Bwa
Burrows-Wheeler Aligner
Versions of package bwa
ReleaseVersionArchitectures
sid0.5.4-1alpha,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,powerpc,s390,sparc
squeeze0.5.4-1amd64,mips,armel,sparc,mipsel,i386,s390,ia64,powerpc
Popcon: 0 users (3 upd.)*
Versions and Archs
License: DFSG free
Svn

Burrows-Wheeler Aligner (BWA) is a program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome. It implements two algorithms, bwa-short and BWA-SW. The former works for query sequences shorter than 200 bp and the latter for longer sequences up to around 100 kbp. Both algorithms do gapped alignment. They are usually more accurate and faster on queries with low error rates.

Please cite: Li, Heng and Durbin, Richard: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14):1754-1760 (2009)
Dialign
Segment-based multiple sequence alignment
Versions of package dialign
ReleaseVersionArchitectures
lenny2.2.1-3i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid2.2.1-3s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.2.1-3sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.2.1-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package dialign:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-withspecial:todo
works-with-formatplaintext
Popcon: 20 users (13 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

DIALIGN2 is a command line tool to perform multiple alignment of protein or DNA sequences. It constructs alignments from gapfree pairs of similar segments of the sequences. This scoring scheme for alignments is the basic difference between DIALIGN and other global or local alignment methods. Note that DIALIGN does not employ any kind of gap penalty. It has been published by Morgenstern B. in Bioinformatics. 1999 Mar;15(3):211-8.

Please cite: Burkhard Morgenstern: DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.. Bioinformatics 15(3):211-218 (1999)
Dialign-tx
Segment-based multiple sequence alignment
Versions of package dialign-tx
ReleaseVersionArchitectures
sid1.0.2-1ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc
squeeze1.0.2-1mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
lenny1.0.1-2arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc
Debtags of package dialign-tx:
fieldbiology:bioinformatics, biology
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 10 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

DIALIGN-TX is a command line tool to perform multiple alignment of protein or DNA sequences. It is a complete reimplementation of the segment-base approach including several new improvements and heuristics that significantly enhance the quality of the output alignments compared to DIALIGN 2.2 and DIALIGN-T. For pairwise alignment, DIALIGN-TX uses a fragment-chaining algorithm that favours chains of low-scoring local alignments over isolated high-scoring fragments. For multiple alignment, DIALIGN-TX uses an improved greedy procedure that is less sensitive to spurious local sequence similarities.

DIALIGN-TX has been published in Amarendran R. Subramanian, Michael Kaufmann, Burkhard Morgenstern: Improvement of the segment-based approach for multiple sequence alignment by combining greedy and progressive alignment strategies, Algorithms for Molecular Biology 3:6, 2008

Please cite: Amarendran R. Subramanian, Michael Kaufmann, Burkhard Morgenstern: Improvement of the segment-based approach for multiple sequence alignment by combining greedy and progressive alignment strategies. Algorithms for Molecular Biology 3:6 (2008)
Embassy-domainatrix
Extra EMBOSS commands to handle domain classification file
Versions of package embassy-domainatrix
ReleaseVersionArchitectures
lenny0.1.0-1i386,armel,s390,ia64,sparc,alpha,mips,powerpc,mipsel,hppa,arm,amd64
sid0.1.0+20090715-1+b1armel,ia64
squeeze0.1.0+20090715-1+b1armel,ia64
sid0.1.0+20090715-1alpha,mipsel,powerpc,sparc,hppa,i386,amd64,s390,mips
squeeze0.1.0+20090715-1sparc,amd64,hppa,i386,mips,mipsel,powerpc,s390
Debtags of package embassy-domainatrix:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, editing, converting, analysing
works-with-formatplaintext
Popcon: 5 users (3 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

The DOMAINATRIX programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domainatrix release are cathparse (generates DCF file from raw CATH files), domainnr (removes redundant domains from a DCF file), domainreso (removes low resolution domains from a DCF file), domainseqs (adds sequence records to a DCF file), domainsse (adds secondary structure records to a DCF file), scopparse (generates DCF file from raw SCOP files) and ssematch (searches a DCF file for secondary structure matches).

Embassy-domalign
Extra EMBOSS commands for protein domain alignment
Versions of package embassy-domalign
ReleaseVersionArchitectures
lenny0.1.0-1mips,hppa,sparc,i386,amd64,ia64,s390,powerpc,mipsel,arm,armel,alpha
sid0.1.0+20090715-1+b1powerpc,ia64,armel,hppa
squeeze0.1.0+20090715-1+b1armel,powerpc,ia64,hppa
sid0.1.0+20090715-1amd64,sparc,mipsel,alpha,mips,i386,s390
squeeze0.1.0+20090715-1sparc,amd64,i386,mips,mipsel,s390
Debtags of package embassy-domalign:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useediting, comparing, analysing
works-with-formatplaintext
Popcon: 4 users (2 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

The DOMALIGN programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domalign release are allversusall (sequence similarity data from all-versus-all comparison), domainalign (generates alignments (DAF file) for nodes in a DCF file), domainrep (reorders DCF file to identify representative structures) and seqalign (extend alignments (DAF file) with sequences (DHF file)).

Embassy-domsearch
Extra EMBOSS commands to search for protein domains
Versions of package embassy-domsearch
ReleaseVersionArchitectures
sid0.1.0+20980715-1mips,armel,mipsel,i386,powerpc,amd64,s390,ia64,sparc,alpha,hppa
squeeze0.1.0+20980715-1sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

The DOMSEARCH programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in this DOMSEARCH release are seqfraggle (removes fragment sequences from DHF files), seqnr (removes redundancy from DHF files), seqsearch (generates PSI-BLAST hits (DHF file) from a DAF file), seqsort (Remove ambiguous classified sequences from DHF files) and seqwords (Generates DHF files from keyword search of UniProt).

Emboss
the european molecular biology open software suite
Versions of package emboss
ReleaseVersionArchitectures
sid6.1.0-4ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
squeeze6.1.0-4mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
lenny5.0.0-7i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc
Debtags of package emboss:
fieldbiology:molecular, biology:bioinformatics, biology
interfacecommandline
roleprogram
scopesuite
useviewing, typesetting, text-formatting, searching, organizing, editing, converting, comparing, analysing
works-withspecial:todo, db
works-with-formatspecial:todo, plaintext
Popcon: 28 users (15 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. EMBOSS breaks the historical trend towards commercial software packages.

The package is enhanced by the following packages: clustalw primer3
Please cite: P. Rice, I. Longden, A. Bleasby: EMBOSS: the European Molecular Biology Open Software Suite.. Trends Genet., 16(6):276-277 (2000)
Screenshots of package emboss
Exonerate
generic tool for pairwise sequence comparison
Versions of package exonerate
ReleaseVersionArchitectures
sid2.2.0-2ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc
squeeze2.2.0-2mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
lenny2.0.0-1+b1sparc
lenny2.0.0-1powerpc,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,arm
Debtags of package exonerate:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching
works-with-formatplaintext
Popcon: 12 users (9 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Exonerate allows you to align sequences using a many alignment models, using either exhaustive dynamic programming, or a variety of heuristics. Much of the functionality of the Wise dynamic programming suite was reimplemented in C for better efficiency. Exonerate is an intrinsic component of the building of the Ensembl genome databases, providing similarity scores between RNA and DNA sequences and thus determining splice variants and coding sequences in general.

An In-silico PCR Experiment Simulation System (see the ipcress man page) is packaged with exonerate.

This package also comes with a selection of utilities for performing simple manipulations quickly on fasta files beyond 2Gb

Please cite: G.C. Slater, E. Birney: Automated generation of heuristics for biological sequence comparison. (DOI) BMC Bioinformatics 6:31 (2005)
Fastdnaml
Tool for construction of phylogenetic trees of DNA sequences
Versions of package fastdnaml
ReleaseVersionArchitectures
sid1.2.2-9s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1.2.2-9sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny1.2.2-8i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch1.2.2-6arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package fastdnaml:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 16 users (11 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

fastDNAml is a program derived from Joseph Felsenstein's version 3.3 DNAML (part of his PHYLIP package). Users should consult the documentation for DNAML before using this program.

fastDNAml is an attempt to solve the same problem as DNAML, but to do so faster and using less memory, so that larger trees and/or more bootstrap replicates become tractable. Much of fastDNAml is merely a recoding of the PHYLIP 3.3 DNAML program from PASCAL to C.

Note that the homepage of this program is not available any more and so this program will probably not see any further updates.

Fastlink
A faster version of pedigree programs of Linkage
Versions of package fastlink
ReleaseVersionArchitectures
sid4.1P-fix95-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze4.1P-fix95-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny4.1P-fix94b-2i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch4.1P-fix92-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package fastlink:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
Popcon: 15 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Genetic linkage analysis is a statistical technique used to map genes and find the approximate location of disease genes. There was a standard software package for genetic linkage called LINKAGE. FASTLINK is a significantly modified and improved version of the main programs of LINKAGE that runs much faster sequentially, can run in parallel, allows the user to recover gracefully from a computer crash, and provides abundant new documentation. FASTLINK has been used in over 1000 published genetic linkage studies.

This package contains the following programs:

 ilink:    GEMINI optimization procedure to find a locally
           optimal value of the theta vector of recombination
           fractions
 linkmap:  calculates location scores of one locus against a
           fixed map of other loci
 lodscore: compares likelihoods at locally optimal theta
 mlink:    calculates lod scores and risk with two of more loci
 unknown:  identify possible genotypes for unknowns
Gamgi
construct, view and analyse atomic structures
Versions of package gamgi
ReleaseVersionArchitectures
sid0.14.2-1sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,s390,alpha
sid0.14.1-1hppa,powerpc
squeeze0.14-1hppa,ia64,mips,mipsel,armel,powerpc,s390,i386,sparc,amd64
Popcon: 7 users (32 upd.)*
Versions and Archs
License: DFSG free
Svn

GAMGI provides a graphic interface to build and analyse atomic structures. The program is aimed at the scientific community, who needs a graphic interface to study atomic structures and to prepare images for presentations, and for teaching the atomic structure of matter.

The package is enhanced by the following packages: gamgi-data gamgi-doc
Garlic
A visualization program for biomolecules
Versions of package garlic
ReleaseVersionArchitectures
lenny1.6-1i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid1.6-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1.6-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch1.5-2arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package garlic:
fieldchemistry, biology
interfacex11
roleprogram
scopeutility
useviewing
x11application
Popcon: 40 users (22 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Garlic is written for the investigation of membrane proteins. It may be used to visualize other proteins, as well as some geometric objects. This version of garlic recognizes PDB format version 2.1. Garlic may also be used to analyze protein sequences.

It only depends on the X libraries, no other libraries are needed.

Features include:

  • The slab position and thickness are visible in a small window.
  • Atomic bonds as well as atoms are treated as independent drawable objects.
  • The atomic and bond colors depend on position. Five mapping modes are available (as for slab).
  • Capable to display stereo image.
  • Capable to display other geometric objects, like membrane.
  • Atomic information is available for atom covered by the mouse pointer. No click required, just move the mouse pointer over the structure!
  • Capable to load more than one structure.
  • Capable to draw Ramachandran plot, helical wheel, Venn diagram, averaged hydrophobicity and hydrophobic moment plot.
  • The command prompt is available at the bottom of the main window. It is able to display one error message and one command string.
Gdpc
visualiser of molecular dynamic simulations
Versions of package gdpc
ReleaseVersionArchitectures
sid2.2.5-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.2.5-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny2.2.4-3i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch2.2.4-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package gdpc:
fieldphysics, chemistry, biology:structural, biology
interfacex11
roleprogram
scopeapplication
uitoolkitgtk
useviewing
works-withvideo, image, 3dmodel
works-with-formatpng, jpg
x11application
Popcon: 25 users (35 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

gpdc is a graphical program for visualising output data from molecular dynamics simulations. It reads input in the standard xyz format, as well as other custom formats, and can output pictures of each frame in JPG or PNG format.

Gff2aplot
pair-wise alignment-plots for genomic sequences in PostScript
Versions of package gff2aplot
ReleaseVersionArchitectures
lenny2.0-5i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid2.0-5s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.0-5sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.0-3arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package gff2aplot:
fieldbiology:bioinformatics, biology
interfaceshell, commandline
roleprogram
scopeutility
useviewing, converting
works-withimage:vector
works-with-formatpostscript, plaintext
Popcon: 12 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

A program to visualize the alignment of two genomic sequences together with their annotations. From GFF-format input files it produces PostScript figures for that alignment. The following menu lists many features of gff2aplot:

  • Comprehensive alignment plots for any GFF-feature. Attributes are defined separately so you can modify only whatsoever attributes for a given file or share same customization across different data-sets.
  • All parameters are set by default within the program, but it can be also fully configured via gff2ps-like flexible customization files. Program can handle several of such files, summarizing all the settings before producing the corresponding figure. Moreover, all customization parameters can be set via command-line switches, which allows users to play with those parameters before adding any to a customization file.
  • Source order is taken from input files, if you swap file order you can visualize alignment and its annotation with the new input arrangement.
  • All alignment scores can be visualized in a PiP box below gff2aplot area, using grey-color scale, user-defined color scale or score-dependent gradients.
  • Scalable fonts, which can also be chosen among the basic PostScript default fonts. Feature and group labels can be rotated to improve readability in both annotation axes.
  • The program is still defined as a Unix filter so it can handle data from files, redirections and pipes, writing output to standard-output and warnings to standard error.
  • gff2aplot is able to manage many physical page formats (from A0 to A10, and more -see available page sizes in its manual-), including user-defined ones. This allows, for instance, the generation of poster size genomic maps, or the use of a continuous-paper supporting plotting device, either in portrait or landscape.
  • You can draw different alignments on same alignment plot and distinguish them by using different colors for each.
  • Shape dictionary has been expanded, so that further feature shapes are now available (see manual).
  • Annotation projections through alignment plots (so called ribbons) emulate transparencies via complementary color fill patterns. This feature allows to show color pseudo-blending when horizontal and vertical ribbons overlap.
Gff2ps
produces PostScript graphical output from GFF-files
Versions of package gff2ps
ReleaseVersionArchitectures
lenny0.98d-3all
sid0.98d-3all
squeeze0.98d-3all
etch0.98d-2all
Debtags of package gff2ps:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useviewing, converting
works-withimage:vector
works-with-formatpostscript
Popcon: 17 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

gff2ps is a script program developed with the aim of converting gff-formatted records into high quality one-dimensional plots in PostScript. Such plots maybe useful for comparing genomic structures and to visualizing outputs from genome annotation programs. It can be used in a very simple way, because it assumes that the GFF file itself carries enough formatting information, but it also allows through a number of options and/or a configuration file, for a great degree of customization.

Ghemical
A GNOME molecular modelling environment
Versions of package ghemical
ReleaseVersionArchitectures
sid2.99.2-1hppa,sparc,powerpc,armel,amd64,s390,mips,ia64,alpha,i386,hurd-i386
squeeze2.95-3+b2amd64
lenny2.95-3+b2amd64
squeeze2.95-3+b1s390,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny2.95-3+b1i386,hppa,mipsel,sparc,mips,powerpc,ia64,s390,armel,alpha,arm
sid2.95-3+b1mipsel
etch2.10-1+b1hppa
etch2.10-1sparc,mips,alpha,mipsel,arm,s390,powerpc,i386,amd64,ia64
Debtags of package ghemical:
fieldchemistry
interfacex11, 3d
roleprogram
suitegnome
uitoolkitgtk
useviewing, editing
works-with3dmodel
x11application
Popcon: 83 users (68 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Ghemical is a computational chemistry software package written in C++. It has a graphical user interface and it supports both quantum- mechanics (semi-empirical) models and molecular mechanics models. Geometry optimization, molecular dynamics and a large set of visualization tools using OpenGL are currently available.

Ghemical relies on external code to provide the quantum-mechanical calculations. Semi-empirical methods MNDO, MINDO/3, AM1 and PM3 come from the MOPAC7 package (Public Domain), and are included in the package. The MPQC package is used to provide ab initio methods: the methods based on Hartree-Fock theory are currently supported with basis sets ranging from STO-3G to 6-31G.

Screenshots of package ghemical
Glam2
gapped protein motifs from unaligned sequences
Versions of package glam2
ReleaseVersionArchitectures
lenny1064-1arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc
sid1064-1ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc
squeeze1064-1mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
Debtags of package glam2:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, comparing, analysing
works-with-formatplaintext
Popcon: 11 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

GLAM2 is a software package for finding motifs in sequences, typically amino-acid or nucleotide sequences. A motif is a re-occurring sequence pattern: typical examples are the TATA box and the CAAX prenylation motif. The main innovation of GLAM2 is that it allows insertions and deletions in motifs.

The package includes these programs:

 glam2:       discovering motifs shared by a set of sequences;
 glam2scan:   finding matches, in a sequence database, to a motif discovered
              by glam2;
 glam2format: converting glam2 motifs to  standard alignment formats;
 glam2mask:   masking glam2 motifs out of sequences, so that weaker motifs
              can be found;
 glam2-purge: removing highly similar members of a set of sequences.

In this package, the fast Fourier algorithm (FFT) was enabled for glam2.

If you use GLAM2, please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey (2008) Discovering sequence motifs with arbitrary insertions and deletions, PLoS Computational Biology (in press).

Please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey: Discovering sequence motifs with arbitrary insertions and deletions. PLoS Computational Biology (2008)
Gromacs
Molecular dynamics simulator, with building and analysis tools
Versions of package gromacs
ReleaseVersionArchitectures
sid4.0.5-4s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze4.0.5-4sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny3.3.3-2alpha,s390,hppa,mips,amd64,powerpc,sparc,mipsel,arm,ia64,armel,i386
etch3.3.1-4s390,mips,arm,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package gromacs:
fieldchemistry, biology:structural, biology
interfacex11, commandline
roleprogram
uitoolkitxlib
usespecial:not-applicable
x11application
Popcon: 50 users (25 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles.

It is primarily designed for biochemical molecules like proteins and lipids that have a lot of complicated bonded interactions, but since GROMACS is extremely fast at calculating the nonbonded interactions (that usually dominate simulations) many groups are also using it for research on non- biological systems, e.g. polymers.

GROMACS offers entirely too many features for a brief description to do it justice. A more complete listing is available at http://www.gromacs.org/content/view/12/176/.

Hmmer
profile hidden Markov models for protein sequence analysis
Versions of package hmmer
ReleaseVersionArchitectures
lenny2.3.2-5s390,hppa,amd64,sparc,powerpc,mips,mipsel,arm,ia64,armel,i386,alpha
sid2.3.2-5s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.3.2-5sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.3.2-2mips,arm,s390,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
sid2.1.4hurd-i386
Debtags of package hmmer:
biologypeptidic, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching
works-withdb
works-with-formatplaintext
Popcon: 23 users (14 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

HMMER is an implementation of profile hidden Markov model methods for sensitive searches of biological sequence databases using multiple sequence alignments as queries.

Given a multiple sequence alignment as input, HMMER builds a statistical model called a "hidden Markov model" which can then be used as a query into a sequence database to find (and/or align) additional homologues of the sequence family.

Please cite: S. R. Eddy.: Multiple Alignment Using Hidden Markov Models.. Proc. Third Int. Conf. Intelligent Systems for Molecular Biology, 114-120. (1995)
Infernal
inference of RNA secondary structural alignments
Versions of package infernal
ReleaseVersionArchitectures
sid1.0.2-1sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,s390,alpha
sid1.0-2powerpc
squeeze1.0-2i386,mipsel,amd64,powerpc,s390,ia64,sparc,mips,armel
Popcon: 1 users (3 upd.)*
Versions and Archs
License: DFSG free
Svn

Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a special case of profile stochastic context-free grammars called covariance models (CMs). A CM is like a sequence profile, but it scores a combination of sequence consensus and RNA secondary structure consensus, so in many cases, it is more capable of identifying RNA homologs that conserve their secondary structure more than their primary sequence.

The tool is an integral component of the Rfam database.

Please cite: Nawrocki, Eric P. and Kolbe, Diana L. and Eddy, Sean R.: Infernal 1.0: inference of RNA alignments. Bioinformatics 15;25(10):1335-7 (2009)
Kalign
Global and progressive multiple sequence alignment
Versions of package kalign
ReleaseVersionArchitectures
sid2.04-2s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.04-2sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny2.03-2i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch1.04-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package kalign:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 17 users (22 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Kalign is a command line tool to perform multiple alignment of biological sequences. It employs the Muth-Manber string-matching algorithm, to improve both the accuracy and speed of the alignment. It uses global, progressive alignment approach, enriched by employing an approximate string-matching algorithm to calculate sequence distances and by incorporating local matches into the otherwise global alignment.

Please cite: Lassmann T, Sonnhammer EL.: Kalign--an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics, 6:298 (2005)
Last-align
genome-scale comparison of biological sequences
Versions of package last-align
ReleaseVersionArchitectures
sid87-1sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,powerpc,s390,alpha
sid70-1hppa
squeeze70-1mips,hppa,mipsel,armel,powerpc,i386,s390,amd64,sparc,ia64
Popcon: 4 users (13 upd.)*
Versions and Archs
License: DFSG free
Svn

LAST is software for comparing and aligning sequences, typically DNA or protein sequences. LAST is similar to BLAST, but it copes better with very large amounts of sequence data. Here are two things LAST is good at:

  • Comparing large (e.g. mammalian) genomes.
  • Mapping lots of sequence tags onto a genome.

The main technical innovation is that LAST finds initial matches based on their multiplicity, instead of using a fixed size (e.g. BLAST uses 11-mers). This allows to map tags to genomes without repeat-masking, without becoming overwhelmed by repetitive hits. To find these variable-sized matches, it uses a suffix array (inspired by Vmatch). To achieve high sensitivity, it uses a discontiguous suffix array, analogous to spaced seeds.

Loki
MCMC linkage analysis on general pedigrees
Versions of package loki
ReleaseVersionArchitectures
lenny2.4.7.4-4i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid2.4.7.4-4s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.4.7.4-4sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.4.7.4-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package loki:
fieldbiology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 16 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Performs Markov chain Monte Carlo multipoint linkage analysis on large, complex pedigrees. The current package supports analyses on quantitative traits only, although this restriction will be lifted in later versions. Joint estimation of QTL number, position and effects uses Reversible Jump MCMC. It is also possible to perform affected only IBD sharing analyses.

The homepage of this project used to be at http://loki.homeunix.net but the project is dead now and the homepage vanished. The Homepage field above points to the web archive.

The package is enhanced by the following packages: loki-doc
Mafft
Multiple alignment program for amino acid or nucleotide sequences
Versions of package mafft
ReleaseVersionArchitectures
sid6.716-1s390,mipsel,armel,alpha,powerpc,amd64,ia64,i386,sparc,mips,hurd-i386
sid6.713-1hppa
squeeze6.713-1ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
lenny6.240-2mipsel,s390,mips,ia64,alpha,arm,armel,amd64,powerpc,hppa,i386,sparc
Debtags of package mafft:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 8 users (15 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

MAFFT is a multiple sequence alignment program which offers three accuracy-oriented methods:

  • L-INS-i (probably most accurate; recommended for <200 sequences; iterative refinement method incorporating local pairwise alignment information),

  • G-INS-i (suitable for sequences of similar lengths; recommended for <200 sequences; iterative refinement method incorporating global pairwise alignment information),

  • E-INS-i (suitable for sequences containing large unalignable regions; recommended for <200 sequences), and five speed-oriented methods:

  • FFT-NS-i (iterative refinement method; two cycles only),

  • FFT-NS-i (iterative refinement method; max. 1000 iterations),

  • FFT-NS-2 (fast; progressive method),

  • FFT-NS-1 (very fast; recommended for >2000 sequences; progressive method with a rough guide tree),

  • NW-NS-PartTree-1 (recommended for ∼50,000 sequences; progressive method with the PartTree algorithm).

Please cite: K. Katoh, G. Asimenos, H. Toh: Multiple alignment of DNA sequences with MAFFT. Methods Mol Biol. 537:39-64 (2009)
Maq
maps short fixed-length polymorphic DNA sequence reads to reference sequences
Versions of package maq
ReleaseVersionArchitectures
sid0.7.1-3+b1armel,powerpc,hppa,ia64
squeeze0.7.1-3+b1hppa,armel,ia64,powerpc
sid0.7.1-3i386,amd64,mips,alpha,sparc,s390,mipsel,hurd-i386
squeeze0.7.1-3sparc,amd64,i386,mips,mipsel,s390
Debtags of package maq:
biologynuceleic-acids
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, comparing, analysing
works-with-formatplaintext
Popcon: 3 users (2 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Maq (short for Mapping and Assembly with Quality) builds mapping assemblies from short reads generated by the next-generation sequencing machines. It is particularly designed for Illumina-Solexa 1G Genetic Analyzer, and has a preliminary functionality to handle ABI SOLiD data. Maq is previously known as mapass2.

With Maq you can:

  • Fast align Illumina/SOLiD reads to the reference genome. With the default options, one million pairs of reads can be mapped to the human genome in about 10 CPU hours with less than 1G memory.
  • Accurately measure the error probability of the alignment of each individual read.
  • Call the consensus genotypes, including homozygous and heterozygous polymorphisms, with a Phred probabilistic quality assigned to each base.
  • Find short indels with paired end reads.
  • Accurately find large scale genomic deletions and translocations with paired end reads.
  • Discover potential CNVs by checking read depth.
  • Evaluate the accuracy of raw base qualities from sequencers and help to check the systematic errors.

However, Maq can NOT:

  • Do de novo assembly. (Maq can only call the consensus by mapping reads to a known reference.)
  • Map shorts reads against themselves. (Maq can only find complete overlap between reads.)
  • Align capillary reads or 454 reads to the reference. (Maq cannot align reads longer than 63bp.)

This package is likely to be useful for users working with genetics or genomic studies in biology who need to assembly DNA sequences from fixed-length sequencers.

Massxpert
linear polymer mass spectrometry software
Versions of package massxpert
ReleaseVersionArchitectures
sid2.0.7-1mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa
squeeze2.0.7-1sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390
Popcon: 7 users (19 upd.)*
Versions and Archs
License: DFSG free
Git

massXpert is a program to simulate and analyse mass spectrometric data obtained on linear (bio-)polymers. It is the successor of GNU polyxmass.

Four modules allow:

  • making brand new polymer chemistry definitions;
  • using the definitions to perform easy calculations in a desktop calculator-like manner;
  • performing sophisticated polymer sequence editing and simulations;
  • perform m/z list comparisons;

Chemical simulations encompass cleavage (either chemical or enzymatic), gas-phase fragmentations, chemical modification of any monomer in the polymer sequence, cross-linking of monomers in the sequence, arbitrary mass searches, calculation of the isotopic pattern...

Melting
compute the melting temperature of nucleic acid duplex
Versions of package melting
ReleaseVersionArchitectures
sid4.3b-1hppa,s390,mipsel,armel,alpha,powerpc,amd64,i386,ia64,hurd-i386,sparc,mips
lenny4.2h-3mips,i386,s390,alpha,ia64,arm,armel,amd64,powerpc,hppa,sparc,mipsel
squeeze4.2h-3ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
upstream4.3c
Debtags of package melting:
biologynuceleic-acids
fieldbiology:molecular, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
works-with-formatplaintext
Popcon: 17 users (11 upd.)*
Newer upstream!
Debtags
License: DFSG free
Svn

This program computes, for a nucleic acid duplex, the enthalpy, the entropy and the melting temperature of the helix-coil transitions. Three types of hybridisation are possible: DNA/DNA, DNA/RNA, and RNA/RNA. The program first computes the hybridisation enthalpy and entropy from the elementary parameters of each Crick's pair by the nearest-neighbor method. Then the melting temperature is computed. The set of thermodynamic parameters can be easily changed, for instance following an experimental breakthrough.

Please cite: Nicolas Le Novère: MELTING, computing the melting temperature of nucleic acid duplex Bioinformatics, 17: S.1226-1227 (2001)
Mipe
Tools to store PCR-derived data
Versions of package mipe
ReleaseVersionArchitectures
lenny1.1-3all
sid1.1-3all
squeeze1.1-3all
etch1.1-1all
Debtags of package mipe:
fieldbiology:molecular, biology:bioinformatics, biology
interfacecommandline
roleprogram, documentation
scopeutility
useorganizing
works-with-formatxml
Popcon: 12 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

MIPE provides a standard format to exchange and/or storage of all information associated with PCR experiments using a flat text file. This will:

  • allow for exchange of PCR data between researchers/laboratories
  • enable traceability of the data
  • prevent problems when submitting data to dbSTS or dbSNP
  • enable the writing of standard scripts to extract data (e.g. a list of PCR primers, SNP positions or haplotypes for different animals)

Although this tool can be used for data storage, it's primary focus should be data exchange. For larger repositories, relational databases are more appropriate for storage of these data. The MIPE format could then be used as a standard format to import into and/or export from these databases.

MIPE was published in: Aerts J & Veenendaal T. MIPE - a XML-format to facilitate the storage and exchange of PCR-related data. Online Journal of Bioinformatics 6(2): 114-120 (2005).

Please cite: Aerts J & Veenendaal T.: MIPE - a XML-format to facilitate the storage and exchange of PCR-related data Online Journal of Bioinformatics 6(2): 114-120 (2005)
Mummer
Efficient sequence alignment of full genomes
Versions of package mummer
ReleaseVersionArchitectures
sid3.22~dfsg-1mipsel,hppa,alpha,powerpc,armel,ia64,amd64,mips,i386,sparc
lenny3.20-3i386,mipsel,ia64,alpha,mips,arm,armel,amd64,s390,hppa,sparc,powerpc
sid3.20-3s390
squeeze3.20-3ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
Debtags of package mummer:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 11 users (23 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

MUMmer is a system for rapidly aligning entire genomes, whether in complete or draft form. For example, MUMmer 3.0 can find all 20-basepair or longer exact matches between a pair of 5-megabase genomes in 13.7 seconds, using 78 MB of memory, on a 2.4 GHz Linux desktop computer. MUMmer can also align incomplete genomes; it handles the 100s or 1000s of contigs from a shotgun sequencing project with ease, and will align them to another set of contigs or a genome using the NUCmer program included with the system. If the species are too divergent for DNA sequence alignment to detect similarity, then the PROmer program can generate alignments based upon the six-frame translations of both input sequences.

Please cite: Stefan Kurtz, Adam Phillippy, Arthur L. Delcher, Michael Smoot, Martin Shumway, Corina Antonescu, Steven L. Salzberg: Versatile and open software for comparing large genomes. (DOI) Genome Biol. 5(2):R12 (2004)
Muscle
Multiple alignment program of protein sequences
Versions of package muscle
ReleaseVersionArchitectures
lenny3.70+fix1-2i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid3.70+fix1-2s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze3.70+fix1-2sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch3.60-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package muscle:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 21 users (9 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

MUSCLE is a multiple alignment program for protein sequences. MUSCLE stands for multiple sequence comparison by log-expectation. In the authors tests, MUSCLE achieved the highest scores of all tested programs on several alignment accuracy benchmarks, and is also one of the fastest programs out there.

Mustang
multiple structural alignment of proteins
Versions of package mustang
ReleaseVersionArchitectures
sid3.2.1-1sparc,amd64,armel,hppa,hurd-i386,ia64,mips,mipsel,powerpc,s390,alpha
sid3.2.0-1i386
squeeze3.2.0-1mips,armel,mipsel,amd64,powerpc,s390,i386,sparc,hppa,ia64
Debtags of package mustang:
biologypeptidic
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
usecomparing, analysing
works-with-formatplaintext
Popcon: 3 users (12 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Mustang is an algorithm to align multiple protein structures. Given a set of PDB files, the program uses the spatial information in the Calpha atoms of the set to produce a sequence alignment. Based on a progressive pairwise heuristic the algorithm then proceeds through a number of refinement passes. Mustang reports the multiple sequence alignment and the corresponding superposition of structures.

When using this program please cite A.S. Konagurthu, J.C. Whisstock, P.J. Stuckey, A.M. Lesk, "MUSTANG: A multiple structural alignment algorithm." Proteins: Structure, Function, and Bioinformatics. 64(3):559-574.

The package is enhanced by the following packages: mustang-testdata
Please cite: A. S. Konagurthu, J. C. Whisstock, P. J. Stuckey, A. M. Lesk: MUSTANG: A multiple structural alignment algorithm Proteins: Structure, Function, and Bioinformatics. 64(3):559-574 (2006)
Ncbi-epcr
Tool to test a DNA sequence for the presence of sequence tagged sites
Versions of package ncbi-epcr
ReleaseVersionArchitectures
sid2.3.12-1s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.3.12-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny2.3.10-1s390,hppa,amd64,sparc,mips,ia64,mipsel,arm,i386,armel,powerpc,alpha
etch1.2.0-3+b1arm,mipsel,s390,mips,powerpc,sparc,hppa,alpha,i386,ia64
etch1.2.0-3amd64
sid1.2.0-3hurd-i386
Debtags of package ncbi-epcr:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, checking
works-with-formatplaintext
Popcon: 16 users (11 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Electronic PCR (e-PCR) is computational procedure that is used to identify sequence tagged sites(STSs), within DNA sequences. e-PCR looks for potential STSs in DNA sequences by searching for subsequences that closely match the PCR primers and have the correct order, orientation, and spacing that could represent the PCR primers used to generate known STSs.

The new version of e-PCR implements a fuzzy matching strategy. To reduce likelihood that a true STS will be missed due to mismatches, multiple discontigous words may be used instead of a single exact word. Each of this word has groups of significant positions separated by 'wildcard' positions that are not required to match. In addition, it is also possible to allow gaps in the primer alignments.

The main motivation for implementing reverse searching (called Reverse e-PCR) was to make it feasible to search the human genome sequence and other large genomes. The new version of e-PCR provides a search mode using a query sequence against a sequence database.

Ncbi-tools-bin
NCBI libraries for biology applications (text-based utilities)
Maintainer: Aaron M. Ucko
Versions of package ncbi-tools-bin
ReleaseVersionArchitectures
sid6.1.20090809-1+b1ia64,armel
squeeze6.1.20090809-1+b1armel,ia64
sid6.1.20090809-1s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc
squeeze6.1.20090809-1mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64
lenny6.1.20080302-4powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha
etch6.1.20061015-1hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha
Debtags of package ncbi-tools-bin:
fieldbiology
interfacecommandline
roleprogram
scopeutility
usesearching, converting
Popcon: 24 users (16 upd.)*
Versions and Archs
Debtags
License: DFSG free

This package includes various utilities distributed with the NCBI C SDK. None of the programs in this package require X; you can find the X-based utilities in the ncbi-tools-x11 package. BLAST and related tools are in a separate package (blast2).

The package is enhanced by the following packages: mcl
Ncbi-tools-x11
NCBI libraries for biology applications (X-based utilities)
Maintainer: Aaron M. Ucko
Versions of package ncbi-tools-x11
ReleaseVersionArchitectures
sid6.1.20090809-1+b1ia64,armel
squeeze6.1.20090809-1+b1armel,ia64
sid6.1.20090809-1s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc
squeeze6.1.20090809-1mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64
lenny6.1.20080302-4powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha
etch6.1.20061015-1hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha
Debtags of package ncbi-tools-x11:
fieldbiology
interfacex11, 3d
roleprogram
scopeutility
uitoolkitmotif
usesearching
x11application
Popcon: 75 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free

This package includes some X-based utilities distributed with the NCBI C SDK: Cn3D, Network Entrez, Sequin, ddv, and udv. These programs are not part of ncbi-tools-bin because they depend on several additional library packages.

Njplot
A phylogenetic tree drawing program
Versions of package njplot
ReleaseVersionArchitectures
sid2.3-2s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.3-2sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny2.2-1i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch0.20060606-2arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package njplot:
fieldbiology:bioinformatics, biology
interfacex11
roleprogram
scopeutility
uitoolkitmotif
useviewing, printing, organizing, editing, analysing
works-with-formatplaintext
x11application
Popcon: 67 users (4 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

NJplot is able to draw any dendrogram expressed in the Newick standard phylogenetic tree format (e.g., the format used by the Phylip package). NJplot is especially convenient for rooting the unrooted trees obtained from parsimony, distance or maximum likelihood tree-building methods.

Perlprimer
Graphical design of primers for PCR
Versions of package perlprimer
ReleaseVersionArchitectures
sid1.1.18-1all
squeeze1.1.18-1all
lenny1.1.14-3all
etch1.1.14-1all
Debtags of package perlprimer:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:molecular, biology
interfacex11
networkclient
roleprogram
scopeutility
uitoolkittk
useanalysing
works-with-formatplaintext
x11application
Popcon: 63 users (17 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

PerlPrimer is a free, open-source GUI application written in Perl that designs primers for standard Polymerase Chain Reaction (PCR), bisulphite PCR, real-time PCR (QPCR) and sequencing. It aims to automate and simplify the process of primer design.

If operated online, the tool nicely communicates with the Ensembl project for further insights into the gene structure, i.e., allowing for taking the location of exons and introns into account for the design of the primers. The sequences themselves can be retrieved, too.

Screenshots of package perlprimer
Phyml
Phylogenetic estimation using Maximum Likelihood
Versions of package phyml
ReleaseVersionArchitectures
sid200900706-2+b1alpha,armel,ia64,powerpc
squeeze200900706-2+b1powerpc,ia64,armel
sid200900706-2mipsel,amd64,s390,hppa,hurd-i386,i386,sparc,mips
squeeze200900706-2mipsel,s390,i386,sparc,amd64,hppa,mips
Popcon: 5 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

PhyML is a software that estimates maximum likelihood phylogenies from alignments of nucleotide or amino acid sequences. It provides a wide range of options that were designed to facilitate standard phylogenetic analyses. The main strengths of PhyML lies in the large number of substitution models coupled to various options to search the space of phylogenetic tree topologies, going from very fast and efficient methods to slower but generally more accurate approaches. It also implements two methods to evaluate branch supports in a sound statistical framework (the non-parametric bootstrap and the approximate likelihood ratio test).

PhyML was designed to process moderate to large data sets. In theory, alignments with up to 4,000 sequences 2,000,000 character-long can be analyzed. In practice however, the amount of memory required to process a data set is proportional of the product of the number of sequences by their length. Hence, a large number of sequences can only be processed provided that they are short. Also, PhyML can handle long sequences provided that they are not numerous. With most standard personal computers, the “comfort zone” for PhyML generally lies around 3 to 500 sequences less than 2,000 character long.

Plasmidomics
draw plasmids and vector maps with PostScript graphics export
Versions of package plasmidomics
ReleaseVersionArchitectures
lenny0.2.0-2all
sid0.2.0-2all
squeeze0.2.0-2all
Debtags of package plasmidomics:
fieldbiology:molecular, biology
interfacex11
roleprogram
scopeutility
uitoolkittk
works-withimage:vector
works-with-formatpostscript
x11application
Popcon: 12 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Plasmidomics is written for easy drawing of plasmids and vector maps to use them in theses, presentations or other forms of publications. It natively supports PostScript as output format.

Plink
whole-genome association analysis toolset
Versions of package plink
ReleaseVersionArchitectures
sid1.07-1armel,hppa,hurd-i386,i386,ia64,powerpc,s390,alpha,sparc,amd64
sid1.03p1-1mipsel,mips
Popcon: 0 users (4 upd.)*
Versions and Archs
License: DFSG free
Svn

plink expects as input the data from SNP (single nucleotide polymorphism) chips of many individuals and their phenotypical description of a disease. It finds associations of single or pairs of DNA variations with a phenotype and can retrieve SNP annotation from an online source.

SNPs can evaluated individually or as pairs for their association with the disease phenotypes. The joint investigation of copy number variations is supported. A variety of statistical tests have been implemented.

Please note: The executable was renamed to p-link because of a name clash. Please read more about this in /usr/share/doc/README.Debian.

Poa
Partial Order Alignment for multiple sequence alignment
Versions of package poa
ReleaseVersionArchitectures
lenny2.0+20060928-2i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid2.0+20060928-2s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.0+20060928-2sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.0+20060928-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package poa:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
works-withspecial:todo
works-with-formatplaintext
Popcon: 13 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

POA is Partial Order Alignment, a fast program for multiple sequence alignment (MSA) in bioinformatics. Its advantages are speed, scalability, sensitivity, and the superior ability to handle branching / indels in the alignment. Partial order alignment is an approach to MSA, which can be combined with existing methods such as progressive alignment. POA optimally aligns a pair of MSAs and which therefore can be applied directly to progressive alignment methods such as CLUSTAL. For large alignments, Progressive POA is 10-30 times faster than CLUSTALW. POA is published in Bioinformatics. 2004 Jul 10;20(10):1546-56.

Please cite: C. Grasso, C. Lee: Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems Bioinformatics 20(10):1546-1556. (2004)
Primer3
Tool to design flanking oligo nucleotides for DNA amplification
Versions of package primer3
ReleaseVersionArchitectures
lenny1.1.4-1i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid1.1.4-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1.1.4-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch1.0b-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package primer3:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
works-withspecial:todo
works-with-formatplaintext
Popcon: 25 users (13 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Primer3 picks primers for Polymerase Chain Reactions (PCRs), considering as criteria oligonucleotide melting temperature, size, GC content and primer-dimer possibilities, PCR product size, positional constraints within the source sequence, and miscellaneous other constraints. All of these criteria are user-specifiable as constraints, and some are specifiable as terms in an objective function that characterizes an optimal primer pair.

It has been published in Rozen S and Skaletsky H, "Primer3 on the WWW for general users and for biologist programmers.", Methods Mol Biol. 2000;132:365-86.

The Whitehead Institute for Biomedical Research provides a web-based front end to Primer3.

Please cite: S. Rozen, H. Skaletsky: Primer3 on the WWW for general users and for biologist programmers Methods Mol Biol. 132:365-86 (2000)
Probcons
PROBabilistic CONSistency-based multiple sequence alignment
Versions of package probcons
ReleaseVersionArchitectures
lenny1.12-4i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid1.12-4s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1.12-4sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch1.11-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package probcons:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing
works-withspecial:todo
works-with-formatplaintext
Popcon: 15 users (9 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Tool for generating multiple alignments of protein sequences. Using a combination of probabilistic modeling and consistency-based alignment techniques, PROBCONS has achieved the highest accuracies of all alignment methods to date. On the BAliBASE benchmark alignment database, alignments produced by PROBCONS show statistically significant improvement over current programs, containing an average of 7% more correctly aligned columns than those of T-Coffee, 11% more correctly aligned columns than those of CLUSTAL W, and 14% more correctly aligned columns than those of DIALIGN. Probcons is published in Do, C.B., Mahabhashyam, M.S.P., Brudno, M., and Batzoglou, S. 2005. Genome Research 15: 330-340.

Please cite: C. B. Do, M. S. P. Mahabhashyam, M. Brudno, S. Batzoglou: Genome Research 15: 330-340 (2005)
Proda
multiple alignment of protein sequences
Versions of package proda
ReleaseVersionArchitectures
lenny1.0-7i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc
sid1.0-7ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
squeeze1.0-7mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
Debtags of package proda:
biologypeptidic, nuceleic-acids, format:fasta
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-withspecial:todo
works-with-formatplaintext
Popcon: 11 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

ProDA is a system for automated detection and alignment of homologous regions in collections of proteins with arbitrary domain architectures. Given an input set of unaligned sequences, ProDA identifies all homologous regions appearing in one or more sequences, and returns a collection of local multiple alignments for these regions.

ProDA is published in: Phuong T.M., Do C.B., Edgar R.C., and Batzoglou S. Multiple alignment of protein sequences with repeats and rearrangements. Nucleic Acids Research 2006 34(20), 5932-5942.

Please cite: T. M. Phuong, C. B. Do, R. C. Edgar, S. Batzoglou: Multiple alignment of protein sequences with repeats and rearrangements Nucleic Acids Research 34(20), 5932-5942 (2006)
Pymol
Molecular Graphics System
Versions of package pymol
ReleaseVersionArchitectures
sid1.2r1-3i386,powerpc,armel,alpha,amd64,ia64
sid1.2r1-2s390,hppa,hurd-i386,mips,mipsel,sparc
squeeze1.2r1-2ia64,hppa,powerpc,armel,s390,amd64,mips,sparc,mipsel,i386
lenny1.1-1i386,ia64,mips,hppa,alpha,amd64,powerpc,sparc,mipsel,arm,armel,s390
etch0.98+0.99rc6-2mipsel,mips,amd64,sparc,alpha,arm,s390,hppa,i386,powerpc,ia64
Debtags of package pymol:
fieldchemistry, biology:structural
interfacex11
roleprogram
scopeutility
uitoolkittk
useviewing
works-withimage
x11application
Popcon: 297 users (68 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

PyMOL is a molecular graphics system targetted at medium to large biomolecules like proteins. It can generate high-quality publication-ready molecular graphics images and animations.

Features include:

  • Visualization of molecules, molecular trajectories and surfaces of crystallography data or orbitals
  • Molecular builder and sculptor
  • Internal raytracer and movie generator
  • Fully extensible and scriptable via a Python interface

File formats PyMOL can read include PDB, XYZ, CIF, MDL Molfile, ChemDraw, CCP4 maps, XPLOR maps and Gaussian cube maps.

R-cran-qtl
GNU R package for genetic marker linkage analysis
Versions of package r-cran-qtl
ReleaseVersionArchitectures
sid1.14-2-1i386,sparc,powerpc,hurd-i386,hppa,armel,s390,amd64,alpha,mipsel,ia64
sid1.13-7-1mips
squeeze1.13-7-1s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny1.08-56-1sparc,amd64,hppa,powerpc,mips,i386,s390,alpha,mipsel,ia64,arm,armel
etch1.05-2-1powerpc,mips,sparc,mipsel,arm,alpha,hppa,amd64,s390,i386,ia64
Debtags of package r-cran-qtl:
devellibrary, lang:r
fieldstatistics, biology
roleapp-data
suitegnu
Popcon: 37 users (92 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

R/qtl is an extensible, interactive environment for mapping quantitative trait loci (QTLs) in experimental crosses. It is implemented as an add-on-package for the freely available and widely used statistical language/software R (see http://www.r-project.org).

The development of this software as an add-on to R allows to take advantage of the basic mathematical and statistical functions, and powerful graphics capabilities, that are provided with R. Further, the user will benefit by the seamless integration of the QTL mapping software into a general statistical analysis program. The goal is to make complex QTL mapping methods widely accessible and allow users to focus on modeling rather than computing.

A key component of computational methods for QTL mapping is the hidden Markov model (HMM) technology for dealing with missing genotype data. We have implemented the main HMM algorithms, with allowance for the presence of genotyping errors, for backcrosses, intercrosses, and phase-known four-way crosses.

The current version of R/qtl includes facilities for estimating genetic maps, identifying genotyping errors, and performing single-QTL genome scans and two-QTL, two-dimensional genome scans, by interval mapping (with the EM algorithm), Haley-Knott regression, and multiple imputation. All of this may be done in the presence of covariates (such as sex, age or treatment). One may also fit higher-order QTL models by multiple imputation.

R-other-bio3d
GNU R package for biological structure analysis
Versions of package r-other-bio3d
ReleaseVersionArchitectures
sid1.0-6-1all
squeeze1.0-6-1all
Debtags of package r-other-bio3d:
fieldbiology:structural, biology:bioinformatics, biology
interfacex11, commandline
roleshared-lib
scopeapplication
useviewing, comparing, analysing
works-with3dmodel
works-with-formatplaintext
Popcon: 10 users (16 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

The bio3d package contains utilities to process, organize and explore protein structure, sequence and dynamics data. Features include the ability to read and write structure, sequence and dynamic trajectory data, perform atom summaries, atom selection, re-orientation, superposition, rigid core identification, clustering, torsion analysis, distance matrix analysis, structure and sequence conservation analysis, and principal component analysis (PCA). In addition, various utility functions are provided to enable the statistical and graphical power of the R environment to work with biological sequence and structural data.

R-other-mott-happy
GNU R package for fine-mapping complex diseases
Versions of package r-other-mott-happy
ReleaseVersionArchitectures
sid2.1-4+b1s390,i386,sparc,alpha,armel,ia64,amd64,mips,mipsel,powerpc
squeeze2.1-4+b1sparc,amd64,armel,i386,ia64,mips,mipsel,powerpc,s390
sid2.1-4hurd-i386,hppa
squeeze2.1-4hppa
Popcon: 2 users (5 upd.)*
Versions and Archs
License: DFSG free
Svn

Most phenotypes of medical importance can be measured quantitatively, and in many cases the genetic contribution is substantial, accounting for 40% or more of the phenotypic variance. Considerable efforts have been made to isolate the genes responsible for quantitative genetic variation in human populations, but with little success, mostly because genetic loci contributing to quantitative traits (quantitative trait loci, QTL) have only a small effect on the phenotype. Association studies have been proposed as the most appropriate method for finding the genes that influence complex traits. However, family-based studies may not provide the resolution needed for positional cloning, unless they are very large, while environmental or genetic differences between cases and controls may confound population-based association studies.

These difficulties have led to the study of animal models of human traits. Studies using experimental crosses between inbred animal strains have been successful in mapping QTLs with effects on a number of different phenotypes, including behaviour, but attempts to fine-map QTLs in animals have often foundered on the discovery that a single QTL of large effect was in fact due to multiple loci of small effect positioned within the same chromosomal region. A further potential difficulty with detecting QTLs between inbred crosses is the significant reduction in genetic heterogeneity compared to the total genetic variation present in animal populations: a QTL segregating in the wild need not be present in the experimental cross.

The idea behind this package is that when multiple strains of animals that differ in their susceptibility to multiple diseases are bread over multiple generations, then one can analyse the contribution that a particular genetic locus has to each of those diseases. While in the past this approach has been performed for one disease at a time, this tool extends the statistics for allowing multiple crosses and thus save animal lifes. A larger stock of animals with more generations to keep them will further help producing larger numbers of observable cross-over events and thus help increasing the resolution of the mapping.

happy is an R interface into the HAPPY C package for fine-mapping Quantitative Trait Loci (QTL) in Heterogenous Stocks (HS). An HS is an advanced intercross between (usually eight) founder inbred strains of mice. HS are suitable for fine-mapping QTL. It uses a multipoint analysis which offers significant improvements in statistical power to detect QTLs over that achieved by single-marker association.

The happy package is an extension of the original C program happy; it uses the C code to compute the probability of descent from each of the founders, at each locus position, but the happy packager allows a much richer range of models to be fit to the data.

Further details can be found in

 Proc. Natl. Acad. Sci. USA, 10.1073/pnas.230304397.
Please cite: Richard Mott, Christopher J. Talbot, Maria G. Turri, Allan C. Collins, Jonathan Flint: A method for fine mapping quantitative trait loci in outbred animal stocks. Proc. Natl. Acad. Sci. USA (2000)
Rasmol
Visualize biological macromolecules
Versions of package rasmol
ReleaseVersionArchitectures
sid2.7.4.2-5+b2ia64
squeeze2.7.4.2-5+b2ia64
sid2.7.4.2-5s390,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,sparc
squeeze2.7.4.2-5mips,sparc,powerpc,amd64,mipsel,i386,hppa,s390,armel
lenny2.7.4.2-3+b1s390,hppa,mips,sparc
lenny2.7.4.2-3i386,mipsel,ia64,arm,armel,alpha,amd64,powerpc
sid2.7.4.2-3hurd-i386
etch2.7.2.1.1-5s390,mips,powerpc,mipsel,arm,amd64,sparc,hppa,alpha,i386,ia64
upstream2.7.5
Debtags of package rasmol:
fieldchemistry
interfacex11
roleprogram
scopeutility
uitoolkitgtk
useviewing, learning
x11application
Popcon: 263 users (35 upd.)*
Newer upstream!
Debtags
License: DFSG free
Git

RasMol is a molecular graphics program intended for the visualisation of proteins, nucleic acids and small molecules. The program is aimed at display, teaching and generation of publication quality images.

The program reads in a molecule coordinate file and interactively displays the molecule on the screen in a variety of colour schemes and molecule representations. Currently available representations include depth-cued wireframes, 'Dreiding' sticks, spacefilling (CPK) spheres, ball and stick, solid and strand biomolecular ribbons, atom labels and dot surfaces.

Supported input file formats include Protein Data Bank (PDB), Tripos Associates' Alchemy and Sybyl Mol2 formats, Molecular Design Limited's (MDL) Mol file format, Minnesota Supercomputer Center's (MSC) XYZ (XMol) format, CHARMm format, CIF format and mmCIF format files.

This package installs two versions of RasMol, rasmol-gtk has a modern GTK-based user interface and rasmol-classic is the version with the old Xlib GUI.

The package is enhanced by the following packages: rasmol-doc
Remark of Debian Med team: New version needs additional dependencies which need to be packaged

If you want to help getting the new version of rasmol into Debian you can work on packaging the following projects first:

 neartree: function library efficiently solving the Nearest Neighbor Problem (#550849)
 cqrlib: utility library for quaternion arithmetic and rotation math (#550850)
 cvector: ANSI C implementation of dynamic arrays approximating the C++ vector class (#550852)

Any help to get these libraries packaged is welcome.

Readseq
[Biology] Conversion between sequence formats
Versions of package readseq
ReleaseVersionArchitectures
lenny1-7i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid1-7s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1-7sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch1-6arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package readseq:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useconverting
works-with-formatplaintext
Popcon: 25 users (9 upd.)*
Versions and Archs
Debtags
License: DFSG free

Reads and writes nucleic/protein sequences in various formats. Data files may have multiple sequences. Readseq is particularly useful as it automatically detects many sequence formats, and converts between them.

Rnahybrid
Fast and effective prediction of microRNA/target duplexes
Versions of package rnahybrid
ReleaseVersionArchitectures
sid2.1-2ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
squeeze2.1-2mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
lenny2.1-1i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc
Debtags of package rnahybrid:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 4 users (4 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

RNAhybrid is a tool for finding the minimum free energy hybridisation of a long and a short RNA. The hybridisation is performed in a kind of domain mode, ie. The short sequence is hybridised to the best fitting part of the long one. The tool is primarily meant as a means for microRNA target prediction.

Public research assisted by RNAhybrid should cite: Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann, Robert Giegerich Fast and effective prediction of microRNA/target duplexes RNA, 10:1507-1517, 2004.

Please cite: Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann, Robert Giegerich: Fast and effective prediction of microRNA/target duplexes RNA 10:1507-1517 (2004)
Samtools
processing sequence alignments in SAM and BAM formats
Versions of package samtools
ReleaseVersionArchitectures
sid0.1.6~dfsg-1alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Samtools is a set of utilities that manipulate nucleotide sequence alignments in the binary BAM format. It imports from and exports to the ascii SAM (Sequence Alignment/Map) format, does sorting, merging and indexing, and allows to retrieve reads in any regions swiftly. It is designed to work on a stream, and is able to open a BAM (not SAM) file on a remote FTP or HTTP server.

The package is enhanced by the following packages: libbio-samtools-perl
Seaview
Multiplatform interface for sequence alignment and phylogeny
Versions of package seaview
ReleaseVersionArchitectures
sid3.2-1ia64,armel,hurd-i386,mips,mipsel
squeeze3.2-1mipsel,mips,ia64,powerpc,i386,hppa,s390,armel,amd64,sparc
etch20060918-1mipsel,amd64,i386,arm,s390,ia64,sparc,alpha,hppa,mips,powerpc
lenny2.4-1amd64,mips,mipsel,sparc,arm,armel,alpha,hppa,s390,i386,ia64,powerpc
Debtags of package seaview:
fieldbiology:bioinformatics, biology
interfacex11
roleprogram
scopeutility
uitoolkitfltk
usespecial:todo, printing, editing
works-withspecial:todo
works-with-formatplaintext
x11application
Popcon: 65 users (10 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

SeaView reads and writes various file formats (NEXUS, MSF, CLUSTAL, FASTA, PHYLIP, MASE, Newick) of DNA and protein sequences and of phylogenetic trees. Alignments can be manually edited. It drives the programs Muscle or Clustal W for multiple sequence alignment, and also allows to use any external alignment algorithm able to read and write FASTA-formatted files. It computes phylogenetic trees by parsimony using PHYLIP's dnapars/protpars algorithm, by distance with NJ or BioNJ algorithms on a variety of evolutionary distances, or by maximum likelihood using the program PhyML 3.0. SeaView draws phylogenetic trees on screen or PostScript files, and allows to download sequences from EMBL/GenBank/UniProt using the Internet.

The package is enhanced by the following packages: muscle
Please cite: N. Galtier, M. Gouy, C. Gautier: SeaView and Phylo_win, two graphic tools for sequence alignment and molecular phylogeny Comput. Applic. Biosci. 12:543-548 (1996)
Sibsim4
align expressed RNA sequences on a DNA template
Versions of package sibsim4
ReleaseVersionArchitectures
lenny0.17-1arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc
sid0.17-1ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc
squeeze0.17-1mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
Debtags of package sibsim4:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usespecial:todo, searching, comparing
works-withspecial:todo
works-with-formatplaintext
Popcon: 12 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

The SIBsim4 project is based on sim4, which is a program designed to align an expressed DNA sequence with a genomic sequence, allowing for introns. SIBsim4 is a fairly extensive rewrite of the original code with the following goals:

  • speed improvement;
  • allow large, chromosome scale, DNA sequences to be used;
  • provide more detailed output about splice types;
  • provide more detailed output about polyA sites;
  • misc code cleanups and fixes.
Sigma-align
Simple greedy multiple alignment of non-coding DNA sequences
Versions of package sigma-align
ReleaseVersionArchitectures
sid1.1.3-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze1.1.3-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny1.1.1-3i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch1.0-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package sigma-align:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-withspecial:todo
works-with-formatplaintext
Popcon: 11 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Sigma (“Simple greedy multiple alignment”) is an alignment program with a new algorithm and scoring scheme designed specifically for non-coding DNA sequence. It uses a strategy of seeking the best possible gapless local alignments, at each step making the best possible alignment consistent with existing alignments, and scores the significance of the alignment based on the lengths of the aligned fragments and a background model which may be supplied or estimated from an auxiliary file of intergenic DNA.

Please cite: BMC Bioinformatics 16;7:143 (2006)
Sim4
tool for aligning cDNA and genomic DNA
Versions of package sim4
ReleaseVersionArchitectures
lenny0.0.20030921-3i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
sid0.0.20030921-3s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze0.0.20030921-3sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch0.0.20030921-1arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package sim4:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, comparing
works-withspecial:todo
works-with-formatplaintext
Popcon: 14 users (7 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

sim4 is a similarity-based tool for aligning an expressed DNA sequence (EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects end matches when the two input sequences overlap at one end (i.e., the start of one sequence overlaps the end of the other).

sim4 employs a blast-based technique to first determine the basic matching blocks representing the "exon cores". In this first stage, it detects all possible exact matches of W-mers (i.e., DNA words of size W) between the two sequences and extends them to maximal scoring gap-free segments. In the second stage, the exon cores are extended into the adjacent as-yet-unmatched fragments using greedy alignment algorithms, and heuristics are used to favor configurations that conform to the splice-site recognition signals (GT-AG, CT-AC). If necessary, the process is repeated with less stringent parameters on the unmatched fragments.

Ssake
genomics application for assembling millions of very short DNA sequences
Versions of package ssake
ReleaseVersionArchitectures
sid3.4-1all
squeeze3.4-1all
Popcon: 3 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

The Short Sequence Assembly by K-mer search and 3′ read Extension (SSAKE) is a genomics application for aggressively assembling millions of short nucleotide sequences by progressively searching for perfect 3′-most k-mers using a DNA prefix tree. SSAKE is designed to help leverage the information from short sequences reads by stringently clustering them into contigs that can be used to characterize novel sequencing targets.

Staden-io-lib-utils
programs for maniuplating DNA sequencing files
Versions of package staden-io-lib-utils
ReleaseVersionArchitectures
sid1.12.1-1amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc
squeeze1.12.1-1powerpc,amd64,mips,armel,sparc,hppa,mipsel,i386,s390,ia64
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

The io_lib from the Staden package is a library of file reading and writing code to provide a general purpose trace file (and Experiment File) reading interface. It has been compiled and tested on a variety of unix systems, MacOS X and MS Windows.

This package contains the programs that are distributed with the Staden io_lib for manipulating and converting sequencing data files, and in particular files to maniuplate short reads generated by second and third generation sequencers and stored in SRF format.

T-coffee
Multiple Sequence Alignment
Versions of package t-coffee
ReleaseVersionArchitectures
sid8.14-1ia64,sparc,powerpc,i386,hurd-i386,armel,s390,amd64,alpha,mipsel,mips
sid8.13-1hppa
squeeze8.13-1s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny5.72-1alpha,mips,mipsel,arm,armel,sparc,hppa,s390,i386,amd64,powerpc,ia64
etch2.50-1alpha,hppa,mips,ia64,amd64,arm,i386,sparc,powerpc,s390,mipsel
Debtags of package t-coffee:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 15 users (24 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

T-Coffee is a multiple sequence alignment package. Given a set of sequences (Proteins or DNA), T-Coffee generates a multiple sequence alignment. Version 2.00 and higher can mix sequences and structures.

T-Coffee allows the combination of a collection of multiple/pairwise, global or local alignments into a single model. It also allows to estimate the level of consistency of each position within the new alignment with the rest of the alignments. See the pre-print for more information

T-Coffee has a special called M-Coffee that makes it possible to combine the output of many multiple sequence alignment packages. In its published version, it uses MUSCLE, PROBCONS, POA, DiAlign-TS, MAFFT, Clustal W, PCMA and T-Coffee. A special version has been made for Debian, DM-Coffee, that uses only free software by replacing Clustal W by Kalign. Using the 8 Methods of M-Coffee can sometimes be a bit heavy. You can use a subset of your favorite methods if you prefer.

The package is enhanced by the following packages: clustalw kalign poa mafft dialign-tx
Please cite: C. Notredame, D. Higgins, J. Heringa: T-Coffee: A novel method for multiple sequence alignments. Journal of Molecular Biology 302(1):205-217 (2000)
Theseus
superimpose macromolecules using maximum likelihood
Versions of package theseus
ReleaseVersionArchitectures
sid1.4.3-2mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa
squeeze1.4.3-2sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390
Debtags of package theseus:
biologypeptidic
fieldbiology:structural, biology:bioinformatics, biology
interfacecommandline
roleprogram
usecomparing, analysing
works-with-formatplaintext
Popcon: 4 users (1 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Theseus is a program that simultaneously superimposes multiple macromolecular structures. Theseus finds the optimal solution to the superposition problem using the method of maximum likelihood. By down-weighting variable regions of the superposition and by correcting for correlations among atoms, the ML superposition method produces very accurate structural alignments.

When macromolecules with different residue sequences are superimposed, other programs and algorithms discard residues that are aligned with gaps. Theseus, however, uses a novel superimposition algorithm that includes all of the data.

Tigr-glimmer
Gene detection in archea and bacteria
Versions of package tigr-glimmer
ReleaseVersionArchitectures
lenny3.02-1i386,sparc,hppa,amd64,s390,arm,powerpc,alpha,mipsel,armel,ia64,mips
sid3.02-1s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze3.02-1sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.13-1+b1s390,mips,arm,mipsel,powerpc,sparc,hppa,alpha,i386,ia64
etch2.13-1amd64
Debtags of package tigr-glimmer:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching
works-with-formatplaintext
Popcon: 13 users (8 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Developed by the TIGR institute this software detects coding sequences in bacteria and archea.

Glimmer is a system for finding genes in microbial DNA, especially the genomes of bacteria and archaea. Glimmer (Gene Locator and Interpolated Markov Modeler) uses interpolated Markov models (IMMs) to identify the coding regions and distinguish them from noncoding DNA.

Tree-ppuzzle
Parallelized reconstruction of phylogenetic trees by maximum likelihood
Versions of package tree-ppuzzle
ReleaseVersionArchitectures
sid5.2-5s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze5.2-5sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny5.2-4alpha,s390,hppa,mips,amd64,powerpc,sparc,mipsel,arm,ia64,armel,i386
etch5.2-2s390,mips,arm,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package tree-ppuzzle:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 6 users (2 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that implements a fast tree search algorithm, quartet puzzling, that allows analysis of large data sets and automatically assigns estimations of support to each internal branch. TREE-PUZZLE also computes pairwise maximum likelihood distances as well as branch lengths for user specified trees. Branch lengths can also be calculated under the clock-assumption. In addition, TREE-PUZZLE offers a novel method, likelihood mapping, to investigate the support of a hypothesized internal branch without computing an overall tree and to visualize the phylogenetic content of a sequence alignment.

This is the parallelized version of tree-puzzle.

Tree-puzzle
Reconstruction of phylogenetic trees by maximum likelihood
Versions of package tree-puzzle
ReleaseVersionArchitectures
sid5.2-5s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze5.2-5sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny5.2-4s390,hppa,amd64,sparc,powerpc,mips,mipsel,arm,ia64,armel,i386,alpha
etch5.2-2mips,arm,s390,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
sid5.2-1hurd-i386
Debtags of package tree-puzzle:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 17 users (12 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that implements a fast tree search algorithm, quartet puzzling, that allows analysis of large data sets and automatically assigns estimations of support to each internal branch. TREE-PUZZLE also computes pairwise maximum likelihood distances as well as branch lengths for user specified trees. Branch lengths can also be calculated under the clock-assumption. In addition, TREE-PUZZLE offers a novel method, likelihood mapping, to investigate the support of a hypothesized internal branch without computing an overall tree and to visualize the phylogenetic content of a sequence alignment.

Treeviewx
Displays and prints phylogenetic trees
Versions of package treeviewx
ReleaseVersionArchitectures
sid0.5.1-6s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc
squeeze0.5.1-6sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips
lenny0.5.1-5i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64
etch0.5.1-3arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64
Debtags of package treeviewx:
fieldbiology:bioinformatics, biology
interfacex11
roleprogram
scopeutility
uitoolkitwxwindows
useviewing
works-withspecial:todo
works-with-formatsvg, postscript, plaintext, pdf
x11application
Popcon: 62 users (5 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

TreeView X is an open source and multi-platform program to display phylogenetic trees. It can read and display NEXUS and Newick format tree files (such as those output by PAUP*, ClustalX, TREE-PUZZLE, and other programs). It allows to order the branches of the trees, and to export the trees in SVG format.

The program was written by Rod Page r.page@bio.gla.ac.uk using the wxWidgets C++ library. It was published in Computer Applications in the Biosciences. 1996 12: 357-358.

Please cite: Computer Applications in the Bioscience 12:357-358 (1996)
Screenshots of package treeviewx
Velvet
Nucleic acid sequence assembler for very short reads
Versions of package velvet
ReleaseVersionArchitectures
sid0.7.55-1mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa
squeeze0.7.55-1sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390
Popcon: 3 users (5 upd.)*
Versions and Archs
License: DFSG free
Svn

Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom.

Velvet currently takes in short read sequences, removes errors then produces high quality unique contigs. It then uses paired read information, if available, to retrieve the repeated areas between contigs.

Wise
comparison of biopolymers, commonly DNA and protein sequences
Versions of package wise
ReleaseVersionArchitectures
sid2.4.1-8sparc,i386,hurd-i386,armel,s390,amd64,alpha,mipsel,ia64
sid2.4.1-7mips,hppa,powerpc
squeeze2.4.1-7s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny2.4.1-5hppa,mipsel,powerpc,arm,armel,sparc,mips,s390,i386,amd64,ia64,alpha
etch2.2.0-3alpha,hppa,sparc,ia64,amd64,powerpc,i386,mipsel,mips,s390,arm
Debtags of package wise:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usespecial:todo, comparing
Popcon: 17 users (13 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Wise2 is a package focused on comparisons of biopolymers, commonly DNA and protein sequences. There are many other packages which do this, probably the best known being BLAST package (from NCBI) and the Fasta package (from Bill Pearson). There are other packages, such as the HMMER package (Sean Eddy) or SAM package (UC Santa Cruz) focused on hidden Markov models (HMMs) of biopolymers.

Wise2's particular forte is the comparison of DNA sequence at the level of its protein translation. This comparison allows the simultaneous prediction of say gene structure with homology based alignment.

Wise2 also contains other algorithms, such as the venerable Smith-Waterman algorithm, or more modern ones such as Stephen Altschul's generalised gap penalties, or even experimental ones developed in house, such as dba. The development of these algorithms is due to the ease of developing such algorithms in the environment used by Wise2.

Wise2 has also been written with an eye for reuse and maintainability. Although it is a pure C package you can access its functionality directly in Perl. Parts of the package (or the entire package) can be used by other C or C++ programs without namespace clashes as all externally linked variables have the unique identifier Wise2 prepended.

Official Debian packages with lower relevance

Emboss-explorer
web-based GUI to EMBOSS
Versions of package emboss-explorer
ReleaseVersionArchitectures
lenny2.2.0-7all
sid2.2.0-7all
squeeze2.2.0-7all
Debtags of package emboss-explorer:
biologyemboss
fieldbiology:bioinformatics, biology
interfaceweb
roleplugin
webcgi, application
Popcon: 8 users (3 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

EMBOSS explorer is a web-based graphical user interface to the EMBOSS suite of bioinformatics tools. It is written in Perl.

If you use the Apache HTTP server, you will at most have to restart it before using EMBOSS explorer. For other web servers, you will have to do the configuration by yourself.

Melting-gui
graphical interface to compute the melting temperature of nucleic acid duplex
Versions of package melting-gui
ReleaseVersionArchitectures
sid4.3b-1all
lenny4.2h-3all
squeeze4.2h-3all
upstream4.3c
Debtags of package melting-gui:
fieldbiology:molecular, biology
interfacex11
roleprogram
scopeutility
uitoolkittk
useanalysing
x11application
Popcon: 8 users (2 upd.)*
Newer upstream!
Debtags
License: DFSG free
Svn

This package provides a graphical user interface for the “melting” program, which computes for a nucleic acid duplex the enthalpy, the entropy and the melting temperature of the helix-coil transitions.

Mozilla-biofox
extension of bioinformatics tools to Iceape and Iceweasel browsers
Versions of package mozilla-biofox
ReleaseVersionArchitectures
lenny1.1.5-1~lennyall
sid1.1.5-1all
squeeze1.1.5-1all
etch1.1.4-2all
Debtags of package mozilla-biofox:
fieldgenealogy, biology:bioinformatics, biology
interfacex11, web
roleplugin
scopeutility
suitemozilla
usesearching, editing, converting, comparing, browsing
works-withdb
Popcon: 99 users (5 upd.)*
Versions and Archs
Debtags
License: DFSG free
Svn

Code bioFOX aims at implementing various bioinformatics tools as an extension on the Iceape and Iceweasel browsers. Analysis of your favorite gene(s) usually require(s) retrieving it from a database like NCBI or Swiss-Prot and then performing one or more tasks including but not limited to:

  • Translation of a nucleotide sequence;
  • Blast search (eg. blastn, blastp etc.) of the desired nucleotide/protein sequence;
  • Calculation of properties (like PI, charge, molecular weight, AT/GC content etc.) of a protein/nucleotide sequence;
  • Conversion between formats (Genbank, Fasta, Swiss-Prot etc.);
  • Prediction of sequence for sub-cellular localization (PREDOTAR, TargetP, pSORT etc).
Seqan-apps
A C++ template library for the analysis of sequences
Versions of package seqan-apps
ReleaseVersionArchitectures
sid1.2-1mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa
squeeze1.2-1sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

SeqAn is a C++ template library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. This library applies a unique generic design that guarantees high performance, generality, extensibility, and integration with other libraries. SeqAn is easy to use and simplifies the development of new software tools with a minimal loss of performance. This package contains the applications dfi, pair_align, micro_razers, seqan_tcoffee, seqcons, razers and tree_recon.

Debian packages in contrib or non-free

Arb
Integrated package for sequence database handling and analysis
Versions of package arb
ReleaseVersionArchitectures
sid0.0.20071207.1-7powerpc,i386,sparc
sid0.0.20071207.1-6mipsel
squeeze0.0.20071207.1-6mipsel,i386,sparc
Debtags of package arb:
fieldbiology:bioinformatics, biology
interfacex11
roleprogram
scopeutility, application
uitoolkitmotif
usesearching, editing, comparing, analysing
x11application
Popcon: 0 users (0 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

The ARB software is a graphically oriented package comprising various tools for sequence database handling and data analysis. A central database of processed (aligned) sequences and any type of additional data linked to the respective sequence entries is structured according to phylogeny or other user defined criteria.

The ARB project (latin, "arbor"=tree) is a joint initiative of the Lehrstuhl fuer Mikrobiologie http://www.mikro.biologie.tu-muenchen.de/ and the Lehrstuhl fuer Rechnertechnik und Rechnerorganisation http://wwwbode.informatik.tu-muenchen.de/ of the Technical University of Munich.

Autodocktools
GUI to help set up, launch and analyze AutoDock dockings
Versions of package autodocktools
ReleaseVersionArchitectures
sid1.5.4.cvs.20090603-1all
squeeze1.5.4.cvs.20090603-1all
Popcon: 1 users (1 upd.)*
Versions and Archs
License: non-free
Svn

This package is part of the mgltools set of Python libraries which provide an infrastructure for the analysis of protein structures and their docking of chemical compounds.

AutoDock is a well established package for the automated screening of libraries of formal representation of chemical compounds that putatively bind to a particular protein at hand. This package provides a graphical user interface that is helping with the preparation of the protein for such analyses.

Please cite: G.M. Morris, R. Huey, W. Lindstrom, M.F. Sanner, R.K. Belew, D.S. Goodsell, A.J. Olson: AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. (2009)
Screenshots of package autodocktools
Clustalw
global multiple nucleotide or peptide sequence alignment
Versions of package clustalw
ReleaseVersionArchitectures
lenny2.0.9-1mips,mipsel,powerpc,arm,sparc,hppa,s390,i386,amd64,ia64,alpha
sid2.0.12-1i386,hppa,armel,s390,amd64,alpha,ia64,sparc
sid2.0.10-1powerpc,mipsel,mips
squeeze2.0.10-1s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
etch1.83-1.2hppa,arm,amd64,powerpc,i386,mipsel,sparc,s390,ia64,mips,alpha
Debtags of package clustalw:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacetext-mode, commandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 22 users (25 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

This program performs an alignment of multiple nucleotide or amino acid sequences. It recognizes the format of input sequences and whether the sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The output format may be selected from in various formats for multiple alignments such as Phylip or FASTA. Clustal W is very well accepted.

The output of Clustal W can be edited manually but preferably with an alignment editor like SeaView or within its companion Clustal X. When building a model from your alignment, this can be applied for improved database searches. The Debian package hmmer creates such in form of an HMM.

The package is enhanced by the following packages: clustalw-mpi
Please cite: M. Larkin, et al.: Clustal W and Clustal X version 2.0 Bioinformatics 23(21):2947-2948 (2007)
Clustalw-mpi
MPI-distributed global sequence alignment with ClustalW
Versions of package clustalw-mpi
ReleaseVersionArchitectures
lenny0.15-1arm,mipsel,ia64,alpha,mips,i386,amd64,s390,hppa,sparc,powerpc,armel
sid0.15-1ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc
squeeze0.15-1mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390
etch0.13-3i386
Debtags of package clustalw-mpi:
fieldbiology
interfacetext-mode, commandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 4 users (2 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

ClustalW is a popular tool for multiple sequence alignment. The alignment is achieved via three steps: pairwise alignment, guide-tree generation and progressive alignment. ClustalW-MPI is an MPI implementation of ClustalW. Based on version 1.82 of the original ClustalW, both the pairwise and progressive alignments are parallelized with MPI, a popular message passing programming standard. The pairwise alignments can be easily parallelized since the many alignments are time independent on each other. However the progressive alignments are essentially not parallelizable because of the time dependencies between each alignment.

Here the recursive parallelism paradigm is applied to the linear space profile-profile alignment algorithm. This approach is more time efficient on computers with distributed memory architecture. Traditional approach that relies on precomputing the profile-profile score matrix has also been implemented. Results shown the latter is indeed more appropriate for shared memory multiprocessor computer.

ClustalX is suggested for its support for local realignments, seaview is a versatile editor of alignments.

The original ClustalW/ClustalX can be found at URL: http://www.clustal.org/download/pre-2/

Clustalx
Multiple alignment of nucleic acid and protein sequences (graphical interface)
Versions of package clustalx
ReleaseVersionArchitectures
sid2.0.12-1hppa,i386,sparc,s390,alpha,amd64
lenny1.83-4mips,alpha,sparc,amd64,s390,arm,armel,hppa,powerpc,i386,mipsel,ia64
sid1.83-4armel,ia64,mips,mipsel,powerpc
squeeze1.83-4mips,i386,sparc,mipsel,hppa,powerpc,armel,s390,amd64,ia64
etch1.83-1.2hppa,amd64,arm,powerpc,i386,mips,sparc,mipsel,ia64,alpha,s390
Debtags of package clustalx:
biologypeptidic, nuceleic-acids, format:fasta, format:aln
fieldbiology:bioinformatics, biology
interfacex11
roleprogram
scopeutility
uitoolkitmotif
useviewing, comparing, analysing
works-with-formatplaintext
x11application
Popcon: 50 users (3 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

This package offers a GUI interface for the Clustal multiple sequence alignment program. It provides an integrated environment for performing multiple sequence- and profile-alignments to analyse the results. The sequence alignment is displayed in a window on the screen. A versatile coloring scheme has been incorporated to highlight conserved features in the alignment. For professional presentations, one should use the texshade LaTeX package or boxshade.

The pull-down menus at the top of the window allow you to select all the options required for traditional multiple sequence and profile alignment. You can cut-and-paste sequences to change the order of the alignment; you can select a subset of sequences to be aligned; you can select a sub-range of the alignment to be realigned and inserted back into the original alignment.

An alignment quality analysis can be performed and low-scoring segments or exceptional residues can be highlighted.

Embassy-phylip
EMBOSS conversions of the programs in the phylip package
Versions of package embassy-phylip
ReleaseVersionArchitectures
sid3.68-1amd64
squeeze3.68-1amd64
Popcon: 0 users (0 upd.)*
Versions and Archs
License: non-free
Svn

This package is the adaptation of the PHYLIP package in which its programs can operate with the biological sequence formats and databases of the European Molecular Biology Open Software Suite (EMBOSS). The software packages adapted for EMBOSS are called EMBASSY.

PHYLIP (the PHYLogeny Inference Package) is a package of programs for inferring phylogenies (evolutionary trees). Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites and fragments, distance matrices, and discrete characters.

The EMBASSY PHYLIP programs all have the prefix "f" to distinguish them from the original programs and avoid namespace conflict.

Molphy
Program Package for MOLecular PHYlogenetics
Versions of package molphy
ReleaseVersionArchitectures
lenny2.3b3-5amd64,i386,sparc,hppa,powerpc,alpha,mips,arm,s390,ia64,mipsel,armel
sid2.3b3-5s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
squeeze2.3b3-5ia64,sparc,powerpc,i386,hppa,s390,armel,amd64,mipsel,mips
etch2.3b3-2arm,mips,powerpc,mipsel,sparc,hppa,alpha,i386,ia64
Debtags of package molphy:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
Popcon: 4 users (2 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

ProtML is a main program in MOLPHY for inferring evolutionary trees from PROTein (amino acid) sequences by using the Maximum Likelihood method. Other programs (C language)

 NucML:  Maximum Likelihood Inference of Nucleic Acid Phylogeny
 ProtST: Basic Statistics of Protein Sequences
 NucST:  Basic Statistics of Nucleic Acid Sequences
 NJdist: Neighbor Joining Phylogeny from Distance Matrix
Utilities (Perl)
 mollist:  get identifiers list        molrev:   reverse DNA sequences
 molcat:   concatenate sequences       molcut:   get partial sequences
 molmerge: merge sequences             nuc2ptn:  DNA -> Amino acid
 rminsdel: remove INS/DEL sites        molcodon: get specified codon sites
 molinfo:  get varied sites            mol2mol:  MOLPHY format beautifer
 inl2mol:  Interleaved -> MOLPHY       mol2inl:  MOLPHY -> Interleaved
 mol2phy:  MOLPHY -> Sequential        phy2mol:  Sequential -> MOLPHY
 must2mol: MUST -> MOLPHY              etc.
Phylip
package of programs for inferring phylogenies
Versions of package phylip
ReleaseVersionArchitectures
sid3.69-1hppa,s390,amd64,alpha,sparc,i386
sid3.68-2ia64,mipsel,powerpc,armel,mips
squeeze3.68-2s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc
lenny3.67-3ia64,hppa,amd64,mipsel,arm,s390,powerpc,sparc,mips,i386,alpha
etch3.6.1-2arm,powerpc,s390,hppa,alpha,i386,ia64,sparc,mips,mipsel
Debtags of package phylip:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 9 users (4 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

The PHYLogeny Inference Package is a package of programs for inferring phylogenies (evolutionary trees) from sequences. Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites, distance matrices, and 0/1 discrete characters.

Raster3d
tools for generating images of proteins or other molecules
Versions of package raster3d
ReleaseVersionArchitectures
squeeze2.7s-1+b1mips,mipsel
lenny2.7s-1+b1mips,mipsel
sid2.7s-1+b1mipsel,mips
squeeze2.7s-1sparc,amd64,armel,hppa,i386,ia64,powerpc,s390
lenny2.7s-1sparc,s390,powerpc,arm,armel,hppa,amd64,i386,ia64,alpha
sid2.7s-1amd64,alpha,sparc,ia64,s390,armel,hppa,powerpc,i386
etch2.7d-2mipsel,mips,ia64,hppa,alpha,i386,arm,sparc
Debtags of package raster3d:
fieldbiology:structural, biology
interfacecommandline
roleprogram
scopeapplication
useviewing, converting
works-withimage:raster, image, 3dmodel
works-with-formatpng, jpg
Popcon: 24 users (3 upd.)*
Versions and Archs
Debtags
License: non-free
Svn

Raster3D is a set of tools for generating high quality raster images of proteins or other molecules. The core program renders spheres, triangles, cylinders, and quadric surfaces with specular highlighting, Phong shading, and shadowing. It uses an efficient software Z-buffer algorithm which is independent of any graphics hardware. Ancillary programs process atomic coordinates from PDB files into rendering descriptions for pictures composed of ribbons, space-filling atoms, bonds, ball+stick, etc. Raster3D can also be used to render pictures composed in other programs such as Molscript in glorious 3D with highlights, shadowing, etc. Output is to pixel image files with 24 bits of color information per pixel.

Packaging has started and developers might try the packaging code in VCS

Acedb-other - wnpp
tools designed to manipulate genomic data
Responsible: Steffen Moeller
License: GPL / LGPL
Debian package not available
SVN
Language: C, C++

AceDB is a genome database system developed since 1989 primarily by Jean Thierry-Mieg (CNRS, Montpellier) and Richard Durbin (Sanger Institute). It provides a custom database kernel, with a non-standard data model designed specifically for handling scientific data flexibly, and a graphical user interface with many specific displays and tools for genomic data. AceDB is used both for managing data within genome projects, and for making genomic data available to the wider scientific community.

AceDB was originally developed for the C.elegans genome project , from which its name was derived (A C. elegans DataBase). However, the tools in it have been generalized to be much more flexible and the same software is now used for many different genomic databases from bacteria to fungi to plants to man. It is also increasingly used for databases with non-biological content.

Agdbnet - wnpp
antigen sequence database software for web-based bacterial typing
Responsible: Andreas Tille
License: GPL
Debian package not available
SVN

AgdbNet is antigen sequence database software for web-based bacterial typing. The software facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. It's written in Perl and runs on Linux/UNIX systems.

Databases are described by XML files and can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The databases can optionally have integral isolate tables so that information about representative isolates can be retrieved or they may be configured to query external isolate databases, such as those hosted on PubMLST.org.

The software is used on a number of public bacterial typing databases:

  • Neisseria PorA variable regions | PorB | FetA
  • Campylobacter flaA
  • Streptococcus equi seM
Axparafit - wnpp
optimized statistical analysis of host-parasite coevolution
Responsible: David Paleino
License: GPL
Debian package not available
SVN

AxParafit is a highly optimized version of Pierre Legendre's Parafit program for statistical analysis of host-parasite coevolution. AxParafit has been parallelized with MPI (Message Passing Interface) for compute clusters and was used to carry out the largest co-evolutionary analysis to date for the paper describing the software.

Beast-mcmc - wnpp
Bayesian MCMC analysis of molecular sequences
Responsible: Felix Feyertag
License: LGPL
Debian package not available
SVN

BEAST is a cross-platform program for Bayesian MCMC analysis of molecular sequences. It is entirely orientated towards rooted, time-measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to average over tree space, so that each tree is weighted proportional to its posterior probability. We include a simple to use user-interface program for setting up standard analyses and a suit of programs for analysing the results.

The source is avialable at http://code.google.com/p/beast-mcmc/ .

Please cite: A. J. Drummond, A. Rambaut: BEAST: Bayesian evolutionary analysis by sampling trees. (DOI) BMC Evolutionary Biology (2007)
Remark of Debian Med team: Name space pollution

There is a Debian package beast which is completely unrelated to this project.

Cluster3 - wnpp
find clustering solutions for genome data
Responsible: Steffen Moeller
License: non-free
Debian package not available
SVN

Cluster 3.0 is an enhanced version of Cluster, which was originally developed by Michael Eisen while at Stanford University. The main improvement consists of the k-means algorithm, which now includes multiple trials to find the best clustering solution. This is crucial for the k-means algorithm to be reliable. The routine for self-organizing maps was extended to include 2D rectangular geometries. The Euclidean distance and the city-block distance were added to the available measures of similarity.

Dazzle
Java-based DAS server
Responsible: Steffen Moeller
License: LGPL
Debian package not available
SVN

Dazzle is a general purpose server for the Distributed Annotation System (DAS) protocol. It is implemented as a Java servlet, using the BioJava APIs. Dazzle is a modular system which uses small "datasource" plugins to provide access to a range of databases. Several general-purpose plugins are included in the package, and it it straightforward to develop new plugins to connect to your own databases.

Information on DAS is available from http://www.biodas.org/

Ecell - wnpp
Concept and environment for constructing virtual cells on computers
Responsible: Steffen Moeller
License: GPL
Debian package not available
SVN

The E-Cell Project is an international research project aiming at developing necessary theoretical supports, technologies and software platforms to allow precise whole cell simulation.

The E-Cell System is an object-oriented software suite for modeling, simulation, and analysis of large scale complex systems such as biological cells, architected by Kouichi Takahashi and written by a team of developers.

The core part of the system, E-Cell Simulation Environment version 3, allows many components driven by multiple algorithms with different timescales to coexist.

E-Cell System consists of the following three major parts:

  • E-Cell Simulation Environment (or E-Cell SE)
  • E-Cell Modeling Environment (or E-Cell ME)
  • E-Cell Analysis Toolkit.
Gbrowse - wnpp
The Generic Genome Browser from GMOD
Responsible: Charles Plessy
License: Perl Artistic License, plus additional clauses
Debian package not available
SVN

The Generic Genome Browser is a combination of database and interactive Web page for manipulating and displaying annotations on genomes. Some of its features:

  • Simultaneous bird's eye and detailed views of the genome.
  • Scroll, zoom, center.
  • Attach arbitrary URLs to any annotation.
  • Order and appearance of tracks are customizable by administrator and end-user.
  • Search by annotation ID, name, or comment.
  • Supports third party annotation using GFF formats.
  • Settings persist across sessions.
  • DNA and GFF dumps.
  • Connectivity to different databases, including BioSQL and Chado.
  • Multi-language support.
  • Third-party feature loading.
  • Customizable plug-in architecture (e.g. run BLAST, dump & import many formats, find oligonucleotides, design primers, create restriction maps, edit features)
Haploview - wnpp
Analysis and visualization of LD and haplotype maps
Responsible: Steffen Moeller
License: DFSG free
Debian package not available
SVN

This tools assists in the analysis of the nucleotide variation in a population. Such investigations are performed to determine genes and genetic pathways that are associated with diseases. This is an early stage in the quest for new drugs.

Jmol - wnpp
viewer for chemical structures in 3D
Responsible: Vincent Fourmond
License: LGPL
Debian package not available
SVN

Jmol is a Java molecular viewer for three-dimensional chemical structures. Features include reading a variety of file types and output from quantum chemistry programs, and animation of multi-frame files and computed normal modes from quantum programs. It includes with features for chemicals, crystals, materials and biomolecules.

  • The JmolApplet is a web browser applet that can be integrated into web pages.
  • The Jmol application is a standalone Java application that runs on the desktop.
  • The JmolViewer is a development tool kit that can be integrated into other Java applications.
Jtreeview - wnpp
Java re-implementation of Michael Eisen's TreeView
Responsible: Steffen Moeller
License: GPL
Debian package not available
SVN

TreeView creates a matrix-like display of expression data, known as Eisen clustering. The original implementation was a Windows program named TreeView by Michael Eisen. This TreeView package, sometimes also referred to as jTreeView, was rewritten in Java under a free license, the original implementation also comes with the source code, but controls commercial distribution. And it did not run on Unix.

Java TreeView is an extensible viewer for microarray data in PCL or CDT format.

Kempbasu
Significance tests for comparing digital gene expression profiles
Responsible: Charles Plessy
License: GPL
Debian package not available
SVN

This package implements the significance tests for comparing digital gene profiles described in the article:

Varuzza et al. "Significance tests for comparing digital gene expression profiles"

They provide two programs: kemp for the frequentist test and basu for the Bayesian test, and some auxiliary scripts.

Mage2tab - wnpp
MAGE-MLv1 converter and visualiser
Responsible: Charles Plessy
License: CBIL Software and Data License (Apache-like)
Debian package not available
SVN

This tool-kit is part of MR_T, a framework for import or export various of MAGE (MicroArray Gene Expression) documents (MAGE-MLv1, MAGE-TAB, SOFT, MINiML) from or into databases like GUS (the Genomics Unified Schema, www.gusdb.org).

Martj
distributed data integration system for biological data
Responsible: Steffen Moeller
License: GPL
Debian package not available
SVN

BioMart is a simple, distributed data integration system with powerful query capabilities. The BioMart data model has been applied to the following data sources: UniProt Proteomes, Macromolecular Structure Database (MSD), Ensembl, Vega, and dbSNP.

Mauvealigner
multiple genome alignment algorithms
Responsible: Andreas Tille
License: GPL
SVN

The mauveAligner and progressiveMauve alignment algorithms have been implemented as command-line programs included with the downloadable Mauve software. When run from the command-line, these programs provide options not yet available in the graphical interface.

Mauve is a system for efficiently constructing multiple genome alignments in the presence of large-scale evolutionary events such as rearrangement and inversion. Multiple genome alignment provides a basis for research into comparative genomics and the study of evolutionary dynamics. Aligning whole genomes is a fundamentally different problem than aligning short sequences.

Mauve has been developed with the idea that a multiple genome aligner should require only modest computational resources. It employs algorithmic techniques that scale well in the amount of sequence being aligned. For example, a pair of Y. pestis genomes can be aligned in under a minute, while a group of 9 divergent Enterobacterial genomes can be aligned in a few hours.

Mauve computes and interactively visualizes genome sequence comparisons. Using FastA or GenBank sequence data, Mauve constructs multiple genome alignments that identify large-scale rearrangement, gene gain, gene loss, indels, and nucleotide substutition.

Mauve is developed at the University of Wisconsin.

Meme
motif discovery and search
Responsible: Steffen Moeller
License: non-free for commercial purpose (http://meme.nbcr.net/meme/COPYRIGHT.html)
Debian package not available
SVN

MEME is a tool for discovering motifs in a group of related DNA or protein sequences. A motif is a sequence pattern that occurs repeatedly in a group of related protein or DNA sequences. MEME represents motifs as position-dependent letter-probability matrices which describe the probability of each possible letter at each position in the pattern. Individual MEME motifs do not contain gaps. Patterns with variable-length gaps are split by MEME into two or more separate motifs.

MEME takes as input a group of DNA or protein sequences (the training set) and outputs as many motifs as requested. MEME uses statistical modeling techniques to automatically choose the best width, number of occurrences, and description for each motif.

Mira - wnpp
Whole Genome Shotgun and EST Sequence Assembler
Responsible: Charles Plessy
License: GPL
Debian package not available
SVN

The mira genome fragment assembler is a specialised assembler for sequencing projects classified as 'hard' due to high number of similar repeats. For expressed sequence tags (ESTs) transcripts, miraEST is specialised on reconstructing pristine mRNA transcripts while detecting and classifying single nucleotide polymorphisms (SNP) occuring in different variations thereof.

The assembler is routinely used for such various tasks as mutation detection in different cell types, similarity analysis of transcripts between organisms, and pristine assembly of sequences from various sources for oligo design in clinical microarray experiments.

Please cite: Chevreux B, Pfisterer T, Drescher B, Driesel AJ, Müller WE, Wetter T, Suhai S.: Using the miraEST Assembler for Reliable and Automated mRNA Transcript Assembly and SNP Detection in Sequenced ESTs. (DOI) Genome Res. Jun;14(6):1147-59. (2004)
Murasaki
homology detection tool across multiple large genomes
License: GPL
Debian package not available
SVN

Murasaki is a scalable and fast, language theory-based homology detection tool across multiple large genomes. It enable whole-genome scale multiple genome global alignments. Supports unlimited length gapped-seed patterns and unique TF-IDF based filtering.

Murasaki is an anchor alignment software, which is

  • exteremely fast (17 CPU hours for whole Human x Mouse genome (with 40 nodes: 52 wall minutes))
  • scalable (Arbitrarily parallelizable across multiple nodes using MPI. Even a single node with 16GB of ram can handle over 1Gbp of sequence.)
  • unlimited pattern length
  • repeat tolerant
  • intelligent noise reduction
Ncoils - wnpp
coiled coil secondary structure prediction
Responsible: Steffen Moeller
License: GPL
Debian package not available
SVN

The program predicts the coiled coil secondary structure predictions from protein sequences. The algorithm was published in Lupas, van Dyke & Stock, Predicting coiled coils from protein sequences Science, 252, 1162-1164, 1991.

Phylographer - wnpp
Graph Visualization Tool
Responsible: Charles Plessy
License: GPL
Debian package not available
SVN
Language: Tcl/Tk

PhyloGrapher is a program designed to visualize and study evolutionary relationships within families of homologous genes or proteins (elements). PhyloGrapher is a drawing tool that generates custom graphs for a given set of elements. In general, it is possible to use PhyloGrapher to visualize any type of relations between elements. Used in conjunction with tcl_blast_parser, PhyloGrapher can represent the results of a BLAST search as a graph.

PhyloGrapher and tcl_blast_parser are useful tools to analyse BLAST biological sequence alignment reports (BLAST is provided by Debian's blast2 package).

R-other-genabel - wnpp
genome-wide SNP association analysis
Responsible: Steffen Moeller
License: GPL 2+
Debian package not available
SVN

A package for genome-wide association analysis between quantitative or binary traits and single-nucleiotide polymorphisms (SNPs).

Smile - wnpp
infer motifs in a set of sequences
Responsible: Steffen Moeller
License: GPL
Debian package not available
SVN

SMILE is a tool that infers motifs in a set of sequences, according to some criteria. It was first made to infer exceptional sites as binding sites in DNA sequences. Since the 1.4 version, it allows to infer motifs written on any alphabet (even degenerate) in any kind of sequences.

The specificity of SMILE is to allow to deal with what we call structured motifs, which are motifs associated by some distance constraints.

Tacg - wnpp
command line program for finding patterns in nucleic acids
Responsible: Charles Plessy
License: GPL and others
Debian package not available
SVN

tacg is a character-based, command line tool for unix-like operating systems for pattern-matching in nucleic acids and performing some of the basic protein manipulations. It was originally designed for restriction enzyme analysis of DNA, but has been extended to other types of matching. It now handles degenerate sequence input in a variety of matching approaches, as well as patterns with errors, regular expressions and TRANSFAC-formatted matrices.

It was designed to be a grep for DNA and like the original grep, its capabilities have grown so that now the author has to keep calling up the help page to figure out which flags (now ~50) mean what. tacg is NOT a GUI application in any sense. However, it's existance as a strictly command-line tool lends itself well to Webification and wrapping by various GUI tools and it is now distributed with a web interface form and a Perl CGI handler. Additionally, it can easily be integrated into editors that support shell commands such as nedit.

The use of tacg may be cited as: Mangalam, HJ. (2002) tacg, a grep for DNA. BMC Bioinformatics. 3:8
http://www.biomedcentral.com/1471-2105/3/8

Unofficial packages built by somebody else

Act
DNA Sequence Comparison Viewer
License: GPL

ACT (Artemis Comparison Tool) is a DNA sequence comparison viewer based on Artemis. In common with Artemis, ACT is written in Java and runs on UNIX, GNU/Linux, Macintosh and MS Windows systems. It can read complete EMBL and GENBANK entries or sequence in FASTA or raw format. Extra sequence features can be in EMBL, GENBANK or GFF format.

The sequence comparison displayed by ACT is usually the result of running a blastn or tblastx search. See the user manual for more information.

To see ACT in action go to the examples page http://www.sanger.ac.uk/Software/ACT/Examples/

Remark of Debian Med team: This package is included into BioLinux distribution
Artemis
genome viewer and annotation tool
License: GPL 2+

Artemis is a free genome viewer and annotation tool that allows visualization of sequence features and the results of analyses within the context of the sequence, and its six-frame translation. Artemis is written in Java, and is available for UNIX, Macintosh and Windows systems. It can read EMBL and GENBANK database entries or sequence in FASTA or raw format. Extra sequence features can be in EMBL, GENBANK or GFF format.

Remark of Debian Med team: This package is included into BioLinux distribution
Big-blast
Helper tool to run blast on large sequences
License: not specified

This script will chop up a large sequence, run blast on each bit and then write out an EMBL feature table and a MSPcrunch -d file containing the hits.

Remark of Debian Med team: This package is included into BioLinux distribution
Blixem
interactive browser of pairwise Blast matches
Responsible: BioLinux - Dan Swan
License: not specified

Blixem (BLast matches In an X-windows Embedded Multiple alignment), is an interactive browser of pairwise Blast matches that have been stacked up in a ma ster-slave multiple alignment

Remark of Debian Med team: The link to the source archive on the web pages is not valid any more - it might be a problem to obtain the source.

This package is included into BioLinux distribution http://envgen.nox.ac.uk/biolinux.html

Caftools
Tools to maintain DNA sequence assemblies
License: Free for non-commercial purposes

This is V2 of the CAF (Common Assembly Format) C-tools. It comprises a set of libraries and programs for manipulating DNA sequence assemblies using CAF files.

The CAF specification can be found at: http://www.sanger.ac.uk/Software/formats/CAF/

Remark of Debian Med team: The BioLinux distribution http://envgen.nox.ac.uk/biolinux.html

maintains a package called bio-linux-assembly-conversion-tools which contains caftools and roche2gap in one package with the following description:

Conversion tools for handling 454 assemblies.

This package contains code from different authors that allow sequence assemblies to be converted into formats such as CAF (Common Assembly Format) or GAP4. This package includes tools to convert assemblies from Newbler's ace format for loading into a gap4 assembly.

Cap3
DNA Sequence Assembly Program
License: free for governmental agency or a non-profit educational institution

CAP3 contains the following improvements to the CAP sequence assembly program.

 1. Use of forward-reverse constraints to correct assembly errors and
    link contigs.
 2. Use of base quality values in alignment of sequence reads.
 3. Automatic clipping of 5' and 3' poor regions of reads.
 4. Generation of assembly results in ace file format for Consed.
 5. CAP3 can be used in GAP4 of the Staden package.
These improvements allow CAP3 to take longer sequences of higher

errors and produce more accurate consensus sequences.

Remark of Debian Med team: Obtaining the source requires to fill in a registration form

Official distribution in Debian is probably impossible. The package included in the BioLinux distribution http://envgen.nox.ac.uk/biolinux.html containins only the binaries cap3 and formcon, dated Aug 29, 2002. This package exists purely for convenience to Bio-Linux users so that the files are placed in locations consistent with the Bio-Linux setup.

Cd-hit
suite of programs designed to quickly group sequences.
License: to be clarified

CD-HIT stands for Cluster Database at High Identity with Tolerance. The program (cd-hit) takes a fasta format sequence database as input and produces a set of 'non-redundant' (nr) representative sequences as output. In addition cd-hit outputs a cluster file, documenting the sequence 'groupies' for each nr sequence representative. The idea is to reduce the overall size of the database without removing any sequence information by only removing 'redundant' (or highly similar) sequences. This is why the resulting database is called non-redundant (nr). Essentially, cd-hit produces a set of closely related protein families from a given fasta sequence database.

CD-HIT uses a 'longest sequence first' list removal algorithm to remove sequences above a certain identity threshold. Additionally the algorithm implements a very fast heuristic to find high identity segments between sequences, and so can avoid many costly full alignments.

With recent developments, cd-hit package offers new programs for DNA sequence clustering and comparing two databases. It also has lots of new options for clustering control.

Remark of Debian Med team: This package is included into BioLinux distribution
Coalesce
outdated program to estimate population-genetic parameters
License: not specified

COALESCE fits the model which has a single population of constant size, and estimates 4Nu, where N is the effective population size and u is the neutral mutation rate per site. You may also want the Postscript or the TeX file of the preprint version of the Kuhner, Yamato, and Felsenstein 1995 paper.

Remark of Debian Med team: This software is probably outdated

The homepage contains the explicite link: "We are no longer supporting COALESCE as its functions can be done just as well by LAMARC and it's easier for us to support just one program. You may still want the paper, however." So this is actually no target for the Debian Med distribution but just a hint for users about the existence of this program and the even better alternative even if BioLinux distribution http://envgen.nox.ac.uk/biolinux.html contains a package.

Compclust
explore and quantify relationships between clustering results
License: MLX (http://woldlab.caltech.edu/compclust/LICENSE.txt)
Language: Python

CompClust is a python package written using the pyMLX and IPlot APIs. It provides software tools to explore and quantify relationships between clustering results. Its development has been largely built around needs of microarray data analysis but could be easily used in other domains.

Briefly pyMLX provides for efficient and convenient execution of many clustering algorithms using a extendable library of algorithms. It also provides many-to-many linkages between data features and annotations (such as cluster labels, gene names, gene ontology information, etc.) These linkages persist through varied data manipulations. IPlot provides an abstraction of the plotting process in which any arbitrary feature or derived feature of the data can be projected onto any feature of the plot, including the X,Y coordinates of points, marker symbol, marker size, maker/line color, etc. These plots are intrinsically linked to the dataset, the View and the Labeling classes found within pyMLX.

Dotter
detailed comparison of two sequences
License: to be clarified

Dotter is a graphical dotplot program for detailed comparison of two sequences. Here, every residue in one sequence is compared to every residue in the other sequence. The first sequence runs along the x-axis and the second sequence along the y-axis. In regions where the two sequences are similar to each other, a row of high scores will run diagonally across the dot matrix. If you're comparing a sequence against itself to find internal repeats, you'll notice that the main diagonal scores maximally, since it's the 100% perfect self-match.

To make the score matrix more intelligible, the pairwise scores are averaged over a sliding window which runs diagonally. The averaged score matrix forms a three-dimensional landscape, with the two sequences in two dimensions and the height of the peaks in the third. This landscape is projected onto two dimensions by aid of greyscales - the darker grey of a peak, the higher it is.

Dotter provides a tool to explore the visual appearance of this landscape, as well as a tool to examine the sequence alignment it represents.

Remark of Debian Med team: This package is included into BioLinux distribution

http://envgen.nox.ac.uk/biolinux.html

Homepage is currently unavailable but the source might be obtainable from freebsd.org.

Dotur
Defining Operational Taxonomic Units and estimating species Richness
License: GPL

Dotur (Distance Based OTU and Richness determination) is a computer program that takes a distance matrix describing the genetic distance between DNA sequence data and assigns sequences to operational taxonomic units (OTUs) using either the furthest, average, or nearest neighbor algorithms for all possible distances that can be described using the distance matrix. Using the OTU composition data, dotur constructs collector's and rarefaction curves for sampling intensity, richness estimators, and diversity indices.

Remark of Debian Med team: This package is included into BioLinux distribution
Estferret
processes, clusters and annotates EST data
License: to be clarified

ESTFerret processes, clusters and annotates EST data. It is user-configurable. Results are currently stored in a series of text tables. Annotation consists of searches against use r-defined blast databases, prosite, GO and allocation of EC numbers where possible.

EST-ferret is a user-configurable, automated pipeline for the convenient analysis of EST sequence data that includes all of the necessary steps for cleanup and trimming, submission to external sequence repositories, clustering, identification by BLAST homology searches and by searches of protein domain databases, annotation with computer-addressable terms and production of outputs for direct entry into microarray analysis packages. It is composed of several widely used, open-source algorithms, including PHRED, CAP3, BLAST, and a range of sequence and annotation databases, including Gene Ontology and Conserved Domain Database to deliver a putative identity and a detailed annotation of each clone. It can be run either step-by-step to track the outputs, or as a single batch process. Users can easily edit the configuration file to define parameter settings.

This package has five major components: (1) ESTs coding system; (2) sequence processing; (3) sequence clustering; (4) sequence annotating and (5) storage and reporting of results. DNA trace files are renamed and converted into FASTA format, cleaned and submitted to dbEST(Boguski, et al, 1993). Sequence assembly uses two rounds of CAP3 to assemble the ESTs into groups corresponding to separate gene families and unique genes. Sequence identification and annotation is provided by a series of BLAST homology searches (Parallel_BLAST and Priority_BLAST) against user-defined sequence databases implemented with the NCBI BLASTALL algorithm. The BLAST results are parsed and annotation terms that reflect functional attributes are captured from Gene Ontology (The Gene Ontology Consortium, 2000), KEGG and Enzyme Commission (EC) databases and applied to each of the clones. CDD (and InterPro) searches are performed for seeking protein domains in the sequences. Other options are provided to run PatSearch, RepeatMasker and BLAT to find UTRs, repeats and EST candidates in genomes. Finally, the package generates analysis reports in a variety of flat file formats, sources of which can be serve as inputs for some gene annotation and gene expression profiling tools, and also as a MySQL database or web-browsable search tool.

Remark of Debian Med team: This package is included into BioLinux distribution
Estscan
detect coding regions in DNA sequences, even if of low quality
License: free

ESTScan is a program that can detect coding regions in DNA sequences, even if they are of low quality. It will also detect and correct sequencing errors that lead to fr ameshifts.

ESTScan is not a gene prediction program, nor is it an open reading frame detector. In fact, its strength lies in the fact that it does not require an open reading frame to detect a coding region. As a result, the program may miss a few translated amino acids at either the N or the C terminus, but will detect coding regions with high selectivity and sensitivity.

Similarly to GENSCAN, ESTScan uses a Markov model to represent the bias in hexanucleotide usage found in coding regions relative to non-coding regions. Additionally, ESTScan allows insertions and deletions when these improve the coding region statistics. Further details can be found at: http://www.ch.embnet.org/software/ESTScan2_help.html

References:

  • Lottaz C, Iseli C, Jongeneel CV, Bucher P. (2003) Modeling sequencing errors by combining Hidden Markov models Bioinformatics 19, ii103-ii112.
  • Iseli C, Jongeneel CV, Bucher P. (1999) ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol.138-48.
Remark of Debian Med team: This package is included into BioLinux distribution
Fasta
searching DNA and protein databases
License: no inclusion into commercial product

FASTA (pronounced FAST-AYE) stands for FAST-ALL, reflecting the fact that it can be used for a fast protein comparison or a fast nucleotide comparison. This program achieves a high level of sensitivity for similarity searching at high speed. This is achieved by performing optimised searches for local alignments using a substitution matrix. The high speed of this program is achieved by using the observed pattern of word hits to identify potential matches before attempting the more time consuming optimised search. The trade-off between speed and sensitivity is controlled by the ktup parameter, which specifies the size of the word. Increasing the ktup decreases the number of background hits. Not every word hit is investigated but instead initially looks for segment's containing several nearby hits.

Remark of Debian Med team: This package is included into BioLinux distribution
Fluctuate
outdated program to model a single population
License: not specified

FLUCTUATE fits the model which has a single population which has been growing (or shrinking) according to an exponential growth law. It estimates 4Nu and g, where N is the effective population size, u is the neutral mutation rate per site, and g is the growth rate of the population.

Remark of Debian Med team: This software is probably outdated

The homepage contains the explicite link: "We are no longer supporting FLUCTUATE as its functions can be done just as well by LAMARC and it's easier for us to support just one program. You may still want the paper, however." So this is actually no target for the Debian Med distribution but just a hint for users about the existence of this program and the even better alternative even if BioLinux distribution http://envgen.nox.ac.uk/biolinux.html contains a package.

Forester
visualization of annotated phylogenetic trees
License: LGPL

FORESTER is a Java/Perl based software package for phylogenomic analyses. Currently, it includes the phylogenetic tree visualization and manipulation tool ATV and implementations of the SDI algorithm and the RIO method (http://www.phylosoft.org/).

Remark of Debian Med team: This package is included into BioLinux distribution
Jalview - wnpp
multiple alignment editor
Responsible: Vincent Fourmond
License: GPL

Jalview is a multiple alignment editor written in Java. It is used widely in a variety of web pages (e.g. the EBI Clustalw server and the Pfam protein domain database) but is available as a general purpose alignment editor.

Lamarc
estimate population-genetic parameters
License: Apache V2.0

LAMARC is a program which estimates population-genetic parameters such as population size, population growth rate, recombination rate, and migration rates. It approximates a summation over all possible genealogies that could explain the observed sample, which may be sequence, SNP, microsatellite, or electrophoretic data. LAMARC and its sister program Migrate are successor programs to the older programs Coalesce, Fluctuate, and Recombine, which are no longer being supported. The programs are memory-intensive but can run effectively on workstations.

Remark of Debian Med team: This package is included into BioLinux distribution
Lucy
DNA sequence quality and vector trimming tool
Responsible: BioLinux - Dan Swan
License: GPL

Lucy is a utility that prepares raw DNA sequence fragments for sequence assembly, possibly using the TIGR Assembler. The cleanup process includes quality assessment, confidence reassurance, vector trimming and vector removal. The primary advantage of Lucy over other similar utilities is that it is a fully integrated, stand alone program.

Lucy was designed and written at The Institute for Genomic Research (TIGR, now the J. Craig Venter Institute), and it has been used here for several years to clean sequence data from automated DNA sequencers prior to sequence assembly and other downstream uses. The quality trimming portion of lucy makes use of phred quality scores, such as those produced by many automated sequencers based on the Sanger sequencing method. As such, lucy’s quality trimming may not be appropriate for sequence data produced by some of the new “next-generation” sequencers.

See also the SourceForge page at http://lucy.sourceforge.net/.

Remark of Debian Med team: This package is included into BioLinux distribution
Maxd
data warehouse and visualisation environment for genomic expression data
License: Artistic

Maxd is a data warehouse and visualisation environment for genomic expression data. It is being developed in the University of Manchester by the Microarray Bioinformatics Group.

Software components:

 maxdLoad2 - standards-compliant, highly customisable transcriptomics
             database
 maxdView  - modular and easily extensible data visualisation and
             analysis environment
 maxdSetup - installation management utility
Remark of Debian Med team: This package is included into BioLinux distribution
Mesquite
help biologists analyze comparative data about organisms
License: LGPL

Mesquite is software for evolutionary biology, designed to help biologists analyze comparative data about organisms. Its emphasis is on phylogenetic analysis, but some of its modules concern population genetics, while others do non-phylogenetic multivariate analysis. Because it is modular, the analyses available depend on the modules installed. Analyses include:

  • Reconstruction of ancestral states (parsimony, likelihood)
  • Tests of process of character evolution, including correlation
  • Analysis of speciation and extinction rates
  • Simulation of character evolution (categorical, DNA, continuous)
  • Parametric bootstrapping (integration with programs such as PAUP* and NONA)
  • Morphometrics (PCA, CVA, geometric morphometrics)
  • Coalescence (simulations, other calculations)
  • Tree comparisons and simulations (tree similarity, Markov speciation models) There is a brief outline of features, which includes screenshots. Mesquite is not primarily designed to infer phylogenetic trees, but rather for diverse analyses using already inferred trees.
Remark of Debian Med team: This package is included into BioLinux distribution
Migrate
estimation of population sizes and gene flow using the coalescent
License: to be clarified

Migrate estimates effective population sizes and past migration rates between n population assuming a migration matrix model with asymmetric migration rates and different subpopulation sizes. Migrate uses maximum likelihood or Bayesian inference to jointly estimate all parameters. It can use the followind data types: sequence data using Felsenstein's 84 model with or without site rate variation, single nucleotide polymorphism data, microsatellite data using a stepwise mutation model or a brownian motion mutation model, and electrophoretic data using an 'infinite' allele model. The output can contain: Estimates of all migration rates and all population sizes, assuming constant mutation rates among loci or a gamma distributed mutation rate among loci. Profile likelihood tables, Percentiles, Likelihood-ratio tests, and simple plots of the log-likelihood surfaces for all populations and all loci.

Remark of Debian Med team: This package is included into BioLinux distribution
Mrbayes
Bayesian estimation of phylogeny
License: GPL

MrBayes is a program for the Bayesian estimation of phylogeny. Bayesian inference of phylogeny is based upon a quantity called the posterior probability distribution of trees, which is the probability of a tree conditioned on the observations. The conditioning is accomplished using Bayes's theorem. The posterior probability distribution of trees is impossible to calculate analytically; instead, MrBayes uses a simulation technique called Markov chain Monte Carlo (or MCMC) to approximate the posterior probabilities of trees.

The program takes as input a character matrix in a NEXUS file format. The output is several files with the parameters that were sampled by the MCMC algorithm. MrBayes can summarize the information in these files for the user. The program features include:

Remark of Debian Med team: This package is included into BioLinux distribution
Msatfinder
identification and characterization of microsatellites in a comparative genomic context
License: GPL

Msatfinder is a Perl script designed to allow the identification and characterization of microsatellites in a comparative genomic context. There is also an online manual, a discussion forum and an online interface where users can do searches in any number of DNA or protein sequences (as long as the maximum size of all sequences does not exceed 10MB). Nucleotide and amino acid sequences in GenBank, FASTA, EMBL and Swissprot formats are supported.

Remark of Debian Med team: This package is included into BioLinux distribution
Mview
reformat results of a sequence database search or a multiple alignment
License: GPL

MView is a tool for converting the results of a sequence database search (BLAST, FASTA, etc.) into the form of a coloured multiple alignment of hits stacked against the query. Alternatively, an existing multiple alignment (MSF, PIR, CLUSTAL, etc.) can be pr ocessed. It reformats the results of a sequence database search or a multiple alignment adding optional HTML markup to control colouring and web page layout. MView is not a multiple alignment program, nor is it a general purpose alignment editor.

Remark of Debian Med team: This package is included into BioLinux distribution
Oligoarrayaux
Prediction of Melting Profiles for Nucleic Acids
License: non-free (fre academical use)

OligoArrayAux is a subset of the UNAFold package for use with OligoArray (http://berry.engin.umich.edu/oligoarray2_1/). OligoArray is a free software that computes gene specific oligonucleotides for genome-scale oligonucleotide microarray construction. (It is not really specified what they mean with "free software". You can download the source code after registration: "registration is the only way for me to keep trace of OligoArray users and be able to send you a bug fix or a new release".)

The original UNAFold server is available at http://dinamelt.bioinfo.rpi.edu/download.php and you should probably read http://dinamelt.bioinfo.rpi.edu/ if you want to know more about "Prediction of Melting Profiles for Nucleic Acids".

Remark of Debian Med team: This package is included into BioLinux distribution

Finally it is hard to find some documentation what OligoArrayAux is really doing because it