Debian Med Project
Help us to see Debian used by medical practitioners and biomedical researchers! Join us on the Alioth page.
Summary
Cloud
Debian Med bioinformatics applications usable in cloud computing

This metapackage will install Debian packages related to molecular biology, structural biology and bioinformatics for use in life sciences, that do not depend on graphical toolkits and therefore can fit on system images for use in cloud computing clusters, where space can be limited.

The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Med to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Med mailing list

Links to other tasks

Debian Med Cloud packages

Official Debian packages with high relevance

Alien-hunter
Temas de Ordem Variável Interpolados para identificar DNA adquirido horizontalmente
Versions of package alien-hunter
ReleaseVersionArchitectures
squeeze1.7-1all
wheezy1.7-1all
sid1.7-1all
Debtags of package alien-hunter:
fieldbiology:structural, biology
roleprogram
scopeutility
useanalysing
Popcon: 10 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

Alien_hunter é uma aplicação para a predição de eventos putativos de Transferência Horizontal de Genes (HGT -- Horizontal Gene Transfer) com a implementação de Temas de Ordem Variável Interpolados (IVOM -- Interpolated Variable Order Motifs). A abordagem IVOM explora tendências composicionais usando distribuições de temas de ordem variável e captura, mais confiavelmente quando comparado a métodos de ordem fixa, a composição local de uma sequencia. Opcionalmente as predições podem ser analisadas em um Modelo Markov Oculto (HMM - Hidden Markov Model) de 2-estados de 2a. ordem, usando uma infraestrutura de detecção de ponto de mudança, para otimizar a localização dos limites das regiões de predição. As predições (formato embl) podem ser automaticamente carregadas no visualizador de genoma Artemis que está livremente disponível em: http://www.sanger.ac.uk/Software/Artemis/.

O manuscrito descrevendo o algoritmo alien_hunter está disponível a partir de "Bioinformatics: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands". Vernikos GS, Parkhill J Bioinformatics. 2006;. PMID: 16837528.

Please cite: Georgios S. Vernikos and Julian Parkhill: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. (PubMed,eprint) Bioinformatics 22(18):2196-2203 (2006)
Altree
program to perform phylogeny-based association and localization analysis
Versions of package altree
ReleaseVersionArchitectures
squeeze1.0.1-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
sid1.0.1-7hurd-i386
wheezy1.2.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package altree:
fieldbiology:bioinformatics, biology
interfacecommandline
roleshared-lib, program
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 19 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

ALTree was designed to perform association detection and localization of susceptibility sites using haplotype phylogenetic trees: first, it allows the detection of an association between a candidate gene and a disease, and second, it enables to make hypothesis about the susceptibility loci.

Please cite: Claire Bardel, Vincent Danjean and Emmanuelle Genin: ALTree: association detection and localization of susceptibility sites using haplotype phylogenetic trees. (PubMed,eprint) Bioinformatics 22(11):1402-1403 (2006)
Autodock
análise das ligações do ligante à estrutura de proteína
Versions of package autodock
ReleaseVersionArchitectures
squeeze4.2.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.2.3-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.2.3-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
experimental4.2.5.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream4.2.5.1
Debtags of package autodock:
fieldbiology:structural, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
works-with3dmodel
Popcon: 18 users (12 upd.)*
Newer upstream!
License: DFSG free
Svn

AutoDock é um representante único dos programas que tratam a simulação da atracagem indo de ligantes químicos bem pequenos até receptores grandes de proteínas. Versões anteriores tinham toda a flexibilidade nos ligantes enquanto a proteína era mantida rígida. Esta última versão 4 também permite flexibilidade de cadeias laterais selecionadas dos resíduos de superfície, i.e., leva os "rotamers" em consideração.

O programa AutoDock realiza a atracagem do ligante a um conjunto de grades descrevendo a proteína alvo. O AutoGrid pré-calcula estas grades.

The package is enhanced by the following packages: autogrid
Please cite: Garrett M. Morris, Ruth Huey, William Lindstrom, Michel F. Sanner, Richard K. Belew, David S. Goodsell and Arthur J. Olson: AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. (PubMed) Journal of Computational Chemistry 30(16):2785-2791 (2009)
Screenshots of package autodock
Bedtools
suite of utilities for comparing genomic features
Versions of package bedtools
ReleaseVersionArchitectures
wheezy2.16.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.17.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package bedtools:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopesuite
usefiltering, converting, comparing, analysing
works-withbiological-sequence
Popcon: 22 users (43 upd.)*
Versions and Archs
License: DFSG free
Git

The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM. Using BEDTools, one can develop sophisticated pipelines that answer complicated research questions by streaming several BEDTools together.

The groupBy utility is distribued in the filo package.

Please cite: Aaron R. Quinlan and Ira M. Hall: BEDTools: a flexible suite of utilities for comparing genomic features. (PubMed,eprint) Bioinformatics 26(6):841-842 (2010)
Bwa
Burrows-Wheeler Aligner
Versions of package bwa
ReleaseVersionArchitectures
squeeze0.5.8c-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.6.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.6.2-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.7.4
Debtags of package bwa:
biologypeptidic, nuceleic-acids
fieldbiology:bioinformatics, biology
interfacetext-mode, commandline
roleprogram
usecomparing, analysing
Popcon: 31 users (17 upd.)*
Newer upstream!
License: DFSG free
Git

Burrows-Wheeler Aligner (BWA, em português Alinhador Burrows-Wheeler) é um programa que alinha sequências relativamente curtas de nucleotídeos comparando-as com uma sequência longa de referência como o genoma humano. Ele implementa dois algoritmos, bwa-short e BWA-SW. O primeiro funciona para sequências de consulta menores que 200 bp e o último para sequências maiores, até aproximadamente 100 kbp. Ambos os algoritmos fazem alinhamento com brechas. Eles são comumente mais precisos e rápidos em consultas com baixas taxas de erro.

Please cite: Heng Li and Richard Durbin: Fast and accurate short read alignment with Burrows-Wheeler transform. (PubMed,eprint) Bioinformatics 25(14):1754-1760 (2009)
Clustalw
global multiple nucleotide or peptide sequence alignment
Versions of package clustalw
ReleaseVersionArchitectures
squeeze2.0.12-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy2.1+lgpl-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1+lgpl-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package clustalw:
biologypeptidic, nuceleic-acids, format:aln
fieldbiology:bioinformatics, biology
interfacetext-mode, commandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 45 users (34 upd.)*
Versions and Archs
License: DFSG free
Git

This program performs an alignment of multiple nucleotide or amino acid sequences. It recognizes the format of input sequences and whether the sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The output format may be selected from in various formats for multiple alignments such as Phylip or FASTA. Clustal W is very well accepted.

The output of Clustal W can be edited manually but preferably with an alignment editor like SeaView or within its companion Clustal X. When building a model from your alignment, this can be applied for improved database searches. The Debian package hmmer creates such in form of an HMM.

The package is enhanced by the following packages: clustalw-mpi
Please cite: M. A. Larkin, G. Blackshields, N. P. Brown, R. Chenna, P. A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J. D. Thompson, T. J. Gibson and D. G. Higgins: Clustal W and Clustal X version 2.0. (PubMed,eprint) Bioinformatics 23(21):2947-2948 (2007)
Embassy-domainatrix
Extra EMBOSS commands to handle domain classification file
Versions of package embassy-domainatrix
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domainatrix:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, editing, converting, analysing
works-with-formatplaintext
Popcon: 5 users (4 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMAINATRIX programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domainatrix release are cathparse (generates DCF file from raw CATH files), domainnr (removes redundant domains from a DCF file), domainreso (removes low resolution domains from a DCF file), domainseqs (adds sequence records to a DCF file), domainsse (adds secondary structure records to a DCF file), scopparse (generates DCF file from raw SCOP files) and ssematch (searches a DCF file for secondary structure matches).

Embassy-domalign
Extra EMBOSS commands for protein domain alignment
Versions of package embassy-domalign
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domalign:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useediting, comparing, analysing
works-with-formatplaintext
Popcon: 6 users (4 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMALIGN programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domalign release are allversusall (sequence similarity data from all-versus-all comparison), domainalign (generates alignments (DAF file) for nodes in a DCF file), domainrep (reorders DCF file to identify representative structures) and seqalign (extend alignments (DAF file) with sequences (DHF file)).

Embassy-domsearch
Extra EMBOSS commands to search for protein domains
Versions of package embassy-domsearch
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domsearch:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 5 users (4 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMSEARCH programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in this DOMSEARCH release are seqfraggle (removes fragment sequences from DHF files), seqnr (removes redundancy from DHF files), seqsearch (generates PSI-BLAST hits (DHF file) from a DAF file), seqsort (Remove ambiguous classified sequences from DHF files) and seqwords (Generates DHF files from keyword search of UniProt).

Emboss
european molecular biology open software suite
Versions of package emboss
ReleaseVersionArchitectures
squeeze6.1.0-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy6.4.0-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid6.4.0-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream6.5.7
Debtags of package emboss:
fieldbiology:molecular, biology:bioinformatics, biology
interfacecommandline
roleprogram
scopesuite
useviewing, typesetting, text-formatting, searching, organizing, editing, converting, comparing, analysing
works-withdb
works-with-formatplaintext
Popcon: 143 users (44 upd.)*
Newer upstream!
License: DFSG free
Git

EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. EMBOSS breaks the historical trend towards commercial software packages.

The package is enhanced by the following packages: clustalw primer3
Please cite: Peter Rice, Ian Longden and Alan Bleasby: EMBOSS: The European Molecular Biology Open Software Suite. (PubMed) Trends in Genetics 16(6):276 - 277 (2000)
Screenshots of package emboss
Fastdnaml
Tool for construction of phylogenetic trees of DNA sequences
Versions of package fastdnaml
ReleaseVersionArchitectures
squeeze1.2.2-9amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.2.2-10amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.2-10amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package fastdnaml:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 11 users (8 upd.)*
Versions and Archs
License: DFSG free
Svn

fastDNAml is a program derived from Joseph Felsenstein's version 3.3 DNAML (part of his PHYLIP package). Users should consult the documentation for DNAML before using this program.

fastDNAml is an attempt to solve the same problem as DNAML, but to do so faster and using less memory, so that larger trees and/or more bootstrap replicates become tractable. Much of fastDNAml is merely a recoding of the PHYLIP 3.3 DNAML program from PASCAL to C.

Note that the homepage of this program is not available any more and so this program will probably not see any further updates.

Please cite: Gary J. Olsen, Hideo Matsuda, Ray Hagstrom and Ross Overbeek: fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood. (PubMed,eprint) Comput Appl Biosci 10(1):41-48 (1994)
Fastlink
faster version of pedigree programs of Linkage
Versions of package fastlink
ReleaseVersionArchitectures
squeeze4.1P-fix95-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.1P-fix95-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.1P-fix95-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package fastlink:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
Popcon: 16 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

Genetic linkage analysis is a statistical technique used to map genes and find the approximate location of disease genes. There was a standard software package for genetic linkage called LINKAGE. FASTLINK is a significantly modified and improved version of the main programs of LINKAGE that runs much faster sequentially, can run in parallel, allows the user to recover gracefully from a computer crash, and provides abundant new documentation. FASTLINK has been used in over 1000 published genetic linkage studies.

This package contains the following programs:

 ilink:    GEMINI optimization procedure to find a locally
           optimal value of the theta vector of recombination
           fractions
 linkmap:  calculates location scores of one locus against a
           fixed map of other loci
 lodscore: compares likelihoods at locally optimal theta
 mlink:    calculates lod scores and risk with two of more loci
 unknown:  identify possible genotypes for unknowns
Please cite: R. W. Cottingham Jr., R. M. Idury and A. A. Schaffer: Faster Sequential Genetic Linkage Computations. (PubMed,eprint) American Journal of Human Genetics 53(1):252-263 (1993)
Filo
FILe and stream Operations
Versions of package filo
ReleaseVersionArchitectures
wheezy1.1+2011020401.2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1+2011020401.2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 13 users (4 upd.)*
Versions and Archs
License: DFSG free
Git

The following tools are available as part of the filo package:

groupBy – mimics the “groupBy” clause in database systems.

shuffle – randomize the order of lines in a file.

stats – computes descriptive statistic on a given column of a tab-delimited file or stream.

Because their name is too generic, ‘shuffle’ and ‘stats’ are relocated in /usr/lib/filo.

Infernal
inference of RNA secondary structural alignments
Versions of package infernal
ReleaseVersionArchitectures
squeeze1.0.2-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.0.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.2-2armel,armhf,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1~rc2-1amd64,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386
Debtags of package infernal:
biologynuceleic-acids
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
useanalysing
Popcon: 20 users (26 upd.)*
Versions and Archs
License: DFSG free
Svn

Infernal ("INFERence of RNA ALignment") searches DNA sequence databases for RNA structure and sequence similarities. It provides an implementation of a special variant of profile stochastic context-free grammars called covariance models (CMs). A CM is like a sequence profile, but it scores a combination of sequence consensus and RNA secondary structure consensus, so in many cases, it is more capable of identifying RNA homologs that conserve their secondary structure more than their primary sequence.

The tool is an integral component of the Rfam database.

Please cite: Eric P. Nawrocki, Diana L. Kolbe and Sean R. Eddy: Infernal 1.0: inference of RNA alignments. (PubMed,eprint) Bioinformatics 25(10):1335-1337 (2009)
Last-align
genome-scale comparison of biological sequences
Versions of package last-align
ReleaseVersionArchitectures
squeeze128-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy199-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid199-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream286
Debtags of package last-align:
fieldbiology:bioinformatics, biology
roleprogram
Popcon: 12 users (8 upd.)*
Newer upstream!
License: DFSG free
Svn

LAST is software for comparing and aligning sequences, typically DNA or protein sequences. LAST is similar to BLAST, but it copes better with very large amounts of sequence data. Here are two things LAST is good at:

  • Comparing large (e.g. mammalian) genomes.
  • Mapping lots of sequence tags onto a genome.

The main technical innovation is that LAST finds initial matches based on their multiplicity, instead of using a fixed size (e.g. BLAST uses 10-mers). This allows one to map tags to genomes without repeat-masking, without becoming overwhelmed by repetitive hits. To find these variable-sized matches, it uses a suffix array (inspired by Vmatch). To achieve high sensitivity, it uses a discontiguous suffix array, analogous to spaced seeds.

Please cite: Martin C. Frith, Raymond Wan and Paul Horton: Incorporating sequence quality data into alignment improves DNA read mapping. (PubMed,eprint) Nucl. Acids Res. 38(7):e100 (2010)
Loki
MCMC linkage analysis on general pedigrees
Versions of package loki
ReleaseVersionArchitectures
squeeze2.4.7.4-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.4.7.4-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.4.7.4-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package loki:
fieldbiology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 14 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

Performs Markov chain Monte Carlo multipoint linkage analysis on large, complex pedigrees. The current package supports analyses on quantitative traits only, although this restriction will be lifted in later versions. Joint estimation of QTL number, position and effects uses Reversible Jump MCMC. It is also possible to perform affected only IBD sharing analyses.

The homepage of this project used to be at http://loki.homeunix.net but the project is dead now and the homepage vanished. The Homepage field above points to the web archive.

The package is enhanced by the following packages: loki-doc
Please cite: Simon C. Heath: Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. (PubMed,eprint) American Journal of Human Genetics 61(3):748-60 (1997)
Maq
maps short fixed-length polymorphic DNA sequence reads to reference sequences
Versions of package maq
ReleaseVersionArchitectures
squeeze0.7.1-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.7.1-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.7.1-5amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package maq:
biologynuceleic-acids
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usesearching, comparing, analysing
works-with-formatplaintext
Popcon: 17 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

Maq (short for Mapping and Assembly with Quality) builds mapping assemblies from short reads generated by the next-generation sequencing machines. It was particularly designed for Illumina-Solexa 1G Genetic Analyzer, and has a preliminary functionality to handle ABI SOLiD data. Maq is previously known as mapass2.

Developmemt of Maq stopped in 2008. Its successors are BWA and SAMtools.

Please cite: Heng Li, Jue Ruan and Richard Durbin: Mapping short DNA sequencing reads and calling variants using mapping quality scores. (PubMed,eprint) Genome Research 18(11):1851-1858 (2008)
Phyml
Phylogenetic estimation using Maximum Likelihood
Versions of package phyml
ReleaseVersionArchitectures
squeeze20100123-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy20110919-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid20110919-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream20130513
Debtags of package phyml:
biologypeptidic
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
usecomparing, analysing
works-withbiological-sequence
Popcon: 28 users (20 upd.)*
Newer upstream!
License: DFSG free
Svn

PhyML is a software that estimates maximum likelihood phylogenies from alignments of nucleotide or amino acid sequences. It provides a wide range of options that were designed to facilitate standard phylogenetic analyses. The main strengths of PhyML lies in the large number of substitution models coupled to various options to search the space of phylogenetic tree topologies, going from very fast and efficient methods to slower but generally more accurate approaches. It also implements two methods to evaluate branch supports in a sound statistical framework (the non-parametric bootstrap and the approximate likelihood ratio test).

PhyML was designed to process moderate to large data sets. In theory, alignments with up to 4,000 sequences 2,000,000 character-long can be analyzed. In practice however, the amount of memory required to process a data set is proportional of the product of the number of sequences by their length. Hence, a large number of sequences can only be processed provided that they are short. Also, PhyML can handle long sequences provided that they are not numerous. With most standard personal computers, the “comfort zone” for PhyML generally lies around 3 to 500 sequences less than 2,000 character long.

This pakcage also includes PhyTime.

Please cite: Stephane Guindon and Olivier Gascuel: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. (PubMed,eprint) Syst Biol 52(5):696-704 (2003)
Picard-tools
Command line tools to manipulate SAM and BAM files
Versions of package picard-tools
ReleaseVersionArchitectures
squeeze1.27-1all
wheezy1.46-1all
sid1.82-2all
experimental1.90-1all
upstream1.91
Popcon: 25 users (53 upd.)*
Newer upstream!
License: DFSG free
Git

SAM (Sequence Alignment/Map) format is a generic format for storing large nucleotide sequence alignments. Picard Tools includes these utilities to manipulate SAM and BAM files: BamToBfq IlluminaBasecallsToSam BuildBamIndex MarkDuplicates CalculateHsMetrics MeanQualityByCycle CleanSam MergeBamAlignment CollectAlignmentSummaryMetrics MergeSamFiles CollectGcBiasMetrics NormalizeFasta CollectInsertSizeMetrics QualityScoreDistribution CollectRnaSeqMetrics ReplaceSamHeader CompareSAMs RevertSam CreateSequenceDictionary SamFormatConverter ExtractIlluminaBarcodes SamToFastq EstimateLibraryComplexity SortSam FastqToSam ValidateSamFile FixMateInformation ViewSam

Plink
whole-genome association analysis toolset
Versions of package plink
ReleaseVersionArchitectures
squeeze1.07-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390,sparc
wheezy1.07-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.07-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package plink:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
Popcon: 14 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

plink expects as input the data from SNP (single nucleotide polymorphism) chips of many individuals and their phenotypical description of a disease. It finds associations of single or pairs of DNA variations with a phenotype and can retrieve SNP annotation from an online source.

SNPs can evaluated individually or as pairs for their association with the disease phenotypes. The joint investigation of copy number variations is supported. A variety of statistical tests have been implemented.

Please note: The executable was renamed to p-link because of a name clash. Please read more about this in /usr/share/doc/README.Debian.

Please cite: Shaun Purcell, Benjamin Neale, Kathe Todd-Brown, Lori Thomas, Manuel A. R. Ferreira, David Bender, Julian Maller, Pamela Sklar, Paul I. W. de Bakker, Mark J. Daly and Pak C. Sham: PLINK: a toolset for whole-genome association and population-based linkage analysis. (PubMed) American Journal of Human Genetics 81(3):559-75 (2007)
Python-cogent
framework for genomic biology
Versions of package python-cogent
ReleaseVersionArchitectures
squeeze1.4.1-1.2 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy1.5.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.5.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.5.3
Debtags of package python-cogent:
biologypeptidic
devellang:python
fieldbiology
roledevel-lib
usecomparing, analysing
Popcon: 7 users (11 upd.)*
Newer upstream!
License: DFSG free
Svn

PyCogent is a software library for genomic biology. It is a fully integrated and thoroughly tested framework for:

  • controlling third-party applications,
  • devising workflows; querying databases,
  • conducting novel probabilistic analyses of biological sequence evolution, and
  • generating publication quality graphics. It is distinguished by many unique built-in capabilities (such as true codon alignment) and the frequent addition of entirely new methods for the analysis of genomic data.
Please cite: Rob Knight, Peter Maxwell, Amanda Birmingham, Jason Carnes, J Gregory Caporaso, Brett C Easton, Michael Eaton, Micah Hamady, Helen Lindsay, Zongzhi Liu, Catherine Lozupone, Daniel McDonald, Michael Robeson, Raymond Sammut, Sandra Smit, Matthew J Wakefield, Jeremy Widmann, Shandy Wikman, Stephanie Wilson, Hua Ying and Gavin A Huttley: PyCogent: a toolkit for making sense from sequence. (PubMed,eprint) Genome Biology 8(8):R171 (2007)
R-bioc-hilbertvis
GNU R package to visualise long vector data
Versions of package r-bioc-hilbertvis
ReleaseVersionArchitectures
squeeze1.5.0-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.14.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.18.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-bioc-hilbertvis:
biologynuceleic-acids
fieldbiology:bioinformatics, biology
useanalysing
Popcon: 20 users (22 upd.)*
Versions and Archs
License: DFSG free
Svn

This tool allows one to display very long data vectors in a space-efficient manner, by organising it along a 2D Hilbert curve. The user can then visually judge the large scale structure and distribution of features simultaenously with the rough shape and intensity of individual features.

In bioinformatics, a typical use case is ChIP-Chip and ChIP-Seq, or basically all the kinds of genomic data, that are conventionally displayed as quantitative track ("wiggle data") in genome browsers such as those provided by Ensembl or UCSC.

Please cite: Simon Anders: Visualization of genomic data with the Hilbert curve. (PubMed,eprint) Bioinformatics 25(10):1231-1235 (2009)
R-cran-qtl
GNU R package for genetic marker linkage analysis
Versions of package r-cran-qtl
ReleaseVersionArchitectures
squeeze1.16-6-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.23-16-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.27-10-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-cran-qtl:
devellibrary, lang:r
fieldstatistics, biology
roleapp-data
suitegnu
Popcon: 54 users (92 upd.)*
Versions and Archs
License: DFSG free
Svn

R/qtl is an extensible, interactive environment for mapping quantitative trait loci (QTLs) in experimental crosses. It is implemented as an add-on-package for the freely available and widely used statistical language/software R (see http://www.r-project.org).

The development of this software as an add-on to R allows one to take advantage of the basic mathematical and statistical functions, and powerful graphics capabilities, that are provided with R. Further, the user will benefit by the seamless integration of the QTL mapping software into a general statistical analysis program. The goal is to make complex QTL mapping methods widely accessible and allow users to focus on modeling rather than computing.

A key component of computational methods for QTL mapping is the hidden Markov model (HMM) technology for dealing with missing genotype data. The main HMM algorithms were implemented, with allowance for the presence of genotyping errors, for backcrosses, intercrosses, and phase-known four-way crosses.

The current version of R/qtl includes facilities for estimating genetic maps, identifying genotyping errors, and performing single-QTL genome scans and two-QTL, two-dimensional genome scans, by interval mapping (with the EM algorithm), Haley-Knott regression, and multiple imputation. All of this may be done in the presence of covariates (such as sex, age or treatment). One may also fit higher-order QTL models by multiple imputation.

Please cite: Karl W. Broman, Hao Wu, Saunak Sen and Gary A. Churchill: R/qtl: QTL mapping in experimental crosses. (PubMed,eprint) Bioinformatics 19:889-890 (2003)
R-other-mott-happy
GNU R package for fine-mapping complex diseases
Versions of package r-other-mott-happy
ReleaseVersionArchitectures
squeeze2.1-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.1-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream2.3
Debtags of package r-other-mott-happy:
fieldbiology:bioinformatics, biology
useanalysing
Popcon: 19 users (11 upd.)*
Newer upstream!
License: DFSG free
Svn

Happy is an R interface into the HAPPY C package for fine-mapping Quantitative Trait Loci (QTL) in Heterogenous Stocks (HS). An HS is an advanced intercross between (usually eight) founder inbred strains of mice. HS are suitable for fine-mapping QTL. It uses a multipoint analysis which offers significant improvements in statistical power to detect QTLs over that achieved by single-marker association.

The happy package is an extension of the original C program happy; it uses the C code to compute the probability of descent from each of the founders, at each locus position, but the happy packager allows a much richer range of models to be fit to the data.

Read /usr/share/doc/r-other-mott-happy/README.Debian for a more detailed explanation.

Please cite: Richard Mott, Christopher J. Talbot, Maria G. Turri, Allan C. Collins and Jonathan Flint: A method for fine mapping quantitative trait loci in outbred animal stocks. (PubMed,eprint) Proc. Natl. Acad. Sci. USA 97(23):12649-12654 (2000)
Raster3d
tools for generating images of proteins or other molecules
Versions of package raster3d
ReleaseVersionArchitectures
squeeze2.9-1-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy3.0-2-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.0-2-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package raster3d:
fieldbiology:structural, biology
interfacecommandline
roleprogram
scopeapplication
useviewing, converting
works-withimage:raster, image, 3dmodel
works-with-formatpng, jpg
Popcon: 29 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

Raster3D is a set of tools for generating high quality raster images of proteins or other molecules. The core program renders spheres, triangles, cylinders, and quadric surfaces with specular highlighting, Phong shading, and shadowing. It uses an efficient software Z-buffer algorithm which is independent of any graphics hardware. Ancillary programs process atomic coordinates from PDB files into rendering descriptions for pictures composed of ribbons, space-filling atoms, bonds, ball+stick, etc. Raster3D can also be used to render pictures composed in other programs such as Molscript in glorious 3D with highlights, shadowing, etc. Output is to pixel image files with 24 bits of color information per pixel.

Please cite: E.A. Merritt and D.J. Bacon: Raster3D Photorealistic Molecular Graphics. (PubMed) Methods in Enzymology 277:505-524 (1997)
Rnahybrid
Fast and effective prediction of microRNA/target duplexes
Versions of package rnahybrid
ReleaseVersionArchitectures
squeeze2.1-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.1.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package rnahybrid:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 10 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

RNAhybrid is a tool for finding the minimum free energy hybridisation of a long and a short RNA. The hybridisation is performed in a kind of domain mode, ie. The short sequence is hybridised to the best fitting part of the long one. The tool is primarily meant as a means for microRNA target prediction.

Please cite: Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann and Robert Giegerich: Fast and effective prediction of microRNA/target duplexes. (PubMed,eprint) RNA 10(10):1507-1517 (2004)
Samtools
processing sequence alignments in SAM and BAM formats
Versions of package samtools
ReleaseVersionArchitectures
squeeze0.1.8-1amd64,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390
wheezy0.1.18-1amd64,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390
sid0.1.19-1amd64,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Debtags of package samtools:
fieldbiology
interfacecommandline
networkclient
roleprogram
scopeutility
uitoolkitncurses
usefiltering, calculating, analysing
works-withbiological-sequence
Popcon: 48 users (72 upd.)*
Versions and Archs
License: DFSG free
Git

Samtools is a set of utilities that manipulate nucleotide sequence alignments in the binary BAM format. It imports from and exports to the ascii SAM (Sequence Alignment/Map) format, does sorting, merging and indexing, and allows to retrieve reads in any regions swiftly. It is designed to work on a stream, and is able to open a BAM (not SAM) file on a remote FTP or HTTP server.

The package is enhanced by the following packages: libbio-samtools-perl
Please cite: Heng Li, Bob Handsaker, Alec Wysoker, Tim Fennell, Jue Ruan, Nils Homer, Gabor Marth, Goncalo Abecasis, Richard Durbin and 1000 Genome Project Data Processing Subgroup: The Sequence Alignment/Map (SAM) Format and SAMtools. (PubMed,eprint) Bioinformatics 25(16):2078-2079 (2009)
Ssake
genomics application for assembling millions of very short DNA sequences
Versions of package ssake
ReleaseVersionArchitectures
squeeze3.5-1all
wheezy3.8-2all
sid3.8-2all
Debtags of package ssake:
biologynuceleic-acids
fieldbiology
interfaceshell
roleprogram
scopeutility
useanalysing
Popcon: 10 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

The Short Sequence Assembly by K-mer search and 3′ read Extension (SSAKE) is a genomics application for aggressively assembling millions of short nucleotide sequences by progressively searching for perfect 3′-most k-mers using a DNA prefix tree. SSAKE is designed to help leverage the information from short sequences reads by stringently clustering them into contigs that can be used to characterize novel sequencing targets.

Please cite: Rene L. Warren, Granger G. Sutton, Steven J. M. Jones and Robert A. Holt: Assembling millions of short DNA sequences using SSAKE. (PubMed,eprint) Bioinformatics 23(4):500-501 (2007)
Tophat
fast splice junction mapper for RNA-Seq reads
Versions of package tophat
ReleaseVersionArchitectures
sid2.0.8-1i386
sid2.0.8b-1amd64,armhf,hurd-i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Popcon: 1 users (3 upd.)*
Versions and Archs
License: DFSG free
Git

TopHat aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. TopHat is a collaborative effort between the University of Maryland Center for Bioinformatics and Computational Biology and the University of California, Berkeley Departments of Mathematics and Molecular and Cell Biology.

The package is enhanced by the following packages: cufflinks
Please cite: Cole Trapnell, Lior Pachter and Steven L. Salzberg: TopHat: discovering splice junctions with RNA-Seq. (PubMed,eprint) Bioinformatics 25(9):1105-1111 (2009)
Tree-ppuzzle
reconstrução paralelizada de árvores filogenéticas por máxima probabilidade
Versions of package tree-ppuzzle
ReleaseVersionArchitectures
squeeze5.2-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid5.2-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
Debtags of package tree-ppuzzle:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 3 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

TREE-PUZZLE (o novo nome para o PUZZLE) é um programa interativo para console que implementa um algoritmo de procura rápida em árvore, "quartet puzzling", que permite análise de grandes conjuntos de dados e automaticamente atribui estimativas de suporte para cada ramo interno. TREE-PUZZLE oferece um método novo, com mapeamento por estimativa, para investigar o suporte a um ramo interno hipotético sem computar uma árvore completa e visualizar o conteúdo filogenético de um alinhamento de seqüência.

Esta é a versão paralelizada do tree-puzzle.

Please cite: Heiko A. Schmidt, Korbinian Strimmer, Martin Vingron and Arndt von Haeseler: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. (PubMed,eprint) Bioinformatics 18(3):502-504 (2002)
Tree-puzzle
reconstrução de árvores filogenéticas por máxima probabilidade
Versions of package tree-puzzle
ReleaseVersionArchitectures
squeeze5.2-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid5.2-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
Debtags of package tree-puzzle:
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
scopeutility
usecomparing, analysing
works-with-formatplaintext
Popcon: 12 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

TREE-PUZZLE (o novo nome para o PUZZLE) é um programa interativo para console que implementa um algoritmo de procura rápida em árvore, "quartet puzzling", que permite análise de grandes conjuntos de dados e automaticamente atribui estimativas de suporte para cada ramo interno. TREE-PUZZLE oferece um método novo, com mapeamento por estimativa, para investigar o suporte a um ramo interno hipotético sem computar uma árvore completa e visualizar o conteúdo filogenético de um alinhamento de seqüência.

Please cite: Heiko A. Schmidt, Korbinian Strimmer, Martin Vingron and Arndt von Haeseler: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. (PubMed,eprint) Bioinformatics 18(3):502-504 (2002)
Velvet
Nucleic acid sequence assembler for very short reads
Versions of package velvet
ReleaseVersionArchitectures
squeeze1.0.02~nozlibcopy-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.2.03~nozlibcopy-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.03~nozlibcopy-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.2.09
Debtags of package velvet:
biologynuceleic-acids
fieldbiology:bioinformatics, biology
interfacecommandline
roleprogram
useanalysing
Popcon: 16 users (7 upd.)*
Newer upstream!
License: DFSG free
Svn

Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom.

Velvet currently takes in short read sequences, removes errors then produces high quality unique contigs. It then uses paired read information, if available, to retrieve the repeated areas between contigs.

Please cite: Daniel R. Zerbino and Ewan Birney: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. (PubMed,eprint) Genome Research 18(5):821-829 (2008)

Debian packages in contrib or non-free

Clustalw-mpi
MPI-distributed global sequence alignment with ClustalW
Versions of package clustalw-mpi
ReleaseVersionArchitectures
squeeze0.15-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy0.15-2 (non-free)amd64
sid0.15-2 (non-free)amd64
Debtags of package clustalw-mpi:
fieldbiology
interfacetext-mode, commandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 3 users (1 upd.)*
Versions and Archs
License: non-free
Svn

ClustalW is a popular tool for multiple sequence alignment. The alignment is achieved via three steps: pairwise alignment, guide-tree generation and progressive alignment. ClustalW-MPI is an MPI implementation of ClustalW. Based on version 1.82 of the original ClustalW, both the pairwise and progressive alignments are parallelized with MPI, a popular message passing programming standard. The pairwise alignments can be easily parallelized since the many alignments are time independent on each other. However the progressive alignments are essentially not parallelizable because of the time dependencies between each alignment.

Here the recursive parallelism paradigm is applied to the linear space profile-profile alignment algorithm. This approach is more time efficient on computers with distributed memory architecture. Traditional approach that relies on precomputing the profile-profile score matrix has also been implemented. Results shown the latter is indeed more appropriate for shared memory multiprocessor computer.

ClustalX is suggested for its support for local realignments, seaview is a versatile editor of alignments.

The original ClustalW/ClustalX can be found at URL: http://www.clustal.org/download/pre-2/

Please cite: Kuo-Bin Li: ClustalW-MPI: ClustalW Analysis Using Distributed and Parallel Computing. (PubMed) Bioinformatics 19(12):1585-1586 (2003)
Embassy-phylip
EMBOSS conversions of the programs in the phylip package
Versions of package embassy-phylip
ReleaseVersionArchitectures
wheezy3.69+20110714-1 (non-free)amd64
sid3.69+20110714-1 (non-free)amd64
upstream3.69.650
Popcon: 4 users (0 upd.)*
Newer upstream!
License: non-free
Svn

This package is the adaptation of the PHYLIP package in which its programs can operate with the biological sequence formats and databases of the European Molecular Biology Open Software Suite (EMBOSS). The software packages adapted for EMBOSS are called EMBASSY.

PHYLIP (the PHYLogeny Inference Package) is a package of programs for inferring phylogenies (evolutionary trees). Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites and fragments, distance matrices, and discrete characters.

The EMBASSY PHYLIP programs all have the prefix "f" to distinguish them from the original programs and avoid namespace conflict.

Packaging has started and developers might try the packaging code in VCS

Bagpipe
genomewide LD mapping
License: GPL3+
Debian package not available
Svn
Version: 2012.02.15-1

Bagpipe is a program for performing genomewide linkage disequilibrium mapping of quantitative trait loci in populations whose genome structure can be accommodated in the HAPPY framework [Mott00]. This includes most diploid crosses where the founders of the individuals have known genotypes.

  • Bagpipe is a simplified and streamlined version of Bagphenotype that does not currently include resample model averaging (RMA) capabilities.
  • Bagpipe can help fit single locus regression models (with or without random effects) to marker intervals whose genetic ancestry is inferred using the HAPPY software.
  • Bagpipe cannot help you decide what is a sensible model to fit.
  • Bagpipe does not currently accommodate populations with significant population structure, except through the specification of simple random intercepts based on unpatterned covariance matrices.
  • Bagpipe is named after the Scottish wind instrument "the bagpipes" and after Bagphenotype, which in turn was a PIPEline for BAGging-based multiple QTL analysis of phenoTYPEs. Bagphenotype was in turn based on software written by Richard Mott and William Valdar to analyze heterogeneous stock mice in [Valdar06].
  • Bagpipe is experimental software, is provided free of charge subject to copyleft restrictions, and comes with no guarantees whatsoever.

[Mott00] Mott R, Talbot CJ, Turri MG, Collins AC, Flint, J (2000) A method for fine mapping quantitative trait loci in outbred animal stocks. Proceedings of the National Academy of Sciences of the United States of America, 97(23), 12649-54. [Valdar06] Valdar W, Solberg LC, Gaugier D, Burnett S, Klenerman P, Cookson WO, Taylor M, Rawlins JNP, Mott R, Flint J (2006) Genome-wide genetic association of complex traits in outbred mice. Nature Genetics 38(8):879-87. PMID:16832355

R-other-valdar-bagphenotype.library
GNU R extension of the functionality of happy
License: GPL-3+
Debian package not available
Svn
Version: 0.22-1

mapping QTLs in populations descended from known founders

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 164770