Summary
Biology
Debian Med micro-biology packages
This metapackage will install Debian packages related to molecular biology,
structural biology and bioinformatics for use in life sciences.
The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Med
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Med mailing list
Links to other tasks
|
Debian Med Biology packages
Official Debian packages with high relevance
|
Adun.app
Molecular Simulator for GNUstep
|
| Versions of package adun.app |
| Release | Version | Architectures |
| sid | 0.81-1 | sparc,powerpc,hurd-i386,hppa,armel,s390,amd64,alpha,ia64,i386 |
| sid | 0.8.2-1+b2 | mips,mipsel |
| squeeze | 0.8.2-1+b2 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 0.8.2-1+b1 | armel,i386,amd64,hppa,powerpc,mips,arm,sparc,s390,alpha,mipsel,ia64 |
| etch | 0.6-3 | powerpc,sparc,mipsel,arm,alpha,hppa,amd64,s390,i386,ia64 |
| Debtags of package adun.app: |
| field | biology:structural, biology |
| interface | x11 |
| role | program |
| scope | application |
| suite | gnustep |
| uitoolkit | gnustep |
| use | viewing, organizing, analysing |
| works-with | db, 3dmodel |
| x11 | application |
|
License: DFSG free
|
|
Adun is a biomolecular simulator that also includes data management and
analysis capabilities. It was developed at the Computational Biophysics
and Biochemistry Laboratory, a part of the Research Unit on Biomedical
Informatics of the UPF.
|
|
|
Alien-hunter
Interpolated Variable Order Motifs to identify horizontally acquired DNA
|
| Versions of package alien-hunter |
| Release | Version | Architectures |
| sid | 1.7-1 | all |
| squeeze | 1.7-1 | all |
|
License: DFSG free
|
|
Alien_hunter is an application for the prediction of putative
Horizontal Gene Transfer (HGT) events with the implementation of
Interpolated Variable Order Motifs (IVOMs). An IVOM approach
exploits compositional biases using variable order motif distributions
and captures more reliably the local composition of a sequence compared
to fixed-order methods. Optionally the predictions can be parsed into a
2-state 2nd order Hidden Markov Model (HMM), in a change-point detection
framework, to optimize the localization of the boundaries of the
predicted regions. The predictions (embl format) can be automatically
loaded into Artemis genome viewer freely available at:
http://www.sanger.ac.uk/Software/Artemis/.
The manuscript describing the alien_hunter algorithm is available
from Bioinformatics: Interpolated variable order motifs for
identification of horizontally acquired DNA: revisiting the
Salmonella pathogenicity islands. Vernikos GS, Parkhill J
Bioinformatics. 2006;. PMID: 16837528
|
|
|
Altree
program to perform phylogeny based analyses
|
| Versions of package altree |
| Release | Version | Architectures |
| lenny | 1.0.1-3 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| sid | 1.0.1-3 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.0.1-3 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| upstream | 1.1.0 |
| Debtags of package altree: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | shared-lib, program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
ALTree was designed to perform phylogeny based analysis: first,
it allows the detection of an association between a candidate gene and
a disease, and second, it enables to make hypothesis about the
susceptibility loci.
|
|
|
Amap-align
Protein multiple alignment by sequence annealing
|
| Versions of package amap-align |
| Release | Version | Architectures |
| lenny | 2.2-1 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 2.2-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.2-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.0-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package amap-align: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | special:todo, comparing, analysing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
AMAP is a command line tool to perform multiple alignment of peptidic
sequences. It utilizes posterior decoding, and a sequence-annealing
alignment, instead of the traditional progressive alignment method. It
is the only alignment program that allows to control the sensitivity /
specificity tradeoff. It is based on the ProbCons source code, but
uses alignment metric accuracy and eliminates the consistency
transformation.
The java visualisation tool of AMAP 2.2 is not yet packaged in Debian.
|
|
|
Autodock
analysis of ligand binding to protein structure
|
| Versions of package autodock |
| Release | Version | Architectures |
| sid | 4.2.2.1-2 | hppa,s390,mipsel,armel,alpha,powerpc,amd64,i386,ia64,hurd-i386,sparc,mips |
| squeeze | 4.2.1+rc1-2 | ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| lenny | 4.0.1-3 | mips,i386,s390,alpha,ia64,arm,armel,amd64,powerpc,hppa,sparc,mipsel |
| Debtags of package autodock: |
| field | biology:structural, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | analysing |
| works-with | 3dmodel |
|
License: DFSG free
|
|
AutoDock is a prime representative of the programs addressing the
simulation of the docking of fairly small chemical ligands to rather big
protein receptors. Earlier versions had all flexibility in the ligands
while the protein was kept rather ridgid. This latest version 4 also
allows for a flexibility of selected sidechains of surface residues,
i.e., takes the rotamers into account.
The AutoDock program performs the docking of the ligand to a set of
grids describing the target protein. AutoGrid pre-calculates these grids.
The package is enhanced by the following packages:
autogrid
Please register by following this link if you are using autodock.
|
|
|
Ballview
A free molecular modeling and molecular graphics tool
|
| Versions of package ballview |
| Release | Version | Architectures |
| sid | 1.3.0-1 | i386,amd64,alpha,s390 |
| squeeze | 1.3.0-1 | powerpc,amd64,s390,i386,ia64 |
| squeeze-proposed-updates | 1.3.0-1 | sparc,ia64,powerpc |
|
License: DFSG free
|
|
BALLView provides fast OpenGL-based visualization of molecular structures,
molecular mechanics methods (minimization, MD simulation using the
AMBER, CHARMM, and MMFF94 force fields), calculation and visualization
of electrostatic properties (FDPB) and molecular editing features.
BALLView can be considered a graphical user interface on the basis of
BALL (Biochemical Algorithms Library), having taken the most common
demands of protein chemists and biophysicists in particular taken into account.
which is currently being developed in the groups of Hans-Peter Lenhof
(Saarland University, Saarbruecken, Germany) and Oliver Kohlbacher
(University of Tuebingen, Germany). BALL is an application framework
in C++ that has been specifically designed for rapid software
development in Molecular Modeling and Computational Molecular Biology.
It provides an extensive set of data structures as well as classes
for Molecular Mechanics, advanced solvation methods, comparison and
analysis of protein structures, file import/export, and visualization.
|
|
|
Blast2
Basic Local Alignment Search Tool
|
| Versions of package blast2 |
| Release | Version | Architectures |
| sid | 2.2.21.20090809-1+b1 | ia64,armel |
| squeeze | 2.2.21.20090809-1+b1 | armel,ia64 |
| sid | 2.2.21.20090809-1 | s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc |
| squeeze | 2.2.21.20090809-1 | mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64 |
| lenny | 2.2.18.20080302-4 | powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha |
| etch | 2.2.15.20061015-1 | hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha |
| Debtags of package blast2: |
| field | biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching |
|
License: DFSG free
|
|
The famous sequence alignment program. This is "official" NCBI version,
#2. The blastall executable allows you to give a nucleotide or protein
sequence to the program. It is compared against databases and a summary of
matches is returned to the user.
Note that databases are not included in Debian; they must be retrieved
manually.
The package is enhanced by the following packages:
mcl
|
|
|
Boxshade
Pretty-printing of multiple sequence alignments
|
| Versions of package boxshade |
| Release | Version | Architectures |
| sid | 3.3.1-4 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 3.3.1-4 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 3.3.1-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 3.1.1-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package boxshade: |
| biology | peptidic, nuceleic-acids, format:aln |
| devel | prettyprint |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | typesetting |
| works-with-format | tex, special:todo, postscript, plaintext, html |
|
License: DFSG free
|
|
Boxshade is a program for creating good looking printouts from
multiple-aligned protein or DNA sequences. The program does not perform
the alignment by itself and requires as input a file that was created
by a multiple alignment program or manually edited with respective tools.
Boxshade reads multiple-aligned sequences from either PILEUP-MSF,
CLUSTAL-ALN, MALIGNED-data and ESEE-save files (limited to a maximum
of 150 sequences with up to 10000 elements each). Various kinds of
shading can be applied to identical/similar residues. Output is written
to screen or to a file in the following formats: ANSI/VT100, PS/EPS,
RTF, HPGL, ReGIS, LJ250-printer, ASCII, xFIG, PICT, HTML
|
|
|
Bwa
|
| Versions of package bwa |
| Release | Version | Architectures |
| sid | 0.5.4-1 | alpha,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| squeeze | 0.5.4-1 | amd64,mips,armel,sparc,mipsel,i386,s390,ia64,powerpc |
|
License: DFSG free
|
|
Burrows-Wheeler Aligner (BWA) is a program that aligns relatively short
nucleotide sequences against a long reference sequence such as the human
genome. It implements two algorithms, bwa-short and BWA-SW. The former works
for query sequences shorter than 200 bp and the latter for longer sequences up
to around 100 kbp. Both algorithms do gapped alignment. They are usually more
accurate and faster on queries with low error rates.
|
|
|
Dialign
Segment-based multiple sequence alignment
|
| Versions of package dialign |
| Release | Version | Architectures |
| lenny | 2.2.1-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 2.2.1-3 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.2.1-3 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.2.1-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package dialign: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
DIALIGN2 is a command line tool to perform multiple alignment of
protein or DNA sequences. It constructs alignments from gapfree pairs
of similar segments of the sequences. This scoring scheme for
alignments is the basic difference between DIALIGN and other global or
local alignment methods. Note that DIALIGN does not employ any kind of
gap penalty. It has been published by Morgenstern B. in
Bioinformatics. 1999 Mar;15(3):211-8.
|
|
|
Dialign-tx
Segment-based multiple sequence alignment
|
| Versions of package dialign-tx |
| Release | Version | Architectures |
| sid | 1.0.2-1 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.0.2-1 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 1.0.1-2 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| Debtags of package dialign-tx: |
| field | biology:bioinformatics, biology |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
DIALIGN-TX is a command line tool to perform multiple alignment of protein or
DNA sequences. It is a complete reimplementation of the segment-base approach
including several new improvements and heuristics that significantly enhance
the quality of the output alignments compared to DIALIGN 2.2 and DIALIGN-T.
For pairwise alignment, DIALIGN-TX uses a fragment-chaining algorithm that
favours chains of low-scoring local alignments over isolated high-scoring
fragments. For multiple alignment, DIALIGN-TX uses an improved greedy
procedure that is less sensitive to spurious local sequence similarities.
DIALIGN-TX has been published in Amarendran R. Subramanian, Michael Kaufmann,
Burkhard Morgenstern: Improvement of the segment-based approach for multiple
sequence alignment by combining greedy and progressive alignment strategies,
Algorithms for Molecular Biology 3:6, 2008
|
|
|
Embassy-domainatrix
Extra EMBOSS commands to handle domain classification file
|
| Versions of package embassy-domainatrix |
| Release | Version | Architectures |
| lenny | 0.1.0-1 | i386,armel,s390,ia64,sparc,alpha,mips,powerpc,mipsel,hppa,arm,amd64 |
| sid | 0.1.0+20090715-1+b1 | armel,ia64 |
| squeeze | 0.1.0+20090715-1+b1 | armel,ia64 |
| sid | 0.1.0+20090715-1 | alpha,mipsel,powerpc,sparc,hppa,i386,amd64,s390,mips |
| squeeze | 0.1.0+20090715-1 | sparc,amd64,hppa,i386,mips,mipsel,powerpc,s390 |
| Debtags of package embassy-domainatrix: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, editing, converting, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
The DOMAINATRIX programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in the current domainatrix release are cathparse (generates DCF
file from raw CATH files), domainnr (removes redundant domains from a DCF
file), domainreso (removes low resolution domains from a DCF file), domainseqs
(adds sequence records to a DCF file), domainsse (adds secondary structure
records to a DCF file), scopparse (generates DCF file from raw SCOP files) and
ssematch (searches a DCF file for secondary structure matches).
|
|
|
Embassy-domalign
Extra EMBOSS commands for protein domain alignment
|
| Versions of package embassy-domalign |
| Release | Version | Architectures |
| lenny | 0.1.0-1 | mips,hppa,sparc,i386,amd64,ia64,s390,powerpc,mipsel,arm,armel,alpha |
| sid | 0.1.0+20090715-1+b1 | powerpc,ia64,armel,hppa |
| squeeze | 0.1.0+20090715-1+b1 | armel,powerpc,ia64,hppa |
| sid | 0.1.0+20090715-1 | amd64,sparc,mipsel,alpha,mips,i386,s390 |
| squeeze | 0.1.0+20090715-1 | sparc,amd64,i386,mips,mipsel,s390 |
| Debtags of package embassy-domalign: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | editing, comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
The DOMALIGN programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in the current domalign release are allversusall (sequence
similarity data from all-versus-all comparison), domainalign (generates
alignments (DAF file) for nodes in a DCF file), domainrep (reorders DCF file
to identify representative structures) and seqalign (extend alignments (DAF
file) with sequences (DHF file)).
|
|
|
Embassy-domsearch
Extra EMBOSS commands to search for protein domains
|
| Versions of package embassy-domsearch |
| Release | Version | Architectures |
| sid | 0.1.0+20980715-1 | mips,armel,mipsel,i386,powerpc,amd64,s390,ia64,sparc,alpha,hppa |
| squeeze | 0.1.0+20980715-1 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
The DOMSEARCH programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in this DOMSEARCH release are seqfraggle (removes fragment
sequences from DHF files), seqnr (removes redundancy from DHF files), seqsearch
(generates PSI-BLAST hits (DHF file) from a DAF file), seqsort (Remove
ambiguous classified sequences from DHF files) and seqwords (Generates DHF
files from keyword search of UniProt).
|
|
|
Emboss
the european molecular biology open software suite
|
| Versions of package emboss |
| Release | Version | Architectures |
| sid | 6.1.0-4 | ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 6.1.0-4 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 5.0.0-7 | i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc |
| Debtags of package emboss: |
| field | biology:molecular, biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | suite |
| use | viewing, typesetting, text-formatting, searching, organizing, editing, converting, comparing, analysing |
| works-with | special:todo, db |
| works-with-format | special:todo, plaintext |
|
License: DFSG free
|
|
EMBOSS is a free Open Source software analysis package specially developed for
the needs of the molecular biology (e.g. EMBnet) user community. The software
automatically copes with data in a variety of formats and even allows
transparent retrieval of sequence data from the web. Also, as extensive
libraries are provided with the package, it is a platform to allow other
scientists to develop and release software in true open source spirit. EMBOSS
also integrates a range of currently available packages and tools for sequence
analysis into a seamless whole. EMBOSS breaks the historical trend towards
commercial software packages.
|
|
|
Exonerate
generic tool for pairwise sequence comparison
|
| Versions of package exonerate |
| Release | Version | Architectures |
| sid | 2.2.0-2 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 2.2.0-2 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 2.0.0-1+b1 | sparc |
| lenny | 2.0.0-1 | powerpc,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,arm |
| Debtags of package exonerate: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Exonerate allows you to align sequences using a many alignment models, using
either exhaustive dynamic programming, or a variety of heuristics. Much of
the functionality of the Wise dynamic programming suite was reimplemented in C
for better efficiency. Exonerate is an intrinsic component of the building of
the Ensembl genome databases, providing similarity scores between RNA and DNA
sequences and thus determining splice variants and coding sequences in
general.
An In-silico PCR Experiment Simulation System (see the ipcress man page) is
packaged with exonerate.
This package also comes with a selection of utilities for performing
simple manipulations quickly on fasta files beyond 2Gb
|
|
|
Fastdnaml
Tool for construction of phylogenetic trees of DNA sequences
|
| Versions of package fastdnaml |
| Release | Version | Architectures |
| sid | 1.2.2-9 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.2.2-9 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 1.2.2-8 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 1.2.2-6 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package fastdnaml: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
fastDNAml is a program derived from Joseph Felsenstein's version 3.3 DNAML
(part of his PHYLIP package). Users should consult the documentation for
DNAML before using this program.
fastDNAml is an attempt to solve the same problem as DNAML, but to do so
faster and using less memory, so that larger trees and/or more bootstrap
replicates become tractable. Much of fastDNAml is merely a recoding of the
PHYLIP 3.3 DNAML program from PASCAL to C.
Note that the homepage of this program is not available any more and so
this program will probably not see any further updates.
|
|
|
Fastlink
A faster version of pedigree programs of Linkage
|
| Versions of package fastlink |
| Release | Version | Architectures |
| sid | 4.1P-fix95-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 4.1P-fix95-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 4.1P-fix94b-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 4.1P-fix92-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package fastlink: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
|
License: DFSG free
|
|
Genetic linkage analysis is a statistical technique used to map
genes and find the approximate location of disease genes. There
was a standard software package for genetic linkage called
LINKAGE. FASTLINK is a significantly modified and improved
version of the main programs of LINKAGE that runs much faster
sequentially, can run in parallel, allows the user to recover
gracefully from a computer crash, and provides abundant new
documentation. FASTLINK has been used in over 1000 published
genetic linkage studies.
This package contains the following programs:
ilink: GEMINI optimization procedure to find a locally
optimal value of the theta vector of recombination
fractions
linkmap: calculates location scores of one locus against a
fixed map of other loci
lodscore: compares likelihoods at locally optimal theta
mlink: calculates lod scores and risk with two of more loci
unknown: identify possible genotypes for unknowns
|
|
|
Gamgi
construct, view and analyse atomic structures
|
| Versions of package gamgi |
| Release | Version | Architectures |
| sid | 0.14.2-1 | sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,s390,alpha |
| sid | 0.14.1-1 | hppa,powerpc |
| squeeze | 0.14-1 | hppa,ia64,mips,mipsel,armel,powerpc,s390,i386,sparc,amd64 |
|
License: DFSG free
|
|
GAMGI provides a graphic interface to build and analyse atomic
structures. The program is aimed at the scientific community,
who needs a graphic interface to study atomic structures and
to prepare images for presentations, and for teaching the atomic
structure of matter.
|
|
|
Garlic
A visualization program for biomolecules
|
| Versions of package garlic |
| Release | Version | Architectures |
| lenny | 1.6-1 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 1.6-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.6-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 1.5-2 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package garlic: |
| field | chemistry, biology |
| interface | x11 |
| role | program |
| scope | utility |
| use | viewing |
| x11 | application |
|
License: DFSG free
|
|
Garlic is written for the investigation of membrane proteins. It may be
used to visualize other proteins, as well as some geometric objects.
This version of garlic recognizes PDB format version 2.1. Garlic may
also be used to analyze protein sequences.
It only depends on the X libraries, no other libraries are needed.
Features include:
-
The slab position and thickness are visible in a small window.
-
Atomic bonds as well as atoms are treated as independent drawable
objects.
-
The atomic and bond colors depend on position. Five mapping modes
are available (as for slab).
-
Capable to display stereo image.
-
Capable to display other geometric objects, like membrane.
-
Atomic information is available for atom covered by the mouse
pointer. No click required, just move the mouse pointer over the
structure!
-
Capable to load more than one structure.
-
Capable to draw Ramachandran plot, helical wheel, Venn diagram,
averaged hydrophobicity and hydrophobic moment plot.
-
The command prompt is available at the bottom of the main window.
It is able to display one error message and one command string.
|
|
|
Gdpc
visualiser of molecular dynamic simulations
|
| Versions of package gdpc |
| Release | Version | Architectures |
| sid | 2.2.5-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.2.5-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 2.2.4-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 2.2.4-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package gdpc: |
| field | physics, chemistry, biology:structural, biology |
| interface | x11 |
| role | program |
| scope | application |
| uitoolkit | gtk |
| use | viewing |
| works-with | video, image, 3dmodel |
| works-with-format | png, jpg |
| x11 | application |
|
License: DFSG free
|
|
gpdc is a graphical program for visualising output data from
molecular dynamics simulations. It reads input in the standard xyz
format, as well as other custom formats, and can output pictures of
each frame in JPG or PNG format.
|
|
|
Gff2aplot
pair-wise alignment-plots for genomic sequences in PostScript
|
| Versions of package gff2aplot |
| Release | Version | Architectures |
| lenny | 2.0-5 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 2.0-5 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.0-5 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.0-3 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package gff2aplot: |
| field | biology:bioinformatics, biology |
| interface | shell, commandline |
| role | program |
| scope | utility |
| use | viewing, converting |
| works-with | image:vector |
| works-with-format | postscript, plaintext |
|
License: DFSG free
|
|
A program to visualize the alignment of two genomic sequences together with
their annotations. From GFF-format input files it produces PostScript figures
for that alignment.
The following menu lists many features of gff2aplot:
-
Comprehensive alignment plots for any GFF-feature. Attributes are defined
separately so you can modify only whatsoever attributes for a given file or
share same customization across different data-sets.
-
All parameters are set by default within the program, but it can be also
fully configured via gff2ps-like flexible customization files. Program can
handle several of such files, summarizing all the settings before producing
the corresponding figure. Moreover, all customization parameters can be set
via command-line switches, which allows users to play with those parameters
before adding any to a customization file.
-
Source order is taken from input files, if you swap file order you can
visualize alignment and its annotation with the new input arrangement.
-
All alignment scores can be visualized in a PiP box below gff2aplot area,
using grey-color scale, user-defined color scale or score-dependent
gradients.
-
Scalable fonts, which can also be chosen among the basic PostScript default
fonts. Feature and group labels can be rotated to improve readability in
both annotation axes.
-
The program is still defined as a Unix filter so it can handle data from
files, redirections and pipes, writing output to standard-output and
warnings to standard error.
-
gff2aplot is able to manage many physical page formats (from A0 to A10, and
more -see available page sizes in its manual-), including user-defined ones.
This allows, for instance, the generation of poster size genomic maps, or
the use of a continuous-paper supporting plotting device, either in portrait
or landscape.
-
You can draw different alignments on same alignment plot and distinguish
them by using different colors for each.
-
Shape dictionary has been expanded, so that further feature shapes are now
available (see manual).
-
Annotation projections through alignment plots (so called ribbons) emulate
transparencies via complementary color fill patterns. This feature allows
to show color pseudo-blending when horizontal and vertical ribbons overlap.
|
|
|
Gff2ps
produces PostScript graphical output from GFF-files
|
| Versions of package gff2ps |
| Release | Version | Architectures |
| lenny | 0.98d-3 | all |
| sid | 0.98d-3 | all |
| squeeze | 0.98d-3 | all |
| etch | 0.98d-2 | all |
| Debtags of package gff2ps: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | viewing, converting |
| works-with | image:vector |
| works-with-format | postscript |
|
License: DFSG free
|
|
gff2ps is a script program developed with the aim of converting gff-formatted
records into high quality one-dimensional plots in PostScript. Such plots
maybe useful for comparing genomic structures and to visualizing outputs from
genome annotation programs.
It can be used in a very simple way, because it assumes that the GFF file
itself carries enough formatting information, but it also allows through a
number of options and/or a configuration file, for a great degree of
customization.
|
|
|
Ghemical
A GNOME molecular modelling environment
|
| Versions of package ghemical |
| Release | Version | Architectures |
| sid | 2.99.2-1 | hppa,sparc,powerpc,armel,amd64,s390,mips,ia64,alpha,i386,hurd-i386 |
| squeeze | 2.95-3+b2 | amd64 |
| lenny | 2.95-3+b2 | amd64 |
| squeeze | 2.95-3+b1 | s390,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 2.95-3+b1 | i386,hppa,mipsel,sparc,mips,powerpc,ia64,s390,armel,alpha,arm |
| sid | 2.95-3+b1 | mipsel |
| etch | 2.10-1+b1 | hppa |
| etch | 2.10-1 | sparc,mips,alpha,mipsel,arm,s390,powerpc,i386,amd64,ia64 |
| Debtags of package ghemical: |
| field | chemistry |
| interface | x11, 3d |
| role | program |
| suite | gnome |
| uitoolkit | gtk |
| use | viewing, editing |
| works-with | 3dmodel |
| x11 | application |
|
License: DFSG free
|
|
Ghemical is a computational chemistry software package written in C++.
It has a graphical user interface and it supports both quantum-
mechanics (semi-empirical) models and molecular mechanics models.
Geometry optimization, molecular dynamics and a large set of
visualization tools using OpenGL are currently available.
Ghemical relies on external code to provide the quantum-mechanical
calculations. Semi-empirical methods MNDO, MINDO/3, AM1 and PM3 come
from the MOPAC7 package (Public Domain), and are included in the
package. The MPQC package is used to provide ab initio methods: the
methods based on Hartree-Fock theory are currently supported with
basis sets ranging from STO-3G to 6-31G.
|
|
|
Glam2
gapped protein motifs from unaligned sequences
|
| Versions of package glam2 |
| Release | Version | Architectures |
| lenny | 1064-1 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| sid | 1064-1 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1064-1 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| Debtags of package glam2: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
GLAM2 is a software package for finding motifs in sequences, typically
amino-acid or nucleotide sequences. A motif is a re-occurring sequence
pattern: typical examples are the TATA box and the CAAX prenylation motif. The
main innovation of GLAM2 is that it allows insertions and deletions in motifs.
The package includes these programs:
glam2: discovering motifs shared by a set of sequences;
glam2scan: finding matches, in a sequence database, to a motif discovered
by glam2;
glam2format: converting glam2 motifs to standard alignment formats;
glam2mask: masking glam2 motifs out of sequences, so that weaker motifs
can be found;
glam2-purge: removing highly similar members of a set of sequences.
In this package, the fast Fourier algorithm (FFT) was enabled for glam2.
If you use GLAM2, please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey
(2008) Discovering sequence motifs with arbitrary insertions and deletions,
PLoS Computational Biology (in press).
|
|
|
Gromacs
Molecular dynamics simulator, with building and analysis tools
|
| Versions of package gromacs |
| Release | Version | Architectures |
| sid | 4.0.5-4 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 4.0.5-4 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 3.3.3-2 | alpha,s390,hppa,mips,amd64,powerpc,sparc,mipsel,arm,ia64,armel,i386 |
| etch | 3.3.1-4 | s390,mips,arm,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package gromacs: |
| field | chemistry, biology:structural, biology |
| interface | x11, commandline |
| role | program |
| uitoolkit | xlib |
| use | special:not-applicable |
| x11 | application |
|
License: DFSG free
|
|
GROMACS is a versatile package to perform molecular dynamics, i.e. simulate
the Newtonian equations of motion for systems with hundreds to millions of
particles.
It is primarily designed for biochemical molecules like proteins and lipids
that have a lot of complicated bonded interactions, but since GROMACS is
extremely fast at calculating the nonbonded interactions (that usually
dominate simulations) many groups are also using it for research on non-
biological systems, e.g. polymers.
GROMACS offers entirely too many features for a brief description to do it
justice. A more complete listing is available at
http://www.gromacs.org/content/view/12/176/.
|
|
|
Hmmer
profile hidden Markov models for protein sequence analysis
|
| Versions of package hmmer |
| Release | Version | Architectures |
| lenny | 2.3.2-5 | s390,hppa,amd64,sparc,powerpc,mips,mipsel,arm,ia64,armel,i386,alpha |
| sid | 2.3.2-5 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.3.2-5 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.3.2-2 | mips,arm,s390,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| sid | 2.1.4 | hurd-i386 |
| Debtags of package hmmer: |
| biology | peptidic, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching |
| works-with | db |
| works-with-format | plaintext |
|
License: DFSG free
|
|
HMMER is an implementation of profile hidden Markov model methods for
sensitive searches of biological sequence databases using multiple sequence
alignments as queries.
Given a multiple sequence alignment as input, HMMER builds a statistical
model called a "hidden Markov model" which can then be used as a query into
a sequence database to find (and/or align) additional homologues of the
sequence family.
|
|
|
Infernal
inference of RNA secondary structural alignments
|
| Versions of package infernal |
| Release | Version | Architectures |
| sid | 1.0.2-1 | sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,s390,alpha |
| sid | 1.0-2 | powerpc |
| squeeze | 1.0-2 | i386,mipsel,amd64,powerpc,s390,ia64,sparc,mips,armel |
|
License: DFSG free
|
|
Infernal ("INFERence of RNA ALignment") is for searching DNA sequence
databases for RNA structure and sequence similarities. It is an
implementation of a special case of profile stochastic context-free
grammars called covariance models (CMs). A CM is like a sequence
profile, but it scores a combination of sequence consensus and RNA
secondary structure consensus, so in many cases, it is more capable of
identifying RNA homologs that conserve their secondary structure more
than their primary sequence.
The tool is an integral component of the Rfam database.
|
|
|
Kalign
Global and progressive multiple sequence alignment
|
| Versions of package kalign |
| Release | Version | Architectures |
| sid | 2.04-2 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.04-2 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 2.03-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 1.04-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package kalign: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Kalign is a command line tool to perform multiple alignment of
biological sequences. It employs the Muth-Manber string-matching
algorithm, to improve both the accuracy and speed of the alignment.
It uses global, progressive alignment approach, enriched by employing
an approximate string-matching algorithm to calculate sequence
distances and by incorporating local matches into the otherwise global
alignment.
|
|
|
Last-align
genome-scale comparison of biological sequences
|
| Versions of package last-align |
| Release | Version | Architectures |
| sid | 87-1 | sparc,amd64,armel,hurd-i386,i386,ia64,mips,mipsel,powerpc,s390,alpha |
| sid | 70-1 | hppa |
| squeeze | 70-1 | mips,hppa,mipsel,armel,powerpc,i386,s390,amd64,sparc,ia64 |
|
License: DFSG free
|
|
LAST is software for comparing and aligning sequences, typically DNA or
protein sequences. LAST is similar to BLAST, but it copes better with very
large amounts of sequence data. Here are two things LAST is good at:
-
Comparing large (e.g. mammalian) genomes.
-
Mapping lots of sequence tags onto a genome.
The main technical innovation is that LAST finds initial matches based on
their multiplicity, instead of using a fixed size (e.g. BLAST uses 11-mers).
This allows to map tags to genomes without repeat-masking, without becoming
overwhelmed by repetitive hits. To find these variable-sized matches, it uses
a suffix array (inspired by Vmatch). To achieve high sensitivity, it uses a
discontiguous suffix array, analogous to spaced seeds.
|
|
|
Loki
MCMC linkage analysis on general pedigrees
|
| Versions of package loki |
| Release | Version | Architectures |
| lenny | 2.4.7.4-4 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 2.4.7.4-4 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.4.7.4-4 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.4.7.4-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package loki: |
| field | biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | analysing |
|
License: DFSG free
|
|
Performs Markov chain Monte Carlo multipoint linkage analysis on large,
complex pedigrees. The current package supports analyses on quantitative
traits only, although this restriction will be lifted in later versions.
Joint estimation of QTL number, position and effects uses Reversible Jump
MCMC. It is also possible to perform affected only IBD sharing analyses.
The homepage of this project used to be at http://loki.homeunix.net
but the project is dead now and the homepage vanished. The Homepage
field above points to the web archive.
The package is enhanced by the following packages:
loki-doc
|
|
|
Mafft
Multiple alignment program for amino acid or nucleotide sequences
|
| Versions of package mafft |
| Release | Version | Architectures |
| sid | 6.716-1 | s390,mipsel,armel,alpha,powerpc,amd64,ia64,i386,sparc,mips,hurd-i386 |
| sid | 6.713-1 | hppa |
| squeeze | 6.713-1 | ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| lenny | 6.240-2 | mipsel,s390,mips,ia64,alpha,arm,armel,amd64,powerpc,hppa,i386,sparc |
| Debtags of package mafft: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
MAFFT is a multiple sequence alignment program which offers three
accuracy-oriented methods:
L-INS-i (probably most accurate; recommended for <200 sequences;
iterative refinement method incorporating local pairwise alignment
information),
G-INS-i (suitable for sequences of similar lengths; recommended for
<200 sequences; iterative refinement method incorporating global
pairwise alignment information),
E-INS-i (suitable for sequences containing large unalignable regions;
recommended for <200 sequences),
and five speed-oriented methods:
FFT-NS-i (iterative refinement method; two cycles only),
FFT-NS-i (iterative refinement method; max. 1000 iterations),
FFT-NS-2 (fast; progressive method),
FFT-NS-1 (very fast; recommended for >2000 sequences; progressive
method with a rough guide tree),
NW-NS-PartTree-1 (recommended for ∼50,000 sequences; progressive
method with the PartTree algorithm).
|
|
|
Maq
maps short fixed-length polymorphic DNA sequence reads to reference sequences
|
| Versions of package maq |
| Release | Version | Architectures |
| sid | 0.7.1-3+b1 | armel,powerpc,hppa,ia64 |
| squeeze | 0.7.1-3+b1 | hppa,armel,ia64,powerpc |
| sid | 0.7.1-3 | i386,amd64,mips,alpha,sparc,s390,mipsel,hurd-i386 |
| squeeze | 0.7.1-3 | sparc,amd64,i386,mips,mipsel,s390 |
| Debtags of package maq: |
| biology | nuceleic-acids |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Maq (short for Mapping and Assembly with Quality) builds mapping assemblies
from short reads generated by the next-generation sequencing machines. It is
particularly designed for Illumina-Solexa 1G Genetic Analyzer, and has a
preliminary functionality to handle ABI SOLiD data. Maq is previously known as
mapass2.
With Maq you can:
-
Fast align Illumina/SOLiD reads to the reference genome. With the
default options, one million pairs of reads can be mapped to the
human genome in about 10 CPU hours with less than 1G memory.
-
Accurately measure the error probability of the alignment of each
individual read.
-
Call the consensus genotypes, including homozygous and heterozygous
polymorphisms, with a Phred probabilistic quality assigned to each base.
-
Find short indels with paired end reads.
-
Accurately find large scale genomic deletions and translocations with
paired end reads.
-
Discover potential CNVs by checking read depth.
-
Evaluate the accuracy of raw base qualities from sequencers and help
to check the systematic errors.
However, Maq can NOT:
-
Do de novo assembly. (Maq can only call the consensus by mapping reads
to a known reference.)
-
Map shorts reads against themselves. (Maq can only find complete overlap
between reads.)
-
Align capillary reads or 454 reads to the reference. (Maq cannot align
reads longer than 63bp.)
This package is likely to be useful for users working with genetics
or genomic studies in biology who need to assembly DNA sequences from
fixed-length sequencers.
|
|
|
Massxpert
linear polymer mass spectrometry software
|
| Versions of package massxpert |
| Release | Version | Architectures |
| sid | 2.0.7-1 | mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa |
| squeeze | 2.0.7-1 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
massXpert is a program to simulate and analyse mass spectrometric
data obtained on linear (bio-)polymers. It is the successor of GNU
polyxmass.
Four modules allow:
-
making brand new polymer chemistry definitions;
-
using the definitions to perform easy calculations in a desktop
calculator-like manner;
-
performing sophisticated polymer sequence editing and simulations;
-
perform m/z list comparisons;
Chemical simulations encompass cleavage (either chemical or
enzymatic), gas-phase fragmentations, chemical modification of any
monomer in the polymer sequence, cross-linking of monomers in the
sequence, arbitrary mass searches, calculation of the isotopic
pattern...
|
|
|
Melting
compute the melting temperature of nucleic acid duplex
|
| Versions of package melting |
| Release | Version | Architectures |
| sid | 4.3b-1 | hppa,s390,mipsel,armel,alpha,powerpc,amd64,i386,ia64,hurd-i386,sparc,mips |
| lenny | 4.2h-3 | mips,i386,s390,alpha,ia64,arm,armel,amd64,powerpc,hppa,sparc,mipsel |
| squeeze | 4.2h-3 | ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| upstream | 4.3c |
| Debtags of package melting: |
| biology | nuceleic-acids |
| field | biology:molecular, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
This program computes, for a nucleic acid duplex, the enthalpy, the
entropy and the melting temperature of the helix-coil
transitions. Three types of hybridisation are possible: DNA/DNA,
DNA/RNA, and RNA/RNA. The program first computes the hybridisation
enthalpy and entropy from the elementary parameters of each Crick's
pair by the nearest-neighbor method. Then the melting temperature is
computed. The set of thermodynamic parameters can be easily changed,
for instance following an experimental breakthrough.
Please cite:
Nicolas Le Novère:
MELTING, computing the melting temperature of nucleic acid duplex
Bioinformatics, 17: S.1226-1227
(2001)
|
|
|
Mipe
Tools to store PCR-derived data
|
| Versions of package mipe |
| Release | Version | Architectures |
| lenny | 1.1-3 | all |
| sid | 1.1-3 | all |
| squeeze | 1.1-3 | all |
| etch | 1.1-1 | all |
| Debtags of package mipe: |
| field | biology:molecular, biology:bioinformatics, biology |
| interface | commandline |
| role | program, documentation |
| scope | utility |
| use | organizing |
| works-with-format | xml |
|
License: DFSG free
|
|
MIPE provides a standard format to exchange and/or storage of all
information associated with PCR experiments using a flat text file. This will:
-
allow for exchange of PCR data between researchers/laboratories
-
enable traceability of the data
-
prevent problems when submitting data to dbSTS or dbSNP
-
enable the writing of standard scripts to extract data (e.g. a
list of PCR primers, SNP positions or haplotypes for different animals)
Although this tool can be used for data storage, it's primary focus
should be data exchange. For larger repositories, relational databases
are more appropriate for storage of these data. The MIPE format could
then be used as a standard format to import into and/or export from
these databases.
MIPE was published in: Aerts J & Veenendaal T. MIPE - a XML-format to
facilitate the storage and exchange of PCR-related data. Online Journal of
Bioinformatics 6(2): 114-120 (2005).
Please cite:
Aerts J & Veenendaal T.:
MIPE - a XML-format to facilitate the storage and exchange of PCR-related data
Online Journal of Bioinformatics 6(2): 114-120
(2005)
|
|
|
Mummer
Efficient sequence alignment of full genomes
|
| Versions of package mummer |
| Release | Version | Architectures |
| sid | 3.22~dfsg-1 | mipsel,hppa,alpha,powerpc,armel,ia64,amd64,mips,i386,sparc |
| lenny | 3.20-3 | i386,mipsel,ia64,alpha,mips,arm,armel,amd64,s390,hppa,sparc,powerpc |
| sid | 3.20-3 | s390 |
| squeeze | 3.20-3 | ia64,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| Debtags of package mummer: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
MUMmer is a system for rapidly aligning entire genomes, whether
in complete or draft form. For example, MUMmer 3.0 can find all
20-basepair or longer exact matches between a pair of 5-megabase genomes
in 13.7 seconds, using 78 MB of memory, on a 2.4 GHz Linux desktop
computer. MUMmer can also align incomplete genomes; it handles the 100s
or 1000s of contigs from a shotgun sequencing project with ease, and
will align them to another set of contigs or a genome using the NUCmer
program included with the system. If the species are too divergent for
DNA sequence alignment to detect similarity, then the PROmer program
can generate alignments based upon the six-frame translations of both
input sequences.
|
|
|
Muscle
Multiple alignment program of protein sequences
|
| Versions of package muscle |
| Release | Version | Architectures |
| lenny | 3.70+fix1-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 3.70+fix1-2 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 3.70+fix1-2 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 3.60-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package muscle: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
MUSCLE is a multiple alignment program for protein sequences. MUSCLE
stands for multiple sequence comparison by log-expectation. In the
authors tests, MUSCLE achieved the highest scores of all tested
programs on several alignment accuracy benchmarks, and is also one of
the fastest programs out there.
|
|
|
Mustang
multiple structural alignment of proteins
|
| Versions of package mustang |
| Release | Version | Architectures |
| sid | 3.2.1-1 | sparc,amd64,armel,hppa,hurd-i386,ia64,mips,mipsel,powerpc,s390,alpha |
| sid | 3.2.0-1 | i386 |
| squeeze | 3.2.0-1 | mips,armel,mipsel,amd64,powerpc,s390,i386,sparc,hppa,ia64 |
| Debtags of package mustang: |
| biology | peptidic |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Mustang is an algorithm to align multiple protein structures.
Given a set of PDB files, the program uses the spatial
information in the Calpha atoms of the set to produce a sequence alignment.
Based on a progressive pairwise heuristic the algorithm then proceeds
through a number of refinement passes. Mustang reports the multiple
sequence alignment and the corresponding superposition of structures.
When using this program please cite A.S. Konagurthu, J.C. Whisstock,
P.J. Stuckey, A.M. Lesk, "MUSTANG: A multiple structural alignment
algorithm." Proteins: Structure, Function, and Bioinformatics. 64(3):559-574.
Please cite:
A. S. Konagurthu, J. C. Whisstock, P. J. Stuckey, A. M. Lesk:
MUSTANG: A multiple structural alignment algorithm
Proteins: Structure, Function, and Bioinformatics. 64(3):559-574
(2006)
|
|
|
Ncbi-epcr
Tool to test a DNA sequence for the presence of sequence tagged sites
|
| Versions of package ncbi-epcr |
| Release | Version | Architectures |
| sid | 2.3.12-1 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.3.12-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 2.3.10-1 | s390,hppa,amd64,sparc,mips,ia64,mipsel,arm,i386,armel,powerpc,alpha |
| etch | 1.2.0-3+b1 | arm,mipsel,s390,mips,powerpc,sparc,hppa,alpha,i386,ia64 |
| etch | 1.2.0-3 | amd64 |
| sid | 1.2.0-3 | hurd-i386 |
| Debtags of package ncbi-epcr: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, checking |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Electronic PCR (e-PCR) is computational procedure that is used to identify
sequence tagged sites(STSs), within DNA sequences. e-PCR looks for potential
STSs in DNA sequences by searching for subsequences that closely match the
PCR primers and have the correct order, orientation, and spacing that could
represent the PCR primers used to generate known STSs.
The new version of e-PCR implements a fuzzy matching strategy. To reduce
likelihood that a true STS will be missed due to mismatches, multiple
discontigous words may be used instead of a single exact word. Each of this
word has groups of significant positions separated by 'wildcard' positions
that are not required to match. In addition, it is also possible to allow
gaps in the primer alignments.
The main motivation for implementing reverse searching (called Reverse e-PCR)
was to make it feasible to search the human genome sequence and other large
genomes. The new version of e-PCR provides a search mode using a query
sequence against a sequence database.
|
|
|
Ncbi-tools-bin
NCBI libraries for biology applications (text-based utilities)
|
| Versions of package ncbi-tools-bin |
| Release | Version | Architectures |
| sid | 6.1.20090809-1+b1 | ia64,armel |
| squeeze | 6.1.20090809-1+b1 | armel,ia64 |
| sid | 6.1.20090809-1 | s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc |
| squeeze | 6.1.20090809-1 | mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64 |
| lenny | 6.1.20080302-4 | powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha |
| etch | 6.1.20061015-1 | hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha |
| Debtags of package ncbi-tools-bin: |
| field | biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, converting |
|
License: DFSG free
|
|
This package includes various utilities distributed with the NCBI C SDK.
None of the programs in this package require X; you can find the X-based
utilities in the ncbi-tools-x11 package. BLAST and related tools are
in a separate package (blast2).
The package is enhanced by the following packages:
mcl
|
|
|
Ncbi-tools-x11
NCBI libraries for biology applications (X-based utilities)
|
| Versions of package ncbi-tools-x11 |
| Release | Version | Architectures |
| sid | 6.1.20090809-1+b1 | ia64,armel |
| squeeze | 6.1.20090809-1+b1 | armel,ia64 |
| sid | 6.1.20090809-1 | s390,alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,sparc |
| squeeze | 6.1.20090809-1 | mips,mipsel,sparc,powerpc,i386,hppa,s390,amd64 |
| lenny | 6.1.20080302-4 | powerpc,ia64,mips,sparc,mipsel,arm,armel,hppa,s390,amd64,i386,alpha |
| etch | 6.1.20061015-1 | hppa,mipsel,amd64,arm,mips,i386,powerpc,sparc,s390,ia64,alpha |
| Debtags of package ncbi-tools-x11: |
| field | biology |
| interface | x11, 3d |
| role | program |
| scope | utility |
| uitoolkit | motif |
| use | searching |
| x11 | application |
|
License: DFSG free
|
|
This package includes some X-based utilities distributed with the
NCBI C SDK: Cn3D, Network Entrez, Sequin, ddv, and udv. These
programs are not part of ncbi-tools-bin because they depend on
several additional library packages.
|
|
|
Njplot
A phylogenetic tree drawing program
|
| Versions of package njplot |
| Release | Version | Architectures |
| sid | 2.3-2 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.3-2 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 2.2-1 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 0.20060606-2 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package njplot: |
| field | biology:bioinformatics, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | motif |
| use | viewing, printing, organizing, editing, analysing |
| works-with-format | plaintext |
| x11 | application |
|
License: DFSG free
|
|
NJplot is able to draw any dendrogram expressed in the Newick standard
phylogenetic tree format (e.g., the format used by the Phylip package).
NJplot is especially convenient for rooting the unrooted trees
obtained from parsimony, distance or maximum likelihood tree-building
methods.
|
|
|
Perlprimer
Graphical design of primers for PCR
|
| Versions of package perlprimer |
| Release | Version | Architectures |
| sid | 1.1.18-1 | all |
| squeeze | 1.1.18-1 | all |
| lenny | 1.1.14-3 | all |
| etch | 1.1.14-1 | all |
| Debtags of package perlprimer: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:molecular, biology |
| interface | x11 |
| network | client |
| role | program |
| scope | utility |
| uitoolkit | tk |
| use | analysing |
| works-with-format | plaintext |
| x11 | application |
|
License: DFSG free
|
|
PerlPrimer is a free, open-source GUI application written in Perl that designs
primers for standard Polymerase Chain Reaction (PCR), bisulphite PCR,
real-time PCR (QPCR) and sequencing. It aims to automate and simplify the
process of primer design.
If operated online, the tool nicely communicates with the Ensembl
project for further insights into the gene structure, i.e., allowing
for taking the location of exons and introns into account for the design
of the primers. The sequences themselves can be retrieved, too.
|
|
|
Phyml
Phylogenetic estimation using Maximum Likelihood
|
| Versions of package phyml |
| Release | Version | Architectures |
| sid | 200900706-2+b1 | alpha,armel,ia64,powerpc |
| squeeze | 200900706-2+b1 | powerpc,ia64,armel |
| sid | 200900706-2 | mipsel,amd64,s390,hppa,hurd-i386,i386,sparc,mips |
| squeeze | 200900706-2 | mipsel,s390,i386,sparc,amd64,hppa,mips |
|
License: DFSG free
|
|
PhyML is a software that estimates maximum likelihood phylogenies from
alignments of nucleotide or amino acid sequences. It provides a wide
range of options that were designed to facilitate standard phylogenetic
analyses. The main strengths of PhyML lies in the large number of
substitution models coupled to various options to search the space of
phylogenetic tree topologies, going from very fast and efficient methods
to slower but generally more accurate approaches. It also implements
two methods to evaluate branch supports in a sound statistical framework
(the non-parametric bootstrap and the approximate likelihood ratio test).
PhyML was designed to process moderate to large data sets. In theory,
alignments with up to 4,000 sequences 2,000,000 character-long can
be analyzed. In practice however, the amount of memory required to process
a data set is proportional of the product of the number of sequences by their
length. Hence, a large number of sequences can only be processed provided
that they are short. Also, PhyML can handle long sequences provided that
they are not numerous. With most standard personal computers, the “comfort
zone” for PhyML generally lies around 3 to 500 sequences less than 2,000
character long.
|
|
|
Plasmidomics
draw plasmids and vector maps with PostScript graphics export
|
| Versions of package plasmidomics |
| Release | Version | Architectures |
| lenny | 0.2.0-2 | all |
| sid | 0.2.0-2 | all |
| squeeze | 0.2.0-2 | all |
| Debtags of package plasmidomics: |
| field | biology:molecular, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | tk |
| works-with | image:vector |
| works-with-format | postscript |
| x11 | application |
|
License: DFSG free
|
|
Plasmidomics is written for easy drawing of plasmids and vector maps
to use them in theses, presentations or other forms of publications. It
natively supports PostScript as output format.
|
|
|
Plink
whole-genome association analysis toolset
|
| Versions of package plink |
| Release | Version | Architectures |
| sid | 1.07-1 | armel,hppa,hurd-i386,i386,ia64,powerpc,s390,alpha,sparc,amd64 |
| sid | 1.03p1-1 | mipsel,mips |
|
License: DFSG free
|
|
plink expects as input the data from SNP (single
nucleotide polymorphism) chips of many individuals
and their phenotypical description of a disease.
It finds associations of single or pairs of DNA
variations with a phenotype and can retrieve
SNP annotation from an online source.
SNPs can evaluated individually or as pairs for their
association with the disease phenotypes. The joint
investigation of copy number variations is supported.
A variety of statistical tests have been implemented.
Please note: The executable was renamed to p-link
because of a name clash. Please read more about this
in /usr/share/doc/README.Debian.
|
|
|
Poa
Partial Order Alignment for multiple sequence alignment
|
| Versions of package poa |
| Release | Version | Architectures |
| lenny | 2.0+20060928-2 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 2.0+20060928-2 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.0+20060928-2 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.0+20060928-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package poa: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
POA is Partial Order Alignment, a fast program for multiple sequence
alignment (MSA) in bioinformatics. Its advantages are speed,
scalability, sensitivity, and the superior ability to handle branching
/ indels in the alignment. Partial order alignment is an approach to
MSA, which can be combined with existing methods such as progressive
alignment. POA optimally aligns a pair of MSAs and which therefore can
be applied directly to progressive alignment methods such as CLUSTAL.
For large alignments, Progressive POA is 10-30 times faster than
CLUSTALW. POA is published in Bioinformatics. 2004 Jul
10;20(10):1546-56.
Please cite:
C. Grasso, C. Lee:
Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems
Bioinformatics 20(10):1546-1556.
(2004)
|
|
|
Primer3
Tool to design flanking oligo nucleotides for DNA amplification
|
| Versions of package primer3 |
| Release | Version | Architectures |
| lenny | 1.1.4-1 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 1.1.4-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.1.4-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 1.0b-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package primer3: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Primer3 picks primers for Polymerase Chain Reactions (PCRs), considering as
criteria oligonucleotide melting temperature, size, GC content and
primer-dimer possibilities, PCR product size, positional constraints within
the source sequence, and miscellaneous other constraints. All of these
criteria are user-specifiable as constraints, and some are specifiable as
terms in an objective function that characterizes an optimal primer pair.
It has been published in Rozen S and Skaletsky H, "Primer3 on the WWW for
general users and for biologist programmers.", Methods Mol Biol.
2000;132:365-86.
The Whitehead Institute for Biomedical Research provides a web-based
front end to Primer3.
Please cite:
S. Rozen, H. Skaletsky:
Primer3 on the WWW for general users and for biologist programmers
Methods Mol Biol. 132:365-86
(2000)
|
|
|
Probcons
PROBabilistic CONSistency-based multiple sequence alignment
|
| Versions of package probcons |
| Release | Version | Architectures |
| lenny | 1.12-4 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 1.12-4 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.12-4 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 1.11-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package probcons: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Tool for generating multiple alignments of protein sequences. Using a
combination of probabilistic modeling and consistency-based alignment
techniques, PROBCONS has achieved the highest accuracies of all alignment
methods to date. On the BAliBASE benchmark alignment database, alignments
produced by PROBCONS show statistically significant improvement over current
programs, containing an average of 7% more correctly aligned columns than
those of T-Coffee, 11% more correctly aligned columns than those of CLUSTAL W,
and 14% more correctly aligned columns than those of DIALIGN. Probcons is
published in Do, C.B., Mahabhashyam, M.S.P., Brudno, M., and Batzoglou, S.
2005. Genome Research 15: 330-340.
Please cite:
C. B. Do, M. S. P. Mahabhashyam, M. Brudno, S. Batzoglou:
Genome Research 15: 330-340
(2005)
|
|
|
Proda
multiple alignment of protein sequences
|
| Versions of package proda |
| Release | Version | Architectures |
| lenny | 1.0-7 | i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc |
| sid | 1.0-7 | ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.0-7 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| Debtags of package proda: |
| biology | peptidic, nuceleic-acids, format:fasta |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
ProDA is a system for automated detection and alignment of homologous
regions in collections of proteins with arbitrary domain architectures.
Given an input set of unaligned sequences, ProDA identifies all
homologous regions appearing in one or more sequences, and returns a
collection of local multiple alignments for these regions.
ProDA is published in: Phuong T.M., Do C.B., Edgar R.C., and Batzoglou
S. Multiple alignment of protein sequences with repeats and
rearrangements. Nucleic Acids Research 2006 34(20), 5932-5942.
Please cite:
T. M. Phuong, C. B. Do, R. C. Edgar, S. Batzoglou:
Multiple alignment of protein sequences with repeats and rearrangements
Nucleic Acids Research 34(20), 5932-5942
(2006)
|
|
|
Pymol
Molecular Graphics System
|
| Versions of package pymol |
| Release | Version | Architectures |
| sid | 1.2r1-3 | i386,powerpc,armel,alpha,amd64,ia64 |
| sid | 1.2r1-2 | s390,hppa,hurd-i386,mips,mipsel,sparc |
| squeeze | 1.2r1-2 | ia64,hppa,powerpc,armel,s390,amd64,mips,sparc,mipsel,i386 |
| lenny | 1.1-1 | i386,ia64,mips,hppa,alpha,amd64,powerpc,sparc,mipsel,arm,armel,s390 |
| etch | 0.98+0.99rc6-2 | mipsel,mips,amd64,sparc,alpha,arm,s390,hppa,i386,powerpc,ia64 |
| Debtags of package pymol: |
| field | chemistry, biology:structural |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | tk |
| use | viewing |
| works-with | image |
| x11 | application |
|
License: DFSG free
|
|
PyMOL is a molecular graphics system targetted at medium to large
biomolecules like proteins. It can generate high-quality publication-ready
molecular graphics images and animations.
Features include:
-
Visualization of molecules, molecular trajectories and surfaces
of crystallography data or orbitals
-
Molecular builder and sculptor
-
Internal raytracer and movie generator
-
Fully extensible and scriptable via a Python interface
File formats PyMOL can read include PDB, XYZ, CIF, MDL Molfile, ChemDraw,
CCP4 maps, XPLOR maps and Gaussian cube maps.
|
|
|
R-cran-qtl
GNU R package for genetic marker linkage analysis
|
| Versions of package r-cran-qtl |
| Release | Version | Architectures |
| sid | 1.14-2-1 | i386,sparc,powerpc,hurd-i386,hppa,armel,s390,amd64,alpha,mipsel,ia64 |
| sid | 1.13-7-1 | mips |
| squeeze | 1.13-7-1 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 1.08-56-1 | sparc,amd64,hppa,powerpc,mips,i386,s390,alpha,mipsel,ia64,arm,armel |
| etch | 1.05-2-1 | powerpc,mips,sparc,mipsel,arm,alpha,hppa,amd64,s390,i386,ia64 |
| Debtags of package r-cran-qtl: |
| devel | library, lang:r |
| field | statistics, biology |
| role | app-data |
| suite | gnu |
|
License: DFSG free
|
|
R/qtl is an extensible, interactive environment for mapping quantitative
trait loci (QTLs) in experimental crosses. It is implemented as an
add-on-package for the freely available and widely used statistical
language/software R (see http://www.r-project.org).
The development of this software as an add-on to R allows to take
advantage of the basic mathematical and statistical functions, and
powerful graphics capabilities, that are provided with R. Further,
the user will benefit by the seamless integration of the QTL mapping
software into a general statistical analysis program. The goal is to
make complex QTL mapping methods widely accessible and allow users to
focus on modeling rather than computing.
A key component of computational methods for QTL mapping is the hidden
Markov model (HMM) technology for dealing with missing genotype data. We
have implemented the main HMM algorithms, with allowance for the presence
of genotyping errors, for backcrosses, intercrosses, and phase-known
four-way crosses.
The current version of R/qtl includes facilities for estimating
genetic maps, identifying genotyping errors, and performing single-QTL
genome scans and two-QTL, two-dimensional genome scans, by interval
mapping (with the EM algorithm), Haley-Knott regression, and multiple
imputation. All of this may be done in the presence of covariates (such
as sex, age or treatment). One may also fit higher-order QTL models by
multiple imputation.
|
|
|
R-other-bio3d
GNU R package for biological structure analysis
|
| Versions of package r-other-bio3d |
| Release | Version | Architectures |
| sid | 1.0-6-1 | all |
| squeeze | 1.0-6-1 | all |
| Debtags of package r-other-bio3d: |
| field | biology:structural, biology:bioinformatics, biology |
| interface | x11, commandline |
| role | shared-lib |
| scope | application |
| use | viewing, comparing, analysing |
| works-with | 3dmodel |
| works-with-format | plaintext |
|
License: DFSG free
|
|
The bio3d package contains utilities to process, organize and explore
protein structure, sequence and dynamics data. Features include the
ability to read and write structure, sequence and dynamic trajectory
data, perform atom summaries, atom selection, re-orientation,
superposition, rigid core identification, clustering, torsion analysis,
distance matrix analysis, structure and sequence conservation analysis,
and principal component analysis (PCA). In addition, various utility
functions are provided to enable the statistical and graphical power of
the R environment to work with biological sequence and structural data.
|
|
|
R-other-mott-happy
GNU R package for fine-mapping complex diseases
|
| Versions of package r-other-mott-happy |
| Release | Version | Architectures |
| sid | 2.1-4+b1 | s390,i386,sparc,alpha,armel,ia64,amd64,mips,mipsel,powerpc |
| squeeze | 2.1-4+b1 | sparc,amd64,armel,i386,ia64,mips,mipsel,powerpc,s390 |
| sid | 2.1-4 | hurd-i386,hppa |
| squeeze | 2.1-4 | hppa |
|
License: DFSG free
|
|
Most phenotypes of medical importance can be measured quantitatively,
and in many cases the genetic contribution is substantial, accounting
for 40% or more of the phenotypic variance. Considerable efforts have
been made to isolate the genes responsible for quantitative genetic
variation in human populations, but with little success, mostly
because genetic loci contributing to quantitative traits
(quantitative trait loci, QTL) have only a small effect on the
phenotype. Association studies have been proposed as the most
appropriate method for finding the genes that influence complex
traits. However, family-based studies may not provide the resolution
needed for positional cloning, unless they are very large, while
environmental or genetic differences between cases and controls may
confound population-based association studies.
These difficulties have led to the study of animal models of human
traits. Studies using experimental crosses between inbred animal
strains have been successful in mapping QTLs with effects on a number
of different phenotypes, including behaviour, but attempts to
fine-map QTLs in animals have often foundered on the discovery that a
single QTL of large effect was in fact due to multiple loci of small
effect positioned within the same chromosomal region. A further
potential difficulty with detecting QTLs between inbred crosses is
the significant reduction in genetic heterogeneity compared to the
total genetic variation present in animal populations: a QTL
segregating in the wild need not be present in the experimental
cross.
The idea behind this package is that when multiple strains of
animals that differ in their susceptibility to multiple diseases
are bread over multiple generations, then one can analyse the
contribution that a particular genetic locus has to each of those
diseases. While in the past this approach has been performed
for one disease at a time, this tool extends the statistics
for allowing multiple crosses and thus save animal lifes. A larger
stock of animals with more generations to keep them will further
help producing larger numbers of observable cross-over events
and thus help increasing the resolution of the mapping.
happy is an R interface into the HAPPY C package for fine-mapping
Quantitative Trait Loci (QTL) in Heterogenous Stocks (HS). An HS is
an advanced intercross between (usually eight) founder inbred strains
of mice. HS are suitable for fine-mapping QTL. It uses a multipoint
analysis which offers significant improvements in statistical power to
detect QTLs over that achieved by single-marker association.
The happy package is
an extension of the original C program happy; it uses the C code to
compute the probability of descent from each of the founders, at each
locus position, but the happy packager allows a much richer range of
models to be fit to the data.
Further details can be found in
Proc. Natl. Acad. Sci. USA, 10.1073/pnas.230304397.
|
|
|
Rasmol
Visualize biological macromolecules
|
| Versions of package rasmol |
| Release | Version | Architectures |
| sid | 2.7.4.2-5+b2 | ia64 |
| squeeze | 2.7.4.2-5+b2 | ia64 |
| sid | 2.7.4.2-5 | s390,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,sparc |
| squeeze | 2.7.4.2-5 | mips,sparc,powerpc,amd64,mipsel,i386,hppa,s390,armel |
| lenny | 2.7.4.2-3+b1 | s390,hppa,mips,sparc |
| lenny | 2.7.4.2-3 | i386,mipsel,ia64,arm,armel,alpha,amd64,powerpc |
| sid | 2.7.4.2-3 | hurd-i386 |
| etch | 2.7.2.1.1-5 | s390,mips,powerpc,mipsel,arm,amd64,sparc,hppa,alpha,i386,ia64 |
| upstream | 2.7.5 |
| Debtags of package rasmol: |
| field | chemistry |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | gtk |
| use | viewing, learning |
| x11 | application |
|
License: DFSG free
|
|
RasMol is a molecular graphics program intended for the visualisation of
proteins, nucleic acids and small molecules. The program is aimed at
display, teaching and generation of publication quality images.
The program reads in a molecule coordinate file and interactively displays
the molecule on the screen in a variety of colour schemes and molecule
representations. Currently available representations include depth-cued
wireframes, 'Dreiding' sticks, spacefilling (CPK) spheres, ball and stick,
solid and strand biomolecular ribbons, atom labels and dot surfaces.
Supported input file formats include Protein Data Bank (PDB), Tripos
Associates' Alchemy and Sybyl Mol2 formats, Molecular Design Limited's
(MDL) Mol file format, Minnesota Supercomputer Center's (MSC) XYZ (XMol)
format, CHARMm format, CIF format and mmCIF format files.
This package installs two versions of RasMol, rasmol-gtk has a modern
GTK-based user interface and rasmol-classic is the version with the old
Xlib GUI.
The package is enhanced by the following packages:
rasmol-doc
|
|
|
Readseq
[Biology] Conversion between sequence formats
|
| Versions of package readseq |
| Release | Version | Architectures |
| lenny | 1-7 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 1-7 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1-7 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 1-6 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package readseq: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | converting |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Reads and writes nucleic/protein sequences in various
formats. Data files may have multiple sequences.
Readseq is particularly useful as it automatically detects many
sequence formats, and converts between them.
|
|
|
Rnahybrid
Fast and effective prediction of microRNA/target duplexes
|
| Versions of package rnahybrid |
| Release | Version | Architectures |
| sid | 2.1-2 | ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 2.1-2 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| lenny | 2.1-1 | i386,arm,mipsel,ia64,alpha,mips,armel,amd64,s390,hppa,sparc,powerpc |
| Debtags of package rnahybrid: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | analysing |
|
License: DFSG free
|
|
RNAhybrid is a tool for finding the minimum free energy hybridisation of a
long and a short RNA. The hybridisation is performed in a kind of domain mode,
ie. The short sequence is hybridised to the best fitting part of the long one.
The tool is primarily meant as a means for microRNA target prediction.
Public research assisted by RNAhybrid should cite:
Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann, Robert Giegerich
Fast and effective prediction of microRNA/target duplexes
RNA, 10:1507-1517, 2004.
Please cite:
Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann, Robert Giegerich:
Fast and effective prediction of microRNA/target duplexes RNA
10:1507-1517
(2004)
|
|
|
Samtools
processing sequence alignments in SAM and BAM formats
|
| Versions of package samtools |
| Release | Version | Architectures |
| sid | 0.1.6~dfsg-1 | alpha,amd64,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
Samtools is a set of utilities that manipulate nucleotide sequence alignments
in the binary BAM format. It imports from and exports to the ascii SAM
(Sequence Alignment/Map) format, does sorting, merging and indexing, and allows
to retrieve reads in any regions swiftly. It is designed to work on a stream,
and is able to open a BAM (not SAM) file on a remote FTP or HTTP server.
|
|
|
Seaview
Multiplatform interface for sequence alignment and phylogeny
|
| Versions of package seaview |
| Release | Version | Architectures |
| sid | 3.2-1 | ia64,armel,hurd-i386,mips,mipsel |
| squeeze | 3.2-1 | mipsel,mips,ia64,powerpc,i386,hppa,s390,armel,amd64,sparc |
| etch | 20060918-1 | mipsel,amd64,i386,arm,s390,ia64,sparc,alpha,hppa,mips,powerpc |
| lenny | 2.4-1 | amd64,mips,mipsel,sparc,arm,armel,alpha,hppa,s390,i386,ia64,powerpc |
| Debtags of package seaview: |
| field | biology:bioinformatics, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | fltk |
| use | special:todo, printing, editing |
| works-with | special:todo |
| works-with-format | plaintext |
| x11 | application |
|
License: DFSG free
|
|
SeaView reads and writes various file formats (NEXUS, MSF, CLUSTAL, FASTA,
PHYLIP, MASE, Newick) of DNA and protein sequences and of phylogenetic trees.
Alignments can be manually edited. It drives the programs Muscle or Clustal W
for multiple sequence alignment, and also allows to use any external alignment
algorithm able to read and write FASTA-formatted files. It computes
phylogenetic trees by parsimony using PHYLIP's dnapars/protpars algorithm, by
distance with NJ or BioNJ algorithms on a variety of evolutionary distances, or
by maximum likelihood using the program PhyML 3.0. SeaView draws phylogenetic
trees on screen or PostScript files, and allows to download sequences from
EMBL/GenBank/UniProt using the Internet.
The package is enhanced by the following packages:
muscle
Please cite:
N. Galtier, M. Gouy, C. Gautier:
SeaView and Phylo_win, two graphic tools for sequence alignment and molecular phylogeny
Comput. Applic. Biosci. 12:543-548
(1996)
|
|
|
Sibsim4
align expressed RNA sequences on a DNA template
|
| Versions of package sibsim4 |
| Release | Version | Architectures |
| lenny | 0.17-1 | arm,i386,armel,mipsel,ia64,alpha,mips,amd64,s390,hppa,sparc,powerpc |
| sid | 0.17-1 | ia64,alpha,amd64,armel,hppa,hurd-i386,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 0.17-1 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| Debtags of package sibsim4: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | special:todo, searching, comparing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
The SIBsim4 project is based on sim4, which is a program designed to align
an expressed DNA sequence with a genomic sequence, allowing for introns.
SIBsim4 is a fairly extensive rewrite of the original code with the following
goals:
-
speed improvement;
-
allow large, chromosome scale, DNA sequences to be used;
-
provide more detailed output about splice types;
-
provide more detailed output about polyA sites;
-
misc code cleanups and fixes.
|
|
|
Sigma-align
Simple greedy multiple alignment of non-coding DNA sequences
|
| Versions of package sigma-align |
| Release | Version | Architectures |
| sid | 1.1.3-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 1.1.3-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 1.1.1-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 1.0-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package sigma-align: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Sigma (“Simple greedy multiple alignment”) is an alignment program with a
new algorithm and scoring scheme designed specifically for non-coding DNA
sequence. It uses a strategy of seeking the best possible gapless local
alignments, at each step making the best possible alignment consistent with
existing alignments, and scores the significance of the alignment based on
the lengths of the aligned fragments and a background model which may be
supplied or estimated from an auxiliary file of intergenic DNA.
Please cite:
BMC Bioinformatics 16;7:143
(2006)
|
|
|
Sim4
tool for aligning cDNA and genomic DNA
|
| Versions of package sim4 |
| Release | Version | Architectures |
| lenny | 0.0.20030921-3 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| sid | 0.0.20030921-3 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 0.0.20030921-3 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 0.0.20030921-1 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package sim4: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching, comparing |
| works-with | special:todo |
| works-with-format | plaintext |
|
License: DFSG free
|
|
sim4 is a similarity-based tool for aligning an expressed DNA sequence
(EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects end
matches when the two input sequences overlap at one end (i.e., the start of
one sequence overlaps the end of the other).
sim4 employs a blast-based technique to first determine the basic matching
blocks representing the "exon cores". In this first stage, it detects all
possible exact matches of W-mers (i.e., DNA words of size W) between the two
sequences and extends them to maximal scoring gap-free segments. In the
second stage, the exon cores are extended into the adjacent as-yet-unmatched
fragments using greedy alignment algorithms, and heuristics are used to favor
configurations that conform to the splice-site recognition signals (GT-AG,
CT-AC). If necessary, the process is repeated with less stringent parameters
on the unmatched fragments.
|
|
|
Ssake
genomics application for assembling millions of very short DNA sequences
|
| Versions of package ssake |
| Release | Version | Architectures |
| sid | 3.4-1 | all |
| squeeze | 3.4-1 | all |
|
License: DFSG free
|
|
The Short Sequence Assembly by K-mer search and 3′ read Extension
(SSAKE) is a genomics application for aggressively assembling
millions of short nucleotide sequences by progressively searching for
perfect 3′-most k-mers using a DNA prefix tree. SSAKE is designed to
help leverage the information from short sequences reads by
stringently clustering them into contigs that can be used to
characterize novel sequencing targets.
|
|
|
Staden-io-lib-utils
programs for maniuplating DNA sequencing files
|
| Versions of package staden-io-lib-utils |
| Release | Version | Architectures |
| sid | 1.12.1-1 | amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390,sparc |
| squeeze | 1.12.1-1 | powerpc,amd64,mips,armel,sparc,hppa,mipsel,i386,s390,ia64 |
|
License: DFSG free
|
|
The io_lib from the Staden package is a library of file reading and writing
code to provide a general purpose trace file (and Experiment File) reading
interface. It has been compiled and tested on a variety of unix systems,
MacOS X and MS Windows.
This package contains the programs that are distributed with the Staden io_lib
for manipulating and converting sequencing data files, and in particular files
to maniuplate short reads generated by second and third generation sequencers
and stored in SRF format.
|
|
|
T-coffee
Multiple Sequence Alignment
|
| Versions of package t-coffee |
| Release | Version | Architectures |
| sid | 8.14-1 | ia64,sparc,powerpc,i386,hurd-i386,armel,s390,amd64,alpha,mipsel,mips |
| sid | 8.13-1 | hppa |
| squeeze | 8.13-1 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 5.72-1 | alpha,mips,mipsel,arm,armel,sparc,hppa,s390,i386,amd64,powerpc,ia64 |
| etch | 2.50-1 | alpha,hppa,mips,ia64,amd64,arm,i386,sparc,powerpc,s390,mipsel |
| Debtags of package t-coffee: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
T-Coffee is a multiple sequence alignment package. Given a set of
sequences (Proteins or DNA), T-Coffee generates a multiple sequence
alignment. Version 2.00 and higher can mix sequences and structures.
T-Coffee allows the combination of a collection of multiple/pairwise,
global or local alignments into a single model. It also allows to
estimate the level of consistency of each position within the new
alignment with the rest of the alignments. See the pre-print for more
information
T-Coffee has a special called M-Coffee that makes it possible to combine the
output of many multiple sequence alignment packages. In its published version,
it uses MUSCLE, PROBCONS, POA, DiAlign-TS, MAFFT, Clustal W, PCMA and
T-Coffee. A special version has been made for Debian, DM-Coffee, that uses
only free software by replacing Clustal W by Kalign. Using the 8 Methods of
M-Coffee can sometimes be a bit heavy. You can use a subset of your favorite
methods if you prefer.
|
|
|
Theseus
superimpose macromolecules using maximum likelihood
|
| Versions of package theseus |
| Release | Version | Architectures |
| sid | 1.4.3-2 | mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa |
| squeeze | 1.4.3-2 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
| Debtags of package theseus: |
| biology | peptidic |
| field | biology:structural, biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Theseus is a program that simultaneously superimposes multiple
macromolecular structures. Theseus finds the optimal solution to the
superposition problem using the method of maximum likelihood. By
down-weighting variable regions of the superposition and by correcting for
correlations among atoms, the ML superposition method produces very
accurate structural alignments.
When macromolecules with different residue sequences are superimposed,
other programs and algorithms discard residues that are aligned with
gaps. Theseus, however, uses a novel superimposition algorithm that
includes all of the data.
|
|
|
Tigr-glimmer
Gene detection in archea and bacteria
|
| Versions of package tigr-glimmer |
| Release | Version | Architectures |
| lenny | 3.02-1 | i386,sparc,hppa,amd64,s390,arm,powerpc,alpha,mipsel,armel,ia64,mips |
| sid | 3.02-1 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 3.02-1 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.13-1+b1 | s390,mips,arm,mipsel,powerpc,sparc,hppa,alpha,i386,ia64 |
| etch | 2.13-1 | amd64 |
| Debtags of package tigr-glimmer: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | searching |
| works-with-format | plaintext |
|
License: DFSG free
|
|
Developed by the TIGR institute this software detects coding sequences in
bacteria and archea.
Glimmer is a system for finding genes in microbial DNA, especially the
genomes of bacteria and archaea. Glimmer (Gene Locator and Interpolated
Markov Modeler) uses interpolated Markov models (IMMs) to identify the
coding regions and distinguish them from noncoding DNA.
|
|
|
Tree-ppuzzle
Parallelized reconstruction of phylogenetic trees by maximum likelihood
|
| Versions of package tree-ppuzzle |
| Release | Version | Architectures |
| sid | 5.2-5 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 5.2-5 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 5.2-4 | alpha,s390,hppa,mips,amd64,powerpc,sparc,mipsel,arm,ia64,armel,i386 |
| etch | 5.2-2 | s390,mips,arm,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package tree-ppuzzle: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that
implements a fast tree search algorithm, quartet puzzling, that allows
analysis of large data sets and automatically assigns estimations of support
to each internal branch. TREE-PUZZLE also computes pairwise maximum
likelihood distances as well as branch lengths for user specified trees.
Branch lengths can also be calculated under the clock-assumption. In
addition, TREE-PUZZLE offers a novel method, likelihood mapping, to
investigate the support of a hypothesized internal branch without
computing an overall tree and to visualize the phylogenetic content of
a sequence alignment.
This is the parallelized version of tree-puzzle.
|
|
|
Tree-puzzle
Reconstruction of phylogenetic trees by maximum likelihood
|
| Versions of package tree-puzzle |
| Release | Version | Architectures |
| sid | 5.2-5 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 5.2-5 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 5.2-4 | s390,hppa,amd64,sparc,powerpc,mips,mipsel,arm,ia64,armel,i386,alpha |
| etch | 5.2-2 | mips,arm,s390,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| sid | 5.2-1 | hurd-i386 |
| Debtags of package tree-puzzle: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: DFSG free
|
|
TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that
implements a fast tree search algorithm, quartet puzzling, that allows
analysis of large data sets and automatically assigns estimations of support
to each internal branch. TREE-PUZZLE also computes pairwise maximum
likelihood distances as well as branch lengths for user specified trees.
Branch lengths can also be calculated under the clock-assumption. In
addition, TREE-PUZZLE offers a novel method, likelihood mapping, to
investigate the support of a hypothesized internal branch without
computing an overall tree and to visualize the phylogenetic content of
a sequence alignment.
|
|
|
Treeviewx
Displays and prints phylogenetic trees
|
| Versions of package treeviewx |
| Release | Version | Architectures |
| sid | 0.5.1-6 | s390,alpha,amd64,armel,hppa,hurd-i386,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 0.5.1-6 | sparc,powerpc,ia64,i386,hppa,s390,armel,amd64,mipsel,mips |
| lenny | 0.5.1-5 | i386,sparc,hppa,amd64,mips,arm,powerpc,s390,mipsel,armel,alpha,ia64 |
| etch | 0.5.1-3 | arm,s390,mips,mipsel,powerpc,amd64,sparc,hppa,alpha,i386,ia64 |
| Debtags of package treeviewx: |
| field | biology:bioinformatics, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | wxwindows |
| use | viewing |
| works-with | special:todo |
| works-with-format | svg, postscript, plaintext, pdf |
| x11 | application |
|
License: DFSG free
|
|
TreeView X is an open source and multi-platform program to display
phylogenetic trees. It can read and display NEXUS and Newick format tree files
(such as those output by PAUP*, ClustalX, TREE-PUZZLE, and other programs). It
allows to order the branches of the trees, and to export the trees in SVG
format.
The program was written by Rod Page r.page@bio.gla.ac.uk using the wxWidgets
C++ library. It was published in Computer Applications in the Biosciences. 1996
12: 357-358.
Please cite:
Computer Applications in the Bioscience 12:357-358
(1996)
|
|
|
Velvet
Nucleic acid sequence assembler for very short reads
|
| Versions of package velvet |
| Release | Version | Architectures |
| sid | 0.7.55-1 | mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa |
| squeeze | 0.7.55-1 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
Velvet is a de novo genomic assembler specially designed for short read
sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and
Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near
Cambridge, in the United Kingdom.
Velvet currently takes in short read sequences, removes errors then produces
high quality unique contigs. It then uses paired read information, if
available, to retrieve the repeated areas between contigs.
|
|
|
Wise
comparison of biopolymers, commonly DNA and protein sequences
|
| Versions of package wise |
| Release | Version | Architectures |
| sid | 2.4.1-8 | sparc,i386,hurd-i386,armel,s390,amd64,alpha,mipsel,ia64 |
| sid | 2.4.1-7 | mips,hppa,powerpc |
| squeeze | 2.4.1-7 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 2.4.1-5 | hppa,mipsel,powerpc,arm,armel,sparc,mips,s390,i386,amd64,ia64,alpha |
| etch | 2.2.0-3 | alpha,hppa,sparc,ia64,amd64,powerpc,i386,mipsel,mips,s390,arm |
| Debtags of package wise: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | special:todo, comparing |
|
License: DFSG free
|
|
Wise2 is a package focused on comparisons of biopolymers, commonly DNA
and protein sequences. There are many other packages which do
this, probably the best known being BLAST package (from NCBI) and the
Fasta package (from Bill Pearson). There are other packages, such as
the HMMER package (Sean Eddy) or SAM package (UC Santa Cruz) focused
on hidden Markov models (HMMs) of biopolymers.
Wise2's particular forte is the comparison of DNA sequence at the level
of its protein translation. This comparison allows the simultaneous
prediction of say gene structure with homology based alignment.
Wise2 also contains other algorithms, such as the venerable Smith-Waterman
algorithm, or more modern ones such as Stephen Altschul's generalised
gap penalties, or even experimental ones developed in house, such as
dba. The development of these algorithms is due to the ease of developing
such algorithms in the environment used by Wise2.
Wise2 has also been written with an eye for reuse and maintainability.
Although it is a pure C package you can access its functionality
directly in Perl. Parts of the package (or the entire package) can
be used by other C or C++ programs without namespace clashes as all
externally linked variables have the unique identifier Wise2 prepended.
|
|
Official Debian packages with lower relevance
|
Emboss-explorer
|
| Versions of package emboss-explorer |
| Release | Version | Architectures |
| lenny | 2.2.0-7 | all |
| sid | 2.2.0-7 | all |
| squeeze | 2.2.0-7 | all |
| Debtags of package emboss-explorer: |
| biology | emboss |
| field | biology:bioinformatics, biology |
| interface | web |
| role | plugin |
| web | cgi, application |
|
License: DFSG free
|
|
EMBOSS explorer is a web-based graphical user interface
to the EMBOSS suite of bioinformatics tools. It is written
in Perl.
If you use the Apache HTTP server, you will at most have to restart it before
using EMBOSS explorer. For other web servers, you will have to do the
configuration by yourself.
|
|
|
Melting-gui
graphical interface to compute the melting temperature of nucleic acid duplex
|
| Versions of package melting-gui |
| Release | Version | Architectures |
| sid | 4.3b-1 | all |
| lenny | 4.2h-3 | all |
| squeeze | 4.2h-3 | all |
| upstream | 4.3c |
| Debtags of package melting-gui: |
| field | biology:molecular, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | tk |
| use | analysing |
| x11 | application |
|
License: DFSG free
|
|
This package provides a graphical user interface for the “melting”
program, which computes for a nucleic acid duplex the enthalpy, the entropy
and the melting temperature of the helix-coil transitions.
|
|
|
Mozilla-biofox
extension of bioinformatics tools to Iceape and Iceweasel browsers
|
| Versions of package mozilla-biofox |
| Release | Version | Architectures |
| lenny | 1.1.5-1~lenny | all |
| sid | 1.1.5-1 | all |
| squeeze | 1.1.5-1 | all |
| etch | 1.1.4-2 | all |
| Debtags of package mozilla-biofox: |
| field | genealogy, biology:bioinformatics, biology |
| interface | x11, web |
| role | plugin |
| scope | utility |
| suite | mozilla |
| use | searching, editing, converting, comparing, browsing |
| works-with | db |
|
License: DFSG free
|
|
Code bioFOX aims at implementing various bioinformatics tools as an extension
on the Iceape and Iceweasel browsers. Analysis of your favorite gene(s)
usually require(s) retrieving it from a database like NCBI or Swiss-Prot and
then performing one or more tasks including but not limited to:
-
Translation of a nucleotide sequence;
-
Blast search (eg. blastn, blastp etc.) of the desired nucleotide/protein
sequence;
-
Calculation of properties (like PI, charge, molecular weight, AT/GC content
etc.) of a protein/nucleotide sequence;
-
Conversion between formats (Genbank, Fasta, Swiss-Prot etc.);
-
Prediction of sequence for sub-cellular localization (PREDOTAR, TargetP,
pSORT etc).
|
|
|
Seqan-apps
A C++ template library for the analysis of sequences
|
| Versions of package seqan-apps |
| Release | Version | Architectures |
| sid | 1.2-1 | mips,armel,mipsel,hurd-i386,powerpc,i386,s390,amd64,sparc,alpha,ia64,hppa |
| squeeze | 1.2-1 | sparc,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,s390 |
|
License: DFSG free
|
|
SeqAn is a C++ template library of efficient algorithms and data
structures for the analysis of sequences with the focus on
biological data. This library applies a unique generic design that
guarantees high performance, generality, extensibility, and
integration with other libraries. SeqAn is easy to use and
simplifies the development of new software tools with a minimal loss
of performance. This package contains the applications dfi, pair_align,
micro_razers, seqan_tcoffee, seqcons, razers and tree_recon.
|
|
Debian packages in contrib or non-free
|
Arb
Integrated package for sequence database handling and analysis
|
| Versions of package arb |
| Release | Version | Architectures |
| sid | 0.0.20071207.1-7 | powerpc,i386,sparc |
| sid | 0.0.20071207.1-6 | mipsel |
| squeeze | 0.0.20071207.1-6 | mipsel,i386,sparc |
| Debtags of package arb: |
| field | biology:bioinformatics, biology |
| interface | x11 |
| role | program |
| scope | utility, application |
| uitoolkit | motif |
| use | searching, editing, comparing, analysing |
| x11 | application |
|
License: non-free
|
|
The ARB software is a graphically oriented package comprising various tools
for sequence database handling and data analysis. A central database of
processed (aligned) sequences and any type of additional data linked to the
respective sequence entries is structured according to phylogeny or other
user defined criteria.
The ARB project (latin, "arbor"=tree) is a joint initiative of the Lehrstuhl
fuer Mikrobiologie http://www.mikro.biologie.tu-muenchen.de/ and the
Lehrstuhl fuer Rechnertechnik und Rechnerorganisation
http://wwwbode.informatik.tu-muenchen.de/ of the Technical University
of Munich.
|
|
|
Autodocktools
GUI to help set up, launch and analyze AutoDock dockings
|
| Versions of package autodocktools |
| Release | Version | Architectures |
| sid | 1.5.4.cvs.20090603-1 | all |
| squeeze | 1.5.4.cvs.20090603-1 | all |
|
License: non-free
|
|
This package is part of the mgltools set of Python libraries which
provide an infrastructure for the analysis of protein structures and
their docking of chemical compounds.
AutoDock is a well established package for the automated screening of
libraries of formal representation of chemical compounds that putatively
bind to a particular protein at hand. This package provides a graphical
user interface that is helping with the preparation of the protein
for such analyses.
|
|
|
Clustalw
global multiple nucleotide or peptide sequence alignment
|
| Versions of package clustalw |
| Release | Version | Architectures |
| lenny | 2.0.9-1 | mips,mipsel,powerpc,arm,sparc,hppa,s390,i386,amd64,ia64,alpha |
| sid | 2.0.12-1 | i386,hppa,armel,s390,amd64,alpha,ia64,sparc |
| sid | 2.0.10-1 | powerpc,mipsel,mips |
| squeeze | 2.0.10-1 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| etch | 1.83-1.2 | hppa,arm,amd64,powerpc,i386,mipsel,sparc,s390,ia64,mips,alpha |
| Debtags of package clustalw: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | text-mode, commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: non-free
|
|
This program performs an alignment of multiple nucleotide or amino acid
sequences. It recognizes the format of input sequences and whether the
sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The output
format may be selected from in various formats for multiple alignments such as
Phylip or FASTA. Clustal W is very well accepted.
The output of Clustal W can be edited manually but preferably with an
alignment editor like SeaView or within its companion Clustal X. When building
a model from your alignment, this can be applied for improved database
searches. The Debian package hmmer creates such in form of an HMM.
The package is enhanced by the following packages:
clustalw-mpi
Please cite:
M. Larkin, et al.:
Clustal W and Clustal X version 2.0
Bioinformatics 23(21):2947-2948
(2007)
|
|
|
Clustalw-mpi
MPI-distributed global sequence alignment with ClustalW
|
| Versions of package clustalw-mpi |
| Release | Version | Architectures |
| lenny | 0.15-1 | arm,mipsel,ia64,alpha,mips,i386,amd64,s390,hppa,sparc,powerpc,armel |
| sid | 0.15-1 | ia64,alpha,amd64,armel,hppa,i386,mips,mipsel,powerpc,s390,sparc |
| squeeze | 0.15-1 | mipsel,hppa,amd64,powerpc,armel,ia64,sparc,mips,i386,s390 |
| etch | 0.13-3 | i386 |
| Debtags of package clustalw-mpi: |
| field | biology |
| interface | text-mode, commandline |
| role | program |
| scope | utility |
| use | comparing |
| works-with-format | plaintext |
|
License: non-free
|
|
ClustalW is a popular tool for multiple sequence alignment. The
alignment is achieved via three steps: pairwise alignment,
guide-tree generation and progressive alignment. ClustalW-MPI is an
MPI implementation of ClustalW. Based on
version 1.82 of the original ClustalW, both the pairwise
and progressive alignments are parallelized with MPI, a
popular message passing programming standard. The
pairwise alignments can be easily parallelized since the many
alignments are time independent on each other. However
the progressive alignments are essentially not parallelizable
because of the time dependencies between each alignment.
Here the recursive parallelism paradigm is applied to the linear space
profile-profile alignment algorithm. This approach is more time
efficient on computers with distributed memory architecture.
Traditional approach that relies on precomputing the profile-profile
score matrix has also been implemented. Results shown the latter is indeed
more appropriate for shared memory multiprocessor computer.
ClustalX is suggested for its support for local realignments, seaview
is a versatile editor of alignments.
The original ClustalW/ClustalX can be found at
URL: http://www.clustal.org/download/pre-2/
|
|
|
Clustalx
Multiple alignment of nucleic acid and protein sequences (graphical interface)
|
| Versions of package clustalx |
| Release | Version | Architectures |
| sid | 2.0.12-1 | hppa,i386,sparc,s390,alpha,amd64 |
| lenny | 1.83-4 | mips,alpha,sparc,amd64,s390,arm,armel,hppa,powerpc,i386,mipsel,ia64 |
| sid | 1.83-4 | armel,ia64,mips,mipsel,powerpc |
| squeeze | 1.83-4 | mips,i386,sparc,mipsel,hppa,powerpc,armel,s390,amd64,ia64 |
| etch | 1.83-1.2 | hppa,amd64,arm,powerpc,i386,mips,sparc,mipsel,ia64,alpha,s390 |
| Debtags of package clustalx: |
| biology | peptidic, nuceleic-acids, format:fasta, format:aln |
| field | biology:bioinformatics, biology |
| interface | x11 |
| role | program |
| scope | utility |
| uitoolkit | motif |
| use | viewing, comparing, analysing |
| works-with-format | plaintext |
| x11 | application |
|
License: non-free
|
|
This package offers a GUI interface for the Clustal multiple sequence
alignment program. It provides an integrated environment for performing
multiple sequence- and profile-alignments to analyse the results.
The sequence alignment is displayed in a window on the screen.
A versatile coloring scheme has been incorporated to highlight conserved
features in the alignment. For professional presentations, one should
use the texshade LaTeX package or boxshade.
The pull-down menus at the top of the window allow you to select all the
options required for traditional multiple sequence and profile alignment.
You can cut-and-paste sequences to change the order of the alignment; you can
select a subset of sequences to be aligned; you can select a sub-range of the
alignment to be realigned and inserted back into the original alignment.
An alignment quality analysis can be performed and low-scoring segments or
exceptional residues can be highlighted.
|
|
|
Embassy-phylip
EMBOSS conversions of the programs in the phylip package
|
| Versions of package embassy-phylip |
| Release | Version | Architectures |
| sid | 3.68-1 | amd64 |
| squeeze | 3.68-1 | amd64 |
|
License: non-free
|
|
This package is the adaptation of the PHYLIP package in which its
programs can operate with the biological sequence formats and databases
of the European Molecular Biology Open Software Suite (EMBOSS). The
software packages adapted for EMBOSS are called EMBASSY.
PHYLIP (the PHYLogeny Inference Package) is a package of programs for
inferring phylogenies (evolutionary trees). Methods that are available
in the package include parsimony, distance matrix, and likelihood
methods, including bootstrapping and consensus trees. Data types that
can be handled include molecular sequences, gene frequencies,
restriction sites and fragments, distance matrices, and discrete
characters.
The EMBASSY PHYLIP programs all have the prefix "f" to distinguish them
from the original programs and avoid namespace conflict.
|
|
|
Molphy
Program Package for MOLecular PHYlogenetics
|
| Versions of package molphy |
| Release | Version | Architectures |
| lenny | 2.3b3-5 | amd64,i386,sparc,hppa,powerpc,alpha,mips,arm,s390,ia64,mipsel,armel |
| sid | 2.3b3-5 | s390,alpha,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| squeeze | 2.3b3-5 | ia64,sparc,powerpc,i386,hppa,s390,armel,amd64,mipsel,mips |
| etch | 2.3b3-2 | arm,mips,powerpc,mipsel,sparc,hppa,alpha,i386,ia64 |
| Debtags of package molphy: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
|
License: non-free
|
|
ProtML is a main program in MOLPHY for inferring evolutionary trees from
PROTein (amino acid) sequences by using the Maximum Likelihood method.
Other programs (C language)
NucML: Maximum Likelihood Inference of Nucleic Acid Phylogeny
ProtST: Basic Statistics of Protein Sequences
NucST: Basic Statistics of Nucleic Acid Sequences
NJdist: Neighbor Joining Phylogeny from Distance Matrix
Utilities (Perl)
mollist: get identifiers list molrev: reverse DNA sequences
molcat: concatenate sequences molcut: get partial sequences
molmerge: merge sequences nuc2ptn: DNA -> Amino acid
rminsdel: remove INS/DEL sites molcodon: get specified codon sites
molinfo: get varied sites mol2mol: MOLPHY format beautifer
inl2mol: Interleaved -> MOLPHY mol2inl: MOLPHY -> Interleaved
mol2phy: MOLPHY -> Sequential phy2mol: Sequential -> MOLPHY
must2mol: MUST -> MOLPHY etc.
|
|
|
Phylip
package of programs for inferring phylogenies
|
| Versions of package phylip |
| Release | Version | Architectures |
| sid | 3.69-1 | hppa,s390,amd64,alpha,sparc,i386 |
| sid | 3.68-2 | ia64,mipsel,powerpc,armel,mips |
| squeeze | 3.68-2 | s390,amd64,armel,hppa,i386,ia64,mips,mipsel,powerpc,sparc |
| lenny | 3.67-3 | ia64,hppa,amd64,mipsel,arm,s390,powerpc,sparc,mips,i386,alpha |
| etch | 3.6.1-2 | arm,powerpc,s390,hppa,alpha,i386,ia64,sparc,mips,mipsel |
| Debtags of package phylip: |
| field | biology:bioinformatics, biology |
| interface | commandline |
| role | program |
| scope | utility |
| use | comparing, analysing |
| works-with-format | plaintext |
|
License: non-free
|
|
The PHYLogeny Inference Package is a package of programs for inferring
phylogenies (evolutionary trees) from sequences.
Methods that are available in the package include parsimony, distance
matrix, and likelihood methods, including bootstrapping and consensus
trees. Data types that can be handled include molecular sequences, gene
frequencies, restriction sites, distance matrices, and 0/1 discrete
characters.
|
|
|
Raster3d
tools for generating images of proteins or other molecules
|
| Versions of package raster3d |
| Release | Version | Architectures |
| squeeze | 2.7s-1+b1 | mips,mipsel |
| lenny | 2.7s-1+b1 | mips,mipsel |
| sid | 2.7s-1+b1 | mipsel,mips |
| squeeze | 2.7s-1 | sparc,amd64,armel,hppa,i386,ia64,powerpc,s390 |
| lenny | 2.7s-1 | sparc,s390,powerpc,arm,armel,hppa,amd64,i386,ia64,alpha |
| sid | 2.7s-1 | amd64,alpha,sparc,ia64,s390,armel,hppa,powerpc,i386 |
| etch | 2.7d-2 | mipsel,mips,ia64,hppa,alpha,i386,arm,sparc |
| Debtags of package raster3d: |
| field | biology:structural, biology |
| interface | commandline |
| role | program |
| scope | application |
| use | viewing, converting |
| works-with | image:raster, image, 3dmodel |
| works-with-format | png, jpg |
|
License: non-free
|
|
Raster3D is a set of tools for generating high quality raster images of
proteins or other molecules. The core program renders spheres, triangles,
cylinders, and quadric surfaces with specular highlighting, Phong shading,
and shadowing. It uses an efficient software Z-buffer algorithm which is
independent of any graphics hardware. Ancillary programs process atomic
coordinates from PDB files into rendering descriptions for pictures composed
of ribbons, space-filling atoms, bonds, ball+stick, etc. Raster3D can also be
used to render pictures composed in other programs such as Molscript in
glorious 3D with highlights, shadowing, etc. Output is to pixel image files
with 24 bits of color information per pixel.
|
|
Packaging has started and developers might try the packaging code in VCS
tools designed to manipulate genomic data
|
License: GPL / LGPL
Debian package not available
Language: C, C++
|
|
AceDB is a genome database system developed since 1989 primarily by
Jean Thierry-Mieg (CNRS, Montpellier) and Richard Durbin (Sanger
Institute). It provides a custom database kernel, with a non-standard
data model designed specifically for handling scientific data
flexibly, and a graphical user interface with many specific displays
and tools for genomic data. AceDB is used both for managing data
within genome projects, and for making genomic data available to the
wider scientific community.
AceDB was originally developed for the C.elegans genome project ,
from which its name was derived (A C. elegans DataBase). However, the
tools in it have been generalized to be much more flexible and the
same software is now used for many different genomic databases from
bacteria to fungi to plants to man. It is also increasingly used for
databases with non-biological content.
|
antigen sequence database software for web-based bacterial typing
|
License: GPL
Debian package not available
|
|
AgdbNet is antigen sequence database software for web-based bacterial
typing. The software facilitates simultaneous BLAST querying of multiple
loci using either nucleotide or peptide sequences. It's written in Perl
and runs on Linux/UNIX systems.
Databases are described by XML files and can have any number of loci, which
may be defined by nucleotide and/or peptide sequences. The databases can
optionally have integral isolate tables so that information about representative
isolates can be retrieved or they may be configured to query external isolate
databases, such as those hosted on PubMLST.org.
The software is used on a number of public bacterial typing databases:
-
Neisseria PorA variable regions | PorB | FetA
-
Campylobacter flaA
-
Streptococcus equi seM
|
optimized statistical analysis of host-parasite coevolution
|
License: GPL
Debian package not available
|
|
AxParafit is a highly optimized version of Pierre Legendre's Parafit
program for statistical analysis of host-parasite coevolution.
AxParafit has been parallelized with MPI (Message Passing Interface)
for compute clusters and was used to carry out the largest
co-evolutionary analysis to date for the paper describing the software.
|
Bayesian MCMC analysis of molecular sequences
|
License: LGPL
Debian package not available
|
|
BEAST is a cross-platform program for Bayesian MCMC analysis of
molecular sequences. It is entirely orientated towards rooted,
time-measured phylogenies inferred using strict or relaxed molecular
clock models. It can be used as a method of reconstructing
phylogenies but is also a framework for testing evolutionary
hypotheses without conditioning on a single tree topology. BEAST uses
MCMC to average over tree space, so that each tree is weighted
proportional to its posterior probability. We include a simple to use
user-interface program for setting up standard analyses and a suit of
programs for analysing the results.
The source is avialable at http://code.google.com/p/beast-mcmc/ .
|
find clustering solutions for genome data
|
License: non-free
Debian package not available
|
|
Cluster 3.0 is an enhanced version of Cluster, which was originally
developed by Michael Eisen while at Stanford University. The main
improvement consists of the k-means algorithm, which now includes
multiple trials to find the best clustering solution. This is crucial
for the k-means algorithm to be reliable. The routine for self-organizing
maps was extended to include 2D rectangular geometries. The Euclidean
distance and the city-block distance were added to the available
measures of similarity.
|
|
Dazzle
|
License: LGPL
Debian package not available
|
|
Dazzle is a general purpose server for the Distributed Annotation System
(DAS) protocol. It is implemented as a Java servlet, using the BioJava
APIs. Dazzle is a modular system which uses small "datasource" plugins to
provide access to a range of databases. Several general-purpose plugins
are included in the package, and it it straightforward to develop new
plugins to connect to your own databases.
Information on DAS is available from http://www.biodas.org/
|
Concept and environment for constructing virtual cells on computers
|
License: GPL
Debian package not available
|
|
The E-Cell Project is an international research project aiming at
developing necessary theoretical supports, technologies and software
platforms to allow precise whole cell simulation.
The E-Cell System is an object-oriented software suite for modeling,
simulation, and analysis of large scale complex systems such as
biological cells, architected by Kouichi Takahashi and written by
a team of developers.
The core part of the system, E-Cell Simulation Environment version 3,
allows many components driven by multiple algorithms with different
timescales to coexist.
E-Cell System consists of the following three major parts:
-
E-Cell Simulation Environment (or E-Cell SE)
-
E-Cell Modeling Environment (or E-Cell ME)
-
E-Cell Analysis Toolkit.
|
The Generic Genome Browser from GMOD
|
License: Perl Artistic License, plus additional clauses
Debian package not available
|
|
The Generic Genome Browser is a combination of database and interactive Web
page for manipulating and displaying annotations on genomes. Some of its
features:
-
Simultaneous bird's eye and detailed views of the genome.
-
Scroll, zoom, center.
-
Attach arbitrary URLs to any annotation.
-
Order and appearance of tracks are customizable by administrator and end-user.
-
Search by annotation ID, name, or comment.
-
Supports third party annotation using GFF formats.
-
Settings persist across sessions.
-
DNA and GFF dumps.
-
Connectivity to different databases, including BioSQL and Chado.
-
Multi-language support.
-
Third-party feature loading.
-
Customizable plug-in architecture (e.g. run BLAST, dump & import many formats,
find oligonucleotides, design primers, create restriction maps, edit features)
|
Analysis and visualization of LD and haplotype maps
|
License: DFSG free
Debian package not available
|
|
This tools assists in the analysis of the nucleotide
variation in a population. Such investigations are performed
to determine genes and genetic pathways that are associated
with diseases. This is an early stage in the quest for new drugs.
|
viewer for chemical structures in 3D
|
License: LGPL
Debian package not available
|
|
Jmol is a Java molecular viewer for three-dimensional chemical structures.
Features include reading a variety of file types and output from quantum
chemistry programs, and animation of multi-frame files and computed normal
modes from quantum programs. It includes with features for chemicals,
crystals, materials and biomolecules.
-
The JmolApplet is a web browser applet that can be integrated into
web pages.
-
The Jmol application is a standalone Java application that runs on
the desktop.
-
The JmolViewer is a development tool kit that can be integrated into
other Java applications.
|
Java re-implementation of Michael Eisen's TreeView
|
License: GPL
Debian package not available
|
|
TreeView creates a matrix-like display of expression data, known as
Eisen clustering. The original implementation was a Windows program
named TreeView by Michael Eisen. This TreeView package, sometimes also
referred to as jTreeView, was rewritten in Java under a free license,
the original implementation also comes with the source code, but controls
commercial distribution. And it did not run on Unix.
Java TreeView is an extensible viewer for microarray data in
PCL or CDT format.
|
|
Kempbasu
Significance tests for comparing digital gene expression profiles
|
License: GPL
Debian package not available
|
|
This package implements the significance tests for comparing digital
gene profiles described in the article:
Varuzza et al. "Significance tests for comparing digital gene
expression profiles"
They provide two programs: kemp for the frequentist test and basu for
the Bayesian test, and some auxiliary scripts.
|
MAGE-MLv1 converter and visualiser
|
License: CBIL Software and Data License (Apache-like)
Debian package not available
|
|
This tool-kit is part of MR_T, a framework for import or export various of
MAGE (MicroArray Gene Expression) documents (MAGE-MLv1, MAGE-TAB, SOFT,
MINiML) from or into databases like GUS (the Genomics Unified Schema,
www.gusdb.org).
|
|
Martj
distributed data integration system for biological data
|
License: GPL
Debian package not available
|
|
BioMart is a simple, distributed data integration system with
powerful query capabilities. The BioMart data model has been applied
to the following data sources: UniProt Proteomes, Macromolecular
Structure Database (MSD), Ensembl, Vega, and dbSNP.
|
|
Mauvealigner
multiple genome alignment algorithms
|
License: GPL
|
|
The mauveAligner and progressiveMauve alignment algorithms have been
implemented as command-line programs included with the downloadable Mauve
software. When run from the command-line, these programs provide options
not yet available in the graphical interface.
Mauve is a system for efficiently constructing multiple genome alignments
in the presence of large-scale evolutionary events such as rearrangement
and inversion. Multiple genome alignment provides a basis for research
into comparative genomics and the study of evolutionary dynamics. Aligning
whole genomes is a fundamentally different problem than aligning short
sequences.
Mauve has been developed with the idea that a multiple genome aligner
should require only modest computational resources. It employs algorithmic
techniques that scale well in the amount of sequence being aligned. For
example, a pair of Y. pestis genomes can be aligned in under a minute,
while a group of 9 divergent Enterobacterial genomes can be aligned in
a few hours.
Mauve computes and interactively visualizes genome sequence comparisons.
Using FastA or GenBank sequence data, Mauve constructs multiple genome
alignments that identify large-scale rearrangement, gene gain, gene loss,
indels, and nucleotide substutition.
Mauve is developed at the University of Wisconsin.
|
|
Meme
motif discovery and search
|
License: non-free for commercial purpose (http://meme.nbcr.net/meme/COPYRIGHT.html)
Debian package not available
|
|
MEME is a tool for discovering motifs in a group of related DNA or protein
sequences. A motif is a sequence pattern that occurs repeatedly in a group
of related protein or DNA sequences. MEME represents motifs as position-dependent
letter-probability matrices which describe the probability of each possible
letter at each position in the pattern. Individual MEME motifs do not contain
gaps. Patterns with variable-length gaps are split by MEME into two or more
separate motifs.
MEME takes as input a group of DNA or protein sequences (the training set)
and outputs as many motifs as requested. MEME uses statistical modeling
techniques to automatically choose the best width, number of occurrences,
and description for each motif.
|
Whole Genome Shotgun and EST Sequence Assembler
|
License: GPL
Debian package not available
|
|
The mira genome fragment assembler is a specialised assembler for
sequencing projects classified as 'hard' due to high number of similar
repeats. For expressed sequence tags (ESTs) transcripts, miraEST is
specialised on reconstructing pristine mRNA transcripts while
detecting and classifying single nucleotide polymorphisms (SNP)
occuring in different variations thereof.
The assembler is routinely used for such various tasks as mutation
detection in different cell types, similarity analysis of transcripts
between organisms, and pristine assembly of sequences from various
sources for oligo design in clinical microarray experiments.
|
|
Murasaki
homology detection tool across multiple large genomes
|
License: GPL
Debian package not available
|
|
Murasaki is a scalable and fast, language theory-based homology
detection tool across multiple large genomes. It enable whole-genome
scale multiple genome global alignments. Supports unlimited length
gapped-seed patterns and unique TF-IDF based filtering.
Murasaki is an anchor alignment software, which is
-
exteremely fast (17 CPU hours for whole Human x Mouse genome (with
40 nodes: 52 wall minutes))
-
scalable (Arbitrarily parallelizable across multiple nodes using MPI.
Even a single node with 16GB of ram can handle over 1Gbp of sequence.)
-
unlimited pattern length
-
repeat tolerant
-
intelligent noise reduction
|
coiled coil secondary structure prediction
|
License: GPL
Debian package not available
|
|
The program predicts the coiled coil secondary structure predictions
from protein sequences. The algorithm was published in
Lupas, van Dyke & Stock, Predicting coiled coils from
protein sequences Science, 252, 1162-1164, 1991.
|
|
|
License: GPL
Debian package not available
Language: Tcl/Tk
|
|
PhyloGrapher is a program designed to visualize and study evolutionary
relationships within families of homologous genes or proteins
(elements). PhyloGrapher is a drawing tool that generates custom graphs
for a given set of elements. In general, it is possible to use
PhyloGrapher to visualize any type of relations between elements.
Used in conjunction with tcl_blast_parser, PhyloGrapher can represent
the results of a BLAST search as a graph.
PhyloGrapher and tcl_blast_parser are useful tools to analyse BLAST
biological sequence alignment reports (BLAST is provided by Debian's
blast2 package).
|
genome-wide SNP association analysis
|
License: GPL 2+
Debian package not available
|
|
A package for genome-wide association analysis between quantitative
or binary traits and single-nucleiotide polymorphisms (SNPs).
|
infer motifs in a set of sequences
|
License: GPL
Debian package not available
|
|
SMILE is a tool that infers motifs in a set of sequences, according to some
criteria. It was first made to infer exceptional sites as binding sites in
DNA sequences. Since the 1.4 version, it allows to infer motifs written on
any alphabet (even degenerate) in any kind of sequences.
The specificity of SMILE is to allow to deal with what we call structured
motifs, which are motifs associated by some distance constraints.
|
command line program for finding patterns in nucleic acids
|
License: GPL and others
Debian package not available
|
|
tacg is a character-based, command line tool for unix-like operating systems
for pattern-matching in nucleic acids and performing some of the basic protein
manipulations. It was originally designed for restriction enzyme analysis of
DNA, but has been extended to other types of matching. It now handles
degenerate sequence input in a variety of matching approaches, as well as
patterns with errors, regular expressions and TRANSFAC-formatted matrices.
It was designed to be a grep for DNA and like the original grep, its
capabilities have grown so that now the author has to keep calling up the help
page to figure out which flags (now ~50) mean what. tacg is NOT a GUI
application in any sense. However, it's existance as a strictly command-line
tool lends itself well to Webification and wrapping by various GUI tools and
it is now distributed with a web interface form and a Perl CGI handler.
Additionally, it can easily be integrated into editors that support shell
commands such as nedit.
The use of tacg may be cited as: Mangalam, HJ. (2002) tacg, a grep for DNA.
BMC Bioinformatics. 3:8
http://www.biomedcentral.com/1471-2105/3/8
|
Unofficial packages built by somebody else
|
Act
DNA Sequence Comparison Viewer
|
License: GPL
|
|
ACT (Artemis Comparison Tool) is a DNA sequence comparison viewer
based on Artemis. In common with Artemis, ACT is written in Java and
runs on UNIX, GNU/Linux, Macintosh and MS Windows systems. It can
read complete EMBL and GENBANK entries or sequence in FASTA or raw
format. Extra sequence features can be in EMBL, GENBANK or GFF
format.
The sequence comparison displayed by ACT is usually the result of
running a blastn or tblastx search. See the user manual for more
information.
To see ACT in action go to the examples page
http://www.sanger.ac.uk/Software/ACT/Examples/
|
|
Artemis
genome viewer and annotation tool
|
License: GPL 2+
|
|
Artemis is a free genome viewer and annotation tool that allows visualization
of sequence features and the results of analyses within the context of the
sequence, and its six-frame translation. Artemis is written in Java, and is
available for UNIX, Macintosh and Windows systems. It can read EMBL and GENBANK
database entries or sequence in FASTA or raw format. Extra sequence features
can be in EMBL, GENBANK or GFF format.
|
|
Big-blast
Helper tool to run blast on large sequences
|
License: not specified
|
|
This script will chop up a large sequence, run blast on each bit and
then write out an EMBL feature table and a MSPcrunch -d file
containing the hits.
|
|
Blixem
interactive browser of pairwise Blast matches
|
License: not specified
|
|
Blixem (BLast matches In an X-windows Embedded Multiple alignment),
is an interactive browser of pairwise Blast matches that have been
stacked up in a ma ster-slave multiple alignment
|
|
Caftools
Tools to maintain DNA sequence assemblies
|
License: Free for non-commercial purposes
|
|
This is V2 of the CAF (Common Assembly Format) C-tools. It comprises
a set of libraries and programs for manipulating DNA sequence
assemblies using CAF files.
The CAF specification can be found at:
http://www.sanger.ac.uk/Software/formats/CAF/
|
|
Cap3
DNA Sequence Assembly Program
|
License: free for governmental agency or a non-profit educational institution
|
|
CAP3 contains the following improvements to the CAP sequence assembly
program.
1. Use of forward-reverse constraints to correct assembly errors and
link contigs.
2. Use of base quality values in alignment of sequence reads.
3. Automatic clipping of 5' and 3' poor regions of reads.
4. Generation of assembly results in ace file format for Consed.
5. CAP3 can be used in GAP4 of the Staden package.
These improvements allow CAP3 to take longer sequences of higher
errors and produce more accurate consensus sequences.
|
|
Cd-hit
suite of programs designed to quickly group sequences.
|
License: to be clarified
|
|
CD-HIT stands for Cluster Database at High Identity with
Tolerance. The program (cd-hit) takes a fasta format sequence
database as input and produces a set of 'non-redundant' (nr)
representative sequences as output. In addition cd-hit outputs a
cluster file, documenting the sequence 'groupies' for each nr
sequence representative. The idea is to reduce the overall size of
the database without removing any sequence information by only
removing 'redundant' (or highly similar) sequences. This is why the
resulting database is called non-redundant (nr). Essentially, cd-hit
produces a set of closely related protein families from a given fasta
sequence database.
CD-HIT uses a 'longest sequence first' list removal algorithm to
remove sequences above a certain identity threshold. Additionally the
algorithm implements a very fast heuristic to find high identity
segments between sequences, and so can avoid many costly full
alignments.
With recent developments, cd-hit package offers new programs for DNA
sequence clustering and comparing two databases. It also has lots of
new options for clustering control.
|
|
Coalesce
outdated program to estimate population-genetic parameters
|
License: not specified
|
|
COALESCE fits the model which has a single population of constant
size, and estimates 4Nu, where N is the effective population size and
u is the neutral mutation rate per site. You may also want the
Postscript or the TeX file of the preprint version of the Kuhner,
Yamato, and Felsenstein 1995 paper.
|
|
Compclust
explore and quantify relationships between clustering results
|
License: MLX (http://woldlab.caltech.edu/compclust/LICENSE.txt)
Language: Python
|
|
CompClust is a python package written using the pyMLX and IPlot APIs. It provides
software tools to explore and quantify relationships between clustering results. Its
development has been largely built around needs of microarray data analysis but could
be easily used in other domains.
Briefly pyMLX provides for efficient and convenient execution of many clustering
algorithms using a extendable library of algorithms. It also provides many-to-many
linkages between data features and annotations (such as cluster labels, gene names,
gene ontology information, etc.) These linkages persist through varied data
manipulations. IPlot provides an abstraction of the plotting process in which any
arbitrary feature or derived feature of the data can be projected onto any feature
of the plot, including the X,Y coordinates of points, marker symbol, marker size,
maker/line color, etc. These plots are intrinsically linked to the dataset, the
View and the Labeling classes found within pyMLX.
|
|
Dotter
detailed comparison of two sequences
|
License: to be clarified
|
|
Dotter is a graphical dotplot program for detailed comparison of two
sequences. Here, every residue in one sequence is compared to every
residue in the other sequence. The first sequence runs along the
x-axis and the second sequence along the y-axis. In regions where the
two sequences are similar to each other, a row of high scores will
run diagonally across the dot matrix. If you're comparing a sequence
against itself to find internal repeats, you'll notice that the main
diagonal scores maximally, since it's the 100% perfect self-match.
To make the score matrix more intelligible, the pairwise scores are
averaged over a sliding window which runs diagonally. The averaged
score matrix forms a three-dimensional landscape, with the two
sequences in two dimensions and the height of the peaks in the
third. This landscape is projected onto two dimensions by aid of
greyscales - the darker grey of a peak, the higher it is.
Dotter provides a tool to explore the visual appearance of this
landscape, as well as a tool to examine the sequence alignment it
represents.
|
|
Dotur
Defining Operational Taxonomic Units and estimating species Richness
|
License: GPL
|
|
Dotur (Distance Based OTU and Richness determination) is a computer
program that takes a distance matrix describing the genetic distance
between DNA sequence data and assigns sequences to operational
taxonomic units (OTUs) using either the furthest, average, or nearest
neighbor algorithms for all possible distances that can be described
using the distance matrix. Using the OTU composition data, dotur
constructs collector's and rarefaction curves for sampling intensity,
richness estimators, and diversity indices.
|
|
Estferret
processes, clusters and annotates EST data
|
License: to be clarified
|
|
ESTFerret processes, clusters and annotates EST data. It is
user-configurable. Results are currently stored in a series of text
tables. Annotation consists of searches against use r-defined blast
databases, prosite, GO and allocation of EC numbers where possible.
EST-ferret is a user-configurable, automated pipeline for the
convenient analysis of EST sequence data that includes all of the
necessary steps for cleanup and trimming, submission to external
sequence repositories, clustering, identification by BLAST homology
searches and by searches of protein domain databases, annotation with
computer-addressable terms and production of outputs for direct entry
into microarray analysis packages. It is composed of several widely
used, open-source algorithms, including PHRED, CAP3, BLAST, and a
range of sequence and annotation databases, including Gene Ontology
and Conserved Domain Database to deliver a putative identity and a
detailed annotation of each clone. It can be run either step-by-step
to track the outputs, or as a single batch process. Users can easily
edit the configuration file to define parameter settings.
This package has five major components: (1) ESTs coding system; (2)
sequence processing; (3) sequence clustering; (4) sequence annotating
and (5) storage and reporting of results. DNA trace files are renamed
and converted into FASTA format, cleaned and submitted to
dbEST(Boguski, et al, 1993). Sequence assembly uses two rounds of
CAP3 to assemble the ESTs into groups corresponding to separate gene
families and unique genes. Sequence identification and annotation is
provided by a series of BLAST homology searches (Parallel_BLAST and
Priority_BLAST) against user-defined sequence databases implemented
with the NCBI BLASTALL algorithm. The BLAST results are parsed and
annotation terms that reflect functional attributes are captured from
Gene Ontology (The Gene Ontology Consortium, 2000), KEGG and Enzyme
Commission (EC) databases and applied to each of the clones. CDD (and
InterPro) searches are performed for seeking protein domains in the
sequences. Other options are provided to run PatSearch, RepeatMasker
and BLAT to find UTRs, repeats and EST candidates in
genomes. Finally, the package generates analysis reports in a variety
of flat file formats, sources of which can be serve as inputs for
some gene annotation and gene expression profiling tools, and also as
a MySQL database or web-browsable search tool.
|
|
Estscan
detect coding regions in DNA sequences, even if of low quality
|
License: free
|
|
ESTScan is a program that can detect coding regions in DNA sequences,
even if they are of low quality. It will also detect and correct
sequencing errors that lead to fr ameshifts.
ESTScan is not a gene prediction program, nor is it an open reading
frame detector. In fact, its strength lies in the fact that it does
not require an open reading frame to detect a coding region. As a
result, the program may miss a few translated amino acids at either
the N or the C terminus, but will detect coding regions with high
selectivity and sensitivity.
Similarly to GENSCAN, ESTScan uses a Markov model to represent the
bias in hexanucleotide usage found in coding regions relative to
non-coding regions. Additionally, ESTScan allows insertions and
deletions when these improve the coding region statistics. Further
details can be found at:
http://www.ch.embnet.org/software/ESTScan2_help.html
References:
-
Lottaz C, Iseli C, Jongeneel CV, Bucher P. (2003) Modeling sequencing
errors by combining Hidden Markov models Bioinformatics 19,
ii103-ii112.
-
Iseli C, Jongeneel CV, Bucher P. (1999) ESTScan: a program for
detecting, evaluating, and reconstructing potential coding regions in
EST sequences. Proc Int Conf Intell Syst Mol Biol.138-48.
|
|
Fasta
searching DNA and protein databases
|
License: no inclusion into commercial product
|
|
FASTA (pronounced FAST-AYE) stands for FAST-ALL, reflecting the fact
that it can be used for a fast protein comparison or a fast
nucleotide comparison. This program achieves a high level of
sensitivity for similarity searching at high speed. This is achieved
by performing optimised searches for local alignments using a
substitution matrix. The high speed of this program is achieved by
using the observed pattern of word hits to identify potential matches
before attempting the more time consuming optimised search. The
trade-off between speed and sensitivity is controlled by the ktup
parameter, which specifies the size of the word. Increasing the ktup
decreases the number of background hits. Not every word hit is
investigated but instead initially looks for segment's containing
several nearby hits.
|
|
Fluctuate
outdated program to model a single population
|
License: not specified
|
|
FLUCTUATE fits the model which has a single population which has been
growing (or shrinking) according to an exponential growth law. It
estimates 4Nu and g, where N is the effective population size, u is
the neutral mutation rate per site, and g is the growth rate of the
population.
|
|
Forester
visualization of annotated phylogenetic trees
|
License: LGPL
|
|
FORESTER is a Java/Perl based software package for phylogenomic
analyses. Currently, it includes the phylogenetic tree visualization
and manipulation tool ATV and implementations of the SDI algorithm
and the RIO method (http://www.phylosoft.org/).
|
multiple alignment editor
|
License: GPL
|
|
Jalview is a multiple alignment editor written in Java. It is used
widely in a variety of web pages (e.g. the EBI Clustalw server and
the Pfam protein domain database) but is available as a general
purpose alignment editor.
|
|
Lamarc
estimate population-genetic parameters
|
License: Apache V2.0
|
|
LAMARC is a program which estimates population-genetic parameters
such as population size, population growth rate, recombination rate,
and migration rates. It approximates a summation over all possible
genealogies that could explain the observed sample, which may be
sequence, SNP, microsatellite, or electrophoretic data. LAMARC and
its sister program Migrate are successor programs to the older
programs Coalesce, Fluctuate, and Recombine, which are no longer
being supported. The programs are memory-intensive but can run
effectively on workstations.
|
|
Lucy
DNA sequence quality and vector trimming tool
|
License: GPL
|
|
Lucy is a utility that prepares raw DNA sequence fragments for
sequence assembly, possibly using the TIGR Assembler. The cleanup
process includes quality assessment, confidence reassurance, vector
trimming and vector removal. The primary advantage of Lucy over other
similar utilities is that it is a fully integrated, stand alone
program.
Lucy was designed and written at The Institute for Genomic Research
(TIGR, now the J. Craig Venter Institute), and it has been used here
for several years to clean sequence data from automated DNA
sequencers prior to sequence assembly and other downstream uses. The
quality trimming portion of lucy makes use of phred quality scores,
such as those produced by many automated sequencers based on the
Sanger sequencing method. As such, lucy’s quality trimming may not
be appropriate for sequence data produced by some of the new
“next-generation” sequencers.
See also the SourceForge page at http://lucy.sourceforge.net/.
|
|
Maxd
data warehouse and visualisation environment for genomic expression data
|
License: Artistic
|
|
Maxd is a data warehouse and visualisation environment for genomic
expression data. It is being developed in the University of
Manchester by the Microarray Bioinformatics Group.
Software components:
maxdLoad2 - standards-compliant, highly customisable transcriptomics
database
maxdView - modular and easily extensible data visualisation and
analysis environment
maxdSetup - installation management utility
|
|
Mesquite
help biologists analyze comparative data about organisms
|
License: LGPL
|
|
Mesquite is software for evolutionary biology, designed to help
biologists analyze comparative data about organisms. Its emphasis is
on phylogenetic analysis, but some of its modules concern population
genetics, while others do non-phylogenetic multivariate
analysis. Because it is modular, the analyses available depend on the
modules installed. Analyses include:
-
Reconstruction of ancestral states (parsimony, likelihood)
-
Tests of process of character evolution, including correlation
-
Analysis of speciation and extinction rates
-
Simulation of character evolution (categorical, DNA, continuous)
-
Parametric bootstrapping (integration with programs such as PAUP*
and NONA)
-
Morphometrics (PCA, CVA, geometric morphometrics)
-
Coalescence (simulations, other calculations)
-
Tree comparisons and simulations (tree similarity, Markov
speciation models)
There is a brief outline of features, which includes
screenshots. Mesquite is not primarily designed to infer phylogenetic
trees, but rather for diverse analyses using already inferred trees.
|
|
Migrate
estimation of population sizes and gene flow using the coalescent
|
License: to be clarified
|
|
Migrate estimates effective population sizes and past migration rates
between n population assuming a migration matrix model with
asymmetric migration rates and different subpopulation sizes. Migrate
uses maximum likelihood or Bayesian inference to jointly estimate all
parameters. It can use the followind data types: sequence data using
Felsenstein's 84 model with or without site rate variation, single
nucleotide polymorphism data, microsatellite data using a stepwise
mutation model or a brownian motion mutation model, and
electrophoretic data using an 'infinite' allele model. The output can
contain: Estimates of all migration rates and all population sizes,
assuming constant mutation rates among loci or a gamma distributed
mutation rate among loci. Profile likelihood tables, Percentiles,
Likelihood-ratio tests, and simple plots of the log-likelihood
surfaces for all populations and all loci.
|
|
Mrbayes
Bayesian estimation of phylogeny
|
License: GPL
|
|
MrBayes is a program for the Bayesian estimation of
phylogeny. Bayesian inference of phylogeny is based upon a quantity
called the posterior probability distribution of trees, which is the
probability of a tree conditioned on the observations. The
conditioning is accomplished using Bayes's theorem. The posterior
probability distribution of trees is impossible to calculate
analytically; instead, MrBayes uses a simulation technique called
Markov chain Monte Carlo (or MCMC) to approximate the posterior
probabilities of trees.
The program takes as input a character matrix in a NEXUS file
format. The output is several files with the parameters that were
sampled by the MCMC algorithm. MrBayes can summarize the information
in these files for the user. The program features include:
|
|
Msatfinder
identification and characterization of microsatellites in a comparative genomic context
|
License: GPL
|
|
Msatfinder is a Perl script designed to allow the identification and
characterization of microsatellites in a comparative genomic
context. There is also an online manual, a discussion forum and an
online interface where users can do searches in any number of DNA or
protein sequences (as long as the maximum size of all sequences does
not exceed 10MB). Nucleotide and amino acid sequences in GenBank,
FASTA, EMBL and Swissprot formats are supported.
|
|
Mview
reformat results of a sequence database search or a multiple alignment
|
License: GPL
|
|
MView is a tool for converting the results of a sequence database
search (BLAST, FASTA, etc.) into the form of a coloured multiple
alignment of hits stacked against the query. Alternatively, an
existing multiple alignment (MSF, PIR, CLUSTAL, etc.) can be pr
ocessed. It reformats the results of a sequence database search or a
multiple alignment adding optional HTML markup to control colouring
and web page layout. MView is not a multiple alignment program, nor
is it a general purpose alignment editor.
|
|
Oligoarrayaux
Prediction of Melting Profiles for Nucleic Acids
|
License: non-free (fre academical use)
|
|
OligoArrayAux is a subset of the UNAFold package for use with
OligoArray (http://berry.engin.umich.edu/oligoarray2_1/). OligoArray
is a free software that computes gene specific oligonucleotides for
genome-scale oligonucleotide microarray construction. (It is not
really specified what they mean with "free software". You can
download the source code after registration: "registration is the
only way for me to keep trace of OligoArray users and be able to send
you a bug fix or a new release".)
The original UNAFold server is available at
http://dinamelt.bioinfo.rpi.edu/download.php and you should probably
read http://dinamelt.bioinfo.rpi.edu/ if you want to know more about
"Prediction of Melting Profiles for Nucleic Acids".
|
|