Debian Med Project
Help us to see Debian used by medical practitioners and biomedical researchers! Join us on the Alioth page.
Summary
Statistics
Debian Med statistics

This metapackage will install packages which are helpful to do statistics with a special focus on tasks in medical care.

The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Med to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Med mailing list

Links to other tasks

Debian Med Statistics packages

Official Debian packages with high relevance

R-bioc-edger
Empirical analysis of digital gene expression data in R
Versions of package r-bioc-edger
ReleaseVersionArchitectures
wheezy2.6.1~dfsg-1all
sid3.2.3~dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-bioc-edger:
fieldbiology
interfacecommandline
roleshared-lib, program, plugin
scopeutility
usecomparing, calculating, analysing
Popcon: 11 users (11 upd.)*
Versions and Archs
License: DFSG free
Git

Bioconductor package for differential expression analysis of whole transcriptome sequencing (RNA-seq) and digital gene expression profiles with biological replication. It uses empirical Bayes estimation and exact tests based on the negative binomial distribution. It is also useful for differential signal analysis with other types of genome-scale count data.

Please cite: Mark D. Robinson, Davis J. McCarthy and Gordon K. Smyth: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. (PubMed,eprint) Bioinformatics 26,:139-140 (2010)
R-bioc-limma
linear models for microarray data
Versions of package r-bioc-limma
ReleaseVersionArchitectures
wheezy3.12.0~dfsg-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.16.3~dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 11 users (13 upd.)*
Versions and Archs
License: DFSG free
Git

A Bioconductor package for the analysis of gene expression microarray data, especially the use of linear models for analysing designed experiments and the assessment of differential expression. The package includes pre-processing capabilities for two-colour spotted arrays. The differential expression methods apply to all array platforms and treat Affymetrix, single channel and two channel experiments in a unified way.

Please cite: Gordon K. Smyth: Limma: linear models for microarray data. (eprint) :397-420 (2005)
R-bioc-qvalue
GNU R package for Q-value estimation for FDR control
Versions of package r-bioc-qvalue
ReleaseVersionArchitectures
wheezy1.30.0-1all
sid1.34.0+dfsg-1all
Popcon: 7 users (23 upd.)*
Versions and Archs
License: DFSG free
Svn

This package takes a list of p-values resulting from the simultaneous testing of many hypotheses and estimates their q-values. The q-value of a test measures the proportion of false positives incurred (called the false discovery rate) when that particular test is called significant. Various plots are automatically generated, allowing one to make sensible significance cut-offs. Several mathematical results have recently been shown on the conservative accuracy of the estimated q-values from this software. The software can be applied to problems in genomics, brain imaging, astrophysics, and data mining.

Please cite: John D Storey and Robert Tibshirani: Statistical significance for genomewide studies. (PubMed,eprint) Proceedings of the National Academy of Sciences of the United States of America 100(16):9440-9445 (2003)
R-cran-pvclust
Hierarchical Clustering with P-Values via Multiscale Bootstrap
Versions of package r-cran-pvclust
ReleaseVersionArchitectures
wheezy1.2-2-1all
sid1.2-2-2all
Popcon: 19 users (26 upd.)*
Versions and Archs
License: DFSG free
Git

pvclust is a package for assessing the uncertainty in hierarchical cluster analysis. It provides AU (approximately unbiased) p-values as well as BP (boostrap probability) values computed via multiscale bootstrap resampling.

Please cite: Ryota Suzuki and Hidetoshi Shimodaira: Pvclust: an R package for assessing the uncertainty in hierarchical clustering. (PubMed,eprint) Bioinformatics 22(12):1540-1542 (2006)
R-cran-randomforest
GNU R package implementing the random forest classificator
Versions of package r-cran-randomforest
ReleaseVersionArchitectures
squeeze4.5-34-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.6-6-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.6-7-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-cran-randomforest:
devellibrary, lang:r
fieldmedicine, biology:bioinformatics, biology
interfacecommandline
roleshared-lib, devel-lib
Popcon: 43 users (35 upd.)*
Versions and Archs
License: DFSG free
Svn

RandomForest implements Breiman’s random forest algorithm (based on Breiman and Cutler’s original Fortran code) for classification and regression. It can also be used in unsupervised mode for assessing proximities among data points.

The technique uses multiple decision trees and combines their individual votes.

Official Debian packages with lower relevance

Science-statistics
Debian Science Statistics packages
Maintainer: Debian Science Team
Versions of package science-statistics
ReleaseVersionArchitectures
squeeze0.12all
wheezy1.0all
sid1.0all
Debtags of package science-statistics:
rolemetapackage
suitedebian
Popcon: 29 users (62 upd.)*
Versions and Archs
License: DFSG free
Svn

This metapackage is part of the Debian Pure Blend "Debian Science" and installs packages related to statistics. This task is a general task which might be useful for any scientific work. It depends from a lot of R packages as well as from other tools which are useful to do statistics. Moreover the Science Mathematics task is suggested to optionally install all mathematics related software.

Packaging has started and developers might try the packaging code in VCS

R-cran-beeswarm
bee swarm plot, an alternative to stripchart
License: Artistic-2.0
Debian package not available
Git
Version: 0.0.7-1

Beeswarm is an add-on package for the R statistical environment. The bee swarm plot is a one-dimensional scatter plot like "stripchart", but with closely-packed, non-overlapping points.

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 165117