DebiChem Project
Summary
Cheminformatics
DebiChem Cheminformatics

This metapackage will install cheminformatics packages useful for chemists.

The list to the right includes various software projects which are of some interest to the DebiChem Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in DebiChem which can sensibly add to a high quality Debian Pure Blend.

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for DebiChem to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the DebiChem mailing list

Links to other tasks

DebiChem Cheminformatics packages

Official Debian packages with high relevance

Libcdk-java
Chemistry Development Kit (CDK) Java libraries
Versions of package libcdk-java
ReleaseVersionArchitectures
squeeze1.0.2-5all
wheezy1.2.10-3all
sid1.2.10-3all
upstream1.4.18
Debtags of package libcdk-java:
devellibrary, lang:java
fieldchemistry
roledevel-lib
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free
Svn

The CDK is a library of Java classes used in computational and information chemistry and in bioinformatics. It includes renderers, file IO, SMILES generation/parsing, maximal common substructure algorithms, fingerprinting and much, much more.

Please cite: C. Steinbeck, Y. Han, S. Kuhn, O. Horlacher, E. Luttmann and E. L. Willighagen: The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo- and Bioinformatics. (2003)
Libopsin-java
Chemical name to structure converter
Versions of package libopsin-java
ReleaseVersionArchitectures
wheezy1.2.0-1all
sid1.2.0-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

OPSIN (Open Parser for Systematic IUPAC Nomenclature) converts IUPAC compound names to semantic chemical information and outputs a structure either as as CML (Chemical Markup Language), SMILES, or InChI.

Openbabel
Chemical toolbox utilities (cli)
Maintainer: Debichem Team
Versions of package openbabel
ReleaseVersionArchitectures
squeeze2.2.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.3.1+dfsg-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.3.1+dfsg-4mipsel
sid2.3.2+dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,powerpc,s390,s390x,sparc
Debtags of package openbabel:
fieldchemistry
interfacecommandline
roleprogram
scopeutility
useconverting
Popcon: 133 users (237 upd.)*
Versions and Archs
License: DFSG free
Svn

Open Babel is a chemical toolbox designed to speak the many languages of chemical data. It allows one to search, convert, analyze, or store data from molecular modeling, chemistry, solid-state materials, biochemistry, or related areas. Features include:

  • Hydrogen addition and deletion
  • Support for Molecular Mechanics
  • Support for SMARTS molecular matching syntax
  • Automatic feature perception (rings, bonds, hybridization, aromaticity)
  • Flexible atom typer and perception of multiple bonds from atomic coordinates
  • Gasteiger-Marsili partial charge calculation

File formats Open Babel supports include PDB, XYZ, CIF, CML, SMILES, MDL Molfile, ChemDraw, Gaussian, GAMESS, MOPAC and MPQC.

This package includes the following utilities:

  • babel: Convert between various chemical file formats
  • obenergy: Calculate the energy for a molecule
  • obminimize: Optimize the geometry, minimize the energy for a molecule
  • obgrep: Molecular search program using SMARTS pattern
  • obgen: Generate 3D coordinates for a molecule
  • obprop: Print standard molecular properties
  • obfit: Superimpose two molecules based on a pattern
  • obrotamer: Generate conformer/rotamer coordinates
  • obconformer: Generate low-energy conformers
  • obchiral: Print molecular chirality information
  • obrotate: Rotate dihedral angle of molecules in batch mode
  • obprobe: Create electrostatic probe grid
Python-chemfp
Cheminformatics fingerprints file formats and tools
Versions of package python-chemfp
ReleaseVersionArchitectures
wheezy1.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.1p1
Popcon: 2 users (1 upd.)*
Newer upstream!
License: DFSG free
Svn

Chem-fingerprints is a set of formats and related tools for the storage, exchange, and search of cheminformatics fingerprint data sets.

It translates fingerprints from the OpenBabel and RDKIT and cheminformatics packages (as well as the proprietary OEChem package) into the binary FPS format.

Besides python modules, it provides the following tools:

  • sdf2fps - Extract fingerprint data from SD tags
  • ob2fps - Use OpenBabel to generate fingerprints from structures
  • rdkit2fps - Use RDKit to generate fingerprints from structures
  • oe2fps - Use OEChem/OEGraphSim to generate fingerprints from structures
  • simsearch - Do threshold or k-nearest neighbor Tanimoto similarity searches

    between two FPS files

Python-cinfony
Python abstraction layer to cheminformatics toolkits
Maintainer: Debichem Team
Versions of package python-cinfony
ReleaseVersionArchitectures
wheezy1.1-1all
sid1.2-1all
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Cinfoy provides a common Python interface to various cheminformatics toolkits. This makes it possible to use the complementary features of cheminformatics toolktis using a common interface. If functionality is provided by more than one toolkit, the Cinfony user can choose the implementation they prefer.

Cinfony supports the following toolkits:

  • OpenBabel
  • RDKit
  • CDK
  • Indigo

Cinfony also includes the Opsin module for translating IUPAC names to chemical data and the Webel web services module.

Please cite: N. M. O'Boyle and G. R. Hutchison: Cinfony - combining Open Source cheminformatics toolkits behind a common interface. (eprint) Chem. Cent. J. 2(1):24444 (2008)
Python-fmcs
Find Maximum Common Substructure
Versions of package python-fmcs
ReleaseVersionArchitectures
wheezy1.0-1all
sid1.0-1all
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Fcms finds the maximum common substructure (MCS) of a group (or cluster) of chemical structures and report the result as a SMARTS string.

More specifically, the MCS found is a common edge subgraph, and not a common induced subgraph. Only connected MCSes are found.

Python-indigo
Organic Chemistry Toolkit (python module)
Versions of package python-indigo
ReleaseVersionArchitectures
wheezy1.0.0-2all
sid1.0.0-2all
Popcon: 3 users (5 upd.)*
Versions and Archs
License: DFSG free
Svn

Indigo is a C++ based organic chemistry and cheminformatics software environment. Features Include:

  • Molecule and reaction rendering including SVG support
  • Automatic layout for SMILES-represented molecules and reactions
  • Canonical (isomeric) SMILES computation
  • Exact matching, substructure matching, SMARTS matching
  • Matching of tautomers and resonance structures
  • Molecule fingerprinting, molecule similarity computation
  • Fast enumeration of SSSR rings, subtrees, and edge sugraphs
  • Molecular weight, molecular formula computation
  • R-Group deconvolution and scaffold detection
  • Computation of the exact maximum common substructure for an arbitrary amount of input structures
  • Combinatorial chemistry
  • Plugin support in the API

File formats Indigo support include MDL Mol, SDF, RDF, CML, SMILES and SMARTS.

This package contains the python modules.

Python-openbabel
Chemical toolbox library (python bindings)
Maintainer: Debichem Team
Versions of package python-openbabel
ReleaseVersionArchitectures
squeeze2.2.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.3.1+dfsg-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.3.1+dfsg-4mipsel
sid2.3.2+dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,powerpc,s390,s390x,sparc
Debtags of package python-openbabel:
devellang:python
fieldchemistry
roleshared-lib
Popcon: 16 users (22 upd.)*
Versions and Archs
License: DFSG free
Svn

Open Babel is a chemical toolbox designed to speak the many languages of chemical data. It allows one to search, convert, analyze, or store data from molecular modeling, chemistry, solid-state materials, biochemistry, or related areas. Features include:

  • Hydrogen addition and deletion
  • Support for Molecular Mechanics
  • Support for SMARTS molecular matching syntax
  • Automatic feature perception (rings, bonds, hybridization, aromaticity)
  • Flexible atom typer and perception of multiple bonds from atomic coordinates
  • Gasteiger-Marsili partial charge calculation

File formats Open Babel supports include PDB, XYZ, CIF, CML, SMILES, MDL Molfile, ChemDraw, Gaussian, GAMESS, MOPAC and MPQC.

This package contains the Python binding.

Python-rdkit
Collection of cheminformatics and machine-learning software
Versions of package python-rdkit
ReleaseVersionArchitectures
wheezy201203-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid201212-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream201303
Popcon: 3 users (4 upd.)*
Newer upstream!
License: DFSG free
Svn

RDKit is a Python/C++ based cheminformatics and machine-learning software environment. Features Include:

  • Chemical reaction handling and transforms
  • Substructure searching with SMARTS
  • Canonical SMILES
  • Molecule-molecule alignment
  • Large number of descriptors
  • Fragmentation using RECAP rules
  • 2D coordinate generation and depiction
  • 3D coordinate generation using geometry embedding
  • UFF forcefield
  • Calculation of (R/S) stereochemistry codes
  • Pharmacophore searching
  • Calculation of shape similarity
  • Atom pairs and topological torsions fingerprints
  • Feature maps and feature-maps vectors
  • Machine-learning algorithms
  • Gasteiger-Marsili partial charge calculation

File formats RDKit supports include MDL Mol, SDF, TDT, SMILES and RDKit binary format.

*Popularitycontest results: number of people who use this package regularly (number of people who upgraded this package recently) out of 165117