Applications of the InChI in cheminformatics with the CDK and Bioclipse
نویسندگان
چکیده
BACKGROUND The InChI algorithms are written in C++ and not available as Java library. Integration into software written in Java therefore requires a bridge between C and Java libraries, provided by the Java Native Interface (JNI) technology. RESULTS We here describe how the InChI library is used in the Bioclipse workbench and the Chemistry Development Kit (CDK) cheminformatics library. To make this possible, a JNI bridge to the InChI library was developed, JNI-InChI, allowing Java software to access the InChI algorithms. By using this bridge, the CDK project packages the InChI binaries in a module and offers easy access from Java using the CDK API. The Bioclipse project packages and offers InChI as a dynamic OSGi bundle that can easily be used by any OSGi-compliant software, in addition to the regular Java Archive and Maven bundles. Bioclipse itself uses the InChI as a key component and calculates it on the fly when visualizing and editing chemical structures. We demonstrate the utility of InChI with various applications in CDK and Bioclipse, such as decision support for chemical liability assessment, tautomer generation, and for knowledge aggregation using a linked data approach. CONCLUSIONS These results show that the InChI library can be used in a variety of Java library dependency solutions, making the functionality easily accessible by Java software, such as in the CDK. The applications show various ways the InChI has been used in Bioclipse, to enrich its functionality.
منابع مشابه
CDK-Taverna 2.0: migration and enhancements of an open-source pipelining solution
Pipelining or workflow tools allow for the LegoTM-like, graphical assembly of I/O modules and algorithms into a complex workflow which can be easily deployed, modified and tested without the hassle of implementing it into a monolithic application. The CDK-Taverna project aims at building an opensource pipelining solution through combination of different open-source projects such as Taverna [1],...
متن کاملLinking Open Drug Data to Cheminformatics and Proteochemometrics
Semantic Web technologies have made great steps forward in data exchange in health care and life sciences in the past years. The work presented here focuses to a some extent on making drug discovery related data available as RDF, and even more so on the integration of RDF approaches with data analysis of molecular information in drug discovery fields like cheminformatics and proteochemometrics....
متن کاملInChI - the worldwide chemical structure identifier standard
Since its public introduction in 2005 the IUPAC InChI chemical structure identifier standard has become the international, worldwide standard for defined chemical structures. This article will describe the extensive use and dissemination of the InChI and InChIKey structure representations by and for the world-wide chemistry community, the chemical information community, and major publishers and...
متن کاملInChI - the worldwide chemical structure standard
Since its introduction in 2005 the IUAPC InChI chemical structure standard has become the international, worldwide standard for defined chemical structures. This presentation will describe the extensive use and dissemination of the InChI and InChIKey structure representations by and for the world-wide chemistry community, the chemical information community, and major publishers and disseminator...
متن کاملInChI: connecting and navigating chemistry
The International Chemical Identifier (InChI) has had a dramatic impact on providing a means by which to deduplicate, validate and link together chemical compounds and related information across databases. Its influence has been especially valuable as the internet has exploded in terms of the amount of chemistry related information available online. This thematic issue aggregates a number of co...
متن کامل