Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Original Paper

The Compressed Feature Matrix—a novel descriptor for adaptive similarity search

S. F. Badreddin AbolmaaliContact Information, Claude Ostermann2 and Andreas Zell1

(1)  Department of Computer Science, University of Tuebingen, Sand 1, 72076 Tübingen, Germany
(2)  ALTANA Pharma AG, Byk Gulden Straße 2, 78467 Konstanz, Germany

Received: 20 June 2002  Accepted: 12 November 2002  Published online: 5 February 2003

Abstract  The Compressed Feature Matrix (CFM) is a new molecular descriptor for adaptive similarity searching. Depending on the requirements, it is based on a distance or geometry matrix. Thus, the CFM permits topological and three-dimensional comparisons of molecules. In contrast to the common distance matrix, the CFM is based on features instead of atoms. Each kind of these features may be weighted separately, depending on its (estimated) contribution to the biological effect of the molecule. In this work, we show that the CFM allows us to adapt similarity evaluations to particular ligands as well as to classification requirements. The CFM method is analyzed regarding correctness, adaptivity and speed. Applying the basic setting of feature weights, the similarity evaluations using the CFM on the one hand and the Tanimoto coefficient together with MACCS Keys on the other yield similar results. However, in contrast to the latter method, the CFM even permits us to focus on small parts of molecules to serve as a basis for similarity. Accordingly, we have achieved striking results not only by readjusting the feature weights with regard to the scaffold but also to the side chain of the respective target. The results of the latter run turned out to be rather independent of the molecular scaffold. Hence, the CFM is suitable not only for common similarity evaluation, but also for techniques such as lead or scaffold hopping.
Figure Chemical structure, feature graph and topological CFM of serotonine
MediaObjects/s00894-002-0110-0flba.gif

Electronic supplementary material  for this article if you access the article at 10.1007/s00894-002-0110-0.

Keywords  Similarity - Descriptor - Computer chemistry features - Scaffold hopping


Contact Information S. F. Badreddin Abolmaali
Email: abolmaali@informatik.uni-tuebingen.de
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.

Supplemental Material


Export this article
Export this article as RIS | Text
 
Remote Address: 38.107.191.108 • Server: mpweb23
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)