Universiti Teknologi Malaysia Institutional Repository

On the magnitudes of coefficient values in the calculation of chemical similarity and dissimilarity

Holliday , John D. and Salim, Naomie and Willett, Peter (2005) On the magnitudes of coefficient values in the calculation of chemical similarity and dissimilarity. In: Chemometrics and Chemoinformatics. American Chemical Society, USA, pp. 77-95. ISBN 9780841238589

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1021/bk-2005-0894

Abstract

Analysis of the distributions of inter-molecular similarity values has been carried out using the Tanimoto coefficient, the Cosine coefficient and the complement of Euclidean distance. In order to determine if they are an effective measure for dissimilarity-based methods, their characteristics at low values have been compared with distributions derived using bit-strings generated by random techniques. The effectiveness of similarity measures for property prediction across the full range of ranked search output was then examined. The results show that the distributions of inter-molecular similarity measures are not random in nature, but their effectiveness for property prediction is better than random only when very small or very large similarity values are considered.

Item Type:Book Section
Subjects:Q Science > QD Chemistry
Divisions:Computer Science and Information System (Formerly known)
ID Code:13309
Deposited By: Liza Porijo
Deposited On:03 Aug 2011 07:06
Last Modified:03 Aug 2011 07:06

Repository Staff Only: item control page