Holliday , John D. and Salim, Naomie and Willett, Peter (2005) On the magnitudes of coefficient values in the calculation of chemical similarity and dissimilarity. In: Chemometrics and Chemoinformatics. American Chemical Society, USA, pp. 77-95. ISBN 9780841238589
Full text not available from this repository.
Official URL: http://dx.doi.org/10.1021/bk-2005-0894
Analysis of the distributions of inter-molecular similarity values has been carried out using the Tanimoto coefficient, the Cosine coefficient and the complement of Euclidean distance. In order to determine if they are an effective measure for dissimilarity-based methods, their characteristics at low values have been compared with distributions derived using bit-strings generated by random techniques. The effectiveness of similarity measures for property prediction across the full range of ranked search output was then examined. The results show that the distributions of inter-molecular similarity measures are not random in nature, but their effectiveness for property prediction is better than random only when very small or very large similarity values are considered.
|Item Type:||Book Section|
|Subjects:||Q Science > QD Chemistry|
|Divisions:||Computer Science and Information System (Formerly known)|
|Deposited By:||Liza Porijo|
|Deposited On:||03 Aug 2011 07:06|
|Last Modified:||03 Aug 2011 07:06|
Repository Staff Only: item control page