Salim, Naomie (2005) Comparative study of probability models for compound similarity searching. In: International Conference on Information Technology in Asia, 12-15 December 2005, Hilton Hotel, Kuching, Sarawak.
- Published Version
The quality of a chemical retrieval system heavily depends on its molecular similarity function which returns a similarity measurement between the target compound and each molecule in the collection. Compounds are sorted according to their similarity values with the query and those with high ranks are returned to the users. Most current chemical retrieval systems use the vector space model for similarity calculation. In this paper, the use of probability of relevance for compound retrieval is explored. It reports on the effectiveness of the probability model for compound similarity searching by using Binary Independence Model and Binary Dependence Madel on two different databases. The result based on fusion of queries for both models is also discussed. The results show that in all cases, Binary Independence Retrieval model performed better than Binary Dependence model. It is also found that fusion does not give better results than the un-fused queries.
|Item Type:||Conference or Workshop Item (Paper)|
|Uncontrolled Keywords:||molecular similarity searching, probability model, data fusion|
|Subjects:||Q Science > QA Mathematics > QA75 Electronic computers. Computer science|
|Divisions:||Computer Science and Information System (Formerly known)|
|Deposited By:||Ms Zalinda Shuratman|
|Deposited On:||06 Aug 2010 13:31|
|Last Modified:||06 Aug 2010 13:31|
Repository Staff Only: item control page