Universiti Teknologi Malaysia Institutional Repository

Condorcet and borda count fusion method for ligand-based virtual screening

Ahmed, Ali and Saeed, Faisal and Salim, Naomie and Abdo, Ammar (2014) Condorcet and borda count fusion method for ligand-based virtual screening. Journal of Cheminformatics, 6 (1). ISSN 1758-2946

[img]
Preview
PDF
397kB

Official URL: http://dx.doi.org/10.1186/1758-2946-6-19

Abstract

Background: It is known that any individual similarity measure will not always give the best recall of active molecule structure for all types of activity classes. Recently, the effectiveness of ligand-based virtual screening approaches can be enhanced by using data fusion. Data fusion can be implemented using two different approaches: group fusion and similarity fusion. Similarity fusion involves searching using multiple similarity measures. The similarity scores, or ranking, for each similarity measure are combined to obtain the final ranking of the compounds in the database. Results: The Condorcet fusion method was examined. This approach combines the outputs of similarity searches from eleven association and distance similarity coefficients, and then the winner measure for each class of molecules, based on Condorcet fusion, was chosen to be the best method of searching. The recall of retrieved active molecules at top 5% and significant test are used to evaluate our proposed method. The MDL drug data report (MDDR), maximum unbiased validation (MUV) and Directory of Useful Decoys (DUD) data sets were used for experiments and were represented by 2D fingerprints. Conclusions: Simulated virtual screening experiments with the standard two data sets show that the use of Condorcet fusion provides a very simple way of improving the ligand-based virtual screening, especially when the active molecules being sought have a lowest degree of structural heterogeneity. However, the effectiveness of the Condorcet fusion was increased slightly when structural sets of high diversity activities were being sought

Item Type:Article
Uncontrolled Keywords:data fusion, similarity coefficients, similarity searching, virtual screening
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:52216
Deposited By: Siti Nor Hashidah Zakaria
Deposited On:01 Feb 2016 03:53
Last Modified:17 Sep 2018 04:01

Repository Staff Only: item control page