Dollah, Rozilawati and Aono, Masaki (2011) Classifying biomedical text abstracts based on hierarchical 'concept' structure. Proceedings of World Academy of Science, Engineering and Technology, 74 . pp. 599-604. ISSN 2010-376X
Full text not available from this repository.
Abstract
Classifying biomedical literature is a difficult and challenging task, especially when a large number of biomedical articles should be organized into a hierarchical structure. In this paper, we present an approach for classifying a collection of biomedical text abstracts downloaded from Medline database with the help of ontology alignment. To accomplish our goal, we construct two types of hierarchies, the OHSUMED disease hierarchy and the Medline abstract disease hierarchies from the OHSUMED dataset and the Medline abstracts, respectively. Then, we enrich the OHSUMED disease hierarchy before adapting it to ontology alignment process for finding probable concepts or categories. Subsequently, we compute the cosine similarity between the vector in probable concepts (in the "enriched" OHSUMED disease hierarchy) and the vector in Medline abstract disease hierarchies. Finally, we assign category to the new Medline abstracts based on the similarity score. The results obtained from the experiments show the performance of our proposed approach for hierarchical classification is slightly better than the performance of the multi-class flat classification.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | biomedical literature, hierarchical text classification, ontology alignment, text mining |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computer Science and Information System |
ID Code: | 28879 |
Deposited By: | Yanti Mohd Shah |
Deposited On: | 30 Nov 2012 00:55 |
Last Modified: | 05 Mar 2019 01:34 |
Repository Staff Only: item control page