Universiti Teknologi Malaysia Institutional Repository

Classifying biomedical text abstracts based on hierarchical 'concept' structure

Dollah, Rozilawati and Aono, Masaki (2011) Classifying biomedical text abstracts based on hierarchical 'concept' structure. Proceedings of World Academy of Science, Engineering and Technology, 74 . pp. 599-604. ISSN 2010-376X

Full text not available from this repository.

Abstract

Classifying biomedical literature is a difficult and challenging task, especially when a large number of biomedical articles should be organized into a hierarchical structure. In this paper, we present an approach for classifying a collection of biomedical text abstracts downloaded from Medline database with the help of ontology alignment. To accomplish our goal, we construct two types of hierarchies, the OHSUMED disease hierarchy and the Medline abstract disease hierarchies from the OHSUMED dataset and the Medline abstracts, respectively. Then, we enrich the OHSUMED disease hierarchy before adapting it to ontology alignment process for finding probable concepts or categories. Subsequently, we compute the cosine similarity between the vector in probable concepts (in the "enriched" OHSUMED disease hierarchy) and the vector in Medline abstract disease hierarchies. Finally, we assign category to the new Medline abstracts based on the similarity score. The results obtained from the experiments show the performance of our proposed approach for hierarchical classification is slightly better than the performance of the multi-class flat classification.

Item Type:Article
Uncontrolled Keywords:biomedical literature, hierarchical text classification, ontology alignment, text mining
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System
ID Code:28879
Deposited By: Yanti Mohd Shah
Deposited On:30 Nov 2012 00:55
Last Modified:05 Mar 2019 01:34

Repository Staff Only: item control page