Binwahlan, Mohammed Salem and Salim, Naomie and Suanmali, Ladda (2009) MMI diversity based text summarization. International Journal of Computer Science and Security, 3 (1). pp. 23-33. ISSN 1985-1553
|
PDF
- Published Version
153kB |
Official URL: http://www.cscjournals.org/csc/manuscript/Journals...
Abstract
The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. The selection of distinct ideas “diversity” from the original document can produce an appropriate summary. Incorporating of multiple means can help to find the diversity in the text. In this paper, we propose approach for text summarization, in which three evidences are employed (clustering, binary tree and diversity based method) to help in finding the document distinct ideas. The emphasis of our approach is on controlling the redundancy in the summarized text. The role of clustering is very important, where some clustering algorithms perform better than others. Therefore we conducted an experiment for comparing two clustering algorithms (K-means and complete linkage clustering algorithms) based on the performance of our method, the results shown that k-means performs better than complete linkage. In general, the experimental results shown that our method performs well for text summarization comparing with the benchmark methods used in this study
Item Type: | Article |
---|---|
Additional Information: | This is an open access journal |
Uncontrolled Keywords: | binary tree, diversity, MMR, summarization, similarity threshol |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science T Technology > T Technology (General) |
Divisions: | Computer Science and Information System |
ID Code: | 11826 |
Deposited By: | Siti Anisa Abdul Hamid |
Deposited On: | 19 Jan 2011 12:29 |
Last Modified: | 19 Jan 2011 12:29 |
Repository Staff Only: item control page