Universiti Teknologi Malaysia Institutional Repository

Diversity based text summarization

Binwahlan, Mohammed Salem and Salim, Naomie and Suanmali, Ladda (2008) Diversity based text summarization. Jurnal Teknologi Maklumat, 20 (2). pp. 1-11. ISSN 0128-3790

[img]
Preview
PDF
469Kb

Abstract

Diversity of selected sentences is an important factor in automatic text summarization to control redundancy in the summarized text. In paper, we propose a method called maximal marginal importance (MMI) for text summarization based on the idea of the well-known diversity approach maximal marginal relevance (MMR) where an emphasis is on the diversity based binary tree is used to exploit the diversity among the document sentences, where the whole document is clustered into a number of clusters, and then each cluster is presented as one binary tree or more. In our method, the sentence is evaluated based on its importance and its relevance. Our experimental results shown that the proposed method outperforms the three benchmark methods used in this study.

Item Type:Article
Uncontrolled Keywords:summarization, diversity, binary tree, similarity threshold
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System (Formerly known)
ID Code:9422
Deposited By: Ms Zalinda Shuratman
Deposited On:24 Nov 2009 01:56
Last Modified:02 Jun 2010 01:59

Repository Staff Only: item control page