Universiti Teknologi Malaysia Institutional Repository

Clustered genetic semantic graph approach for multi-document abstractive summarization

Khan, A. and Salim, N. and Farman, H. (2016) Clustered genetic semantic graph approach for multi-document abstractive summarization. In: International Conference on Intelligent Systems Engineering, ICISE 2016, 15 January 2016 through 17 January 2016, Pakistan.

Full text not available from this repository.

Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

Multi-document summarization aims to produce a compressed version of numerous online text documents and preserves the salient information. A particular challenge for multi-document summarization is that there is an inevitable overlap in the information stored in different documents. Thus, effective summarization methods that merge similar information across the documents are desirable. This paper introduces a clustered genetic semantic graph approach for multi-document abstractive summarization. The semantic graph from the document set is constructed in such a way that the graph vertices represent the predicate argument structures (PASs), extracted automatically by employing semantic role labeling (SRL); and the edges of graph correspond to semantic similarity weight determined from PAS-to-PAS semantic similarity, and PAS-to-document relationship. The PAS-to-document relationship is expressed by different features, weighted and optimized by genetic algorithm. The salient graph nodes (PASs) are ranked based on modified weighted graph based ranking algorithm. The clustering algorithm is performed to eliminate redundancy in such a way that representative PAS with the highest salience score from each cluster is chosen, and fed to language generation to generate summary sentences. Experiment of this study is performed using DUC-2002, a standard corpus for text summarization. Experimental results indicate that the proposed approach outperforms other summarization systems.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:genetic algorithm, multi-document abstractive summarization, semantic graph, semantic role labeling, semantic similarity measure
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:73272
Deposited By: Muhammad Atiff Mahussain
Deposited On:23 Nov 2017 04:17
Last Modified:23 Nov 2017 04:17

Repository Staff Only: item control page