Sulaiman, Nur Rafeeqkha and Md. Siraj, Maheyzah and Mat Din, Mazura (2020) Named entity recognition of South China Sea conflicts. In: Sustainable and Integrated Engineering International Conference, SIE 2019, 8 December 2019 - 9 December 2019, Putrajaya, Malaysia.
|
PDF
680kB |
Official URL: http://dx.doi.org/10.1088/1757-899X/884/1/012057
Abstract
Online news articles not only provide us with useful and reliable information and reports, it also eases information extraction and gathering for research purposes especially in Natural Language Processing (NLP) and machine learning (ML). The topics regarding the South China Sea have been popular lately due to the rise of conflicts between several countries claim on the islands in the sea. Gathering data through Internet and online sources proves to be easy, but to process a huge amount of data and to identify only useful information is no longer possible. Because of that, relevant information and the classification of news articles in relation to the conflicts need to be done. In this paper, a model is proposed to use NER that search for and classifies important information regarding to the conflicts. In order to do that, a combination of POS and NER are needed to extract meaningful information from the news. This study also aims to classify conflict related news by using Conditional Random Field (CRF) algorithm as classification method by training and testing the data.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | South China Sea, NLP, ML, CRF |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computing |
ID Code: | 93748 |
Deposited By: | Yanti Mohd Shah |
Deposited On: | 31 Dec 2021 08:48 |
Last Modified: | 31 Dec 2021 08:48 |
Repository Staff Only: item control page