Universiti Teknologi Malaysia Institutional Repository

Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure

Sharma, K. K. and Seal, A. and Yazidi, A. and Selamat, A. and Krejcar, O. (2021) Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure. IEEE Access, 9 . ISSN 2169-3536

[img]
Preview
PDF
3MB

Official URL: http://dx.doi.org/10.1109/ACCESS.2021.3083969

Abstract

In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms.

Item Type:Article
Uncontrolled Keywords:bipartite matching, probability density estimation, uncertain data clustering
Subjects:T Technology > T Technology (General)
Divisions:Malaysia-Japan International Institute of Technology
ID Code:95349
Deposited By: Narimah Nawil
Deposited On:29 Apr 2022 22:32
Last Modified:29 Apr 2022 22:32

Repository Staff Only: item control page