Sharma, K. K. and Seal, A. and Yazidi, A. and Selamat, A. and Krejcar, O. (2021) Clustering uncertain data objects using jeffreys-divergence and maximum bipartite matching based similarity measure. IEEE Access, 9 . ISSN 2169-3536
|
PDF
3MB |
Official URL: http://dx.doi.org/10.1109/ACCESS.2021.3083969
Abstract
In recent years, uncertain data clustering has become the subject of active research in many fields, for example, pattern recognition, and machine learning. Nowadays, researchers have committed themselves to substitute the traditional distance or similarity measures with new metrics in the existing centralized clustering algorithms in order to tackle uncertainty in data. However, in order to perform uncertain data clustering, representation plays an imperative role. In this paper, a Monte-Carlo integration is adopted and modified to express uncertain data in a probabilistic form. Then three similarity measures are used to determine the closeness between two probability distributions including one novel measure. These similarity measures are derived from the notion of Kullback-Leibler divergence and Jeffreys divergence. Finally, density-based spatial clustering of applications with noise and $k$ -medoids algorithms are modified and implemented on one synthetic database and three real-world uncertain databases. The obtained outcomes confirm that the proposed clustering technique defeats some of the existing algorithms.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | bipartite matching, probability density estimation, uncertain data clustering |
Subjects: | T Technology > T Technology (General) |
Divisions: | Malaysia-Japan International Institute of Technology |
ID Code: | 95349 |
Deposited By: | Narimah Nawil |
Deposited On: | 29 Apr 2022 22:32 |
Last Modified: | 29 Apr 2022 22:32 |
Repository Staff Only: item control page