Robust clustering based on the most frequent value method

Ferenc Tolner; Sándor Fegyverneki; Balázs Barta; György Eigner

doi:10.35925/j.multi.2023.1.11

Authors

Ferenc Tolner Pannon Business Network Association, Óbuda University Doctoral School of Applied Informatics and Applied Mathematics https://orcid.org/0000-0001-9669-9298
Sándor Fegyverneki University of Miskolc https://orcid.org/0000-0002-4100-8100
Balázs Barta Pannon Business Network Association https://orcid.org/0000-0002-4857-1860
György Eigner University Research and Innovation Center, Physiological Controls Research Center, Óbuda University Biomatics and Applied Artificial Intelligence Institution, John von Neumann Faculty of Informatics, Óbuda University https://orcid.org/0000-0001-8038-2210

DOI:

https://doi.org/10.35925/j.multi.2023.1.11

Keywords:

Most Frequent Value, k-MFVs, outlier map, robust clustering, anomaly detection

Abstract

Assigning observations to highly separable although relatively homogeneous groups is still a challenging task despite the abundance of well-elaborated theories and effective, practical algorithms. Not just the aim of clustering then the underlying data itself influences the choice of method and the way of assessing the results. Outliers and non-normal data distribution can lead to surprising, unstable and many times undesirable clustering results especially in higher dimensions. This implies the importance of some human supervision in case of such unsupervised algorithms as well. In this paper a robust clustering alternative is presented based on the Most Frequent Value Method for crisp-type clustering in case of real-life data. The proposed approach is compared with the k-Medians algorithm. A favourable attribute of the applied procedure is its ease of application on multidimensional data sets where critical judgment of formed groups is particularly troublesome.

Robust clustering based on the most frequent value method

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

Language