Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26041
Title: Estimating the Optimal Number of Clusters from Subsets of Ensembles
Authors: Odebode, A
Tucker, A
Arzoky, M
Swift, S
Keywords: ensemble clustering;subset selection;cluster analysis;number of clusters
Issue Date: 11-Jul-2022
Citation: Odebode, A. et al. (2022) 'Estimating the Optimal Number of Clusters from Subsets of Ensembles', Proceedings of the 11th International Conference on Data Science, Technology and Applications, Lisbon, Portugal, 11-13 July, pp. 383 - 391. doi: 10.5220/0011275000003269.
Abstract: This research estimates the optimal number of clusters in a dataset using a novel ensemble technique - a preferred alternative to relying on the output of a single clustering. Combining clusterings from different algorithms can lead to a more stable and robust solution, often unattainable by any single clustering solution. Technically, we created subsets of ensembles as possible estimates; and evaluated them using a quality metric to obtain the best subset. We tested our method on publicly available datasets of varying types, sources and clustering difficulty to establish the accuracy and performance of our approach against eight standard methods. Our method outperforms all the techniques in the number of clusters estimated correctly. Due to the exhaustive nature of the initial algorithm, it is slow as the number of ensembles or the solution space increases; hence, we have provided an updated version based on the single-digit difference of Gray code that runs in linear time in terms of the subset size.
URI: https://bura.brunel.ac.uk/handle/2438/26041
DOI: https://doi.org/10.5220/0011275000003269
ISBN: 978-989-758-583-8
Other Identifiers: ORCID iDs: Mahir Arzoky https://orcid.org/0000-0002-2721-643X; Allan Tucker https://orcid.org/0000-0001-5105-3506.
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright (c) 2022 by SCITEPRESS – Science and Technology Publications, Lda. Under a Creative Commons (CC BY-NC-ND) Attribution Licence (https://creativecommons.org/licenses/by-nc-nd/4.0/).602.55 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons