147
Views
0
CrossRef citations to date
0
Altmetric
Short Technical Note

On Simulating Skewed and Cluster-Weighted Data for Studying Performance of Clustering Algorithms

, , , , &
Pages 303-309 | Received 16 Jun 2022, Accepted 10 Apr 2023, Published online: 20 Jul 2023
 

Abstract

In this article, extensions to the recently introduced concept of pairwise overlap between mixture components are proposed. The notion of overlap is useful for studying the systematic performance of clustering algorithms. Existing methods can be used for simulating elliptical data according to pre-specified overlap characteristics. First, an approach to simulating skewed clusters with a desired overlap is proposed. Next, an extension to measuring overlap in cluster-weighted models is considered. Thus, this article provides important extensions to the existing methods for simulating heterogeneous data for studying the systematic performance of clustering algorithms. Supplementary materials for this article are available online.

Supplementary Materials

Software implementing the functionality of the developed methodology is provided in the supplement.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.