Aufsatz(elektronisch)18. März 2024

Data Imbalances in Coincidence Analysis: A Simulation Study

In: Sociological methods and research

Swiatczak, Martyna Daria; Baumgartner, Michael

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

Abstract

In this paper, we investigate the conditions under which data imbalances, a common data characteristic that occurs when factor values are unevenly distributed, are problematic for the performance of Coincidence Analysis (CNA). We further examine how such imbalances relate to fragmentation and noise in data. We show that even extreme data imbalances, when not combined with fragmentation or noise, do not negatively affect CNA's performance. However, an extended series of simulation experiments on fuzzy-set data reveals that, when mixed with fragmentation or noise, data imbalances may substantially impair CNA's performance. Furthermore, we find that the performance impairment is higher when endogenous factors are imbalanced than when exogenous factors are concerned. Our results allow us to quantify these impacts and demarcate degrees at which data imbalances should be considered as problematic. Thus, applied researchers can use our demarcation guidelines to enhance the validity of their studies.

Sprachen

Englisch

Verlag

SAGE Publications

ISSN: 1552-8294

DOI

10.1177/00491241241227039

Exportieren Ein Problem melden

Problem melden

Data Imbalances in Coincidence Analysis: A Simulation Study

Abstract

Sprachen

Verlag

DOI

Kontakt

Hilfe