A bespoke data linkage of an IVF clinical quality registry to population health datasets; methods and performance
In: International journal of population data science: (IJPDS), Band 6, Heft 1
ISSN: 2399-4908
IntroductionAssisted reproductive technologies (ART), such as in-vitro fertilisation (IVF), have revolutionised the treatment of infertility, with an estimated 8 million babies born worldwide. However, the long-term health outcomes for women and their offspring remain an area of concern. Linking IVF treatment data to long-term health data is the most efficient method for assessing such outcomes.
ObjectivesTo describe the creation and performance of a bespoke population-based data linkage of an ART clinical quality registry to state-based and national administrative datasets.
MethodsThe linked dataset was created by deterministically and probabilistically linking the Australia and New Zealand Assisted Reproduction Database (ANZARD) to New South Wales (NSW) and Australian Capital Territory (ACT) administrative datasets (performed by NSW Centre for Health Record Linkage (CHeReL)) and to national claims datasets (performed by Australian Institute of Health and Welfare (AIHW)). The CHeReL's Master Linkage Key (MLK) was used as a bridge between ANZARD's partially identifiable patient data (statistical linkage key) and NSW and ACT administrative datasets. CHeReL then provided personal identifiers to the AIHW to obtain national content data. The results of the linkage were reported, and concordance between births recorded in ANZARD and perinatal data collections (PDCs) was evaluated.
ResultsOf the 62,833 women who had ART treatment in NSW or ACT, 60,419 could be linked to the CHeReL MLK (linkage rate: 96.2%). A reconciliation of ANZARD-recorded births among NSW residents found that 94.2% (95% CI: 93.9--94.4%) of births were also recorded in state/territory-based PDCs. A high concordance was found in plurality status and birth outcome (≥99% agreement rate, Cohen's kappa ranged: 0.78--0.98) between ANZARD and PDCs.
ConclusionThe data linkage resource demonstrates that high linkage rates can be achieved with partially identifiable data and that a population spine, such as the CHeReL's MLK, can be successfully used as a bridge between clinical registries and administrative datasets.