TY - JOUR TI - Bringing practical statistical science to AI and predictive model fairness testing AU - Lo, Victor S. Y. AU - Datta, Sayan AU - Salami, Youssouf PY - 2024 PB - Springer Science and Business Media LLC LA - eng AB - AbstractArtificial Intelligence, Machine Learning, Statistical Modeling and Predictive Analytics have been widely used in various industries for a long time. More recently, AI Model Governance including AI Ethics has received significant attention from academia, industry, and regulatory agencies. To minimize potential unjustified treatment disfavoring individuals based on demographics, an increasingly critical task is to assess group fairness through some established metrics. Many commercial and open-source tools are now available to support the computations of these fairness metrics. However, this area is largely based on rules, e.g., metrics within a prespecified range would be considered satisfactory. These metrics are statistical estimates and are often based on limited sample data and therefore subject to sampling variability. For instance, if a fairness criterion is barely met or missed, it is often uncertain if it should be a "pass" or "failure," if the sample size is not large. This is where statistical science can help. Specifically, statistical hypothesis testing enables us to determine whether the sample data can support a particular hypothesis (e.g., falling within an acceptable range) or the observations may have happened by chance. Drawing upon the bioequivalence literature from medicine and advanced hypothesis testing in statistics, we propose a practical statistical significance testing method to enhance the current rule-based process for model fairness testing and its associated power calculation, followed by an illustration with a realistic example. UR - https://doi.org/10.1007/s43681-024-00518-2 DO - 10.1007/s43681-024-00518-2 T2 - AI and ethics SN - 2730-5961 SN - 2730-5953 UR - https://www.pollux-fid.de/r/cr-10.1007/s43681-024-00518-2 H1 - Pollux (Fachinformationsdienst Politikwissenschaft) ER -