We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Evaluating and mitigating unfairness in multimodal remote mental health assessments.
- Authors
Jiang, Zifan; Seyedi, Salman; Griner, Emily; Abbasi, Ahmed; Rad, Ali Bahrami; Kwon, Hyeokhyen; Cotes, Robert O.; Clifford, Gari D.
- Abstract
Research on automated mental health assessment tools has been growing in recent years, often aiming to address the subjectivity and bias that existed in the current clinical practice of the psychiatric evaluation process. Despite the substantial health and economic ramifications, the potential unfairness of those automated tools was understudied and required more attention. In this work, we systematically evaluated the fairness level in a multimodal remote mental health dataset and an assessment system, where we compared the fairness level in race, gender, education level, and age. Demographic parity ratio (DPR) and equalized odds ratio (EOR) of classifiers using different modalities were compared, along with the F1 scores in different demographic groups. Post-training classifier threshold optimization was employed to mitigate the unfairness. No statistically significant unfairness was found in the composition of the dataset. Varying degrees of unfairness were identified among modalities, with no single modality consistently demonstrating better fairness across all demographic variables. Post-training mitigation effectively improved both DPR and EOR metrics at the expense of a decrease in F1 scores. Addressing and mitigating unfairness in these automated tools are essential steps in fostering trust among clinicians, gaining deeper insights into their use cases, and facilitating their appropriate utilization. Author summary: In this work, we systematically explored and discussed the unfairness reporting and mitigation of automated mental health assessment tools. These tools are becoming increasingly important in mental health practice, especially with the rise of telehealth services and large language model applications. However, they often carry inherent biases. Without proper assessment and mitigation, they potentially lead to unfair treatment of certain demographic groups and significant harm. Proper unfairness reporting and mitigation of these tools is the first step to building trust among clinicians and patients and ensuring appropriate application. Using our previously developed multimodal mental health assessment system, we evaluated the unfairness level of using various types of features of the subjects for mental health assessment, including facial expressions, acoustic features of the voice, emotions expressed through language, general language representations generated by large language models, and cardiovascular patterns detected from the face. We analyzed the system's fairness across different demographics: race, gender, education level, and age. We found no single modality consistently fair across all demographics. While unfairness mitigation methods improved the fairness level, we found a trade-off between the performance and the fairness level, calling for broader moral discussion and investigation on the topic.
- Subjects
HEALTH services accessibility; MENTAL health services; CONCEPTUAL models; RESEARCH funding; MENTAL health; QUESTIONNAIRES; TELEPSYCHIATRY; DESCRIPTIVE statistics; MANN Whitney U Test; CHI-squared test; PSYCHOLOGY; ODDS ratio; HEALTH equity
- Publication
PLoS Digital Health, 2024, Vol 3, Issue 7, p1
- ISSN
2767-3170
- Publication type
Article
- DOI
10.1371/journal.pdig.0000413