We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
Managing re-identification risks while providing access to the All of Us research program.
- Authors
Xia, Weiyi; Basford, Melissa; Carroll, Robert; Clayton, Ellen Wright; Harris, Paul; Kantacioglu, Murat; Liu, Yongtai; Nyemba, Steve; Vorobeychik, Yevgeniy; Wan, Zhiyu; Malin, Bradley A
- Abstract
Objective The All of Us Research Program makes individual-level data available to researchers while protecting the participants' privacy. This article describes the protections embedded in the multistep access process, with a particular focus on how the data was transformed to meet generally accepted re-identification risk levels. Methods At the time of the study, the resource consisted of 329 084 participants. Systematic amendments were applied to the data to mitigate re-identification risk (eg, generalization of geographic regions, suppression of public events, and randomization of dates). We computed the re-identification risk for each participant using a state-of-the-art adversarial model specifically assuming that it is known that someone is a participant in the program. We confirmed the expected risk is no greater than 0.09, a threshold that is consistent with guidelines from various US state and federal agencies. We further investigated how risk varied as a function of participant demographics. Results The results indicated that 95th percentile of the re-identification risk of all the participants is below current thresholds. At the same time, we observed that risk levels were higher for certain race, ethnic, and genders. Conclusions While the re-identification risk was sufficiently low, this does not imply that the system is devoid of risk. Rather, All of Us uses a multipronged data protection strategy that includes strong authentication practices, active monitoring of data misuse, and penalization mechanisms for users who violate terms of service.
- Subjects
DATA protection; DATA privacy; RACE; ELECTRONIC health records; GOVERNMENT agencies; IDENTIFICATION; ETHNICITY
- Publication
Journal of the American Medical Informatics Association, 2023, Vol 30, Issue 5, p907
- ISSN
1067-5027
- Publication type
Article
- DOI
10.1093/jamia/ocad021