We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
A data-driven approach to choosing privacy parameters for clinical trial data sharing under differential privacy.
- Authors
Chen, Henian; Pang, Jinyong; Zhao, Yayi; Giddens, Spencer; Ficek, Joseph; Valente, Matthew J; Cao, Biwei; Daley, Ellen
- Abstract
Objectives Clinical trial data sharing is crucial for promoting transparency and collaborative efforts in medical research. Differential privacy (DP) is a formal statistical technique for anonymizing shared data that balances privacy of individual records and accuracy of replicated results through a "privacy budget" parameter, ε. DP is considered the state of the art in privacy-protected data publication and is underutilized in clinical trial data sharing. This study is focused on identifying ε values for the sharing of clinical trial data. Materials and Methods We analyzed 2 clinical trial datasets with privacy budget ε ranging from 0.01 to 10. Smaller values of ε entail adding greater amounts of random noise, with better privacy as a result. Comparison of rates, odds ratios, means, and mean differences between the original clinical trial datasets and the empirical distribution of the DP estimator was performed. Results The DP rate closely approximated the original rate of 6.5% when ε > 1. The DP odds ratio closely aligned with the original odds ratio of 0.689 when ε ≥ 3. The DP mean closely approximated the original mean of 164.64 when ε ≥ 1. As ε increased to 5, both the minimum and maximum DP means converged toward the original mean. Discussion There is no consensus on how to choose the privacy budget ε. The definition of DP does not specify the required level of privacy, and there is no established formula for determining ε. Conclusion Our findings suggest that the application of DP holds promise in the context of sharing clinical trial data.
- Subjects
CLINICAL trials; INFORMATION sharing; PRIVACY; ODDS ratio; MEDICAL research
- Publication
Journal of the American Medical Informatics Association, 2024, Vol 31, Issue 5, p1135
- ISSN
1067-5027
- Publication type
Article
- DOI
10.1093/jamia/ocae038