We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Exploring the diagnostic performance of machine learning in prediction of metabolic phenotypes focusing on thyroid function.
- Authors
Ahn, Hyeong Jun; Ishikawa, Kyle; Kim, Min-Hee
- Abstract
In this study, we employed various machine learning models to predict metabolic phenotypes, focusing on thyroid function, using a dataset from the National Health and Nutrition Examination Survey (NHANES) from 2007 to 2012. Our analysis utilized laboratory parameters relevant to thyroid function or metabolic dysregulation in addition to demographic features, aiming to uncover potential associations between thyroid function and metabolic phenotypes by various machine learning methods. Multinomial Logistic Regression performed best to identify the relationship between thyroid function and metabolic phenotypes, achieving an area under receiver operating characteristic curve (AUROC) of 0.818, followed closely by Neural Network (AUROC: 0.814). Following the above, the performance of Random Forest, Boosted Trees, and K Nearest Neighbors was inferior to the first two methods (AUROC 0.811, 0.811, and 0.786, respectively). In Random Forest, homeostatic model assessment for insulin resistance, serum uric acid, serum albumin, gamma glutamyl transferase, and triiodothyronine/thyroxine ratio were positioned in the upper ranks of variable importance. These results highlight the potential of machine learning in understanding complex relationships in health data. However, it's important to note that model performance may vary depending on data characteristics and specific requirements. Furthermore, we emphasize the significance of accounting for sampling weights in complex survey data analysis and the potential benefits of incorporating additional variables to enhance model accuracy and insights. Future research can explore advanced methodologies combining machine learning, sample weights, and expanded variable sets to further advance survey data analysis.
- Subjects
MACHINE learning; MACHINE performance; NATIONAL Health & Nutrition Examination Survey; RECEIVER operating characteristic curves; THYROID gland; PHENOTYPES; THYROID gland function tests
- Publication
PLoS ONE, 2024, Vol 19, Issue 6, p1
- ISSN
1932-6203
- Publication type
Academic Journal
- DOI
10.1371/journal.pone.0304785