We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A model-based clustering algorithm with covariates adjustment and its application to lung cancer stratification.
- Authors
Relvas, Carlos E. M.; Nakata, Asuka; Chen, Guoan; Beer, David G.; Gotoh, Noriko; Fujita, Andre
- Abstract
Usually, the clustering process is the first step in several data analyses. Clustering allows identify patterns we did not note before and helps raise new hypotheses. However, one challenge when analyzing empirical data is the presence of covariates, which may mask the obtained clustering structure. For example, suppose we are interested in clustering a set of individuals into controls and cancer patients. A clustering algorithm could group subjects into young and elderly in this case. It may happen because the age at diagnosis is associated with cancer. Thus, we developed CEM-Co, a model-based clustering algorithm that removes/minimizes undesirable covariates' effects during the clustering process. We applied CEM-Co on a gene expression dataset composed of 129 stage I non-small cell lung cancer patients. As a result, we identified a subgroup with a poorer prognosis, while standard clustering algorithms failed.
- Subjects
LUNG cancer; NON-small-cell lung carcinoma; ALGORITHMS
- Publication
Journal of Bioinformatics & Computational Biology, 2023, Vol 21, Issue 4, p1
- ISSN
0219-7200
- Publication type
Article
- DOI
10.1142/S0219720023500191