We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A mixture model-based approach to the clustering of microarray expression data.
- Authors
McLachlan, G J; Bean, R W; Peel, D
- Abstract
This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes.
- Publication
Bioinformatics (Oxford, England), 2002, Vol 18, Issue 3, p413
- ISSN
1367-4803
- Publication type
Journal Article
- DOI
10.1093/bioinformatics/18.3.413