This work deals with the classification problem in the case that groups are known and both labeled and unlabeled data are available. The classification rule is derived using Gaussian mixtures, with covariance matrices fixed according to a multiple testing procedure, which allows to choose among four alternatives: heteroscedasticity, homometroscedasticity, homotroposcedasticity, and homoscedasticity. The mixture models are then fitted using all available data (labeled and unlabeled) and adopting the EM and the CEM algorithms. Applications on real data are provided in order to show the classification performance of the proposed procedure.
Bagnato, L., Greselin, F., Model-Based Clustering and Classification via Patterned Covariance Analysis, Abstract de <<CLAssification and Data Analysis Group of the Italian Statistical Society, 8th Scientific Meeting University of Pavia>>, (Pavia, 07-09 September 2011 ), Cerchiello, P. Tarantola, C., Pavia 2011: 4-4 [http://hdl.handle.net/10807/42320]
Model-Based Clustering and Classification via Patterned Covariance Analysis
Bagnato, Luca;
2011
Abstract
This work deals with the classification problem in the case that groups are known and both labeled and unlabeled data are available. The classification rule is derived using Gaussian mixtures, with covariance matrices fixed according to a multiple testing procedure, which allows to choose among four alternatives: heteroscedasticity, homometroscedasticity, homotroposcedasticity, and homoscedasticity. The mixture models are then fitted using all available data (labeled and unlabeled) and adopting the EM and the CEM algorithms. Applications on real data are provided in order to show the classification performance of the proposed procedure.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.