IRIS UniCatt

Mixtures of matrix Gaussian distributions provide a probabilistic framework for clustering continuous matrix-variate data, which are increasingly common in various fields. Despite their widespread use and successful applications, these models suffer from over-parameterization, making them not suitable for even moderately sized matrix-variate data. To address this issue, we introduce a sparse model-based clustering approach for three-way data. Our approach assumes that the matrix mixture parameters are sparse and have different degrees of sparsity across clusters, enabling the induction of parsimony in a flexible manner. Estimation relies on the maximization of a penalized likelihood, with specifically tailored group and graphical lasso penalties. These penalties facilitate the selection of the most informative features for clustering three-way data where variables are recorded over multiple occasions, as well as allowing the identification of cluster-specific association structures. We conduct extensive testing of the proposed methodology on synthetic data and validate its effectiveness through an application to time-dependent crime patterns across multiple U.S. cities. Supplementary files for this article are available online.

Cappozzo, A., Casa, A., Fop, M., Sparse model-based clustering of three-way data via lasso-type penalties, <<JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS>>, 2024; (00): 1-21. [doi:10.1080/10618600.2024.2429705] [https://hdl.handle.net/10807/304037]

Sparse model-based clustering of three-way data via lasso-type penalties

Cappozzo, Andrea^Co-primo;Casa, Alessandro;Fop, Michael

2024

Abstract

Mixtures of matrix Gaussian distributions provide a probabilistic framework for clustering continuous matrix-variate data, which are increasingly common in various fields. Despite their widespread use and successful applications, these models suffer from over-parameterization, making them not suitable for even moderately sized matrix-variate data. To address this issue, we introduce a sparse model-based clustering approach for three-way data. Our approach assumes that the matrix mixture parameters are sparse and have different degrees of sparsity across clusters, enabling the induction of parsimony in a flexible manner. Estimation relies on the maximization of a penalized likelihood, with specifically tailored group and graphical lasso penalties. These penalties facilitate the selection of the most informative features for clustering three-way data where variables are recorded over multiple occasions, as well as allowing the identification of cluster-specific association structures. We conduct extensive testing of the proposed methodology on synthetic data and validate its effectiveness through an application to time-dependent crime patterns across multiple U.S. cities. Supplementary files for this article are available online.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2024
			
	Lingua del contenuto
	
				Inglese
			
	Nome del periodico
	
				JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS
			
	DOI del contributo
	
				https://dx.doi.org/10.1080/10618600.2024.2429705
			
	Citazione
	
				Cappozzo, A., Casa, A., Fop, M., Sparse model-based clustering of three-way data via lasso-type penalties, <<JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS>>, 2024;  (00): 1-21. [doi:10.1080/10618600.2024.2429705] [https://hdl.handle.net/10807/304037]
			
	Appare nelle tipologie:
	
				Articolo in rivista, Nota a sentenza

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/304037

Citazioni

ND

1

1

social impact