IRIS PubliCatt

Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors.

Tomarchio, S. D., Punzo, A., Bagnato, L., Two new matrix-variate distributions with application in model-based clustering, <<COMPUTATIONAL STATISTICS & DATA ANALYSIS>>, 2020; 152 (107050): 107050-107071. [doi:10.1016/j.csda.2020.107050] [http://hdl.handle.net/10807/160319]

Two new matrix-variate distributions with application in model-based clustering

Tomarchio, Salvatore D.;Punzo, Antonio;Bagnato, Luca

2020

Abstract

Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-variate normal distribution, are introduced. They belong to the normal scale mixture family, and are respectively obtained by choosing a convenient shifted exponential or uniform as mixing distribution. Moreover, they have a closed-form for the probability density function that is characterized by only one additional parameter, with respect to the nested matrix-variate normal, governing the tail-weight. Both distributions are then used for model-based clustering via finite mixture models. The resulting mixtures, being able to handle data with atypical observations in a better way than the matrix-variate normal mixture, can avoid the disruption of the true underlying group structure. Different EM-based algorithms are implemented for parameter estimation and tested in terms of computational times and parameter recovery. Furthermore, these mixture models are fitted to simulated and real datasets, and their fitting and clustering performances are analyzed and compared to those obtained by other well-established competitors.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2020
			
	Lingua del contenuto
	
				Inglese
			
	Nome del periodico
	
				COMPUTATIONAL STATISTICS & DATA ANALYSIS
			
	DOI del contributo
	
				https://dx.doi.org/10.1016/j.csda.2020.107050
			
	Citazione
	
				Tomarchio, S. D., Punzo, A., Bagnato, L., Two new matrix-variate distributions with application in model-based clustering, <<COMPUTATIONAL STATISTICS &amp; DATA ANALYSIS>>, 2020;  152 (107050): 107050-107071. [doi:10.1016/j.csda.2020.107050] [http://hdl.handle.net/10807/160319]
			
	Appare nelle tipologie:
	
				Articolo in rivista, Nota a sentenza

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/160319

Citazioni

ND

23

19

social impact