Abstract In this work we are interested in clustering data whose support is “curved”. For this purpose, we will follow a Bayesian nonparametric approach by considering a species sampling mixture model. Our first goal is to define a general/flexible class of distributions, such that they can model data from clusters with non standard shape. To this end, we extend the definition of principal curve given in [8] (Tibshirani 1992) into a Bayesian framework. We propose a new hierarchical model, where the data in each cluster are parametrically distributed around the Bayesian principal curve, and the prior cluster assignment is given on the latent variables at the second level of hierarchy according to a species sampling model. As an application we will consider the detection of seismic faults using data coming from Italian earthquake catalogues.

Argiento, R., Guglielmi, A., Bayesian principal curve clustering by species-sampling mixture models Clustering mediante modelli mistura a campionamento di specie di curve principali bayesiane, in Proceedings of 47th SIS Scientific Meeting of the Italian Statistica Society, (Cagliari, 11-13 June 2014), CUEC editrice, Cagliari 2014: 1-6 [http://hdl.handle.net/10807/145236]

Bayesian principal curve clustering by species-sampling mixture models Clustering mediante modelli mistura a campionamento di specie di curve principali bayesiane

Argiento, Raffaele;
2014

Abstract

Abstract In this work we are interested in clustering data whose support is “curved”. For this purpose, we will follow a Bayesian nonparametric approach by considering a species sampling mixture model. Our first goal is to define a general/flexible class of distributions, such that they can model data from clusters with non standard shape. To this end, we extend the definition of principal curve given in [8] (Tibshirani 1992) into a Bayesian framework. We propose a new hierarchical model, where the data in each cluster are parametrically distributed around the Bayesian principal curve, and the prior cluster assignment is given on the latent variables at the second level of hierarchy according to a species sampling model. As an application we will consider the detection of seismic faults using data coming from Italian earthquake catalogues.
2014
Inglese
Proceedings of 47th SIS Scientific Meeting of the Italian Statistica Society
47th SIS Scientific Meeting of the Italian Statistica Society
Cagliari
11-giu-2014
13-giu-2014
978-88-8467-874-4
CUEC editrice
Argiento, R., Guglielmi, A., Bayesian principal curve clustering by species-sampling mixture models Clustering mediante modelli mistura a campionamento di specie di curve principali bayesiane, in Proceedings of 47th SIS Scientific Meeting of the Italian Statistica Society, (Cagliari, 11-13 June 2014), CUEC editrice, Cagliari 2014: 1-6 [http://hdl.handle.net/10807/145236]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/145236
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact