The intrinsic dimension (id) of a dataset conveys essential information regarding the complexity of the underlying data-generating process. In particular, it describes the di- mensionality of the latent manifold on which the data-generating probability distribution has support. Complex datasets may be characterized by multiple manifolds having differ- ent ids. To properly estimate these heterogeneous ids, a recent modeling approach uses finite scale mixtures of Pareto distributions aided by a homogeneity-inducing term in the likelihood. In this contribution, we explore a different modeling perspective, estimating Pareto’s scale mixtures via spatial product partition models. We present the general idea and introduce Spider, our Bayesian nonparametric approach. Finally, we showcase some encouraging preliminary results.

Denti, F., Di Noia, A., Mira, A., Bayesian nonparametric estimation of heterogeneous intrinsic dimension via product partition models, in Book of the Short Papers SEAS IN 2023, (Ancona, 21-23 June 2023), Pearson, Ancona 2023: 316-321 [https://hdl.handle.net/10807/249474]

Bayesian nonparametric estimation of heterogeneous intrinsic dimension via product partition models

Denti, Francesco
;
2023

Abstract

The intrinsic dimension (id) of a dataset conveys essential information regarding the complexity of the underlying data-generating process. In particular, it describes the di- mensionality of the latent manifold on which the data-generating probability distribution has support. Complex datasets may be characterized by multiple manifolds having differ- ent ids. To properly estimate these heterogeneous ids, a recent modeling approach uses finite scale mixtures of Pareto distributions aided by a homogeneity-inducing term in the likelihood. In this contribution, we explore a different modeling perspective, estimating Pareto’s scale mixtures via spatial product partition models. We present the general idea and introduce Spider, our Bayesian nonparametric approach. Finally, we showcase some encouraging preliminary results.
2023
Inglese
Book of the Short Papers SEAS IN 2023
SIS 2023 - Statistical Learning, Sustainability and Impact Evaluation
Ancona
21-giu-2023
23-giu-2023
9788891935618AAVV
Pearson
Denti, F., Di Noia, A., Mira, A., Bayesian nonparametric estimation of heterogeneous intrinsic dimension via product partition models, in Book of the Short Papers SEAS IN 2023, (Ancona, 21-23 June 2023), Pearson, Ancona 2023: 316-321 [https://hdl.handle.net/10807/249474]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/249474
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact