Learning the structure of dependence relations between variables is a pervasive issue in the statistical literature. A directed acyclic graph (DAG) can represent a set of conditional independencies, but different DAGs may encode the same set of relations and are indistinguishable using observational data. Equivalent DAGs can be collected into classes, each represented by a partially directed graph known as essential graph (EG). Structure learning directly conducted on the EG space, rather than on the allied space of DAGs, leads to theoretical and computational benefits. Still, the majority of efforts has been dedicated to Gaussian data, with less attention to methods designed for multivariate categorical data. A Bayesian methodology for structure learning of categorical EGs is then proposed. Combining a constructive parameter prior elicitation with a graph-driven likelihood decomposition, a closed-form expression for the marginal likelihood of a categorical EG model is derived. Asymptotic properties are studied, and an MCMC sampler scheme developed for approximate posterior inference. The methodology is evaluated on both simulated scenarios and real data, with appreciable performance in comparison with state-of-the-art methods.

Castelletti, F., Peluso, S., Equivalence class selection of categorical graphical models, <<COMPUTATIONAL STATISTICS & DATA ANALYSIS>>, 2021; 164 (N/A): N/A-N/A. [doi:10.1016/j.csda.2021.107304] [https://hdl.handle.net/10807/182891]

Equivalence class selection of categorical graphical models

Castelletti, Federico
Primo
;
Peluso, Stefano
Secondo
2021

Abstract

Learning the structure of dependence relations between variables is a pervasive issue in the statistical literature. A directed acyclic graph (DAG) can represent a set of conditional independencies, but different DAGs may encode the same set of relations and are indistinguishable using observational data. Equivalent DAGs can be collected into classes, each represented by a partially directed graph known as essential graph (EG). Structure learning directly conducted on the EG space, rather than on the allied space of DAGs, leads to theoretical and computational benefits. Still, the majority of efforts has been dedicated to Gaussian data, with less attention to methods designed for multivariate categorical data. A Bayesian methodology for structure learning of categorical EGs is then proposed. Combining a constructive parameter prior elicitation with a graph-driven likelihood decomposition, a closed-form expression for the marginal likelihood of a categorical EG model is derived. Asymptotic properties are studied, and an MCMC sampler scheme developed for approximate posterior inference. The methodology is evaluated on both simulated scenarios and real data, with appreciable performance in comparison with state-of-the-art methods.
2021
Inglese
Castelletti, F., Peluso, S., Equivalence class selection of categorical graphical models, <<COMPUTATIONAL STATISTICS & DATA ANALYSIS>>, 2021; 164 (N/A): N/A-N/A. [doi:10.1016/j.csda.2021.107304] [https://hdl.handle.net/10807/182891]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/182891
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact