The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as working alternative distributions, to enforce separation from the null and thus refine the screening procedure. We show how these weighted alternatives improve various operating characteristics, such as the Bayesian false discovery rate, of the resulting tests for a fixed mixture proportion with respect to a local, unweighted likelihood approach. Parametric and nonparametric model specifications are proposed, along with efficient samplers for posterior inference. By means of a simulation study, we exhibit how our model compares with both well-established and state-of-the-art alternatives in terms of various operating characteristics. Finally, to illustrate the versatility of our method, we conduct three differential expression analyses with publicly-available datasets from genomic studies of heterogeneous nature.

Denti, F., Peluso, S., Guindani, M., Mira, A., Multiple hypothesis screening using mixtures of non-local distributions with applications to genomic studies, <<STATISTICS IN MEDICINE>>, 2023; (N/A): 1-15. [doi:10.1002/sim.9705] [https://hdl.handle.net/10807/228356]

Multiple hypothesis screening using mixtures of non-local distributions with applications to genomic studies

Denti, Francesco
Primo
;
Peluso, Stefano;
2023

Abstract

The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as working alternative distributions, to enforce separation from the null and thus refine the screening procedure. We show how these weighted alternatives improve various operating characteristics, such as the Bayesian false discovery rate, of the resulting tests for a fixed mixture proportion with respect to a local, unweighted likelihood approach. Parametric and nonparametric model specifications are proposed, along with efficient samplers for posterior inference. By means of a simulation study, we exhibit how our model compares with both well-established and state-of-the-art alternatives in terms of various operating characteristics. Finally, to illustrate the versatility of our method, we conduct three differential expression analyses with publicly-available datasets from genomic studies of heterogeneous nature.
2023
Inglese
Denti, F., Peluso, S., Guindani, M., Mira, A., Multiple hypothesis screening using mixtures of non-local distributions with applications to genomic studies, <<STATISTICS IN MEDICINE>>, 2023; (N/A): 1-15. [doi:10.1002/sim.9705] [https://hdl.handle.net/10807/228356]
File in questo prodotto:
File Dimensione Formato  
09_SIM_Multiple hypothesis screening using mixtures of nonlocal distributions with.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.99 MB
Formato Adobe PDF
1.99 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/228356
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact