IRIS PubliCatt

Data quality from poor and socially deprived regions have given rise to many statistical challenges. One of them is the underreporting of vital events leading to biased estimates for the associated risks. To deal with underreported count data, models based on compound Poisson distributions have been commonly assumed. To be identifiable, such models usually require extra and strong information about the probability of reporting the event in all areas of interest, which is not always available. We introduce a novel approach for the compound Poisson model assuming that the areas are clustered according to their data quality. We leverage these clusters to create a hierarchical structure in which the reporting probabilities decrease as we move from the best group to the worst ones.We obtain constraints for model identifiability and prove that only prior information about the reporting probability in areas experiencing the best data quality is required. Several approaches to model the uncertainty about the reporting probabilities are presented, including reference priors. Different features regarding the proposed methodology are studied through simulation. We apply our model to map the early neonatal mortality risks in Minas Gerais, a Brazilian state that presents heterogeneous characteristics and a relevant socio-economical inequality.

Lopes De Oliveira, G., Argiento, R., Helena Loschi, R., Martins Assuncao, R., Ruggeri, F., D’Elia Branco, M., Bias Correction in Clustered Underreported Data, <<BAYESIAN ANALYSIS>>, 2020; (NA): 1-32. [doi:10.1214/20-BA1244] [http://hdl.handle.net/10807/163433]

Bias Correction in Clustered Underreported Data

Guilherme Lopes de Oliveira;Argiento, Raffaele;Rosangela Helena Loschi;Renato Martins Assuncao;Fabrizio Ruggeri;Marcia D’Elia Branco

2020

Abstract

Data quality from poor and socially deprived regions have given rise to many statistical challenges. One of them is the underreporting of vital events leading to biased estimates for the associated risks. To deal with underreported count data, models based on compound Poisson distributions have been commonly assumed. To be identifiable, such models usually require extra and strong information about the probability of reporting the event in all areas of interest, which is not always available. We introduce a novel approach for the compound Poisson model assuming that the areas are clustered according to their data quality. We leverage these clusters to create a hierarchical structure in which the reporting probabilities decrease as we move from the best group to the worst ones.We obtain constraints for model identifiability and prove that only prior information about the reporting probability in areas experiencing the best data quality is required. Several approaches to model the uncertainty about the reporting probabilities are presented, including reference priors. Different features regarding the proposed methodology are studied through simulation. We apply our model to map the early neonatal mortality risks in Minas Gerais, a Brazilian state that presents heterogeneous characteristics and a relevant socio-economical inequality.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2020
			
	Lingua del contenuto
	
				Inglese
			
	Nome del periodico
	
				BAYESIAN ANALYSIS
			
	DOI del contributo
	
				https://dx.doi.org/10.1214/20-BA1244
			
	URL alternativo
	
				https://projecteuclid.org/euclid.ba/1600999224
			
	Citazione
	
				Lopes De Oliveira, G., Argiento, R., Helena Loschi, R., Martins Assuncao, R., Ruggeri, F., D’Elia Branco, M., Bias Correction in Clustered Underreported Data, <<BAYESIAN ANALYSIS>>, 2020;  (NA): 1-32. [doi:10.1214/20-BA1244] [http://hdl.handle.net/10807/163433]
			
	Appare nelle tipologie:
	
				Articolo in rivista, Nota a sentenza

File in questo prodotto:

File	Dimensione	Formato
Paper_brazil_Ufficiale.pdf accesso aperto Tipologia file ?: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 4.53 MB Formato Adobe PDF Visualizza/Apri	4.53 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/163433

Citazioni

ND

10

9

social impact