Hedonic models are widely used to predict selling prices of properties. Originally, they were proposed as simple spatial regressions, i.e. a spatially referenced response regressed on spatially referenced predictors. Subsequently, spatial random effects were introduced to serve as surrogates for unmeasured or unobservable predictors and were shown to provide better out-of-sample prediction. However, what has been ignored in the literature is the fact that the locations (and times) of the sales are random and, in fact, are an observation of a random point pattern. Here, we first consider whether there is stochastic dependence between the point pattern of locations and the set of responses. If so, a second question is whether incorporating a log-intensity for the point pattern of locations in the hedonic modelling enables improvement in the prediction of selling price. We connect this problem to what is referred to as preferential sampling. Through model comparison we illuminate the role of the point pattern data in the prediction of selling price. Using two different years of property sales from Zaragoza, Spain, we employ both the full database as well as an intentionally biased subset to elaborate this story.
Paci, L., Gelfand, A. E., Beamonte, M. A., Gargallo, P., Salvador, M., Spatial hedonic modelling adjusted for preferential sampling, <<JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A. STATISTICS IN SOCIETY>>, 2020; 183 (1): 169-192. [doi:10.1111/rssa.12489] [http://hdl.handle.net/10807/146654]
Spatial hedonic modelling adjusted for preferential sampling
Paci, Lucia
Primo
;
2020
Abstract
Hedonic models are widely used to predict selling prices of properties. Originally, they were proposed as simple spatial regressions, i.e. a spatially referenced response regressed on spatially referenced predictors. Subsequently, spatial random effects were introduced to serve as surrogates for unmeasured or unobservable predictors and were shown to provide better out-of-sample prediction. However, what has been ignored in the literature is the fact that the locations (and times) of the sales are random and, in fact, are an observation of a random point pattern. Here, we first consider whether there is stochastic dependence between the point pattern of locations and the set of responses. If so, a second question is whether incorporating a log-intensity for the point pattern of locations in the hedonic modelling enables improvement in the prediction of selling price. We connect this problem to what is referred to as preferential sampling. Through model comparison we illuminate the role of the point pattern data in the prediction of selling price. Using two different years of property sales from Zaragoza, Spain, we employ both the full database as well as an intentionally biased subset to elaborate this story.File | Dimensione | Formato | |
---|---|---|---|
PGBGP_prefsamp_rev2.pdf
Open Access dal 01/02/2020
Tipologia file ?:
Postprint (versione finale dell’autore successiva alla peer-review)
Licenza:
Creative commons
Dimensione
634.38 kB
Formato
Adobe PDF
|
634.38 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.