TY - JOUR
T1 - Classification trees for poverty mapping
AU - Bilton, Penny
AU - Jones, Geoff
AU - Ganesh, Siva
AU - Haslett, Steve
N1 - Publisher Copyright:
© 2017 Elsevier B.V.
PY - 2017/11
Y1 - 2017/11
N2 - Poverty mapping uses small area estimation techniques to estimate levels of deprivation (poverty, undernutrition) across small geographic domains within a country. These estimates are then displayed on a poverty map, and used by aid organizations such as the United Nations World Food Programme for the efficient allocation of aid. Current methodology employs unit-level regression modelling of a target variable (household income, child weight-for-age). An alternative modelling technique is proposed, using tree-based methods, that has some practical advantages. Alternative ways of amalgamating the unit-level predictions from classification trees to small area level are explored, adapting the trees to account for the survey design, and resampling strategies are proposed for producing standard errors. The methodology is evaluated using both real data and simulations based on a poverty mapping study in Nepal. The simulations suggest that amalgamation of posterior probabilities from the tree gives approximately unbiased estimates, and standard errors can be calculated using a cluster bootstrap approach with cluster effects included in the predictions. Small area estimates of poverty incidence for a region in Nepal, generated using the proposed tree based method, are comparable to the published results obtained by the standard method.
AB - Poverty mapping uses small area estimation techniques to estimate levels of deprivation (poverty, undernutrition) across small geographic domains within a country. These estimates are then displayed on a poverty map, and used by aid organizations such as the United Nations World Food Programme for the efficient allocation of aid. Current methodology employs unit-level regression modelling of a target variable (household income, child weight-for-age). An alternative modelling technique is proposed, using tree-based methods, that has some practical advantages. Alternative ways of amalgamating the unit-level predictions from classification trees to small area level are explored, adapting the trees to account for the survey design, and resampling strategies are proposed for producing standard errors. The methodology is evaluated using both real data and simulations based on a poverty mapping study in Nepal. The simulations suggest that amalgamation of posterior probabilities from the tree gives approximately unbiased estimates, and standard errors can be calculated using a cluster bootstrap approach with cluster effects included in the predictions. Small area estimates of poverty incidence for a region in Nepal, generated using the proposed tree based method, are comparable to the published results obtained by the standard method.
KW - Clustered data
KW - Complex survey data
KW - Small area estimation
KW - Sustainable Development Goals
UR - http://www.scopus.com/inward/record.url?scp=85020919436&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2017.05.009
DO - 10.1016/j.csda.2017.05.009
M3 - Article
SN - 0167-9473
VL - 115
SP - 53
EP - 66
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -