Using big data to predict pertussis infections in Jinan city, China: a time series analysis

Yuzhou Zhang, Hilary Bambrick, Kerrie Mengersen, Shilu Tong, Lei Feng, Li Zhang, Guifang Liu, Aiqiang Xu, Wenbiao Hu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)

Abstract

This study aims to use big data (climate data, internet query data and school calendar patterns (SCP)) to improve pertussis surveillance and prediction, and develop an early warning model for pertussis epidemics. We collected weekly pertussis notifications, SCP, climate and internet search query data (Baidu index (BI)) in Jinan, China between 2013 and 2017. Time series decomposition and temporal risk assessment were used for examining the epidemic features in pertussis infections. A seasonal autoregressive integrated moving average (SARIMA) model and regression tree model were developed to predict pertussis occurrence using identified predictors. Our study demonstrates clear seasonal patterns in pertussis epidemics, and pertussis activity was most significantly associated with BI at 2-week lag (rBI = 0.73, p < 0.05), temperature at 1-week lag (rtemp = 0.19, p < 0.05) and rainfall at 2-week lag (rrainfall = 0.27, p < 0.05). No obvious relationship between pertussis peaks and school attendance was found in the study. Pertussis cases were more likely to be temporally concentrated throughout the epidemics during the study period. SARIMA models with 2-week-lagged BI and 1-week-lagged temperature had better predictive performance (βsearch query = 0.06, p = 0.02; βtemp = 0.16, p = 0.03) with large correlation coefficients (r = 0.67, p < 0.01) and low root mean squared error (RMSE) value (r = 3.59). The regression tree model identified threshold values of potential predictors (search query, climate and SCP) for pertussis epidemics. Our results showed that internet query in conjunction with social and climatic data can predict pertussis epidemics, which is a foundation of using such data to develop early warning systems.

Original languageEnglish
Pages (from-to)95-104
Number of pages10
JournalInternational Journal of Biometeorology
Volume64
Issue number1
DOIs
Publication statusPublished - 1 Jan 2020
Externally publishedYes

Fingerprint

Dive into the research topics of 'Using big data to predict pertussis infections in Jinan city, China: a time series analysis'. Together they form a unique fingerprint.

Cite this