TY - JOUR
T1 - Block-entropy analysis of climate data
AU - Larson, J. Walter
AU - Briggs, Peter R.
AU - Tobis, Michael
PY - 2011
Y1 - 2011
N2 - We explore the use of block entropy as a dynamics classifier for meteorological timeseries data. The block entropy estimates define the entropy growth curve H(L) with respect to block length L. For a finitary process, the entropy growth curve tends to an asymptotic linear regime H(L) = E + h μL, with entropy rate hμ and excess entropy E. These quantities apportion the system's information content into'memory' (E) and'randomness' (hμ). We discuss the challenges inherent in analyzing weather data using symbolic techniques, identifying the pitfalls associated with alphabet size, finite sample timeseries length, and stationarity. We apply the block entropy-based techniques in the form of a wet/dry partition to Australian daily precipitation data from the Patched Point Dataset station record collection and version 3 of the Australian Water Availability Project analysis dataset. Preliminary results demonstrate h μ and E are viable climatological classifiers for precipitation, with station records from similar climatic regimes possessing similar values of hμ and E. The resultant clustering reflects expected characteristics of local climatic memory and randomness. The AWAP results show weaker clustering than their PPD counterparts, with different E- and h μ-values reflecting respectively the relative biases and truncation errors in the AWAP analysis system. The entropy rates of convergence analysis rules out finite order Markov processes for orders falling within the range of block sizes considered.
AB - We explore the use of block entropy as a dynamics classifier for meteorological timeseries data. The block entropy estimates define the entropy growth curve H(L) with respect to block length L. For a finitary process, the entropy growth curve tends to an asymptotic linear regime H(L) = E + h μL, with entropy rate hμ and excess entropy E. These quantities apportion the system's information content into'memory' (E) and'randomness' (hμ). We discuss the challenges inherent in analyzing weather data using symbolic techniques, identifying the pitfalls associated with alphabet size, finite sample timeseries length, and stationarity. We apply the block entropy-based techniques in the form of a wet/dry partition to Australian daily precipitation data from the Patched Point Dataset station record collection and version 3 of the Australian Water Availability Project analysis dataset. Preliminary results demonstrate h μ and E are viable climatological classifiers for precipitation, with station records from similar climatic regimes possessing similar values of hμ and E. The resultant clustering reflects expected characteristics of local climatic memory and randomness. The AWAP results show weaker clustering than their PPD counterparts, with different E- and h μ-values reflecting respectively the relative biases and truncation errors in the AWAP analysis system. The entropy rates of convergence analysis rules out finite order Markov processes for orders falling within the range of block sizes considered.
KW - Climate predictibility
KW - Model-data evaluation
KW - Symbolic dynamics
KW - Timeseries analysis
UR - http://www.scopus.com/inward/record.url?scp=79958274172&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2011.04.172
DO - 10.1016/j.procs.2011.04.172
M3 - Conference article
AN - SCOPUS:79958274172
SN - 1877-0509
VL - 4
SP - 1592
EP - 1601
JO - Procedia Computer Science
JF - Procedia Computer Science
T2 - 11th International Conference on Computational Science, ICCS 2011
Y2 - 1 June 2011 through 3 June 2011
ER -