TY - GEN
T1 - Universal compression of piecewise i.i.d. sources
AU - Vellambi, Badri
AU - Cameron, Owen
AU - Hutter, Marcus
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/19
Y1 - 2018/7/19
N2 - We study the problem of compressing piecewise i.i.d. sources, which models the practical application of jointly compressing multiple disparate data files. We establish that universal compression of piecewise i.i.d data is possible by modeling the data as a Markov process whose memory grows suitably with the size of the data using the Krichevsky-Trofimov (KT) estimator. The memory order is chosen large enough so that successful learning of the distribution of the each piece of the data from the corresponding contexts is possible for almost any realization of any piecewise i.i.d. data process. This is, a priori, a surprising result given that we are employing a stationary model to asymptotically optimally (model and) compress non-stationary data.
AB - We study the problem of compressing piecewise i.i.d. sources, which models the practical application of jointly compressing multiple disparate data files. We establish that universal compression of piecewise i.i.d data is possible by modeling the data as a Markov process whose memory grows suitably with the size of the data using the Krichevsky-Trofimov (KT) estimator. The memory order is chosen large enough so that successful learning of the distribution of the each piece of the data from the corresponding contexts is possible for almost any realization of any piecewise i.i.d. data process. This is, a priori, a surprising result given that we are employing a stationary model to asymptotically optimally (model and) compress non-stationary data.
KW - KT estimator
KW - Markov sources
KW - Non stationary source
KW - Piecewise iid sources
KW - Universal compression
UR - http://www.scopus.com/inward/record.url?scp=85050994191&partnerID=8YFLogxK
U2 - 10.1109/DCC.2018.00035
DO - 10.1109/DCC.2018.00035
M3 - Conference contribution
T3 - Data Compression Conference Proceedings
SP - 267
EP - 276
BT - Proceedings - DCC 2018
A2 - Bilgin, Ali
A2 - Storer, James A.
A2 - Serra-Sagrista, Joan
A2 - Marcellin, Michael W.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 Data Compression Conference, DCC 2018
Y2 - 27 March 2018 through 30 March 2018
ER -