TY - GEN
T1 - Parametric Learning of Texture Filters by Stacked Fisher Autoencoders
AU - Shahriari, Arash
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/12/22
Y1 - 2016/12/22
N2 - Deep learning has recently contributed significantly to large-scale recognition of several modalities like image, video and speech. Stacked autoencoders are a family of powerful convolutional neural nets to build scalable generative models for automatic feature learning. In this paper, we propose a network of novel overcomplete autoencoders called Fisher autoencoders. In contrast to convolutional autoencoders which learn some latent representations, we train a set of projections for the model variables using banks of filters. The Fisher autoencoders are independently computed in stacks of variable depth based on the complexity of patterns under study and the ability of each individual filter to extract deep features. We select texture understanding as one of the most difficult tasks in pattern recognition and conduct our experiments in a standard platform to assure fair comparisons with other methods. Our results show considerable improvements over the most recent benchmarks on several texture datasets for our Fisher autoencoders evaluated against improved Fisher vectors on Dense SIFT (DSIFT) and DeCAF-VGG deep local descriptors.
AB - Deep learning has recently contributed significantly to large-scale recognition of several modalities like image, video and speech. Stacked autoencoders are a family of powerful convolutional neural nets to build scalable generative models for automatic feature learning. In this paper, we propose a network of novel overcomplete autoencoders called Fisher autoencoders. In contrast to convolutional autoencoders which learn some latent representations, we train a set of projections for the model variables using banks of filters. The Fisher autoencoders are independently computed in stacks of variable depth based on the complexity of patterns under study and the ability of each individual filter to extract deep features. We select texture understanding as one of the most difficult tasks in pattern recognition and conduct our experiments in a standard platform to assure fair comparisons with other methods. Our results show considerable improvements over the most recent benchmarks on several texture datasets for our Fisher autoencoders evaluated against improved Fisher vectors on Dense SIFT (DSIFT) and DeCAF-VGG deep local descriptors.
UR - http://www.scopus.com/inward/record.url?scp=85011079601&partnerID=8YFLogxK
U2 - 10.1109/DICTA.2016.7797072
DO - 10.1109/DICTA.2016.7797072
M3 - Conference contribution
T3 - 2016 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2016
BT - 2016 International Conference on Digital Image Computing
A2 - Liew, Alan Wee-Chung
A2 - Zhou, Jun
A2 - Gao, Yongsheng
A2 - Wang, Zhiyong
A2 - Fookes, Clinton
A2 - Lovell, Brian
A2 - Blumenstein, Michael
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2016
Y2 - 30 November 2016 through 2 December 2016
ER -