TY - JOUR
T1 - Learning Saliency from Single Noisy Labelling
T2 - A Robust Model Fitting Perspective
AU - Zhang, Jing
AU - Dai, Yuchao
AU - Zhang, Tong
AU - Harandi, Mehrtash
AU - Barnes, Nick
AU - Hartley, Richard
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2021/8/1
Y1 - 2021/8/1
N2 - The advances made in predicting visual saliency using deep neural networks come at the expense of collecting large-scale annotated data. However, pixel-wise annotation is labor-intensive and overwhelming. In this paper, we propose to learn saliency prediction from a single noisy labelling, which is easy to obtain (e.g., from imperfect human annotation or from unsupervised saliency prediction methods). With this goal, we address a natural question: Can we learn saliency prediction while identifying clean labels in a unified framework? To answer this question, we call on the theory of robust model fitting and formulate deep saliency prediction from a single noisy labelling as robust network learning and exploit model consistency across iterations to identify inliers and outliers (i.e., noisy labels). Extensive experiments on different benchmark datasets demonstrate the superiority of our proposed framework, which can learn comparable saliency prediction with state-of-the-art fully supervised saliency methods. Furthermore, we show that simply by treating ground truth annotations as noisy labelling, our framework achieves tangible improvements over state-of-the-art methods.
AB - The advances made in predicting visual saliency using deep neural networks come at the expense of collecting large-scale annotated data. However, pixel-wise annotation is labor-intensive and overwhelming. In this paper, we propose to learn saliency prediction from a single noisy labelling, which is easy to obtain (e.g., from imperfect human annotation or from unsupervised saliency prediction methods). With this goal, we address a natural question: Can we learn saliency prediction while identifying clean labels in a unified framework? To answer this question, we call on the theory of robust model fitting and formulate deep saliency prediction from a single noisy labelling as robust network learning and exploit model consistency across iterations to identify inliers and outliers (i.e., noisy labels). Extensive experiments on different benchmark datasets demonstrate the superiority of our proposed framework, which can learn comparable saliency prediction with state-of-the-art fully supervised saliency methods. Furthermore, we show that simply by treating ground truth annotations as noisy labelling, our framework achieves tangible improvements over state-of-the-art methods.
KW - Salinecy prediction
KW - robust model fitting
KW - single noisy labelling
UR - http://www.scopus.com/inward/record.url?scp=85098785216&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2020.3046486
DO - 10.1109/TPAMI.2020.3046486
M3 - Article
SN - 0162-8828
VL - 43
SP - 2866
EP - 2873
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 8
M1 - 9303417
ER -