TY - GEN
T1 - CNN-based small object detection and visualization with feature activation mapping
AU - Menikdiwela, Medhani
AU - Nguyen, Chuong
AU - Li, Hongdong
AU - Shaw, Marnie
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - Object detection is a well-studied topic, however detection of small objects still lacks attention. Detecting small objects has been difficult due to small sizes, occlusion and complex backgrounds. Small objects detection is important in a number of applications including detection of small insects. One application is spider detection and removal. Spiders are frequently found on grapes and broccolis sold at supermarkets and this poses a significant safety issue and generates negative publicity for the industry. In this paper, we present a fine-tuned VGG16 network for detection of small objects such as spiders. Furthermore, we introduce a simple technique called 'feature activation mapping' for object visualization from VGG16 feature maps. The testing accuracy of our network on tiny spiders with various backgrounds is 84%, as compared to 72% using fined-tuned Faster R-CNN and 95.32% using CAM. Even though our feature activation mapping technique has a mid-range of test accuracy, it provides more detailed shape and size of spiders than using CAM which is important for the application area. A data set for spider detection is made available online.
AB - Object detection is a well-studied topic, however detection of small objects still lacks attention. Detecting small objects has been difficult due to small sizes, occlusion and complex backgrounds. Small objects detection is important in a number of applications including detection of small insects. One application is spider detection and removal. Spiders are frequently found on grapes and broccolis sold at supermarkets and this poses a significant safety issue and generates negative publicity for the industry. In this paper, we present a fine-tuned VGG16 network for detection of small objects such as spiders. Furthermore, we introduce a simple technique called 'feature activation mapping' for object visualization from VGG16 feature maps. The testing accuracy of our network on tiny spiders with various backgrounds is 84%, as compared to 72% using fined-tuned Faster R-CNN and 95.32% using CAM. Even though our feature activation mapping technique has a mid-range of test accuracy, it provides more detailed shape and size of spiders than using CAM which is important for the application area. A data set for spider detection is made available online.
KW - CNN
KW - R-CNN
KW - feature activation map
KW - heat map
KW - object detection
UR - http://www.scopus.com/inward/record.url?scp=85050017387&partnerID=8YFLogxK
U2 - 10.1109/IVCNZ.2017.8402455
DO - 10.1109/IVCNZ.2017.8402455
M3 - Conference contribution
T3 - International Conference Image and Vision Computing New Zealand
SP - 1
EP - 5
BT - 2017 International Conference on Image and Vision Computing New Zealand, IVCNZ 2017
PB - IEEE Computer Society
T2 - 2017 International Conference on Image and Vision Computing New Zealand, IVCNZ 2017
Y2 - 4 December 2017 through 6 December 2017
ER -