TY - JOUR
T1 - Deep learning AI for corrosion detection
AU - Nash, Will
AU - Drummond, Tom
AU - Birbilis, Nick
N1 - Publisher Copyright:
© 2019 by NACE International.
PY - 2019
Y1 - 2019
N2 - Visual inspection is a vital component of asset management that stands to benefit from automation. Using artificial intelligence to assist inspections can increase safety, reduce access costs, provide objective classification, and integrate with digital asset management systems. The work presented herein investigates the impact of dataset size on Deep Learning for automatic detection of corrosion on steel assets. Dataset creation is typically one of the first steps when applying Machine Learning methods to a new task; and the real-world performance of models hinges on the quality and quantity of data available. Producing an image dataset for semantic segmentation is resource intensive, particularly for specialist subjects where class segmentation is not able to be effectively farmed out. The benefit of producing a large, but poorly labelled, dataset versus a small, expertly segmented dataset for semantic segmentation is an open question. Here we show that a large, noisy dataset outperforms a small, expertly segmented dataset for training a Fully Convolutional Network model for semantic segmentation of corrosion in images. A large dataset of 250 images with segmentations labelled by undergraduates and a second dataset of just 10 images, with segmentations labelled by subject matter experts were produced. The mean Intersection over Union and micro F-score metrics were compared after training for 50,000 epochs. The relationship between dataset size and F-score was investigated to estimate the requirements to achieve human level accuracy. This work is illustrative for researchers setting out to develop deep learning models for detection and location of specialist features.
AB - Visual inspection is a vital component of asset management that stands to benefit from automation. Using artificial intelligence to assist inspections can increase safety, reduce access costs, provide objective classification, and integrate with digital asset management systems. The work presented herein investigates the impact of dataset size on Deep Learning for automatic detection of corrosion on steel assets. Dataset creation is typically one of the first steps when applying Machine Learning methods to a new task; and the real-world performance of models hinges on the quality and quantity of data available. Producing an image dataset for semantic segmentation is resource intensive, particularly for specialist subjects where class segmentation is not able to be effectively farmed out. The benefit of producing a large, but poorly labelled, dataset versus a small, expertly segmented dataset for semantic segmentation is an open question. Here we show that a large, noisy dataset outperforms a small, expertly segmented dataset for training a Fully Convolutional Network model for semantic segmentation of corrosion in images. A large dataset of 250 images with segmentations labelled by undergraduates and a second dataset of just 10 images, with segmentations labelled by subject matter experts were produced. The mean Intersection over Union and micro F-score metrics were compared after training for 50,000 epochs. The relationship between dataset size and F-score was investigated to estimate the requirements to achieve human level accuracy. This work is illustrative for researchers setting out to develop deep learning models for detection and location of specialist features.
KW - Corrosion
KW - Datasets
KW - Fully Convolutional Network
KW - Machine Learning
KW - Semantic Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85070075966&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85070075966
SN - 0361-4409
VL - 2019-March
JO - NACE - International Corrosion Conference Series
JF - NACE - International Corrosion Conference Series
M1 - 13267
T2 - Corrosion Conference and Expo 2019
Y2 - 24 March 2019 through 28 March 2019
ER -