TY - JOUR
T1 - Use of noise to augment training data
T2 - A neural network method of mineral-potential mapping in regions of limited known deposit examples
AU - Brown, Warick M.
AU - Gedeon, Tamás D.
AU - Groves, David I.
PY - 2003
Y1 - 2003
N2 - One of the main factors that affects the performance ofMLPneural networks trained using the backpropagation algorithm in mineral-potential mapping is the paucity of deposit relative to barren training patterns. To overcome this problem, random noise is added to the original training patterns in order to create additional synthetic deposit training data. Experiments on the effect of the number of deposits available for training in the Kalgoorlie Terrane orogenic gold province show that both the classification performance of a trained network and the quality of the resultant prospectivity map increase significantly with increased numbers of deposit patterns. Experiments are conducted to determine the optimum amount of noise using both uniform and normally distributed random noise. Through the addition of noise to the original deposit training data, the number of deposit training patterns is increased from approximately 50 to 1000. The percentage of correct classifications significantly improves for the independent test set as well as for deposit patterns in the test set. For example, using≥40% uniform random noise, the test-set classification performance increases from 67.9% and 68.0% to 72.8% and 77.1% (for test-set overall and test-set deposit patterns, respectively). Indices for the quality of the resultant prospectivity map, (i.e. D/A, D≥(D/A), where Dis the percentage of deposits and Ais the percentage of the total area for the highest prospectivity map-class, and area under an ROC curve) also increase from 8.2, 105, 0.79 to 17.9, 226, 0.87, respectively. Increasing the size of the training-stop data set results in a further increase in classification performance to 73.5%, 77.4%, 14.7, 296, 0.87 for test-set overall and test-set deposit patterns, D/A, D≥(D/A), and area under the ROC curve, respectively.
AB - One of the main factors that affects the performance ofMLPneural networks trained using the backpropagation algorithm in mineral-potential mapping is the paucity of deposit relative to barren training patterns. To overcome this problem, random noise is added to the original training patterns in order to create additional synthetic deposit training data. Experiments on the effect of the number of deposits available for training in the Kalgoorlie Terrane orogenic gold province show that both the classification performance of a trained network and the quality of the resultant prospectivity map increase significantly with increased numbers of deposit patterns. Experiments are conducted to determine the optimum amount of noise using both uniform and normally distributed random noise. Through the addition of noise to the original deposit training data, the number of deposit training patterns is increased from approximately 50 to 1000. The percentage of correct classifications significantly improves for the independent test set as well as for deposit patterns in the test set. For example, using≥40% uniform random noise, the test-set classification performance increases from 67.9% and 68.0% to 72.8% and 77.1% (for test-set overall and test-set deposit patterns, respectively). Indices for the quality of the resultant prospectivity map, (i.e. D/A, D≥(D/A), where Dis the percentage of deposits and Ais the percentage of the total area for the highest prospectivity map-class, and area under an ROC curve) also increase from 8.2, 105, 0.79 to 17.9, 226, 0.87, respectively. Increasing the size of the training-stop data set results in a further increase in classification performance to 73.5%, 77.4%, 14.7, 296, 0.87 for test-set overall and test-set deposit patterns, D/A, D≥(D/A), and area under the ROC curve, respectively.
KW - Archean orogenic gold deposits
KW - Geographic information systems (GIS)
KW - Mineral prospectivity maps
KW - Multilayer perceptrons (MLP)
KW - Neural networks
KW - Random noise
UR - http://www.scopus.com/inward/record.url?scp=23944483793&partnerID=8YFLogxK
U2 - 10.1023/A:1024218913435
DO - 10.1023/A:1024218913435
M3 - Article
SN - 1520-7439
VL - 12
SP - 141
EP - 152
JO - Natural Resources Research
JF - Natural Resources Research
IS - 2
ER -