TY - JOUR
T1 - Species identification using high resolution melting (HRM) analysis with random forest classification
AU - Bowman, Sorelle
AU - McNevin, Dennis
AU - Venables, Samantha J.
AU - Roffey, Paul
AU - Richardson, Alice
AU - Gahan, Michelle E.
N1 - Publisher Copyright:
© 2017, © 2017 Australian Academy of Forensic Sciences.
PY - 2019/1/2
Y1 - 2019/1/2
N2 - Species identification is an important facet of forensic investigation. In this study, human and non-human species (cow, chicken, pig, sheep, cat, dog, rabbit, fox, kangaroo and wombat) were assayed on the ViiA 7 Real-Time PCR System (Thermo Fisher Scientific) to rapidly screen for their species of origin using the high resolution melt (HRM) analysis targeting the 16S rRNA gene. Classification of HRM difference profiles using the onboard ViiA 7 software resulted in a classification accuracy of <20%. Derivative profiles (temperature versus negative first derivative of fluorescence, –dF/dT) were classified using random forest algorithms supplemented by bagging and boosting, with either a randomly partitioned test set or a variety of folds of cross-classification, in addition to a range of trees and variables. Random forest classification with bagging conditions (constructed over 500 trees) was found to considerably outperform the ViiA 7 software for species differentiation with 100% classification accuracy for biological material from humans, domestic pets (cat and dog) and consumable meats (chicken and sheep) with an average classification accuracy of 70% across all species.
AB - Species identification is an important facet of forensic investigation. In this study, human and non-human species (cow, chicken, pig, sheep, cat, dog, rabbit, fox, kangaroo and wombat) were assayed on the ViiA 7 Real-Time PCR System (Thermo Fisher Scientific) to rapidly screen for their species of origin using the high resolution melt (HRM) analysis targeting the 16S rRNA gene. Classification of HRM difference profiles using the onboard ViiA 7 software resulted in a classification accuracy of <20%. Derivative profiles (temperature versus negative first derivative of fluorescence, –dF/dT) were classified using random forest algorithms supplemented by bagging and boosting, with either a randomly partitioned test set or a variety of folds of cross-classification, in addition to a range of trees and variables. Random forest classification with bagging conditions (constructed over 500 trees) was found to considerably outperform the ViiA 7 software for species differentiation with 100% classification accuracy for biological material from humans, domestic pets (cat and dog) and consumable meats (chicken and sheep) with an average classification accuracy of 70% across all species.
KW - Forensic science
KW - high resolution melt analysis
KW - predictive modelling
KW - random forest
KW - rapid screening
UR - http://www.scopus.com/inward/record.url?scp=85018669431&partnerID=8YFLogxK
U2 - 10.1080/00450618.2017.1315835
DO - 10.1080/00450618.2017.1315835
M3 - Article
SN - 0045-0618
VL - 51
SP - 57
EP - 72
JO - Australian Journal of Forensic Sciences
JF - Australian Journal of Forensic Sciences
IS - 1
ER -