TY - JOUR
T1 - Machine learning algorithms translate big data into predictive breeding accuracy
AU - Crossa, José
AU - Montesinos-Lopez, Osval A.
AU - Costa-Neto, Germano
AU - Vitale, Paolo
AU - Martini, Johannes W.R.
AU - Runcie, Daniel
AU - Fritsche-Neto, Roberto
AU - Montesinos-Lopez, Abelardo
AU - Pérez-Rodríguez, Paulino
AU - Gerard, Guillermo
AU - Dreisigacker, Susanna
AU - Crespo-Herrera, Leonardo
AU - Pierre, Carolina Saint
AU - Lillemo, Morten
AU - Cuevas, Jaime
AU - Bentley, Alison
AU - Ortiz, Rodomiro
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024
Y1 - 2024
N2 - Statistical machine learning (ML) extracts patterns from extensive genomic, phenotypic, and environmental data. ML algorithms automatically identify relevant features and use cross-validation to ensure robust models and improve prediction reliability in new lines. Furthermore, ML analyses of genotype-by-environment (G×E) interactions can offer insights into the genetic factors that affect performance in specific environments. By leveraging historical breeding data, ML streamlines strategies and automates analyses to reveal genomic patterns. In this review we examine the transformative impact of big data, including multi-trait genomics, phenomics, and environmental covariables, on genomic-enabled prediction in plant breeding. We discuss how big data and ML are revolutionizing the field by enhancing prediction accuracy, deepening our understanding of G×E interactions, and optimizing breeding strategies through the analysis of extensive and diverse datasets.
AB - Statistical machine learning (ML) extracts patterns from extensive genomic, phenotypic, and environmental data. ML algorithms automatically identify relevant features and use cross-validation to ensure robust models and improve prediction reliability in new lines. Furthermore, ML analyses of genotype-by-environment (G×E) interactions can offer insights into the genetic factors that affect performance in specific environments. By leveraging historical breeding data, ML streamlines strategies and automates analyses to reveal genomic patterns. In this review we examine the transformative impact of big data, including multi-trait genomics, phenomics, and environmental covariables, on genomic-enabled prediction in plant breeding. We discuss how big data and ML are revolutionizing the field by enhancing prediction accuracy, deepening our understanding of G×E interactions, and optimizing breeding strategies through the analysis of extensive and diverse datasets.
KW - big genomics
KW - climate change
KW - environmental data
KW - genomic prediction
KW - modern breeding programs
KW - phenomics
KW - statistical machine learning
UR - http://www.scopus.com/inward/record.url?scp=85207195710&partnerID=8YFLogxK
U2 - 10.1016/j.tplants.2024.09.011
DO - 10.1016/j.tplants.2024.09.011
M3 - Review article
AN - SCOPUS:85207195710
SN - 1360-1385
JO - Trends in Plant Science
JF - Trends in Plant Science
ER -