TY - GEN
T1 - Knowledge-based dynamic systems modeling
T2 - 37th IEEE International Conference on Data Engineering, ICDE 2021
AU - Park, Namyong
AU - Kim, Minhyeok
AU - Hoai, Nguyen Xuan
AU - Bob McKay, R. I.
AU - Kim, Dong Kyun
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/4
Y1 - 2021/4
N2 - Modeling real-world phenomena is a focus of many science and engineering efforts, from ecological modeling to financial forecasting. Building an accurate model for complex and dynamic systems improves understanding of underlying processes and leads to resource efficiency. Knowledge-driven modeling builds a model based on human expertise, yet is often suboptimal. At the opposite extreme, data-driven modeling learns a model directly from data, requiring extensive data and potentially generating overfitting. We focus on an intermediate approach, model revision, in which prior knowledge and data are combined to achieve the best of both worlds. We propose a genetic model revision framework based on tree-adjoining grammar (TAG) guided genetic programming (GP), using the TAG formalism and GP operators in an effective mechanism making data-driven revisions while incorporating prior knowledge. Our framework is designed to address the high computational cost of evolutionary modeling of complex systems. Via a case study on the challenging problem of river water quality modeling, we show that the framework efficiently learns an interpretable model, with higher modeling accuracy than existing methods.
AB - Modeling real-world phenomena is a focus of many science and engineering efforts, from ecological modeling to financial forecasting. Building an accurate model for complex and dynamic systems improves understanding of underlying processes and leads to resource efficiency. Knowledge-driven modeling builds a model based on human expertise, yet is often suboptimal. At the opposite extreme, data-driven modeling learns a model directly from data, requiring extensive data and potentially generating overfitting. We focus on an intermediate approach, model revision, in which prior knowledge and data are combined to achieve the best of both worlds. We propose a genetic model revision framework based on tree-adjoining grammar (TAG) guided genetic programming (GP), using the TAG formalism and GP operators in an effective mechanism making data-driven revisions while incorporating prior knowledge. Our framework is designed to address the high computational cost of evolutionary modeling of complex systems. Via a case study on the challenging problem of river water quality modeling, we show that the framework efficiently learns an interpretable model, with higher modeling accuracy than existing methods.
KW - Dynamic system modeling
KW - Evolutionary algorithm
KW - Prior knowledge in-corporation
KW - River water quality modeling
UR - http://www.scopus.com/inward/record.url?scp=85112864176&partnerID=8YFLogxK
U2 - 10.1109/ICDE51399.2021.00229
DO - 10.1109/ICDE51399.2021.00229
M3 - Conference contribution
AN - SCOPUS:85112864176
T3 - Proceedings - International Conference on Data Engineering
SP - 2231
EP - 2236
BT - Proceedings - 2021 IEEE 37th International Conference on Data Engineering, ICDE 2021
PB - IEEE Computer Society
Y2 - 19 April 2021 through 22 April 2021
ER -