Semantically-based crossover in genetic programming: Application to real-valued symbolic regression

Nguyen Quang Uy, Nguyen Xuan Hoai, Michael O'Neill, R. I. McKay, Edgar Galván-López

Research output: Contribution to journalArticlepeer-review

217 Citations (Scopus)

Abstract

We investigate the effects of semantically-based crossover operators in genetic programming, applied to real-valued symbolic regression problems. We propose two new relations derived from the semantic distance between subtrees, known as semantic equivalence and semantic similarity. These relations are used to guide variants of the crossover operator, resulting in two new crossover operators-semantics aware crossover (SAC) and semantic similarity-based crossover (SSC). SAC, was introduced and previously studied, is added here for the purpose of comparison and analysis. SSC extends SAC by more closely controlling the semantic distance between subtrees to which crossover may be applied. The new operators were tested on some real-valued symbolic regression problems and compared with standard crossover (SC), context aware crossover (CAC), Soft Brood Selection (SBS), and No Same Mate (NSM) selection. The experimental results show on the problems examined that, with computational effort measured by the number of function node evaluations, only SSC and SBS were significantly better than SC, and SSC was often better than SBS. Further experiments were also conducted to analyse the perfomance sensitivity to the parameter settings for SSC. This analysis leads to a conclusion that SSC is more constructive and has higher locality than SAC, NSM and SC; we believe these are the main reasons for the improved performance of SSC.

Original languageEnglish
Pages (from-to)91-119
Number of pages29
JournalGenetic Programming and Evolvable Machines
Volume12
Issue number2
DOIs
Publication statusPublished - Jun 2011
Externally publishedYes

Fingerprint

Dive into the research topics of 'Semantically-based crossover in genetic programming: Application to real-valued symbolic regression'. Together they form a unique fingerprint.

Cite this