Implicit bias and recursive grammar structures in estimation of distribution genetic programming

Kangil Kim*, Hoai Nguyen Xuan, Bob McKay

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Much recent research in Estimation of Distribution Algorithms (EDA) applied to Genetic Programming has adopted a Stochastic Context Free Grammar(SCFG)-based model formalism. However these methods generate biases which may be indistinguishable from selection bias, resulting in sub-optimal performance. The primary factor generating this bias is the combined effect of recursion in the grammars and depth limitation removing some sample trees from the distribution. Here, we demonstrate the bias and provide exact estimates of its scale (assuming infinite populations and simple recursions). We define a quantity h which determines both whether bias occurs (h > 1) and its scale. We apply this analysis to a number of simple illustrative grammars, and to a range of practically-used GP grammars, showing that this bias is both real and important.

    Original languageEnglish
    Title of host publication2012 IEEE Congress on Evolutionary Computation, CEC 2012
    DOIs
    Publication statusPublished - 2012
    Event2012 IEEE Congress on Evolutionary Computation, CEC 2012 - Brisbane, QLD, Australia
    Duration: 10 Jun 201215 Jun 2012

    Publication series

    Name2012 IEEE Congress on Evolutionary Computation, CEC 2012

    Conference

    Conference2012 IEEE Congress on Evolutionary Computation, CEC 2012
    Country/TerritoryAustralia
    CityBrisbane, QLD
    Period10/06/1215/06/12

    Fingerprint

    Dive into the research topics of 'Implicit bias and recursive grammar structures in estimation of distribution genetic programming'. Together they form a unique fingerprint.

    Cite this