Sampling bias in estimation of distribution algorithms for genetic programming using prototype trees

Kangil Kim*, Bob McKay, Dharani Punithan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Probabilistic models are widely used in evolutionary and related algorithms. In Genetic Programming (GP), the Probabilistic Prototype Tree (PPT) is often used as a model representation. Drift due to sampling bias is a widely recognised problem, and may be serious, particularly in dependent probability models. While this has been closely studied in independent probability models, and more recently in probabilistic dependency models, it has received little attention in systems with strict dependence between probabilistic variables such as arise in PPT representation. Here, we investigate this issue, and present results suggesting that the drift effect in such models may be particularly severe - so severe as to cast doubt on their scalability. We present a preliminary analysis through a factor representation of the joint probability distribution. We suggest future directions for research aiming to overcome this problem.

Original languageEnglish
Title of host publicationPRICAI 2010
Subtitle of host publicationTrends in Artificial Intelligence - 11th Pacific Rim International Conference on Artificial Intelligence, Proceedings
Pages100-111
Number of pages12
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event11th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2010 - Daegu, Korea, Republic of
Duration: 30 Aug 20102 Sept 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6230 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2010
Country/TerritoryKorea, Republic of
CityDaegu
Period30/08/102/09/10

Fingerprint

Dive into the research topics of 'Sampling bias in estimation of distribution algorithms for genetic programming using prototype trees'. Together they form a unique fingerprint.

Cite this