Sensitivity analysis of a decision tree classification to input data errors using a general Monte Carlo error sensitivity model

Zhi Huang*, Shawn W. Laffan

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    8 Citations (Scopus)

    Abstract

    We analysed the sensitivity of a decision tree derived forest type mapping to simulated data errors in input digital elevation model (DEM), geology and remotely sensed (Landsat Thematic Mapper) variables. We used a stochastic Monte Carlo simulation model coupled with a one-at-a-time approach. The DEM error was assumed to be spatially autocorrelated with its magnitude being a percentage of the elevation value. The error of categorical geology data was assumed to be positional and limited to boundary areas. The Landsat data error was assumed to be spatially random following a Gaussian distribution. Each layer was perturbed using its error model with increasing levels of error, and the effect on the forest type mapping was assessed. The results of the three sensitivity analyses were markedly different, with the classification being most sensitive to the DEM error, than to the Landsat data errors, but with only a limited sensitivity to the geology data error used. A linear increase in error resulted in non-linear increases in effect for the DEM and Landsat errors, while it was linear for geology. As an example, a DEM error of as small as ¡2% reduced the overall test accuracy by more than 2%. More importantly, the same uncertainty level has caused nearly 10% of the study area to change its initial class assignment at each perturbation, on average. A spatial assessment of the sensitivities indicates that most of the pixel changes occurred within those forest classes expected to be more sensitive to data error. In addition to characterising the effect of errors on forest type mapping using decision trees, this study has demonstrated the generality of employing Monte Carlo analysis for the sensitivity and uncertainty analysis of categorical outputs that have distinctive characteristics from that of numerical outputs.

    Original languageEnglish
    Pages (from-to)1433-1452
    Number of pages20
    JournalInternational Journal of Geographical Information Science
    Volume23
    Issue number11
    DOIs
    Publication statusPublished - Nov 2009

    Fingerprint

    Dive into the research topics of 'Sensitivity analysis of a decision tree classification to input data errors using a general Monte Carlo error sensitivity model'. Together they form a unique fingerprint.

    Cite this