Maximum likelihood estimation for outcome-dependent samples

Robert Graham Clark*

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    2 Citations (Scopus)

    Abstract

    In outcome-dependent sampling, the continuous or binary outcome variable in a regression model is available in advance to guide selection of a sample on which explanatory variables are then measured. Selection probabilities may either be a smooth function of the outcome variable or be based on a stratification of the outcome. In many cases, only data from the final sample is accessible to the analyst. A maximum likelihood approach for this data configuration is developed here for the first time. The likelihood for fully general outcome-dependent designs is stated, then the special case of Poisson sampling is examined in more detail. The maximum likelihood estimator differs from the well-known maximum sample likelihood estimator, and an information bound result shows that the former is asymptotically more efficient. A simulation study suggests that the efficiency difference is generally small. Maximum sample likelihood estimation is therefore recommended in practice when only sample data is available. Some new smooth sample designs show considerable promise.

    Original languageEnglish
    Pages (from-to)49-70
    Number of pages22
    JournalAustralian and New Zealand Journal of Statistics
    Volume62
    Issue number1
    DOIs
    Publication statusPublished - 1 Mar 2020

    Fingerprint

    Dive into the research topics of 'Maximum likelihood estimation for outcome-dependent samples'. Together they form a unique fingerprint.

    Cite this