TY - JOUR
T1 - Sporadic, global linkage disequilibrium between unlinked segregating sites
AU - Skelly, Daniel A.
AU - Magwene, Paul M.
AU - Stone, Eric A.
N1 - Publisher Copyright:
© 2016 by the Genetics Society of America.
PY - 2016/2
Y1 - 2016/2
N2 - Demographic, genetic, or stochastic factors can lead to perfect linkage disequilibrium (LD) between alleles at two loci without respect to the extent of their physical distance, a phenomenon that Lawrence et al. (2005a) refer to as “genetic indistin-guishability.” This phenomenon can complicate genotype–phenotype association testing by hindering the ability to localize causal alleles, but has not been thoroughly explored from a theoretical perspective or using large, dense whole-genome polymorphism data sets. We derive a simple theoretical model of the prevalence of genetic indistinguishability between unlinked loci and verify its accuracy via simulation. We show that sample size and minor allele frequency are the major determinants of the prevalence of perfect LD between unlinked loci but that demographic factors, such as deviations from random mating, can produce significant effects as well. Finally, we quantify this phenomenon in three model organisms and find thousands of pairs of moderate-frequency (> 5%) genetically indistinguishable variants in relatively large data sets. These results clarify a previously underexplored population genetic phenomenon with important implications for association studies and define conditions under which it is likely to manifest.
AB - Demographic, genetic, or stochastic factors can lead to perfect linkage disequilibrium (LD) between alleles at two loci without respect to the extent of their physical distance, a phenomenon that Lawrence et al. (2005a) refer to as “genetic indistin-guishability.” This phenomenon can complicate genotype–phenotype association testing by hindering the ability to localize causal alleles, but has not been thoroughly explored from a theoretical perspective or using large, dense whole-genome polymorphism data sets. We derive a simple theoretical model of the prevalence of genetic indistinguishability between unlinked loci and verify its accuracy via simulation. We show that sample size and minor allele frequency are the major determinants of the prevalence of perfect LD between unlinked loci but that demographic factors, such as deviations from random mating, can produce significant effects as well. Finally, we quantify this phenomenon in three model organisms and find thousands of pairs of moderate-frequency (> 5%) genetically indistinguishable variants in relatively large data sets. These results clarify a previously underexplored population genetic phenomenon with important implications for association studies and define conditions under which it is likely to manifest.
KW - Genetically indistinguishable
KW - Genome-wide association study
KW - Linkage disequilibrium
UR - http://www.scopus.com/inward/record.url?scp=84979917607&partnerID=8YFLogxK
U2 - 10.1534/genetics.115.177816
DO - 10.1534/genetics.115.177816
M3 - Article
SN - 0016-6731
VL - 202
SP - 427
EP - 437
JO - Genetics
JF - Genetics
IS - 2
ER -