Correcting for measurement error in individual ancestry estimates in structured association tests

Jasmin Divers; Laura K Vaughan; Miguel A Padilla; José R Fernandez; David B Allison; David T Redden

doi:10.1534/genetics.107.075408

Correcting for measurement error in individual ancestry estimates in structured association tests

Genetics. 2007 Jul;176(3):1823-33. doi: 10.1534/genetics.107.075408. Epub 2007 May 16.

Authors

Jasmin Divers¹, Laura K Vaughan, Miguel A Padilla, José R Fernandez, David B Allison, David T Redden

Affiliation

¹ Center for Public Health Genomics, Department of Biostatistical Sciences, Division of Public Health Services, Wake Forest University Health Sciences, Winston-Salem, North Carolina 27101, USA. [email protected]

Abstract

We present theoretical explanations and show through simulation that the individual admixture proportion estimates obtained by using ancestry informative markers should be seen as an error-contaminated measurement of the underlying individual ancestry proportion. These estimates can be used in structured association tests as a control variable to limit type I error inflation or reduce loss of power due to population stratification observed in studies of admixed populations. However, the inclusion of such error-containing variables as covariates in regression models can bias parameter estimates and reduce ability to control for the confounding effect of admixture in genetic association tests. Measurement error correction methods offer a way to overcome this problem but require an a priori estimate of the measurement error variance. We show how an upper bound of this variance can be obtained, present four measurement error correction methods that are applicable to this problem, and conduct a simulation study to compare their utility in the case where the admixed population results from the intermating between two ancestral populations. Our results show that the quadratic measurement error correction (QMEC) method performs better than the other methods and maintains the type I error to its nominal level.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Biological Evolution
Computer Simulation
Confounding Factors, Epidemiologic*
Genetics, Population*
Humans
Methods
Models, Genetic

Abstract

Publication types

MeSH terms

Grants and funding