Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.

Kumasaka N; Yamaguchi-Kabata Y; Takahashi A; Kubo M; Nakamura Y; Kamatani N

doi:10.1038/jhg.2010.63

Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.

Kubo M ,

Affiliations

1. Laboratory for Statistical Analysis, Research Group for Medical Informatics, Center for Genomic Medicine, RIKEN, Tokyo, Japan.
Authors
Kumasaka N¹
(1 author)

Journal of Human Genetics, 17 Jun 2010, 55(8):525-533
https://doi.org/10.1038/jhg.2010.63 PMID: 20555335

Abstract

Recent studies have demonstrated that principal component analysis (PCA) can detect the presence of population mixture and admixture in a sample and thus can be used to correct population stratification in genome-wide association studies (GWAS). We propose a complementary approach to PCA that compensates for potential weaknesses associated with PCA, so that one can perform population structure analyses using limited numbers of subjects and single-nucleotide polymorphisms (SNPs). Our method first requires a PCA of the largest reference sample from a population to standardize the system. Once the system is established, it can perform PCA for each individual with a much smaller number of SNPs drawn from the same population. This is because of the introduction of the probabilistic PCA, so that the prediction of the principal components (PCs) is performed under a rigorous probabilistic framework. The subsequent linear discriminant analysis also helps to understand from which ancestries or subpopulations a given individual is more likely to derive, in terms of posterior probabilities given the predicted PCs. A real-world prototype of the system for the Japanese population is developed based on 19 260 subjects, which illustrates the potential usefulness of the system as an aid in the detection of population structures in validation samples, or to help with the correction of population stratification in GWAS.

Full text links

Read article at publisher's site: https://doi.org/10.1038/jhg.2010.63

Read article for free, from open access legal sources, via Unpaywall: https://www.nature.com/articles/jhg201063.pdf

References

Articles referenced by this article (26)

Population structure and eigenanalysis.
Patterson N, Price AL, Reich D
PLoS Genet, (12):e190 2006
MED: 17194218
Measuring European population stratification with microarray genotype data.
Bauchet M, McEvoy B, Pearson LN, Quillen EE, Sarkisian T, Hovhannesyan K, Deka R, Bradley DG, Shriver MD
Am J Hum Genet, (5):948-956 2007
MED: 17436249
Correlation between genetic and geographic structure in Europe.
Lao O, Lu TT, Nothnagel M, Junge O, Freitag-Wolf S, Caliebe A, Balascakova M, Bertranpetit J, Bindoff LA, Comas D, Holmlund G, Kouvatsi A, Macek M, Mollet I, Parson W, Palo J, Ploski R, Sajantila A, Tagliabracci A, [...] Kayser M
Curr Biol, (16):1241-1248 2008
MED: 18691889
Genes mirror geography within Europe.
Novembre J, Johnson T, Bryc K, Kutalik Z, Boyko AR, Auton A, Indap A, King KS, Bergmann S, Nelson MR, Stephens M, Bustamante CD
Nature, (7218):98-101 2008
MED: 18758442
Discerning the ancestry of European Americans in genetic association studies.
Price AL, Butler J, Patterson N, Capelli C, Pascali VL, Scarnicci F, Ruiz-Linares A, Groop L, Saetta AA, Korkolopoulou P, Seligsohn U, Waliszewska A, Schirmer C, Ardlie K, Ramos A, Nemesh J, Arbeitman L, Goldstein DB, Reich D, Hirschhorn JN
PLoS Genet, (1):e236 2007
MED: 18208327
Mapping human genetic diversity in Asia.
HUGO Pan-Asian SNP Consortium, Abdulla MA, Ahmed I, Assawamakin A, Bhak J, Brahmachari SK, Calacal GC, Chaurasia A, Chen CH, Chen J, Chen YT, Chu J, Cutiongco-de la Paz EM, De Ungria MC, Delfin FC, Edo J, Fuchareon S, Ghang H, Gojobori T, Han J, Ho SF, Hoh BP, Huang W, Inoko H, Jha P, Jinam TA, Jin L, Jung J, Kangwanpong D, Kampuansai J, Kennedy GC, Khurana P, Kim HL, Kim K, Kim S, Kim WY, Kimm K, Kimura R, Koike T, Kulawonganunchai S, Kumar V, Lai PS, Lee JY, Lee S, Liu ET, Majumder PP, Mandapati KK, Marzuki S, Mitchell W, Mukerji M, Naritomi K, Ngamphiw C, Niikawa N, Nishida N, Oh B, Oh S, Ohashi J, Oka A, Ong R, Padilla CD, Palittapongarnpim P, Perdigon HB, Phipps ME, Png E, Sakaki Y, Salvador JM, Sandraling Y, Scaria V, Seielstad M, Sidek MR, Sinha A, Srikummool M, Sudoyo H, Sugano S, Suryadi H, Suzuki Y, Tabbada KA, Tan A, Tokunaga K, Tongsima S, Villamor LP, Wang E, Wang Y, Wang H, Wu JY, Xiao H, Xu S, Yang JO, Shugart YY, Yoo HS, Yuan W, Zhao G, Zilfalil BA; Indian Genome Variation Consortium
Science, (5959):1541-1545 2009
MED: 20007900
Genetic structure of the Han Chinese population revealed by genome-wide SNP variation.
Chen J, Zheng H, Bei JX, Sun L, Jia WH, Li T, Zhang F, Seielstad M, Zeng YX, Zhang X, Liu J
Am J Hum Genet, (6):775-785 2009
MED: 19944401
Analysis of East Asia genetic substructure using genome-wide SNP arrays.
Tian C, Kosoy R, Lee A, Ransom M, Belmont JW, Gregersen PK, Seldin MF
PLoS One, (12):e3862 2008
MED: 19057645
Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies.
Yamaguchi-Kabata Y, Nakazono K, Takahashi A, Saito S, Hosono N, Kubo M, Nakamura Y, Kamatani N
Am J Hum Genet, (4):445-456 2008
MED: 18817904
Genome-wide insights into the patterns and determinants of fine-scale population structure in humans.
Biswas S, Scheinfeldt LB, Akey JM
Am J Hum Genet, (5):641-650 2009
MED: 19442770

Show 10 more references (10 of 26)

Citations & impact

Impact metrics

Citations

Jump to Citations

Citations of article over time

Article citations

The role and risks of selective adaptation in extreme coral habitats.
Scucchia F, Zaslansky P, Boote C, Doheny A, Mass T, Camp EF
Nat Commun, 14(1):4475, 28 Jul 2023
Cited by: 4 articles | PMID: 37507378 | PMCID: PMC10382478
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
The fine-scale genetic structure and evolution of the Japanese population.
Takeuchi F, Katsuya T, Kimura R, Nabika T, Isomura M, Ohkubo T, Tabara Y, Yamamoto K, Yokota M, Liu X, Saw WY, Mamatyusupu D, Yang W, Xu S, Japanese Genome Variation Consortium, Teo YY, Kato N
PLoS One, 12(11):e0185487, 01 Nov 2017
Cited by: 18 articles | PMID: 29091727 | PMCID: PMC5665431
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Palm dermatoglyphs and interleukin-4 receptor polymorphisms in asthma.
Sun L, Xue W, Li J, Zhou Z, Han W
Biomed Rep, 6(1):21-26, 07 Nov 2016
Cited by: 1 article | PMID: 28123702 | PMCID: PMC5244800
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Unique characteristics of the Ainu population in Northern Japan.
Jinam TA, Kanzawa-Kiriyama H, Inoue I, Tokunaga K, Omoto K, Saitou N
J Hum Genet, 60(10):565-571, 16 Jul 2015
Cited by: 15 articles | PMID: 26178428
Compilation of copy number variants identified in phenotypically normal and parous Japanese women.
Migita O, Maehara K, Kamura H, Miyakoshi K, Tanaka M, Morokuma S, Fukushima K, Shimamoto T, Saito S, Sago H, Nishihama K, Abe K, Nakabayashi K, Umezawa A, Okamura K, Hata K
J Hum Genet, 59(6):326-331, 01 May 2014
Cited by: 2 articles | PMID: 24785687

Go to all (7) article citations

Search life-sciences literature (45,103,589 articles, preprints and more)

Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.

Affiliations

Authors

Abstract

Full text links

References

Population structure and eigenanalysis.

Measuring European population stratification with microarray genotype data.

Correlation between genetic and geographic structure in Europe.

Genes mirror geography within Europe.

Discerning the ancestry of European Americans in genetic association studies.

Mapping human genetic diversity in Asia.

Genetic structure of the Han Chinese population revealed by genome-wide SNP variation.

Analysis of East Asia genetic substructure using genome-wide SNP arrays.

Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies.

Genome-wide insights into the patterns and determinants of fine-scale population structure in humans.

Citations & impact

Impact metrics

Citations of article over time

Article citations

The role and risks of selective adaptation in extreme coral habitats.

The fine-scale genetic structure and evolution of the Japanese population.

Palm dermatoglyphs and interleukin-4 receptor polymorphisms in asthma.

Unique characteristics of the Ainu population in Northern Japan.

Compilation of copy number variants identified in phenotypically normal and parous Japanese women.

Similar Articles

Dynamic variable selection in SNP genotype autocalling from APEX microarray data.

Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

SNP selection and multidimensional scaling to quantify population structure.

Software engineering the mixed model for genome-wide association studies on large samples.

Partnerships & funding

Similar Articles

Dynamic variable selection in SNP genotype autocalling from APEX microarray data.

Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.
Mol Ecol, 19(5):968-984, 08 Feb 2010

SNP selection and multidimensional scaling to quantify population structure.
Genet Epidemiol, 33(6):488-496, 01 Sep 2009

Software engineering the mixed model for genome-wide association studies on large samples.
Brief Bioinform, 10(6):664-675, 01 Nov 2009

Search life-sciences literature (45,103,589 articles, preprints and more)

Establishment of a standardized system to perform population structure analyses with limited sample size or with different sets of SNP genotypes.

Author information

Affiliations

Authors

Abstract

Full text links

References

Citations & impact

Impact metrics

Citations of article over time

Article citations

Similar Articles

Partnerships & funding