Development and validation of polygenic risk scores for prediction of breast cancer and breast cancer subtypes in Chinese women

BMC Cancer. 2022 Apr 8;22(1):374. doi: 10.1186/s12885-022-09425-3.

Abstract

Background: Studies investigating breast cancer polygenic risk score (PRS) in Chinese women are scarce. The objectives of this study were to develop and validate PRSs that could be used to stratify risk for overall and subtype-specific breast cancer in Chinese women, and to evaluate the performance of a newly proposed Artificial Neural Network (ANN) based approach for PRS construction.

Methods: The PRSs were constructed using the dataset from a genome-wide association study (GWAS) and validated in an independent case-control study. Three approaches, including repeated logistic regression (RLR), logistic ridge regression (LRR) and ANN based approach, were used to build the PRSs for overall and subtype-specific breast cancer based on 24 selected single nucleotide polymorphisms (SNPs). Predictive performance and calibration of the PRSs were evaluated unadjusted and adjusted for Gail-2 model 5-year risk or classical breast cancer risk factors.

Results: The primary PRSANN and PRSLRR both showed modest predictive ability for overall breast cancer (odds ratio per interquartile range increase of the PRS in controls [IQ-OR] 1.76 vs 1.58; area under the receiver operator characteristic curve [AUC] 0.601 vs 0.598) and remained to be predictive after adjustment. Although estrogen receptor negative (ER-) breast cancer was poorly predicted by the primary PRSs, the ER- PRSs trained solely on ER- breast cancer cases saw a substantial improvement in predictions of ER- breast cancer.

Conclusions: The 24 SNPs based PRSs can provide additional risk information to help breast cancer risk stratification in the general population of China. The newly proposed ANN approach for PRS construction has potential to replace the traditional approaches, but more studies are needed to validate and investigate its performance.

Keywords: Artificial neural network; Breast cancer; Estrogen receptor-negative breast cancer; Polygenic risk score; Single nucleotide polymorphisms.

MeSH terms

  • Breast Neoplasms* / diagnosis
  • Breast Neoplasms* / epidemiology
  • Breast Neoplasms* / genetics
  • Case-Control Studies
  • Female
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study
  • Humans
  • Polymorphism, Single Nucleotide
  • Risk Factors