Skip to main content
  • AACR Publications
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

  • Register
  • Log in
Advertisement

Main menu

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • OnlineFirst
    • Editors' Picks
    • Citations
    • Author/Keyword
  • News
    • Cancer Discovery News
  • AACR Publications
    • Cancer Discovery
    • Cancer Epidemiology, Biomarkers & Prevention
    • Cancer Immunology Research
    • Cancer Prevention Research
    • Cancer Research
    • Clinical Cancer Research
    • Molecular Cancer Research
    • Molecular Cancer Therapeutics

User menu

  • Register
  • Log in

Search

  • Advanced search
Cancer Research
Cancer Research

Advanced Search

  • Home
  • About
    • The Journal
    • AACR Journals
    • Subscriptions
    • Permissions and Reprints
    • Reviewing
  • Articles
    • OnlineFirst
    • Current Issue
    • Past Issues
    • Meeting Abstracts
    • Collections
      • Focus on Computer Resources
      • Highly Cited Collection
      • Editors' Picks
  • For Authors
    • Information for Authors
    • Author Services
    • Early Career Award
    • Best of: Author Profiles
    • Submit
  • Alerts
    • Table of Contents
    • OnlineFirst
    • Editors' Picks
    • Citations
    • Author/Keyword
  • News
    • Cancer Discovery News
Prevention and Epidemiology

Assessment of Breast Cancer Risk Factors Reveals Subtype Heterogeneity

Johanna Holm, Louise Eriksson, Alexander Ploner, Mikael Eriksson, Mattias Rantalainen, Jingmei Li, Per Hall and Kamila Czene
Johanna Holm
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Johanna.holm@ki.se
Louise Eriksson
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.Department of Oncology and Pathology, Karolinska Institutet and University Hospital, Stockholm, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexander Ploner
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mikael Eriksson
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mattias Rantalainen
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jingmei Li
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Per Hall
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.Department of Oncology, Södersjukhuset, Stockholm, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kamila Czene
Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Solna, Sweden.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
DOI: 10.1158/0008-5472.CAN-16-2574 Published July 2017
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Abstract

Subtype heterogeneity for breast cancer risk factors has been suspected, potentially reflecting etiologic differences and implicating risk prediction. However, reports are conflicting regarding the presence of heterogeneity for many exposures. To examine subtype heterogeneity across known breast cancer risk factors, we conducted a case–control analysis of 2,632 breast cancers and 15,945 controls in Sweden. Molecular subtype was predicted from pathology record–derived IHC markers by a classifier trained on PAM50 subtyping. Multinomial logistic regression estimated separate ORs for each subtype by the exposures parity, age at first birth, breastfeeding, menarche, hormone replacement therapy use, somatotype at age 18, benign breast disease, mammographic density, polygenic risk score, family history of breast cancer, and BRCA mutations. We found clear subtype heterogeneity for genetic factors and breastfeeding. Polygenic risk score was associated with all subtypes except for the basal-like (Pheterogeneity < 0.0001). “Never breastfeeding” was associated with increased risk of basal-like subtype [OR 4.17; 95% confidence interval (CI) 1.89–9.21] compared with both nulliparity (reference) and breastfeeding. Breastfeeding was not associated with risk of HER2-overexpressing type, but protective for all other subtypes. The observed heterogeneity in risk of distinct breast cancer subtypes for germline variants supports heterogeneity in etiology and has implications for their use in risk prediction. The association between basal-like subtype and breastfeeding merits more research into potential causal mechanisms and confounders. Cancer Res; 77(13); 3708–17. ©2017 AACR.

Introduction

Breast cancer is a molecularly diverse disease. At least four subtypes have been robustly established following gene expression–based characterization in the early 2000s (1, 2). These subtypes (Luminal A, luminal B, HER2-overexpressing, and basal-like) behave differently in terms of age at onset and prognosis, but the question remains to what extent they also represent etiologically distinct cancers and differ in risk factors. Recently analysis of recurrent and metastasizing patient samples has shown that breast cancer can drift in molecular subtype throughout disease progression (3–6). However, it has repeatedly been shown that in situ cancers display the full spectrum of molecular subtypes before any signs of invasiveness (7–11), pointing toward early determination of the cancer's nature. Presence of subtype heterogeneity for breast cancer risk factors could indicate separate etiology for one or several subtypes of breast cancer, and would also implicate clinical efforts in risk prediction and prevention.

Studies addressing subtype heterogeneity of risk factors have so far employed various IHC marker combinations as proxy for the gene expression–based classifications, which limits precision of findings. Because of the majority of cases in study populations being luminal, one could expect known risk factors to be biased toward predicting luminal breast cancers. A few consistent observations have been made: compared with the most prevalent subtype Luminal A, basal-like breast cancer is associated with younger age at onset, presence of BRCA1 mutations, and recent African heritage (3). However, results are less consistent for lifestyle and reproductive risk factors such as hormone replacement therapy (HRT) use, parity, age at first birth, and age at menarche (4). Most consistent is a stronger protective effect of breastfeeding on the basal-like subtype among parous women (5–8). Intriguingly, although parity has been consistently associated with decreased risk of luminal subtypes, results vary from a decreased to an increased risk of the basal-like subtype (4). Very little is known about risk factors for the luminal B and HER2-overexpressing types, related to difficulties in finding consistent IHC proxies to represent them (4). Moreover, because the relative proportions of subtypes vary across ethnicities, genetic variation has been put forward as explaining variation in subtype incidence (9, 10), but could also be explained by environmental, lifestyle, and reproductive differences (11–14). Germline genetic risk factors beyond BRCA mutations should therefore also be included in studies of subtype heterogeneity.

Our goal was to determine whether differences in breast cancer risk factors are related to breast cancer molecular subtypes, addressing subtype heterogeneity of risk factors in a Swedish study population. To improve precision in classification of the outcome, we made use of a subset of our data where PAM50 subtyping was available and predicted subtype on the full dataset from these, as well as used a previously established IHC proxy to define subtypes to assess the robustness of any findings.

Patients and Methods

Setting

This is a case–control study based on two Swedish breast cancer cohorts. The study was conducted in accordance with the Declaration of Helsinki. Ethical approvals were granted from the regional ethical vetting board and all participants gave written informed consent.

Participants

Participants were recruited from the the KARolinska MAmmography Project for Risk Prediction of Breast Cancer (KARMA) and Libro-1study cohorts (15, 16). KARMA is a prospective cohort study of 70,877 women with or without breast cancer, recruited in 2011–2013, from four mammography units situated in Skåne county and Stockholm conducting both population-based mammography screening and clinical mammography. Libro1 is a case-only, population-based cohort consisting of 5,715 women diagnosed with breast cancer in Stockholm years 2001–2008. Women in both studies answered questionnaires, donated blood at enrollment, and consented to the retrieval of their mammograms and medical records. Questionnaires and study material were largely similar for both studies as Libro-1 was the pilot study of KARMA.

All primary invasive breast cancer cases from both studies diagnosed 2005 to 2015 were eligible for inclusion (n = 4,598). The cutoff at 2005 was chosen as IHC markers HER2 and Ki67 were not stained for, and thus not available in medical records, prior to this year. Exclusion criteria were missed information on either of the IHC markers ER, PR, HER2, and Ki67 (n = 1,265). Controls were randomly selected among breast cancer–free participants of the KARMA study, frequency-matched up to 1:5 to cases on age at enrollment (controls) to age at diagnosis (cases) in 5-year strata using a greedy nearest neighbor algorithm without replacement (17). The final study sample consisted of 2,632 breast cancers and 15,945 controls. For the analysis of the polygenic risk score, all KARMA women who did not have breast cancer and who had available genotyping (n = 5,425) were used as controls.

Data on exposures

Data on parity, age at first birth, breastfeeding, HRT use, somatotype at age 18, menarche, weight and height at enrollment, BRCA1/2 mutation carriership, country of birth, education and family history of breast cancer was collected from web- and paper study questionnaires answered at study enrollment, with all variables harmonized between the two studies prior to analysis. Recall of HRT use was aided by pictures of HRT brands dispensed in Sweden. Somatotypes were illustrated by 9 pictograms in the questionnaires and women were asked which pictogram most resembled them at age 18. As BRCA mutation status was self-reported and thus more likely to be known for cases, only data on BRCA status from cases were considered for this study.

History of benign breast disease (BBD) was obtained from pathology records. Only BBD diagnoses at least one year prior to breast cancer for cases, or before 2013 for the controls, were included in analysis. Mammograms were collected from radiology departments and available for 90% of the study population. Mammographic density (MD) was measured using an automated method previously described (15, 18). Image pairs from a reading within 4 years prior to diagnosis or study enrollment were selected for cases and controls, respectively, and absolute MD from the left and right mediolateral oblique view were averaged. Data on SNP markers was obtained from blood donated by participants at enrollment, which had been genotyped on a custom Illumina iSelect Array (iCOGS Array; ref. 19). Missing genotypes were imputed using 1000 Genomes [phase I integrated variant set release (v3) in National Center for Biotechnology Information build 37 (hg19) coordinates]. Polygenic risk scores (PRS) were constructed using 77 breast cancer risk SNPs discovered in large consortia studies. The score for each patient was calculated by summing the number of alleles for each SNP (0, 1 or 2), weighted by the per-allele ORs for the minor alleles reported by Mavvadat and colleagues (20). Two scores were generated, one general score using breast cancer risk ORs as weights and the other score using weights from associations to ER-negative breast cancer only. Full details on the construction of the PRS were described previously (21).

Data on tumor characteristics (cases only)

Data on molecular markers were retrieved in 2015–2016 from medical and pathology records at treating hospitals. Percent estrogen receptor (ER) and progesterone receptor (PR) staining was dichotomized into positive or negative status with a cutoff at ≥10% as positive during this period. HER2 status was dichotomized into positive or negative according to the Swedish Society of Pathology's guidelines (22): HER2 was considered negative if protein expression showed 0 or 1+, or was higher with no confirmed gene amplification by FISH, and positive if FISH showed gene amplification. Proliferation marker Ki67 was measured in hotspot regions according to contemporary guidelines (22) and reported as percent staining. Information on tumor invasiveness and prior breast cancer diagnoses was obtained through merges to the Swedish National Cancer Register (23) and the Regional Breast Cancer Quality Register (24) using the unique, Swedish personal identity numbers (25).

Outcome classification

A random forest algorithm was used to construct a subtype classifier using the caret R package (v. 6.0.58; ref. 26). As part of the Clinical Sequencing of Cancer in Sweden (Clinseq) project, 237 of the cases had had tumors RNA-sequenced and assigned into PAM-50 molecular subtypes as described previously (27). The algorithm was trained to predict subtype on the subset of the data with PAM50 subtype information available (“training data”, n = 237). Binary ER, PR, HER2, continuous Ki67 and age at diagnosis were entered as input and the algorithm run with five times repeated 10-fold cross validation to avoid overfitting. Accuracy and kappa values were used to select the best performing algorithm. The resulting best algorithm (hereafter denoted “classifier”) then assigned PAM50 subtype to all remaining cases based on their age, ER, PR, HER2, and Ki67 status.

As a sensitivity analysis, the St. Gallen method of using IHC markers as a proxy for gene expression–based subtyping (28) was used to assign subtype, with a modified cutoff for Ki67 of 25 % instead of 14% due to the lack of whole-slide percent for Ki67. This proxy defines luminal A as ER+/PR+/HER2− and KI67low, luminal B as either ER+/PR−/HER2−, or ER+/PR+/HER2−, KI67high, or ER+/HER2+/any PR, any KI67. HER2-overexpressing is defined as ER−/PR−/HER2+, and basal-like defined as ER−/PR−/HER2− (triple negative).

To assess the performance of our classifier and the St. Gallen proxy, we obtained accuracy and kappa statistics for both methods as compared against gene expression–based PAM50 subtyping, by resampling from the 237 observations with PAM50 data (function “resamples” from the caret package in R). Resampled values were necessary to avoid over-fitted, over-optimistic statistics for the classifier, which had been trained on the same data. Confusion Matrices of actual verses predicted subtype were tabulated for both methods, in the training dataset.

Statistical analysis

Multivariable regression.

In multivariable regression analysis, breast tumors of different subtypes were considered as separate outcomes and their respective risks were modeled relative to healthy controls via multinomial logistic regression. For simplicity, we refer to the resulting relative RRs as ORs. Heterogeneity in ORs was formally assessed with a global Wald test, testing the null hypothesis that the risk associated with the exposure was the same across all subtypes. In addition, multinomial logistic regression was performed in a case-only design using luminal A cases as the reference group. For the sake of comparison to subtype-specific ORs, ORs comparing all breast cancer cases to controls were obtained using unconditional logistic regression. All analyses were minimally adjusted for age (matching variable), education level (<10 years, 10–12 years, university, other) and country of birth (binary, Sweden/other), further covariates were included as potential confounders in the models based on subject matter knowledge. The sets of covariates included in each of the fully adjusted models are stated in the respective tables. Only case-only analysis was performed pertaining to exposure BRCA mutation status.

Exposure parameterizations.

First-degree family history of breast cancer was modeled as a binary variable, defined as having a mother or sister with breast cancer, yes/no. Continuous variables for polygenic risk scores were scaled prior to modeling and modeled per 1-standard deviation (SD) increase. Both PRSs were additionally modeled as categorical variables, cut into quartiles of the scores. Parity was modeled as a categorical (0, 1–2, >2 children) and continuous variable. Age at first birth was dichotomized into < 30 and ≥30 years of age, restricting analysis to parous women. Breastfeeding was categorized into 0, >0–1.5 years, and >1.5 years, after summarizing the total length of breastfeeding across all children. Breastfeeding was also assessed as a composite variable including parity, by using nulliparous women as reference group. HRT use was modeled as “Ever”/“Never” use. Women who reported use of locally administered HRT exclusively were coded as never users. Somatotype was modeled as a semicontinuous variable by assigning integers to each somatotype 1–9 from lowest to highest adiposity. Menarche was modeled per year's delay as a continuous variable. Mammographic density was scaled prior to modeling and assessed per-1 SD increase. Benign breast disease was separated into nonproliferative and proliferative, nonatypical lesions and modeled as binary variables of “ever”/“never” diagnosed. Atypical proliferative lesions were not included in analysis due to insufficient numbers.

All statistical tests were two-sided with a predetermined cutoff for statistical significance at α= 0.05. Software R (29) v.3.2.2 was used for all statistical analysis.

Results

Outcome classification

The average resampled accuracy and kappa values were 0.73 and 0.55 for the random forest classifier, and 0.64 and 0.46 for the St. Gallen IHC proxy. For all main tables, the random forest classifier was used for outcome classification. The mode of IHC marker combinations for cases classified as luminal A or B was ER+/PR+/HER2−, observed for 80% and 54%, respectively, for HER2-overexpressing ER−/PR−/HER2+, observed for 45%, and for basal-like triple-negative for all markers, observed for 85%. Average percentage Ki67 staining was 14% among luminal A tumors, 46% among luminal B tumors, 36% among HER2-overexpressing tumors, and 69% among basal-like tumors (Table 1). Confusion matrixes of true versus predicted subtypes revealed that both the classifier and the St. Gallen IHC proxy were good at capturing luminal A and basal-like status, but performed worse for luminal B and HER2-overexpressing tumors (Supplementary Table S1).

View this table:
  • View inline
  • View popup
Table 1.

Demographics and covariates by case status and breast cancer subtype

Descriptive cross-tabulations

Table 1 shows crude descriptive contingency tables of cases verses controls, and cases by subtypes for adjusting variables and selected exposures of interest. Cases tended to be more highly educated, more often born abroad, had a higher frequency of family history of breast cancer and had breastfed less than controls. Within subtypes, variations in age, BRCA mutations, and breastfeeding were observed. Although age ranges were similar across subtypes, luminal A cases were on average older than the other categories (59 years) and basal-like cases were youngest at diagnosis (52 years; Table 1).

Adjusted case–control ORs

Multivariable regression analysis of genetic background risk factors for each subtype is shown in Table 2. The general 77-SNP PRS was associated with all subtypes except for the basal-like subtype, and most strongly associated with the luminal A subtype. The Wald test showed significant heterogeneity across subtype odds ratios (Pheterogeneity <0.0001). In contrast to the general PRS, the PRS weighted on associations to ER-negative breast cancer yielded no evidence of heterogeneity of effect across subtypes (Pheterogeneity = 0.43, Table 2). There was no evidence of heterogeneity for first-degree family history across breast cancer subtypes (Table 2).

View this table:
  • View inline
  • View popup
Table 2.

Risk of breast cancer, overall and by subtype, by genetic background

Multivariable regression analysis of reproductive factors by subtypes is shown in Table 3. There was no statistical evidence of heterogeneity between subtypes for parity; however, judging from point estimates, parity had a protective effect on all subtypes except for basal-like subtype. Similarly, in analysis of parous women, no heterogeneity was observed for age at first birth (Table 3). The effects of breastfeeding were heterogeneous across subtypes (Pheterogeneity = 0.01). With nulliparous women as reference group, parous women who never breastfed had an increased risk of basal-like breast cancer (OR 4.17; 95 % CI, 1.89–9.21). Breastfeeding returned the risk of basal-like breast cancer to that of nulliparous women (OR breastfeeding <1.5 years 1.02; 95% CI, 0.59–1.76), OR breastfeeding ≥ 1.5 years 0.81; 95% CI, 0.43–1.60; Table 3). In contrast, the risk of developing luminal A breast cancer was no different from nulliparous women for parous women never breastfeeding (OR 1.01; 95% CI, 0.74–1.39), but a protective effect was seen for breastfeeding (breastfeeding <1.5 years, OR 0.69; 95% CI, 0.59–0.82), breastfeeding ≥ 1.5 years, OR 0.63; 95% CI, 0.52–0.76). Luminal B showed point estimates similar to those of luminal A, whereas HER2-overexpressing type was unaffected by breastfeeding (Table 3).

View this table:
  • View inline
  • View popup
Table 3.

Risk of breast cancer overall and by subtype, by reproductive risk factors

Multivariable analysis of lifestyle and nonreproductive hormonal factors by subtypes is shown in Table 4. Ever use of HRT showed heterogeneity (Pheterogeneity = 0.05), with increased risk for the luminal A (ever HRT use, OR 1.43; 95% CI, 1.28–1.61) but no effect among the other subtypes (Table 4). Point estimates for age at menarche, somatotype at age 18, and benign breast disease also suggested differences by subtype but no statistically significant heterogeneity was observed. Still, for age at menarche, a protective effect of increasing age was observed for the luminal A and B subtypes alone. Increasingly endomorph somatotype at age 18 was associated with a protective effect for the luminal and HER2 subtypes, whereas basal-like appeared null-associated. Proliferative nonatypical lesions were associated with luminal A cancers but showed similar estimates for other subtypes except luminal B, whereas nonproliferative lesions were null-associated with all subtypes except for a possible increase for the HER2-subtype. There was no evidence of subtype heterogeneity for mammographic density, with an increased risk for all subtypes of disease with increasing density (Table 4).

View this table:
  • View inline
  • View popup
Table 4.

Risk for breast cancer overall and by subtype: ever hormone replacement therapy (HRT use), age at menarche, somatotype at age 18, mammographic density, benign breast disease

Summary of findings for risk factors that displayed subtype heterogeneity are shown graphically as forest plots in Figure 1, separately by subtype. Luminal A and B showed very similar estimates, whereas the basal-like subtype displayed distinct features (Fig. 1).

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Forest plots summarizing observed heterogeneity of results for exposures breastfeeding, PRS, and HRT use across the subtypes. BF, breastfeeding; PRS, polygenic risk score; Q1–4, quartiles 1 to 4 of the PRS.

All analyses were repeated within a case-only design with luminal A as reference, yielding the same conclusions regarding heterogeneity for PRS, breastfeeding, and HRT use as the case–control analysis (Supplementary Tables S2–S4). Case-only analysis of self-reported BRCA mutations among cases showed that basal-like tumors had higher prevalence of BRCA mutations than other subtypes, with an OR of 11.31 (95% CI, 5.37–23.07) relative to luminal A tumors (Supplementary Table S2).

Sensitivity analysis

All findings were replicated albeit attenuated when using the St. Gallen IHC proxy to define subtypes, except for the heterogeneity found for HRT use, which was not observed (Supplementary Tables S5–S7).

Discussion

Analysis of 2,632 breast cancer cases revealed evidence of subtype heterogeneity for three categories of risk factors: genetic susceptibility, HRT use, and breastfeeding. The 77-SNP PRS was exclusively associated with risk of non-basal–like subtypes, with the largest effect size for luminal A breast cancers. HRT use was associated with risk of the luminal A subtype only. Compared with nulliparous women, never breastfeeding was associated with an increased risk of basal-like breast cancer but not with risk of the other subtypes. Among parous women, breastfeeding was protective for the luminal A, B, and basal-like subtypes but null-associated with the HER2-overexpressing subtype. Luminal A and B breast cancers were very similar in associations with most risk factors.

Both germline BRCA mutations (Supplementary Table S2) and PRS (Table 2) were differentially associated with subtypes, suggesting that in addition to the previously observed heterogeneity for BRCA1 mutations (1, 2), inherited low-risk variants could also differentially increase risk of specific breast cancer subtypes. BRCA1 mutations have been shown experimentally to result in accumulation of undifferentiated luminal progenitor cells (30–33), the suspected cell-of-origin of basal-like breast cancer (34, 35), but it is not known how or if low-risk SNPs could represent an etiologic difference between subtypes. Individual SNPs have been found to be associated with either subtype or ER status (36–39), supporting our finding for the PRS. The ER-negative weighted PRS was associated with basal-like breast cancer in our study thus showing potential as a complement or replacement to the overall PRS score for identification of women at risk of this aggressive disease. Collectively, these observations also indicate a role of germline variants beyond BRCA in distinct etiology of molecular subtypes, which should be further investigated.

Our results confirm previous reports of the largest protective effect of breastfeeding on risk of basal-like breast cancer (5–8). We additionally show that the protective effect for basal-like breast cancer stems from never-breastfeeding parous women having higher risk of the basal-like subtype than both nulliparous and breastfeeding women. The reason behind this increased risk, should it be causal, is not known. As basal-like cancers are thought to originate from undifferentiated luminal progenitor cells (33–34), the association may be related to higher numbers of progenitor cells in the absence of breastfeeding. Fully differentiated type 4 lobules do not form until the end of pregnancy and during lactation under the influence of prolactin (40), and prolactin, released throughout lactation (41), has recently been identified as a central promotor of luminal progenitor cell maturation in vitro (42). This hypothesis would agree with observations of BRCA1 mutations resulting in accumulation of luminal progenitor cells (30–33) and the high proportion of basal-like breast cancer among BRCA1 mutation carriers. Epidemiologic studies have additionally shown that BRCA1 carriers who breastfeed are more protected from developing breast cancer (43, 44). However, future studies should definitely consider possible confounding by lifestyle factors, preferably including data on reasons for not breastfeeding. The lack of association with breastfeeding for the HER2-overexpressing subtype distinguishes it from the other subtypes and merits further investigation, preferably with gene expression–based subtype definitions.

Point estimates for somatotype, menarche, age at first birth, parity, and benign breast disease suggested heterogeneity but were not statistically different. We saw a protective effect of parity on all subtypes except for a null association of risk of basal-like breast cancer with parity, after adjusting for age at first birth and breastfeeding. This is in line with some of the published literature, whereas others have reported increased risks of the basal-like subtype with increasing parity. Differences may be partly due to underlying variations in prevalence of breastfeeding.

We found no evidence of subtype heterogeneity for mammographic density, or with ER-negative weighted PRS. These are important messages for prevention efforts using such exposures in risk prediction, as they would be expected to identify women at risk of breast cancer independent of subtype. Mammographic density is an especially promising tool to predict risk, as it is also among the stronger risk factors for the disease.

All results were robust albeit sometimes attenuated in sensitivity analysis using the St. Gallen IHC proxy to define subtypes (Supplementary Tables S5–S7). The only exception was seen for HRT use, which did not display subtype heterogeneity using the St. Gallen IHC proxy. Future studies should ideally address this question using true gene expression–based subtype data, but our results suggest heterogeneity in subtype risk for HRT use.

A limitation of our study is that we only had true PAM50 subtype information for 237 of the cases and predicted subtype for the rest. As the assigned subtypes were largely latent for true subtype status, model estimates should be interpreted with caution. Although the overall accuracy of our classifier was higher compared with the St. Gallen IHC proxy for defining subtypes, it still performed poorly in classifying luminal B and HER2 types and our results should be interpreted in light of this.

The retrospective nature of this study introduces some elements of caution in interpretation. Cases may recollect exposures differently than controls, but there is little reason to believe such differences would vary greatly by subtype. Controls were only available from one of the cohorts, which potentially could introduce bias. Reassuring is that the case–control ORs for generic breast cancer exhibited the expected strengths and directions of associations. Moreover, case-only analysis yielded the same conclusions regarding heterogeneity as case–control analysis. Strengths of the current work include the large cohort, the use of a subtype classifier with improved accuracy compared with a previously used IHC-based method, and the availability of genetic, lifestyle, and reproductive exposures comprehensively assessed in the same study.

In conclusion, both rare and common inherited gene variants displayed subtype heterogeneity primarily between basal-like and non-basal–like subtypes, suggesting separate etiology and implications for risk prediction. We additionally found subtype heterogeneity by breastfeeding status. Relative to nulliparous women, women who did not breastfeed postpartum were exclusively at an increased risk of basal-like breast cancer. Future research is needed to confirm or refute this finding, subsequently addressing reasons behind the observed increase in risk of the highly aggressive subtype basal-like breast cancer. Mammographic density did not display subtype heterogeneity, which is assuring for usage of mammographic density in risk prediction and prevention. Finally, although we did improve in overall accuracy over available subtype surrogacy classifiers, more work remains to improve on accuracy for defining luminal B and HER2 subtypes by IHC markers for their future use in research.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Authors' Contributions

Conception and design: J. Holm, P. Hall, K. Czene

Development of methodology: J. Holm

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): J. Holm, M. Eriksson, P. Hall, K. Czene

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): J. Holm, L. Eriksson, A. Ploner, M. Rantalainen, J. Li

Writing, review, and/or revision of the manuscript: J. Holm, L. Eriksson, A. Ploner, M. Eriksson, M. Rantalainen, J. Li, P. Hall, K. Czene

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): J. Holm, K. Czene

Study supervision: K. Czene

Grant Support

The KARMA study was funded by the Märit and Hans Rausing's Initiative Against Breast Cancer. The LIBRO-1 study was funded by the Cancer Risk Prediction Center (CRisP), a Linneus Centre (contract ID 70867902) financed by the Swedish Research Council. K. Czene was supported by the Swedish Research Council (grant no. 2014-2271) and Swedish Cancer Society (grant nos. CAN 2013/469 and CAN 2016/684) and Cancer Society in Stockholm (grant no. 141092). L. Eriksson was suppored by Stockholm County Council (grant no. 20150642).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

Acknowledgments

The authors wish to thank all participants and staff in the Libro1 and Karma studies for their effort and time, and research nurses Agneta Lönn and Cecilia Arnesson with colleagues for assistance with data collection in Stockholm and Lund, respectively.

Footnotes

  • Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).

  • Received September 27, 2016.
  • Revision received February 5, 2017.
  • Accepted April 19, 2017.
  • ©2017 American Association for Cancer Research.

References

  1. 1.↵
    1. Perou CM,
    2. Sørlie T,
    3. Eisen MB,
    4. van de Rijn M,
    5. Jeffrey SS,
    6. Rees CA,
    7. et al.
    Molecular portraits of human breast tumours. Nature 2000;406:747–52.
    OpenUrlCrossRefPubMed
  2. 2.↵
    1. Sorlie T,
    2. Tibshirani R,
    3. Parker J,
    4. Hastie T,
    5. Marron JS,
    6. Nobel A,
    7. et al.
    Repeated observation of breast tumor subtypes in independent gene expression data sets. Proc Natl Acad Sci 2003;100:8418–23.
    OpenUrlAbstract/FREE Full Text
  3. 3.↵
    1. Toft DJ,
    2. Cryns VL
    . Minireview: basal-like breast cancer: from molecular profiles to targeted therapies. Mol Endocrinol 2011;25:199–211.
    OpenUrlCrossRefPubMed
  4. 4.↵
    1. Barnard ME,
    2. Boeke CE,
    3. Tamimi RM
    . Established breast cancer risk factors and risk of intrinsic tumor subtypes. Biochim Biophys Acta Rev Cancer 2015;1856:73–85.
    OpenUrl
  5. 5.↵
    1. Millikan RC,
    2. Newman B,
    3. Tse C-K,
    4. Moorman PG,
    5. Conway K,
    6. Dressler LG,
    7. et al.
    Epidemiology of basal-like breast cancer. Breast Cancer Res Treat 2008;109:123–39.
    OpenUrlCrossRefPubMed
  6. 6.↵
    1. Kwan ML,
    2. Kushi LH,
    3. Weltzien E,
    4. Maring B,
    5. Kutner SE,
    6. Fulton RS,
    7. et al.
    Epidemiology of breast cancer subtypes in two prospective cohort studies of breast cancer survivors. Breast Cancer Res 2009;11:R31.
    OpenUrlCrossRefPubMed
  7. 7.↵
    1. Redondo CM,
    2. Gago-Domínguez M,
    3. Ponte SM,
    4. Castelo ME,
    5. Jiang X,
    6. García AA,
    7. et al.
    Breast feeding, parity and breast cancer subtypes in a Spanish cohort. PLoS One 2012;7:e40543.
    OpenUrlCrossRefPubMed
  8. 8.↵
    1. Shinde SS,
    2. Forman MR,
    3. Kuerer HM,
    4. Yan K,
    5. Peintinger F,
    6. Hunt KK,
    7. et al.
    Higher parity and shorter breastfeeding duration: association with triple-negative phenotype of breast cancer. Cancer 2010;116:4933–43.
    OpenUrlCrossRefPubMed
  9. 9.↵
    1. Brewster AM,
    2. Chavez-MacGregor M,
    3. Brown P
    . Epidemiology, biology, and treatment of triple-negative breast cancer in women of African ancestry. Lancet Oncol 2014;15:e625–34.
    OpenUrlPubMed
  10. 10.↵
    1. Boyle P
    . Triple-negative breast cancer: epidemiological considerations and recommendations. Ann Oncol 2012;23:vi7–12.
    OpenUrlAbstract/FREE Full Text
  11. 11.↵
    1. Palmer JR,
    2. Viscidi E,
    3. Troester MA,
    4. Hong CC,
    5. Schedin P,
    6. Bethea TN,
    7. et al.
    Parity, lactation, and breast cancer subtypes in African American women: results from the AMBER Consortium. J Natl Cancer Inst 2014;106:pii: dju237.
    OpenUrl
  12. 12.↵
    1. Trivers KF,
    2. Lund MJ,
    3. Porter PL,
    4. Liff JM,
    5. Flagg EW,
    6. Coates RJ,
    7. et al.
    The epidemiology of triple-negative breast cancer, including race. Cancer Causes Control 2009;20:1071–82.
    OpenUrlCrossRefPubMed
  13. 13.↵
    1. Ambrosone CB,
    2. Zirpoli G,
    3. Ruszczyk M,
    4. Shankar J,
    5. Hong CC,
    6. McIlwain D,
    7. et al.
    Parity and breastfeeding among African-American women: differential effects on breast cancer risk by estrogen receptor status in the Women's Circle of Health Study. Cancer Causes Control 2014;25:259–65.
    OpenUrlCrossRefPubMed
  14. 14.↵
    1. Bandera EV,
    2. Chandran U,
    3. Hong C-C,
    4. Troester MA,
    5. Bethea TN,
    6. Adams-Campbell LL,
    7. et al.
    Obesity, body fat distribution, and risk of breast cancer subtypes in African American women participating in the AMBER Consortium. Breast Cancer Res Treat 2015;150:655–6.
    OpenUrlCrossRefPubMed
  15. 15.↵
    1. Holm J,
    2. Humphreys K,
    3. Li J,
    4. Ploner A,
    5. Cheddad A,
    6. Eriksson M,
    7. et al.
    Risk factors and tumor characteristics of interval cancers by mammographic density. J Clin Oncol 2015;33:1030–7.
    OpenUrlAbstract/FREE Full Text
  16. 16.↵
    1. Gabrielsson M,
    2. Eriksson M,
    3. Hammarström M,
    4. Borgquist S,
    5. Leifland K,
    6. Czene K,
    7. et al.
    Cohort profile: the Karolinska mammography project for risk prediction of breast cancer (KARMA). Int J Epidemiol. 2017 Feb 9. [Epub ahead of print].
  17. 17.↵
    1. Austin PC
    . A comparison of 12 algorithms for matching on the propensity score. Stat Med 2014;33:1057–69.
    OpenUrlCrossRefPubMed
  18. 18.↵
    1. Li J,
    2. Szekely L,
    3. Eriksson L,
    4. Heddson B,
    5. Sundbom A,
    6. Czene K,
    7. et al.
    High-throughput mammographic-density measurement: a tool for risk prediction of breast cancer. Breast Cancer Res 2012;14:R114.
    OpenUrlCrossRefPubMed
  19. 19.↵
    1. Michailidou K,
    2. Hall P,
    3. Gonzalez-Neira A,
    4. Ghoussaini M,
    5. Dennis J,
    6. Milne RL,
    7. et al.
    Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nat Genet 2013;45:353–61.
    OpenUrlCrossRefPubMed
  20. 20.↵
    1. Mavaddat N,
    2. Pharoah PDP,
    3. Michailidou K,
    4. Tyrer J,
    5. Brook MN,
    6. Bolla MK,
    7. et al.
    Prediction of breast cancer risk based on profiling with common genetic variants. J Natl Cancer Inst 2015;107:pii: djv036.
    OpenUrl
  21. 21.↵
    1. Li J,
    2. Holm J,
    3. Bergh J,
    4. Eriksson M,
    5. Darabi H,
    6. Lindström LS,
    7. et al.
    Breast cancer genetic risk profile is differentially associated with interval and screen-detected breast cancers. Ann Oncol 2015;26:517–22.
    OpenUrlAbstract/FREE Full Text
  22. 22.↵
    1. Grabau D
    . KVAST Dokument Brösttumörer; 2014. Available from: http://svfp.se/files/docs/kvast/brostpatologi/Brostcancerdokument_godkant_maj_2014.pdf.
  23. 23.↵
    1. Barlow L,
    2. Westergren K,
    3. Holmberg L,
    4. Talbäck M
    . The completeness of the Swedish Cancer Register: a sample survey for year 1998. Acta Oncol 2009;48:27–33.
    OpenUrlCrossRefPubMed
  24. 24.↵
    1. Emilsson L,
    2. Lindahl B,
    3. Köster M,
    4. Lambe M,
    5. Ludvigsson JF
    . Review of 103 Swedish healthcare quality registries. J Intern Med 2015;277:94–136.
    OpenUrlCrossRefPubMed
  25. 25.↵
    1. Ludvigsson JF,
    2. Otterblad-Olausson P,
    3. Pettersson BU,
    4. Ekbom A
    . The Swedish personal identity number: possibilities and pitfalls in healthcare and medical research. Eur J Epidemiol 2009;24:659–67.
    OpenUrlCrossRefPubMed
  26. 26.↵
    1. Kuhn M
    . Caret: classification and regression training. R Package version; 2015. Available from: http://cran.r-project.org/package=caret.
  27. 27.↵
    1. Wang M,
    2. Klevebring D,
    3. Lindberg J,
    4. Czene K,
    5. Grönberg H,
    6. Rantalainen M
    . Determining breast cancer histological grade from RNA-sequencing data. Breast Cancer Res 2016;18:48.
    OpenUrl
  28. 28.↵
    1. Goldhirsch A,
    2. Winer EP,
    3. Coates AS,
    4. Gelber RD,
    5. Piccart-Gebhart M,
    6. Thürlimann B,
    7. et al.
    Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the primary therapy of early breast cancer 2013. Ann Oncol 2013;24:2206–23.
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2015. Available from: https://www.r-project.org/.
  30. 30.↵
    1. Foulkes WD
    . BRCA1 functions as a breast stem cell regulator. J Med Genet 2004;41:1–5.
    OpenUrlAbstract/FREE Full Text
  31. 31.↵
    1. Burga LN,
    2. Tung NM,
    3. Troyan SL,
    4. Bostina M,
    5. Konstantinopoulos PA,
    6. Fountzilas H,
    7. et al.
    Altered proliferation and differentiation properties of primary mammary epithelial cells from BRCA1 mutation carriers. Cancer Res 2009;69:1273–8.
    OpenUrlAbstract/FREE Full Text
  32. 32.↵
    1. Furuta S,
    2. Jiang X,
    3. Gu B,
    4. Cheng E,
    5. Chen P-L,
    6. Lee W-H
    . Depletion of BRCA1 impairs differentiation but enhances proliferation of mammary epithelial cells. Proc Natl Acad Sci U S A 2005;102:9176–81.
    OpenUrlAbstract/FREE Full Text
  33. 33.↵
    1. Liu S,
    2. Ginestier C,
    3. Charafe-Jauffret E,
    4. Foco H,
    5. Kleer CG,
    6. Merajver SD,
    7. et al.
    BRCA1 regulates human mammary stem/progenitor cell fate. Proc Natl Acad Sci 2008;105:1680–5.
    OpenUrlAbstract/FREE Full Text
  34. 34.↵
    1. Molyneux G,
    2. Smalley MJ
    . The cell of origin of BRCA1 mutation-associated breast cancer: a cautionary tale of gene expression profiling. J Mammary Gland Biol Neoplasia 2011;16:51–5.
    OpenUrlCrossRefPubMed
  35. 35.↵
    1. Lim E,
    2. Vaillant F,
    3. Wu D,
    4. Forrest NC,
    5. Pal B,
    6. Hart AH,
    7. et al.
    Aberrant luminal progenitors as the candidate target population for basal tumor development in BRCA1 mutation carriers. Nat Med 2009;15:907–13.
    OpenUrlCrossRefPubMed
  36. 36.↵
    1. Figueroa JD,
    2. Garcia-Closas M,
    3. Humphreys M,
    4. Platte R,
    5. Hopper JL,
    6. Southey MC,
    7. et al.
    Associations of common variants at 1p11.2 and 14q24.1 (RAD51l1) with breast cancer risk and heterogeneity by tumor subtype: findings from the breast cancer association consortium. Hum Mol Genet 2011;20:4693–706.
    OpenUrlAbstract/FREE Full Text
  37. 37.↵
    1. Liang H,
    2. Yang X,
    3. Chen L,
    4. Li H,
    5. Zhu A,
    6. Sun M,
    7. et al.
    Heterogeneity of breast cancer associations with common genetic variants in FGFR2 according to the intrinsic subtypes in Southern han Chinese women. Biomed Res Int 2015;2015:626948.
    OpenUrl
  38. 38.↵
    1. O'Brien KM,
    2. Cole SR,
    3. Engel LS,
    4. Bensen JT,
    5. Poole C,
    6. Herring AH,
    7. et al.
    Breast cancer subtypes and previously established genetic risk factors: a Bayesian approach. Cancer Epidemiol Biomarkers Prev 2014;23:84–97.
    OpenUrlAbstract/FREE Full Text
  39. 39.↵
    1. Purrington KS,
    2. Slager S,
    3. Eccles D,
    4. Yannoukakos D,
    5. Fasching PA,
    6. Miron P,
    7. et al.
    Genome-wide association study identifies 25 known breast cancer susceptibility loci as risk factors for triple-negative breast cancer. Carcinogenesis 2014;35:1012–9.
    OpenUrlAbstract/FREE Full Text
  40. 40.↵
    1. Russo J,
    2. Moral R,
    3. Balogh GA,
    4. Mailo D,
    5. Russo IH
    . The protective role of pregnancy in breast cancer. Breast Cancer Res 2005;7:131–42.
    OpenUrlCrossRefPubMed
  41. 41.↵
    1. Crowley WR
    . Neuroendocrine regulation of lactation and milk production. In: Comprehensive physiology. Hoboken, NJ: John Wiley & Sons, Inc.; 2014. p.255–91.
  42. 42.↵
    1. Liu F,
    2. Pawliwec A,
    3. Feng Z,
    4. Yasruel Z,
    5. Lebrun J-J,
    6. Ali S
    . Prolactin/Jak2 directs apical/basal polarization and luminal linage maturation of mammary epithelial cells through regulation of the Erk1/2 pathway. Stem Cell Res 2015;15:376–83.
    OpenUrl
  43. 43.↵
    1. Jernström H,
    2. Lubinski J,
    3. Lynch HT,
    4. Ghadirian P,
    5. Neuhausen S,
    6. Isaacs C,
    7. et al.
    Breast-feeding and the risk of breast cancer in BRCA1 and BRCA2 mutation carriers. J Natl Cancer Inst 2004;96:1094–8.
    OpenUrlAbstract/FREE Full Text
  44. 44.↵
    1. Gronwald J,
    2. Byrski T,
    3. Huzarski T,
    4. Cybulski C,
    5. Sun P,
    6. Tulman A,
    7. et al.
    Influence of selected lifestyle factors on breast and ovarian cancer risk in BRCA1 mutation carriers from Poland. Breast Cancer Res Treat 2006;95:105–9.
    OpenUrlCrossRefPubMed
View Abstract
PreviousNext
Back to top
Cancer Research: 77 (13)
July 2017
Volume 77, Issue 13
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by Author
  • Editorial Board (PDF)

Sign up for alerts

View this article with LENS

Open full page PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for sharing this Cancer Research article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Assessment of Breast Cancer Risk Factors Reveals Subtype Heterogeneity
(Your Name) has forwarded a page to you from Cancer Research
(Your Name) thought you would be interested in this article in Cancer Research.
Citation Tools
Assessment of Breast Cancer Risk Factors Reveals Subtype Heterogeneity
Johanna Holm, Louise Eriksson, Alexander Ploner, Mikael Eriksson, Mattias Rantalainen, Jingmei Li, Per Hall and Kamila Czene
Cancer Res July 1 2017 (77) (13) 3708-3717; DOI: 10.1158/0008-5472.CAN-16-2574

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Assessment of Breast Cancer Risk Factors Reveals Subtype Heterogeneity
Johanna Holm, Louise Eriksson, Alexander Ploner, Mikael Eriksson, Mattias Rantalainen, Jingmei Li, Per Hall and Kamila Czene
Cancer Res July 1 2017 (77) (13) 3708-3717; DOI: 10.1158/0008-5472.CAN-16-2574
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Patients and Methods
    • Results
    • Discussion
    • Disclosure of Potential Conflicts of Interest
    • Authors' Contributions
    • Grant Support
    • Acknowledgments
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF
Advertisement

Related Articles

Cited By...

More in this TOC Section

  • AIAN Cancer Survival
  • Oral Microbiome and Esophageal Cancer
  • Diabetes and Breast Cancer in Black Women
Show more Prevention and Epidemiology
  • Home
  • Alerts
  • Feedback
  • Privacy Policy
Facebook  Twitter  LinkedIn  YouTube  RSS

Articles

  • Online First
  • Current Issue
  • Past Issues
  • Meeting Abstracts

Info for

  • Authors
  • Subscribers
  • Advertisers
  • Librarians
  • Reviewers

About Cancer Research

  • About the Journal
  • Editorial Board
  • Permissions
  • Submit a Manuscript
AACR logo

Copyright © 2019 by the American Association for Cancer Research.

Cancer Research Online ISSN: 1538-7445
Cancer Research Print ISSN: 0008-5472
Journal of Cancer Research ISSN: 0099-7013
American Journal of Cancer ISSN: 0099-7374

Advertisement