Abstract
Subtype heterogeneity for breast cancer risk factors has been suspected, potentially reflecting etiologic differences and implicating risk prediction. However, reports are conflicting regarding the presence of heterogeneity for many exposures. To examine subtype heterogeneity across known breast cancer risk factors, we conducted a case–control analysis of 2,632 breast cancers and 15,945 controls in Sweden. Molecular subtype was predicted from pathology record–derived IHC markers by a classifier trained on PAM50 subtyping. Multinomial logistic regression estimated separate ORs for each subtype by the exposures parity, age at first birth, breastfeeding, menarche, hormone replacement therapy use, somatotype at age 18, benign breast disease, mammographic density, polygenic risk score, family history of breast cancer, and BRCA mutations. We found clear subtype heterogeneity for genetic factors and breastfeeding. Polygenic risk score was associated with all subtypes except for the basal-like (Pheterogeneity < 0.0001). “Never breastfeeding” was associated with increased risk of basal-like subtype [OR 4.17; 95% confidence interval (CI) 1.89–9.21] compared with both nulliparity (reference) and breastfeeding. Breastfeeding was not associated with risk of HER2-overexpressing type, but protective for all other subtypes. The observed heterogeneity in risk of distinct breast cancer subtypes for germline variants supports heterogeneity in etiology and has implications for their use in risk prediction. The association between basal-like subtype and breastfeeding merits more research into potential causal mechanisms and confounders. Cancer Res; 77(13); 3708–17. ©2017 AACR.
Introduction
Breast cancer is a molecularly diverse disease. At least four subtypes have been robustly established following gene expression–based characterization in the early 2000s (1, 2). These subtypes (Luminal A, luminal B, HER2-overexpressing, and basal-like) behave differently in terms of age at onset and prognosis, but the question remains to what extent they also represent etiologically distinct cancers and differ in risk factors. Recently analysis of recurrent and metastasizing patient samples has shown that breast cancer can drift in molecular subtype throughout disease progression (3–6). However, it has repeatedly been shown that in situ cancers display the full spectrum of molecular subtypes before any signs of invasiveness (7–11), pointing toward early determination of the cancer's nature. Presence of subtype heterogeneity for breast cancer risk factors could indicate separate etiology for one or several subtypes of breast cancer, and would also implicate clinical efforts in risk prediction and prevention.
Studies addressing subtype heterogeneity of risk factors have so far employed various IHC marker combinations as proxy for the gene expression–based classifications, which limits precision of findings. Because of the majority of cases in study populations being luminal, one could expect known risk factors to be biased toward predicting luminal breast cancers. A few consistent observations have been made: compared with the most prevalent subtype Luminal A, basal-like breast cancer is associated with younger age at onset, presence of BRCA1 mutations, and recent African heritage (3). However, results are less consistent for lifestyle and reproductive risk factors such as hormone replacement therapy (HRT) use, parity, age at first birth, and age at menarche (4). Most consistent is a stronger protective effect of breastfeeding on the basal-like subtype among parous women (5–8). Intriguingly, although parity has been consistently associated with decreased risk of luminal subtypes, results vary from a decreased to an increased risk of the basal-like subtype (4). Very little is known about risk factors for the luminal B and HER2-overexpressing types, related to difficulties in finding consistent IHC proxies to represent them (4). Moreover, because the relative proportions of subtypes vary across ethnicities, genetic variation has been put forward as explaining variation in subtype incidence (9, 10), but could also be explained by environmental, lifestyle, and reproductive differences (11–14). Germline genetic risk factors beyond BRCA mutations should therefore also be included in studies of subtype heterogeneity.
Our goal was to determine whether differences in breast cancer risk factors are related to breast cancer molecular subtypes, addressing subtype heterogeneity of risk factors in a Swedish study population. To improve precision in classification of the outcome, we made use of a subset of our data where PAM50 subtyping was available and predicted subtype on the full dataset from these, as well as used a previously established IHC proxy to define subtypes to assess the robustness of any findings.
Patients and Methods
Setting
This is a case–control study based on two Swedish breast cancer cohorts. The study was conducted in accordance with the Declaration of Helsinki. Ethical approvals were granted from the regional ethical vetting board and all participants gave written informed consent.
Participants
Participants were recruited from the the KARolinska MAmmography Project for Risk Prediction of Breast Cancer (KARMA) and Libro-1study cohorts (15, 16). KARMA is a prospective cohort study of 70,877 women with or without breast cancer, recruited in 2011–2013, from four mammography units situated in Skåne county and Stockholm conducting both population-based mammography screening and clinical mammography. Libro1 is a case-only, population-based cohort consisting of 5,715 women diagnosed with breast cancer in Stockholm years 2001–2008. Women in both studies answered questionnaires, donated blood at enrollment, and consented to the retrieval of their mammograms and medical records. Questionnaires and study material were largely similar for both studies as Libro-1 was the pilot study of KARMA.
All primary invasive breast cancer cases from both studies diagnosed 2005 to 2015 were eligible for inclusion (n = 4,598). The cutoff at 2005 was chosen as IHC markers HER2 and Ki67 were not stained for, and thus not available in medical records, prior to this year. Exclusion criteria were missed information on either of the IHC markers ER, PR, HER2, and Ki67 (n = 1,265). Controls were randomly selected among breast cancer–free participants of the KARMA study, frequency-matched up to 1:5 to cases on age at enrollment (controls) to age at diagnosis (cases) in 5-year strata using a greedy nearest neighbor algorithm without replacement (17). The final study sample consisted of 2,632 breast cancers and 15,945 controls. For the analysis of the polygenic risk score, all KARMA women who did not have breast cancer and who had available genotyping (n = 5,425) were used as controls.
Data on exposures
Data on parity, age at first birth, breastfeeding, HRT use, somatotype at age 18, menarche, weight and height at enrollment, BRCA1/2 mutation carriership, country of birth, education and family history of breast cancer was collected from web- and paper study questionnaires answered at study enrollment, with all variables harmonized between the two studies prior to analysis. Recall of HRT use was aided by pictures of HRT brands dispensed in Sweden. Somatotypes were illustrated by 9 pictograms in the questionnaires and women were asked which pictogram most resembled them at age 18. As BRCA mutation status was self-reported and thus more likely to be known for cases, only data on BRCA status from cases were considered for this study.
History of benign breast disease (BBD) was obtained from pathology records. Only BBD diagnoses at least one year prior to breast cancer for cases, or before 2013 for the controls, were included in analysis. Mammograms were collected from radiology departments and available for 90% of the study population. Mammographic density (MD) was measured using an automated method previously described (15, 18). Image pairs from a reading within 4 years prior to diagnosis or study enrollment were selected for cases and controls, respectively, and absolute MD from the left and right mediolateral oblique view were averaged. Data on SNP markers was obtained from blood donated by participants at enrollment, which had been genotyped on a custom Illumina iSelect Array (iCOGS Array; ref. 19). Missing genotypes were imputed using 1000 Genomes [phase I integrated variant set release (v3) in National Center for Biotechnology Information build 37 (hg19) coordinates]. Polygenic risk scores (PRS) were constructed using 77 breast cancer risk SNPs discovered in large consortia studies. The score for each patient was calculated by summing the number of alleles for each SNP (0, 1 or 2), weighted by the per-allele ORs for the minor alleles reported by Mavvadat and colleagues (20). Two scores were generated, one general score using breast cancer risk ORs as weights and the other score using weights from associations to ER-negative breast cancer only. Full details on the construction of the PRS were described previously (21).
Data on tumor characteristics (cases only)
Data on molecular markers were retrieved in 2015–2016 from medical and pathology records at treating hospitals. Percent estrogen receptor (ER) and progesterone receptor (PR) staining was dichotomized into positive or negative status with a cutoff at ≥10% as positive during this period. HER2 status was dichotomized into positive or negative according to the Swedish Society of Pathology's guidelines (22): HER2 was considered negative if protein expression showed 0 or 1+, or was higher with no confirmed gene amplification by FISH, and positive if FISH showed gene amplification. Proliferation marker Ki67 was measured in hotspot regions according to contemporary guidelines (22) and reported as percent staining. Information on tumor invasiveness and prior breast cancer diagnoses was obtained through merges to the Swedish National Cancer Register (23) and the Regional Breast Cancer Quality Register (24) using the unique, Swedish personal identity numbers (25).
Outcome classification
A random forest algorithm was used to construct a subtype classifier using the caret R package (v. 6.0.58; ref. 26). As part of the Clinical Sequencing of Cancer in Sweden (Clinseq) project, 237 of the cases had had tumors RNA-sequenced and assigned into PAM-50 molecular subtypes as described previously (27). The algorithm was trained to predict subtype on the subset of the data with PAM50 subtype information available (“training data”, n = 237). Binary ER, PR, HER2, continuous Ki67 and age at diagnosis were entered as input and the algorithm run with five times repeated 10-fold cross validation to avoid overfitting. Accuracy and kappa values were used to select the best performing algorithm. The resulting best algorithm (hereafter denoted “classifier”) then assigned PAM50 subtype to all remaining cases based on their age, ER, PR, HER2, and Ki67 status.
As a sensitivity analysis, the St. Gallen method of using IHC markers as a proxy for gene expression–based subtyping (28) was used to assign subtype, with a modified cutoff for Ki67 of 25 % instead of 14% due to the lack of whole-slide percent for Ki67. This proxy defines luminal A as ER+/PR+/HER2− and KI67low, luminal B as either ER+/PR−/HER2−, or ER+/PR+/HER2−, KI67high, or ER+/HER2+/any PR, any KI67. HER2-overexpressing is defined as ER−/PR−/HER2+, and basal-like defined as ER−/PR−/HER2− (triple negative).
To assess the performance of our classifier and the St. Gallen proxy, we obtained accuracy and kappa statistics for both methods as compared against gene expression–based PAM50 subtyping, by resampling from the 237 observations with PAM50 data (function “resamples” from the caret package in R). Resampled values were necessary to avoid over-fitted, over-optimistic statistics for the classifier, which had been trained on the same data. Confusion Matrices of actual verses predicted subtype were tabulated for both methods, in the training dataset.
Statistical analysis
Multivariable regression.
In multivariable regression analysis, breast tumors of different subtypes were considered as separate outcomes and their respective risks were modeled relative to healthy controls via multinomial logistic regression. For simplicity, we refer to the resulting relative RRs as ORs. Heterogeneity in ORs was formally assessed with a global Wald test, testing the null hypothesis that the risk associated with the exposure was the same across all subtypes. In addition, multinomial logistic regression was performed in a case-only design using luminal A cases as the reference group. For the sake of comparison to subtype-specific ORs, ORs comparing all breast cancer cases to controls were obtained using unconditional logistic regression. All analyses were minimally adjusted for age (matching variable), education level (<10 years, 10–12 years, university, other) and country of birth (binary, Sweden/other), further covariates were included as potential confounders in the models based on subject matter knowledge. The sets of covariates included in each of the fully adjusted models are stated in the respective tables. Only case-only analysis was performed pertaining to exposure BRCA mutation status.
Exposure parameterizations.
First-degree family history of breast cancer was modeled as a binary variable, defined as having a mother or sister with breast cancer, yes/no. Continuous variables for polygenic risk scores were scaled prior to modeling and modeled per 1-standard deviation (SD) increase. Both PRSs were additionally modeled as categorical variables, cut into quartiles of the scores. Parity was modeled as a categorical (0, 1–2, >2 children) and continuous variable. Age at first birth was dichotomized into < 30 and ≥30 years of age, restricting analysis to parous women. Breastfeeding was categorized into 0, >0–1.5 years, and >1.5 years, after summarizing the total length of breastfeeding across all children. Breastfeeding was also assessed as a composite variable including parity, by using nulliparous women as reference group. HRT use was modeled as “Ever”/“Never” use. Women who reported use of locally administered HRT exclusively were coded as never users. Somatotype was modeled as a semicontinuous variable by assigning integers to each somatotype 1–9 from lowest to highest adiposity. Menarche was modeled per year's delay as a continuous variable. Mammographic density was scaled prior to modeling and assessed per-1 SD increase. Benign breast disease was separated into nonproliferative and proliferative, nonatypical lesions and modeled as binary variables of “ever”/“never” diagnosed. Atypical proliferative lesions were not included in analysis due to insufficient numbers.
All statistical tests were two-sided with a predetermined cutoff for statistical significance at α= 0.05. Software R (29) v.3.2.2 was used for all statistical analysis.
Results
Outcome classification
The average resampled accuracy and kappa values were 0.73 and 0.55 for the random forest classifier, and 0.64 and 0.46 for the St. Gallen IHC proxy. For all main tables, the random forest classifier was used for outcome classification. The mode of IHC marker combinations for cases classified as luminal A or B was ER+/PR+/HER2−, observed for 80% and 54%, respectively, for HER2-overexpressing ER−/PR−/HER2+, observed for 45%, and for basal-like triple-negative for all markers, observed for 85%. Average percentage Ki67 staining was 14% among luminal A tumors, 46% among luminal B tumors, 36% among HER2-overexpressing tumors, and 69% among basal-like tumors (Table 1). Confusion matrixes of true versus predicted subtypes revealed that both the classifier and the St. Gallen IHC proxy were good at capturing luminal A and basal-like status, but performed worse for luminal B and HER2-overexpressing tumors (Supplementary Table S1).
Demographics and covariates by case status and breast cancer subtype
Descriptive cross-tabulations
Table 1 shows crude descriptive contingency tables of cases verses controls, and cases by subtypes for adjusting variables and selected exposures of interest. Cases tended to be more highly educated, more often born abroad, had a higher frequency of family history of breast cancer and had breastfed less than controls. Within subtypes, variations in age, BRCA mutations, and breastfeeding were observed. Although age ranges were similar across subtypes, luminal A cases were on average older than the other categories (59 years) and basal-like cases were youngest at diagnosis (52 years; Table 1).
Adjusted case–control ORs
Multivariable regression analysis of genetic background risk factors for each subtype is shown in Table 2. The general 77-SNP PRS was associated with all subtypes except for the basal-like subtype, and most strongly associated with the luminal A subtype. The Wald test showed significant heterogeneity across subtype odds ratios (Pheterogeneity <0.0001). In contrast to the general PRS, the PRS weighted on associations to ER-negative breast cancer yielded no evidence of heterogeneity of effect across subtypes (Pheterogeneity = 0.43, Table 2). There was no evidence of heterogeneity for first-degree family history across breast cancer subtypes (Table 2).
Risk of breast cancer, overall and by subtype, by genetic background
Multivariable regression analysis of reproductive factors by subtypes is shown in Table 3. There was no statistical evidence of heterogeneity between subtypes for parity; however, judging from point estimates, parity had a protective effect on all subtypes except for basal-like subtype. Similarly, in analysis of parous women, no heterogeneity was observed for age at first birth (Table 3). The effects of breastfeeding were heterogeneous across subtypes (Pheterogeneity = 0.01). With nulliparous women as reference group, parous women who never breastfed had an increased risk of basal-like breast cancer (OR 4.17; 95 % CI, 1.89–9.21). Breastfeeding returned the risk of basal-like breast cancer to that of nulliparous women (OR breastfeeding <1.5 years 1.02; 95% CI, 0.59–1.76), OR breastfeeding ≥ 1.5 years 0.81; 95% CI, 0.43–1.60; Table 3). In contrast, the risk of developing luminal A breast cancer was no different from nulliparous women for parous women never breastfeeding (OR 1.01; 95% CI, 0.74–1.39), but a protective effect was seen for breastfeeding (breastfeeding <1.5 years, OR 0.69; 95% CI, 0.59–0.82), breastfeeding ≥ 1.5 years, OR 0.63; 95% CI, 0.52–0.76). Luminal B showed point estimates similar to those of luminal A, whereas HER2-overexpressing type was unaffected by breastfeeding (Table 3).
Risk of breast cancer overall and by subtype, by reproductive risk factors
Multivariable analysis of lifestyle and nonreproductive hormonal factors by subtypes is shown in Table 4. Ever use of HRT showed heterogeneity (Pheterogeneity = 0.05), with increased risk for the luminal A (ever HRT use, OR 1.43; 95% CI, 1.28–1.61) but no effect among the other subtypes (Table 4). Point estimates for age at menarche, somatotype at age 18, and benign breast disease also suggested differences by subtype but no statistically significant heterogeneity was observed. Still, for age at menarche, a protective effect of increasing age was observed for the luminal A and B subtypes alone. Increasingly endomorph somatotype at age 18 was associated with a protective effect for the luminal and HER2 subtypes, whereas basal-like appeared null-associated. Proliferative nonatypical lesions were associated with luminal A cancers but showed similar estimates for other subtypes except luminal B, whereas nonproliferative lesions were null-associated with all subtypes except for a possible increase for the HER2-subtype. There was no evidence of subtype heterogeneity for mammographic density, with an increased risk for all subtypes of disease with increasing density (Table 4).
Risk for breast cancer overall and by subtype: ever hormone replacement therapy (HRT use), age at menarche, somatotype at age 18, mammographic density, benign breast disease
Summary of findings for risk factors that displayed subtype heterogeneity are shown graphically as forest plots in Figure 1, separately by subtype. Luminal A and B showed very similar estimates, whereas the basal-like subtype displayed distinct features (Fig. 1).
Forest plots summarizing observed heterogeneity of results for exposures breastfeeding, PRS, and HRT use across the subtypes. BF, breastfeeding; PRS, polygenic risk score; Q1–4, quartiles 1 to 4 of the PRS.
All analyses were repeated within a case-only design with luminal A as reference, yielding the same conclusions regarding heterogeneity for PRS, breastfeeding, and HRT use as the case–control analysis (Supplementary Tables S2–S4). Case-only analysis of self-reported BRCA mutations among cases showed that basal-like tumors had higher prevalence of BRCA mutations than other subtypes, with an OR of 11.31 (95% CI, 5.37–23.07) relative to luminal A tumors (Supplementary Table S2).
Sensitivity analysis
All findings were replicated albeit attenuated when using the St. Gallen IHC proxy to define subtypes, except for the heterogeneity found for HRT use, which was not observed (Supplementary Tables S5–S7).
Discussion
Analysis of 2,632 breast cancer cases revealed evidence of subtype heterogeneity for three categories of risk factors: genetic susceptibility, HRT use, and breastfeeding. The 77-SNP PRS was exclusively associated with risk of non-basal–like subtypes, with the largest effect size for luminal A breast cancers. HRT use was associated with risk of the luminal A subtype only. Compared with nulliparous women, never breastfeeding was associated with an increased risk of basal-like breast cancer but not with risk of the other subtypes. Among parous women, breastfeeding was protective for the luminal A, B, and basal-like subtypes but null-associated with the HER2-overexpressing subtype. Luminal A and B breast cancers were very similar in associations with most risk factors.
Both germline BRCA mutations (Supplementary Table S2) and PRS (Table 2) were differentially associated with subtypes, suggesting that in addition to the previously observed heterogeneity for BRCA1 mutations (1, 2), inherited low-risk variants could also differentially increase risk of specific breast cancer subtypes. BRCA1 mutations have been shown experimentally to result in accumulation of undifferentiated luminal progenitor cells (30–33), the suspected cell-of-origin of basal-like breast cancer (34, 35), but it is not known how or if low-risk SNPs could represent an etiologic difference between subtypes. Individual SNPs have been found to be associated with either subtype or ER status (36–39), supporting our finding for the PRS. The ER-negative weighted PRS was associated with basal-like breast cancer in our study thus showing potential as a complement or replacement to the overall PRS score for identification of women at risk of this aggressive disease. Collectively, these observations also indicate a role of germline variants beyond BRCA in distinct etiology of molecular subtypes, which should be further investigated.
Our results confirm previous reports of the largest protective effect of breastfeeding on risk of basal-like breast cancer (5–8). We additionally show that the protective effect for basal-like breast cancer stems from never-breastfeeding parous women having higher risk of the basal-like subtype than both nulliparous and breastfeeding women. The reason behind this increased risk, should it be causal, is not known. As basal-like cancers are thought to originate from undifferentiated luminal progenitor cells (33–34), the association may be related to higher numbers of progenitor cells in the absence of breastfeeding. Fully differentiated type 4 lobules do not form until the end of pregnancy and during lactation under the influence of prolactin (40), and prolactin, released throughout lactation (41), has recently been identified as a central promotor of luminal progenitor cell maturation in vitro (42). This hypothesis would agree with observations of BRCA1 mutations resulting in accumulation of luminal progenitor cells (30–33) and the high proportion of basal-like breast cancer among BRCA1 mutation carriers. Epidemiologic studies have additionally shown that BRCA1 carriers who breastfeed are more protected from developing breast cancer (43, 44). However, future studies should definitely consider possible confounding by lifestyle factors, preferably including data on reasons for not breastfeeding. The lack of association with breastfeeding for the HER2-overexpressing subtype distinguishes it from the other subtypes and merits further investigation, preferably with gene expression–based subtype definitions.
Point estimates for somatotype, menarche, age at first birth, parity, and benign breast disease suggested heterogeneity but were not statistically different. We saw a protective effect of parity on all subtypes except for a null association of risk of basal-like breast cancer with parity, after adjusting for age at first birth and breastfeeding. This is in line with some of the published literature, whereas others have reported increased risks of the basal-like subtype with increasing parity. Differences may be partly due to underlying variations in prevalence of breastfeeding.
We found no evidence of subtype heterogeneity for mammographic density, or with ER-negative weighted PRS. These are important messages for prevention efforts using such exposures in risk prediction, as they would be expected to identify women at risk of breast cancer independent of subtype. Mammographic density is an especially promising tool to predict risk, as it is also among the stronger risk factors for the disease.
All results were robust albeit sometimes attenuated in sensitivity analysis using the St. Gallen IHC proxy to define subtypes (Supplementary Tables S5–S7). The only exception was seen for HRT use, which did not display subtype heterogeneity using the St. Gallen IHC proxy. Future studies should ideally address this question using true gene expression–based subtype data, but our results suggest heterogeneity in subtype risk for HRT use.
A limitation of our study is that we only had true PAM50 subtype information for 237 of the cases and predicted subtype for the rest. As the assigned subtypes were largely latent for true subtype status, model estimates should be interpreted with caution. Although the overall accuracy of our classifier was higher compared with the St. Gallen IHC proxy for defining subtypes, it still performed poorly in classifying luminal B and HER2 types and our results should be interpreted in light of this.
The retrospective nature of this study introduces some elements of caution in interpretation. Cases may recollect exposures differently than controls, but there is little reason to believe such differences would vary greatly by subtype. Controls were only available from one of the cohorts, which potentially could introduce bias. Reassuring is that the case–control ORs for generic breast cancer exhibited the expected strengths and directions of associations. Moreover, case-only analysis yielded the same conclusions regarding heterogeneity as case–control analysis. Strengths of the current work include the large cohort, the use of a subtype classifier with improved accuracy compared with a previously used IHC-based method, and the availability of genetic, lifestyle, and reproductive exposures comprehensively assessed in the same study.
In conclusion, both rare and common inherited gene variants displayed subtype heterogeneity primarily between basal-like and non-basal–like subtypes, suggesting separate etiology and implications for risk prediction. We additionally found subtype heterogeneity by breastfeeding status. Relative to nulliparous women, women who did not breastfeed postpartum were exclusively at an increased risk of basal-like breast cancer. Future research is needed to confirm or refute this finding, subsequently addressing reasons behind the observed increase in risk of the highly aggressive subtype basal-like breast cancer. Mammographic density did not display subtype heterogeneity, which is assuring for usage of mammographic density in risk prediction and prevention. Finally, although we did improve in overall accuracy over available subtype surrogacy classifiers, more work remains to improve on accuracy for defining luminal B and HER2 subtypes by IHC markers for their future use in research.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Authors' Contributions
Conception and design: J. Holm, P. Hall, K. Czene
Development of methodology: J. Holm
Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): J. Holm, M. Eriksson, P. Hall, K. Czene
Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): J. Holm, L. Eriksson, A. Ploner, M. Rantalainen, J. Li
Writing, review, and/or revision of the manuscript: J. Holm, L. Eriksson, A. Ploner, M. Eriksson, M. Rantalainen, J. Li, P. Hall, K. Czene
Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): J. Holm, K. Czene
Study supervision: K. Czene
Grant Support
The KARMA study was funded by the Märit and Hans Rausing's Initiative Against Breast Cancer. The LIBRO-1 study was funded by the Cancer Risk Prediction Center (CRisP), a Linneus Centre (contract ID 70867902) financed by the Swedish Research Council. K. Czene was supported by the Swedish Research Council (grant no. 2014-2271) and Swedish Cancer Society (grant nos. CAN 2013/469 and CAN 2016/684) and Cancer Society in Stockholm (grant no. 141092). L. Eriksson was suppored by Stockholm County Council (grant no. 20150642).
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Acknowledgments
The authors wish to thank all participants and staff in the Libro1 and Karma studies for their effort and time, and research nurses Agneta Lönn and Cecilia Arnesson with colleagues for assistance with data collection in Stockholm and Lund, respectively.
Footnotes
Note: Supplementary data for this article are available at Cancer Research Online (http://cancerres.aacrjournals.org/).
- Received September 27, 2016.
- Revision received February 5, 2017.
- Accepted April 19, 2017.
- ©2017 American Association for Cancer Research.













