Abstract
Background/Aim: Epidermal growth factor receptor (EGFR), mothers against decapentaplegic homolog 7 (SMAD7) and transforming growth factor betta (TGFB) are crucial for colorectal cancer (CRC) tumorigenesis. This study investigated whether polymorphisms in EGFR, SMAD7, and TGFB are associated with CRC risk in patients with Lynch syndrome. Materials and Methods: Genotyping was performed using Sequenom iPLEX MassArray. Association between genetic polymorphisms and CRC was assessed using a weighted Cox proportional hazard model. Results: Patients carrying the AA genotype of EGFR rs2227983 had a significantly higher CRC risk than those carrying the G allele (HR=2.55, 95% CI=1.25-5.17). The dominant model of SMAD7 rs12953717 (CT + TT genotypes) significantly increased CRC risk (HR=2.17, 95% CI=1.12-4.16) when compared to the wild-type CC genotype. Similarly, the GG genotype of TGFBR2 rs6785358 significantly increased the risk of CRC (HR=21.1, 95% CI=5.06-88.1) compared to the AA genotype. Conclusion: EGFR, SMAD7, and TGFBR2 are associated with CRC risk in patients with Lynch syndrome.
Lynch syndrome is a cancer predisposition disorder caused by a germline mutation in one of the mismatch repair (MMR) genes (1). MMR genes encode proteins that prevent both mutation and cancer development (2). Loss of function in MMR proteins usually results in error-prone DNA replication and microsatellite instability (MSI) (3). Approximately 1 in 3,140 individuals harbors a germline mutation in MLH1 or MSH2 and are at a higher risk of colorectal cancer (CRC) and other cancers than the general population (4, 5). Moreover, these patients exhibit an earlier onset of CRC and other cancers compared to the general population (6, 7).
Lichtenstein et al. estimated that 35% of inherited genetic factors are associated with CRC susceptibility (8). One of these genetic factors is the transforming growth factor beta (TGFB), whose mutations have been identified in colorectal tumors with MSI (9), a feature of Lynch syndrome. The TGFB signaling pathway is mediated via the TGFB receptor (TGFBR) or mothers against decapentaplegic homolog 7 (SMAD7) (9). Epidermal growth factor receptor (EGFR) inhibits TGFB signaling pathway through SMAD7 (10). Studies have reported that single nucleotide polymorphisms (SNPs) in EGFR, SMAD7, and TGFB are associated with cancer predisposition (11-13). More specifically, EGFR is involved in angiogenesis, metastasis, tumor invasion, and survival; TGFB plays a crucial role in cell proliferation, differentiation, and apoptosis, whereas SMAD7 is an inhibitor of TGFB and other SMAD family members (14-16). Prior studies have indicated that TGFBR2 rs3087465, EGFR rs2227983, and SMAD7 rs12953717 SNPs are associated with CRC risk (17-20). By contrast, other studies have found non-significant associations between EGFR rs2227983, SMAD7 rs2337104, and SMAD7 rs12953717 SNPs and CRC (21-23).
CRC is a heterogeneous disease caused by numerous biological interactions of different signaling pathways. Therefore, as part of an initiative to identify genetic factors associated with CRC susceptibility in patients with Lynch syndrome, this study investigated whether EGFR rs2227983, SMAD7 (rs12953717 and rs2337104), TGFB1 (rs1800468 and rs1800489), TGFBR1 rs334354, and TGFBR2 (rs3087465 and rs6785358) SNPs are associated with CRC susceptibility in patients with Lynch syndrome in Taiwan.
Materials and Methods
Study population. Using the Amsterdam II criteria, patients suspected of harboring a germline mutation in MMR genes were recruited from seven hospitals located throughout Taiwan. These patients were enrolled into the Amsterdam criteria family registry as described by our previous studies (24-26). A total of 1,014 probands and their relatives were recruited from 135 hereditary nonpolyposis colorectal cancer (HNPCC) families (patients that meet the Amsterdam criteria, but are not genetically tested for germline mutations). The protocol of this study was approved by the Taipei Medical University Institutional Review Board and also the Taiwan National Health Research Institute.
Informed consent was obtained from all patients and genetic analyses were performed in all patients who met the Amsterdam criteria. Of the 1014 HNPCC patients who met the Amsterdam II criteria, 303 were identified as harboring MLH1 or MSH2 germline mutation; the details of these are available in our previous studies (25, 26). Forty-one of these 303 patients were excluded because their SNPs results were unavailable. We also excluded two patients who harbored mutations in both MLH1 and MSH2. Eventually, 260 patients with germline mutations were recruited from 62 Lynch syndrome families.
Data collection. Clinical data from all patients were collected by nurses using a structured questionnaire, which included sociodemographic factors, medical, and family histories of cancer, as described by our previous studies (25, 26). Patients with Lynch syndrome were biennially followed-up for their recent cancer diagnoses. CRC and other cancer diagnoses were confirmed by cancer registry reports, pathology reports, medical reports, and death certificates.
Genotyping analyses. Sequenom iPLEX MassArray (Sequenom, San Diego, CA, USA) was used to genotype EGFR, SMAD7, and TGFB as described by our prior studies (25, 26). DNA sample of 10 ng was added to a polymerase chain reaction (PCR) mix containing Qiagen HotStarTaq (Qiagen, Valencia, CA, USA). PCR was performed according to Sequenom protocols and primers used in this study were obtained from Integrated DNA Technologies (Coralville, IA, USA). Assays were designed by MassARRAY Assay Design, Version 3.1. For quality control, 10% of randomly selected samples were repeated yielding a reproducibility of 100%.
Statistical analysis. Frequency distribution of all genotypes was examined for conformance to the Hardy-Weinberg equilibrium (HWE). Chi-squared goodness-of-fit test with one degree freedom was used to compare the differences between the observed and expected genotype frequencies of all the SNPs. Time at risk was considered to begin at birth and end at cancer diagnoses, death, or loss to follow-up. Patients who did not receive a diagnosis of CRC were censored at the date of their last known contact or in February 2012.
Since germline mutation carriers were recruited using non-probability sampling methods, to minimize for this nonrandom ascertainment, probability sampling weights were calculated and applied to each germline mutation carrier as previously described (27). Hazard ratios (HRs) and 95% confidence intervals (CIs) for the association between EGFR, SMAD7, and TGFB SNPs and CRC development were calculated in three genetic models (codominant, dominant, and recessive) using a weighted Cox proportional hazard model. The multivariable hazard model was used to adjust for the potential confounding factors including sex, MMR genes, year of birth, and regular colonoscopy. Furthermore, we adjusted for family and cluster correlations of these patients by using a robust sandwich covariance model, as described by a previous study (28).
For haplotype analysis, PHASE version 2.1 was used to construct haplotypes from genotypes of SMAD7 and TGFBR2 SNPs as described by Stephens et al. (29). PHASE is a Bayesian method for haplotyping population genotype data. The most frequent haplotype was used for comparison with other haplotypes. All tests were 2-sided and statistical significance was set at <0.05. All data analyses were performed by using SAS Version 9.4 for Windows.
Results
Genotype frequency distributions of EGFR, SMAD7, and TGFB SNPs are presented in Table I. All the SNPs, with the exception of TGFB1 rs1800468, conformed to HWE genotypic proportions. Because TGFB1 rs1800468 violated HWE, it was excluded from further analysis. Approximately 46.2% of participants harbored the heterozygous GA genotype of EFGR rs2227983. Moreover, 51.5% and 94.2% of participants harbored the wild-type homozygous genotypes of SMAD7 rs12953717 and rs2337104, respectively. Similarly, 51.8% and 49.8% of participants harbored the heterozygous genotypes of TGFB1 rs1800469 and TGFBR1 rs334354, respectively. More than 70% of germline mutation carriers in this study harbored the wild-type alleles of TGFBR2 rs3087465 and rs6785358.
The associations of EGFR, SMAD7, and TGFB SNPs with CRC risk are presented in Table II. After adjustment for potential confounding factors, hazard ratio (HR) showed that the variant homozygous AA genotype of EGFR rs2227983 was significantly associated with an increased risk of CRC in the recessive model compared with the wild-type G allele (HR=2.55, 95% CI=1.25-5.17). The codominant (HR=2.30, 95% CI=1.15-4.58) and dominant (HR=2.17, 95% CI=1.12-4.16) models of the SMAD7 rs12953717 SNP were significantly associated with an increased risk of CRC. In addition, the variant homozygous CC genotype of SMAD7 rs2337104 was significantly associated with CRC risk (HR=5.21, 95% CI=2.31-11.7) compared with the wild-type T allele. Similarly, the variant homozygous GG genotype of TGFBR2 rs6785358 was significantly associated with an increased risk of CRC compared with the homozygous wild-type A allele (HR=21.1, 95% CI=5.06-88.1).
Frequency distributions of EGFR, SMAD7, and TGFB polymorphisms in patients with Lynch syndrome.
Furthermore, the association between the haplotypes of SMAD7 and TGFBR2 SNPs and CRC risk was assessed in germline mutation carriers (Table III). Linkage disequilibrium of rs3087465 and rs6785358 was D’=0.62 for TGFBR2. However, rs12953717 and rs2337104 were in linkage equilibrium for SMAD7. The TT haplotype of SMAD7 SNPs was significantly associated with CRC risk (HR=2.18, 95% CI=1.13-4.19) compared to the most frequent CT haplotype. However, haplotypes of TGFBR2 were not associated with CRC risk. Table IV presents the combined effect of risk genotypes of EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 on CRC risk, in patients with Lynch syndrome. Patients who harbored at least one risk genotype had higher CRC risk (HR=2.94, 95% CI=1.39-6.21 for patients with one risk genotype; HR=4.89, 95% CI=2.10-11.3 for patients with two risk genotypes) than those without risk genotypes.
Discussion
The present study revealed that EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 SNPs are associated with CRC susceptibility in germline mutation carriers. Haplotype analysis revealed that the TT haplotype of SMAD7 SNPs was associated with an increased CRC risk compared to the most frequent CT haplotype. Moreover, the presence of at least one risk genotype significantly increased CRC risk.
EGFR plays a crucial role in cell adhesion, proliferation, differentiation, and migration (14). In this study, harboring variant AA genotype of rs2227983 significantly increased CRC risk. Our findings are consistent with Lurje et al. (30) who reported that patients with AA genotype of rs2227983, also termed rs11543848, were associated with poor CRC outcome and shorter progression-free survival than other genotypes. In contrast, another study reported that the wild-type GG genotype of rs2227983 is associated with CRC risk (31). Moreover, patients carrying wild-type genotype have been reported to have a poorer prognosis than those carrying the variant genotype (18, 32). The rs2227983 polymorphism, an arginine-to-lysine amino acid substitution at codon 521 of the EGFR, has been shown to decrease ligand binding affinity, thus attenuating growth stimulation, tyrosine kinase activation and the induction of proto-oncogenes (33). The increased risk of CRC in patients carrying variant genotype of rs2227983 in this study is a novel finding. The mechanism for this observed association is currently unclear. However, it has been indicated that EGFR inhibits TGFB signaling pathway through SMAD7 signaling pathways (10). Disruption of TGFB signaling pathways by SMAD7 leads to various forms of tumorigenesis (34), thus increasing cancer risk. Moreover, other polymorphisms of the EGFR, including rs712829 and rs712830, have also been reported to be associated with CRC susceptibility (32).
In this study, patients with variant T allele of SMAD7 rs12953717 were associated with an increased risk of CRC, corroborating previous studies (20, 35-38). In addition, a meta-analysis reported that SMAD7 rs12953717 was associated with CRC risk (19). In contrast, a cohort study reported a non-significant association between SMAD7 rs12953717 and CRC in germline mutation carriers (39). The non-significant results reported by Wijnen et al. may have been due to a lack of statistical power. SMAD7 encodes an inhibitory protein that acts as an antagonist of TGFB by blocking phosphorylation of receptor-activated SMAD (40). Inhibition of TGFB results in blocking activation of downstream signaling pathways (16), which eventually lead to CRC carcinogenesis. Moreover, Broderick et al. showed that lower median SMAD7 mRNA expression was associated with CRC risk among patients harboring the variant T allele (35), which support our findings.
EGFR, SMAD7, and TGFB polymorphisms and risk of CRC in patients with Lynch syndrome.
To date, only one study has investigated the association between SMAD7 rs2337104 and CRC (23). Akbari et al. reported that the TC genotype and C allele were associated with CRC risk (23), which are consistent with our findings. However, their results were not statistically significant. The rs2337104 is a T>C polymorphism occurring at intron 3 of SMAD7. The biological mechanism underlying the increased CRC risk among the variant C allele remains unclear, as it has rarely been investigated. Due to the low frequency of the minor C allele, our results should be interpreted with caution. Nevertheless, a haplotype analysis was performed and data revealed that the TT haplotype of rs12953717 and rs2337104 SNPs in SMAD7 was associated with an increased CRC risk. This finding confirms that the T allele of SMAD7 rs12953717 is a high-risk allele associated with CRC susceptibility.
To our best knowledge, this is the first study to report an association between TGFBR2 rs6785358 and CRC. The variant GG of rs6785358 was observed to be associated with CRC risk, compared to the A allele. Our findings are consistent with other studies (41, 42); however, these studies were conducted in patients with congenital heart defects and not in patients with CRC. Li et al. suggested that the polymorphisms of rs6785358 result in the decreased transcriptional activity of TGFBR2 and its low protein expression (42), thus altering the signal transduction pathways, which may explain the increased risk of CRC among individuals harboring the variant G allele.
Since CRC is caused by numerous biological interactions of different signaling pathways, we evaluated the joint effect of harboring risk genotype on CRC risk. Our results revealed that carrying at least one risk genotype in EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 significantly increased CRC risk. In addition to the significance of individual SNPs, genetic combination of relevant SNPs may contribute to CRC susceptibility, as demonstrated in this retrospective cohort study.
However, this study was limited by the inability to test other MMR genes including MSH6, PMS2, and EPCAM. Some patients were not willing to be followed-up; hence, some cancer cases were not recorded. In addition, the associations of EGFR, SMAD7, and TGFB SNPs with CRC could not be compared in patients without Lynch syndrome. The main strengths of this study are that all patients included were confirmed to have germline mutation in MLH1 or MSH2. Statistical methods that correctly adjust for ascertainment bias were used; thus, our results are comparable to others. Moreover, all cancer diagnoses were confirmed histologically.
Haplotypes of SMAD7 and TGFBR2 and CRC risk in patients with Lynch syndrome.
Combined effect of EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 polymorphisms and risk of CRC in patients with Lynch syndrome.
In conclusion, EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 SNPs are associated with CRC risk in Chinese patients with Lynch syndrome. Moreover, harboring at least one risk genotype of EGFR rs2227983, SMAD7 rs12953717, and TGFBR2 rs6785358 significantly increases the risk of CRC in patients with Lynch syndrome.
Acknowledgements
The Authors would like to thank all patients and their families for participating in this study, as well as the staff of Chang Gung University and Chang Gung Memorial Hospital. This study was funded by the National Science Council, Executive Yuan, Taiwan (NSC 101-2314-B-038-057), Ministry of Science and Technology, Taiwan (MOST 104-2314-B-038-009), and WanFang Hospital, Chi-Mei Medical Center, and Hualien Tzu-Chi Hospital Joing Cancer Center Grant-Focus on Colon Cancer Research (Health and welfare surcharge of tobacco products: MOHW107-TDU-B-212-114020).
- Received August 15, 2018.
- Revision received September 18, 2018.
- Accepted September 19, 2018.
- Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved