Abstract
Background: Genetic alterations occurring in lung cancer are the basis for defining molecular subtypes and essential for targeted therapies. Exhaled breath condensate (EBC) is a form of non-invasive sample that, amongst components, contains DNA from pulmonary tissue. Next-generation sequencing (NGS) was herein used to analyze mutations in EBC from patients with lung cancer. Materials and Methods: EBC was collected from 26 patients with cancer and 20 healthy controls. Amplicon-based sequencing using Ion Ampliseq Colon and Lung Cancer gene panel v2 was applied. Results: The sequencing was successful in 17 patients and 20 controls. EBC from patients revealed 39 hotspot mutations occurring in: adenomatous polyposis coli (APC), v-raf murine sarcoma viral oncogene homolog B (BRAF), discoidin domain receptor tyrosine kinase 2 (DDR2), epidermal growth factor receptor (EGFR), erb-b2 receptor tyrosine kinase 4 (ERBB4), F-box and WD repeat domain containing 7 (FBXW7), fibroblast growth factor receptor 1 (FGFR1), FGFR3 (fibroblast growth factor receptor 3), Kirsten rat sarcoma viral oncogene homolog (KRAS), mitogen-activated protein kinase kinase 1 (MAP2K1), met proto-oncogene (MET), neuroblastoma RAS viral (v-ras) oncogene homolog (NRAS), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA), phosphatase and tensin homolog (PTEN), ret proto-oncogene (RET), SMAD family member 4 (SMAD4), serine/threonine kinase 11 (STK11), and tumor protein p53 (TP53) genes. EBC from controls revealed 35 hotspot mutations. The average mutant allele fraction was higher in patients than controls. Conclusion: NGS can identify mutations in EBCs from patients with lung cancer. This could provide a promising non-invasive method for the assessment of gene mutations in lung cancer.
Lung cancer is one of the most common malignancies worldwide. Lung cancer has a poor prognosis and a long-term survival rate that has not improved in the past 20 years (1). One reason for the poor tumor course and outcome is the very late phase at which the malignancy is diagnosed. At this phase, tumoral molecular changes lead to resistance to multiple forms of treatment (2-5). Sufficient tumor tissue biopsies are essential for making appropriate diagnostic and therapeutic decisions during the course of the disease. However, such biopsies are not always easily obtained. Recently, free circulating DNA has been investigated as a non-invasive method for the detection of numerous genetic alterations associated with malignancy and, thus, offers an alternative source of material for early cancer diagnosis (6-9).
Amongst other components, exhaled breath condensate (EBC) contains DNA that seemingly originates from the tumorous tissue in patients with lung carcinoma (10). Gene mutations occurring at tumor protein p53 (TP53), Kirsten rat sarcoma viral oncogene homolog (KRAS), epidermal growth factor receptor (EGFR), and cyclin-dependent kinase inhibitor 2A (CDKN2A) have been studied in EBC from patients with different lung malignancies (11-14). We recently demonstrated that next-generation sequencing (NGS) can be used to study mutations in EBC from healthy individuals (15). In the present study, we investigated the applicability of EBC in patients with cancer.
Materials and Methods
Patients and controls. EBC samples were collected from 26 patients with different lung neoplasms. We included any type of patient with lung neoplasms who were physically fit to give 15 minutes breathing and gave their consent. Seventeen patients had non-small cell lung carcinoma (NSCLC) (eleven adenocarcinoma and six squamous cell carcinoma), six had small cell lung carcinoma, two had pleural mesothelioma, and one patient had a suspected malignancy but did not have histopathological diagnosis (Table I). EBC samples from eight patients were collected during or after treatment (either chemotherapy, radiotherapy, or surgery). EBC from the remaining 18 patients was collected before any specific cancer treatment (Table II). Patients were diagnosed and treated at Helsinki University Hospital. For controls, EBC was collected from 20 healthy individuals including healthy smokers, ex-smokers, and non-smokers, and excluding those with pulmonary infections e.g. common flu. Detailed information and results from controls were reported previously (15). In brief, 10 healthy females (aged 25-56 years) and 10 healthy males (aged 25-68 years) were studied.
The study was approved by the Hospital and Uusimaa (HUS) Ethical Review Board (ethical permission number 253/13/03/01/2015). Written informed consent was obtained from all patients and controls.
EBC sample collection. Each patient provided an EBC sample by breathing through an ECoScreen® (Eric Jaeger, Würzburg, Germany) for 15 minutes. Mean breath volume and breathing frequency were checked every 5 minutes until the end of the collection procedure. Collected EBC samples were transported immediately on ice to the laboratory. Samples were transferred into 2-ml tubes. The volume of EBC was measured for each sample and then samples were stored at −70°C.
DNA extraction. DNA was extracted from the whole EBC sample using QIAamp Circulating Nucleic Acid Kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The kit uses RNA carrier and a vacuum for fast and efficient processing of DNA from the whole sample (1-3 ml). DNA was measured using a Qubit® 2.0 Fluorometer and Qubit® dsDNA HS Assay kit (Thermo Fisher Scientific, Waltham, MA, USA).
NGS: Preparation of libraries. Ten nanograms of EBC DNA were used for preparing amplicon libraries using Ion AmpliSeq™ Library kit 2.0 (Thermo Fisher Scientific) according to the manufacturer's instructions. In order to amplify the template, Ion Ampliseq Colon and Lung Cancer panel v2 (Thermo Fisher Scientific) was used. The panel consisted of a primer pool of 92 amplicons covering 504 hotspot mutations in 22 genes frequently mutated in lung cancer. The genes included in this panel are AKT serine/threonine kinase 1 (AKT1), ALK receptor tyrosine kinase (ALK), v-raf murine sarcoma viral oncogene homolog B (BRAF), catenin beta 1 (CTNNB1), discoidin domain receptor tyrosine kinase 2 (DDR2), EGFR, erb-b2 receptor tyrosine kinase 2 (ERBB2), erb-b2 receptor tyrosine kinase 4 (ERBB4), F-box and WD repeat domain containing 7 (FBXW7), fibroblast growth factor receptor 1 (FGFR1), FGFR2, FGFR3, KRAS, mitogen-activated protein kinase kinase 1 (MAP2K1), met proto-oncogene (MET), (Notch homolog 1, translocation-associated (Drosophila) (NOTCH1), neuroblastoma RAS viral (v-ras) oncogene homolog (NRAS), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA), phosphatase and tensin homolog (PTEN), SMAD family member 4 (SMAD4), serine/threonine kinase 11 (STK11), and TP53. Only two samples from mesothelioma patients were amplified by using the Ion AmpliSeq Cancer Hotspot Panel v2 (Life Technologies, California, USA). This panel consisted of primer pool for 207 amplicons covering 2800 mutational hotspot regions in 50 genes, including ret proto-oncogene (RET) mutations. The amplified libraries were purified using Agencourt AMPure XP beads (Beckman Coulter Genomics, High Wycombe, UK). Library concentrations were measured using Qubit® dsDNA HS Assay kit and Qubit® 2.0 Fluorometer (Thermo Fisher Scientific). The libraries were stored at −20°C until the sequencing step was performed.
Template preparation and sequencing. The purified libraries were diluted to 100 pM and subsequently amplified on Ion SphereTM particles (Life Technologies). The templates were prepared and enriched using the Ion OneTouch™ 2 System (Life Technologies), an automated emulsion polymerase chain reaction system. Sequencing was performed on the Ion Personal Genome Machine System (PGM™; Life Technologies) using Ion 316™ chips and Ion PGM™ Sequencing Hi-Q view kit v2.
NGS data analysis. Torrent Suite Software v.5.2.2 (Life Technologies) was used to assess run performance and data analysis. The Integrative Genomics Viewer (IGV v 2.4; Broad Institute, Cambridge, MA, USA) was used for visual inspection of the aligned reads. The Variant Caller plug-in (v5.2) was used for variant calling. The same cutoff score as that used in our healthy control study was applied, with a threshold for quality score ≥20 and for mutant allele fraction ≥3% (15). This NGS 3% allele fraction cutoff was tested previously for EGFR mutations in patients with lung cancer when compared with a clinically approved real-time polymerase chain reaction assay (16). Only single nucleotide variants resulting in a non-synonymous amino acid change, a premature stop codon, and indels resulting in either a frameshift or insertion or deletion of amino acids were selected. All single nucleotide variants were analyzed for previously reported hotspot mutations [somatic mutations reported in the Catalogue of Somatic Mutations in Cancer (COSMIC) database] and novel variations such as new mutations detected by NGS, but not reported in either COSMIC (https://cancer.sanger.ac.uk/cosmic) or dbSNP (build 151) databases (https://www.ncbi.nlm.nih.gov/projects/SNP/).
Statistical analysis. Data were analyzed using IBM SPSS advanced statistics version 24 (IBM, Armonk, NY, USA). Comparison between the median EBC DNA concentration between two groups was performed using the Mann–Whitney test (non-parametric t-test). Independent samples t-test was used to compare significant differences in the number of detected mutations between ‘before treatment’ and the ’during/after treatment’ groups. All tests were two-sided and p-values of 0.05 or less were considered statistically significant.
Results
Success rate. DNA was successfully isolated from all 26 patients; NGS was successful in 17 patients. Nine EBC samples failed to amplify during sequencing (Table I). In the previously studied healthy controls, DNA was successfully isolated and sequenced in all 20 participants (15).
Summary of exhaled breath condensate samples collected and sequenced.
Sample collection, sequencing, and detected mutations according to treatment status for all patients.
Hotspot mutations in EBC from patients and controls. For both controls and patients, the same criteria for cutoff were set and mutations with quality ≥20 and allelic frequency ≥3% were considered to be true mutations. By applying these parameters to patient EBC samples, a total of 39 hotspot mutations that were previously reported as somatic mutations in the COSMIC database and 98 novel mutations (i.e. not previously reported) were identified. The most frequent hotspot mutations occurred in the following genes: TP53 (n=6), SMAD4 (n=6), PIK3CA (n=4), KRAS (n=3), BRAF (n=2), ERBB4 (n=2), FBXW7 (n=2), MAP2K1 (n=2), PTEN (n=2), STK11 (n=2), APC (n=1), DDR2 (n=1), EGFR (n=1), FGFR1 (n=1), FGFR3 (n=1), MET (n=1), NRAS (n=1), and RET (n=1).
A total of 35 hotspot variants and 106 novel mutations were detected at an allelic frequency ≥3% in EBC from healthy controls, with hotspot mutations occurring at the following genes: TP53 (11), KRAS (4), NRAS (3), SMAD4 (3), CTNNB1 (2), MET (2), PTEN (2), BRAF (1), DDR2 (1), EGFR (1), ERBB2 (1), FBXW7 (1), FGFR3 (1), NOTCH1 (1), and PIK3CA (1) (15). Detailed information on all hotspot mutations detected in both patients and controls, along with corresponding mutant allele fraction and quality score, are shown in Table III.
Mutations in corresponding tumor tissue. Out of the 17 successfully sequenced patient EBC samples, NGS reports for tumor tissue from routine clinical diagnostics were available for only four patients with adenocarcinomas (Table IV). Tissue NGS in clinical diagnostic laboratory was performed on Ion Torrent PGM using in-house cancer gene panel 1, which included the following genes: EGFR, KIT, KRAS, MET, NRAS, platelet-derived growth factor receptor A (PDGFRA), PIK3CA (all exons), and BRAF (exons 11-15). For patient EP-8, tumor DNA revealed mutations in KRAS exon 3 (c.183A>C) and PDGFRA exon 4 (c.423G>T). PDGFRA was not included in our patient panel; the KRAS mutation was however not identified in EBC sequencing results. In the tumor DNA of patient EP-16, a mutation in MET exon 14 (c.3028+ 3del) was observed, which in our analysis was outside of the amplicon region and was not studied. In the other two patients with KRAS exon 2 (c.35 G>A) mutations in their tumor specimens, no corresponding mutations were seen in their EBC samples (Table IV).
Mutant allele fraction (MAF) and quality score of hotspot mutations detected in patients and controls.
Mutation in relation to treatment status. EBC samples from eight patients were collected during or after treatment (chemotherapy, radiotherapy, or surgery); five out of these eight samples were successfully sequenced. The number of mutations detected in EBC samples was lower in these treated patients compared to the non-treated patients. However, this difference was not statistically significant (p=0.83) (Table II).
Clinical characteristics of patients with successfully sequenced samples.
Discussion
In this study, we showed, to our knowledge for the first time, that NGS can be adopted for analysis of EBC specimens from patients with different lung malignancies.
In our previous study, we optimized the application of NGS to EBC from healthy individuals with a median DNA concentration of 2,240 ng/ml (15). In the current study, we applied the same methodological techniques to EBC from patients with lung cancer to identify driver gene mutations. The success rate in patients was 65.4%; nine samples did not have sufficient sequencing reads even after repeating the sequencing step. The common element between these samples was a low DNA library concentration. Accordingly, we examined DNA concentration as a possible cause for failed amplification. The difference in median DNA concentration between samples with successful sequencing (1,560 ng/ml) and those with failed sequencing (1,380 ng/ml) was statistically significant (p=0.037). When comparing genes frequently mutated in both patients and controls (such as TP53 and KRAS) for hotspot mutations, the average mutant allele fraction was higher in patients than controls (22.9% and 13.6% in TP53 and 11.4% and 4.3% in KRAS in patients and controls, respectively). The presence of mutations in healthy controls does not necessarily have a clinical significance, but instead reflects a physiological homeostatic process and removal of damaged DNA due to continuous cellular turnover in pulmonary tissue (15). Similar findings were highlighted in 2012 by Kordiak et al., who compared the ratio of mutant and wild-type KRAS in EBC from patients with lung cancer and healthy controls. The authors revealed a significantly higher ratio of mutant:wild-type KRAS in patients (0.20±0.03) than in controls (0.028±0.032). They also compared the ratio of mutant:wild-type KRAS between tissue tumors and normal pulmonary parenchyma and found significantly higher values in cancerous tissues (12).
Pattern of hotspot mutations in exhaled breath condensate from patients and controls. See Table III for gene names.
Testing for KRAS mutational status may have a predictive value in different treatment protocols, as this mutation confers resistance to a wide variety of therapeutic agents, including chemotherapy and inhibitors of angiogenesis and EGFR (17). Four patients with adenocarcinomas had NGS results from tumor tissue acquired during routine clinical diagnostics. We compared these results with the corresponding EBC sequencing data. In patient EP-8, KRAS exon 3 mutation (c.183A>C) was detected in the tissue specimen, but was not observed in the patient's EBC. Patients EP-10 and EP-17 had a KRAS exon 2 (c.35G>A) mutation in their tissue specimens collected at the time of diagnosis (6 months and 18 months prior to EBC sampling, respectively) that were not detected in EBC. However, the EBC samples from these patients were collected when they had already undergone cancer treatment. For patient EP-10, EBC was collected during chemotherapy and VEGFR-inhibitor treatment. For patient EP 17, EBC was collected after radical treatment (lobectomy with adjuvant chemotherapy and radiotherapy). It not known whether the absence of mutations in EBC is due to treatment effect or other reasons. A previous reports showed a significant decrease in the mutant:wild-type allelic fraction of KRAS in EBC after tumor resection (12).
Comparing the pattern of hotspot mutations in EBC between patients and controls illustrated that some genes were mutated only in patient EBC samples (APC, FGFR1, RET, MAP2K1, and STK11). One of these mutations was MAP2K1 F53L, which has been reported to be associated with a distinct subset of lung adenocarcinoma and is a possible target for mitogen-activated protein kinase inhibitors (18). Mutations in BRAF, FBXW7, PIK3CA, and SMAD4 were found in both patients and controls but more frequently in patients (Figure 1). PIK3CA gene mutation was found in the patient group at codon 1047 (H1047Y), which is commonly mutated in NSCLC and may mediate resistance to anti-EGFR and other targeted therapies (19, 20).
Although the relevance of the mutational patterns observed in EBC is not yet clear, we feel that this should be reported due to the limited number of such studies on EBC. In addition, hotspot mutations in EBC categorized according to histological tumor types are given in Table V. The presence of hotspot mutations in apparently healthy individuals has also been reported in other studies (21, 22). Accumulation of mutations in somatic cells is a major step in the multistep process of tumorigenesis (23). There is also a correlation between increased mutation rate and increased lifetime incidence of lung cancer. It is believed that every ~30 mutations in the coding region are associated with a 0.2% increase of lifetime incidence of cancer in general (24). In our study, the average number of hotspot mutations in patient EBC samples (2.29) was higher than the average from control EBC samples (1.75). However, this was not statistically significant (p=0.292). Additionally, tissue NGS was not applicable to 15 patients due to the widely accepted clinical practice that only adenocarcinomas, and never smokers/former light smokers with other lung cancer histology are routinely tested. In real-life situations, there is often insufficient tissue material for tumor molecular profiling, which highlights the need for easily accessible samples such as EBC for molecular analysis in such difficult cases. Importantly, EBC might be able to replace the need for tumor re-biopsies in following the disease course and could facilitate detection of new resistance mutations during treatment. EBC may also be an ideal sample for screening of mutations and other genetic alterations in individuals at substantial risk of lung cancer, such as smokers and persons exposed to asbestos.
Hotspot mutations in exhaled breath condensate according to tumor type.
Our study has limitations, as we only had NGS results from corresponding tumor tissue for four patients to compare mutations seen in EBC. Moreover, two of these patients had undergone treatment since their tissue analysis, which makes it difficult to draw any conclusion regarding the concordance of results between EBC and tumor tissue. Further studies on a larger number of patients and comparison of their tumor and EBC mutation profile is required to assess the accuracy of EBC mutation profiling for diagnostic applications.
Conclusion
In the present study, hotspot mutations were detectable in EBC from patients with different types of lung malignancies. The mutant allele fraction for hotspot mutations was higher in EBC samples from patients than from healthy controls. EBC may provide a helpful tool in cancer diagnostics and molecular profiling. Genomic studies on EBC in lung cancer are still at an early phase. Further investigations are required to test the applicability of NGS for clinical use.
Acknowledgements
The Authors would like to thank Minna Ratilainen and Julia Hämäläinen for their invaluable role and help in EBC sample collection. This study was funded by the Sigrid Jusélius Foundation and the Finnish Work Environment Fund. O.Y. is grateful for the financial support received from the Otto A. Malm Foundation.
Footnotes
This article is freely accessible online.
Conflict of Interest
The Authors declare no conflict of interest in regard to this study.
- Received August 27, 2018.
- Revision received September 10, 2018.
- Accepted September 13, 2018.
- Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved