Abstract
Background: In patients with branch-duct intraductal papillary mucinous neoplasms (BD-IPMN), we aimed to develop a novel blood-based biomarker utilizing a gene-expression profile for the detection of pancreatic malignancies, such as IPMN-derived carcinoma (IPMC) or pancreatic ductal adenocarcinoma (PDAC). Patients and Methods: We enrolled 40 patients with pancreatic tumors (24 BD-IPMNs, four IPMCs and 12 PDACs) and identified the characteristic gene-expression profiles in pancreatic malignancies. Subsequently, we constructed a gene-expression scoring system for the proper diagnosis of pancreatic malignancies. The result was validated in 14 patients (five IPMNs, three IPMCs and six PDACs). Results: The scoring system utilizing the expression levels of 13 genes showed high diagnostic yield (sensitivity=94.0%, specificity=92.0% and area under the curve=0.94), which was confirmed in the validation set. Furthermore, its diagnostic yield was not reduced even in early-stage pancreatic malignancies (sensitivity=85.0%, specificity=93.0% and area under the curve=0.88). Conclusion: We developed a blood-based gene expression scoring system for cancer screening in patients with BD-IPMNs.
- Intraductal papillary mucinous neoplasms
- IPMN-derived carcinoma
- pancreatic ductal adenocarcinoma
- biomarker
- microarray
Intraductal papillary mucinous neoplasms (IPMNs) can be classified into two types based on their primary location in the pancreatic duct: Main-duct IPMN and branch-duct (BD-IPMN) (1). Each type differs in the risk of malignancy, and the revised 2017 international consensus guideline recommended resection for all surgically fit patients with main duct-IPMN since it has a high risk of malignancy (2). On the other hand, BD-IPMN has a lower frequency of malignancy, and careful follow-up with imaging modalities is generally recommended. However, recent studies reported a surprisingly high cumulative risk of pancreatic malignancies such as concomitant PDAC and IPMN-derived carcinoma (IPMC) during the observational period in patients with BD-IPMN. A recent long-term observational study reported that the overall incidence rates of pancreatic malignancies 5, 10, and 15 years after BD-IPMN diagnosis were up to 3.3%, 6.6% and 15.0%, respectively (3). Therefore, a careful surveillance program that aims to detect pancreatic malignancies should be established for patients with BD-IPMN.
Several guidelines have been developed and are widely used to estimate the malignant potential of BD-IPMN mainly for the detection of IPMC (2, 4, 5). The guidelines have proposed several key imaging features (e.g. mural nodule or dilated pancreatic duct) obtained by high-resolution imaging modalities for risk stratification. However, interpreting the results of these sophisticated modalities (e.g. endoscopic ultrasound or magnetic resonance imaging) requires expertise, and the diagnostic performance of the guidelines was reported to be varied in distinguishing benign IPMN and IPMC (sensitivity range from 7.3 to 83.3% and specificity range from 35.2 to 88.2%) (6-10). Regarding concomitant PDAC, little evidence exists for a screening program.
Highly diagnostic biomarkers that can be obtained in a less invasive manner are an ideal method for cancer screening programs. A considerable number of studies have attempted to develop biomarkers of body fluids (blood, pancreatic juice and cystic fluid) for the risk stratification of IPMN (11-16). A recent systematic review, including 193 studies with 12,297 patients, reported that serum carcinoembryonic antigen (CEA) and carbohydrate antigen 19-9 (CA19-9) had moderate diagnostic accuracy among other biomarkers (of 0.80 and 0.81, respectively) (17). Those serum biomarkers can be obtained less invasively but should be complementary to other imaging modalities for IPMN, considering their low diagnostic yield.
We previously developed a novel gene-expression scoring system that helped in differentiating basaloid squamous cell carcinoma of the esophagus from non-basaloid squamous cell carcinoma with high accuracy (18). In this study, we applied this gene-scoring system using blood specimens for detection of both IPMC and PDAC in patients with BD-IPMN.
Patients and Methods
Patients and diagnosis. To establish a novel gene-expression scoring system for the accurate diagnosis of pancreatic malignancies in patients with BD-IPMN, we prospectively enrolled 86 patients who were diagnosed with pancreatic tumors (32 BD-IPMN, 11 IPMC, 40 PDAC and three other types) by imaging modalities at Fukushima Medical University Hospital between April 2017 and April 2019. Among them, we excluded patients who met the following criteria: (i) Current treatment for malignancies; (ii) presence of concomitant non-pancreatic malignancy; and (iii) age younger than 18 years. Finally, we enrolled 40 patients (24 BD-IPMNs, four IPMCs and 12 PDACs) for the initial study (discovery set). Next, we enrolled 14 consecutive patients diagnosed with pancreatic tumors (five IPMNs, three IPMCs and six PDACs) between April 2019 and December 2019 to validate the results of the scoring system under the same criteria (validation set). The clinical backgrounds of the patients in the training and validation sets are summarized in Table I. In brief, there were significant differences in most variables, and the serum levels of CEA and CA19-9 were lower in patients with IPMN than in patients with pancreatic malignancies in both sets.
The primary endpoint was diagnostic accuracy, which differentiated IPMN and pancreatic malignancies with the gene-expression scoring system. The secondary endpoints were sensitivity and specificity for the gene-expression scoring system and other tumor markers.
The diagnostic criteria of BD-IPMN were as follows: (i) Cystic dilation of the branch duct connected to the main pancreatic duct with/without (ii) secretion of mucin from the major or minor papilla identified by endoscopic retrograde cholangiopancreatography (ERCP) or duodenoscopy (12). Regarding IPMC, the diagnosis was made by histopathology of surgical specimens or clinical features highly suggestive of the malignant transformation of BD-IPMN (progression of BD-IPMN with intramural nodules and invasion to surrounding organs). All PDAC cases were pathologically diagnosed via endoscopic ultrasound-guided fine-needle aspiration. The clinical stage (cStage) of pancreatic malignancies was determined according to the eighth edition of the American Joint Committee on Cancer (AJCC)/Union for International Cancer Control (UICC) staging system (19, 20).
Patient clinical data, including age, sex, location of disease, height, body weight, smoking and drinking habits, family history of pancreatic cancer and serum levels of CEA and CA19-9, were obtained from the electronic medical records. The pathological characteristics of the tumors (e.g. cStage) were also retrieved.
The study protocol conformed to the ethical guidelines of the 1975 Declaration of Helsinki and was approved by the Ethics Committee of Fukushima Medical University (approval no. 1953). Written informed consent was obtained from the patients.
RNA preparation. Aspiration of peripheral blood was conducted before treatment for pancreatic tumors in all cases. Peripheral blood was obtained through a venous catheter. A sample of peripheral blood of 1 to 2 ml was immediately mixed vigorously with three volumes of ISOGEN-LS (Nippon Gene Co., Ltd., Tokyo, Japan). After adding 11 ml of ISOGEN (Nippon Gene) to each sample, the samples were mixed thoroughly, and total RNA extraction was conducted according to the manufacturer's protocol (18).
Comprehensive gene-expression analysis (CGEA). For the DNA microarray that used total RNA, a set of synthetic polynucleotides (80-mers) (MiroDiagnostic, Tokyo, Japan) representing 14,400 species of human transcript sequences was printed on a glass slide using a custom arrayer. For the RNA of the samples, SuperScript II (Invitrogen Life Technologies, Carlsbad, CA, USA) and cyanine 5-dUTP (Perkin-Elmer Inc., Boston, MA, USA) were used to synthesize labeled cDNA from 5 μg of total RNA. Using the same method for the reference RNA, cyanine 3-dUTP (Perkin-Elmer Inc.) was used to synthesize labeled cDNA from 5 μg of Human Universal Reference RNA Type II (MicroDiagonostic).
Hybridization was performed with a labeling and hybridization kit (MicroDiagnostic). Signals were measured using a GenePix 4000B Scanner (Axon Instruments, Inc., Union City, CA, USA) and then processed into the primary expression ratios of the cyanine 5 intensity of each specimen to the cyanine 3 intensity of the human common reference RNA. Each ratio was normalized using GenePix Pro 3.0 software (Axon Instruments, Inc.). The primary expression ratios were converted into log2 values, which were designated as log ratios or converted values. The data were processed using Microsoft Excel software (Microsoft, Bellevue, WA, USA) and the MDI gene-expression analysis software package (MicroDiagnostic).
Hierarchical clustering and construction of the gene expression scoring system. Step 1: Genes with fluorescence intensity values below the detection limit in two or more of the 12 PDAC and 4 IPMC specimens and three or more of the 24 IPMN specimens were excluded. Step 2: The mean of the converted values of the chosen genes were calculated for each group, and the genes that met the following requirement were selected: absolute value of the difference between the mean of 12 PDAC+ four IPMC samples and the mean of 24 IPMN samples ≥0.5. Step 3: t-Test was used to compare the converted values between the PDAC plus IPMC samples and the IPMN samples. The genes with a p<0.01 were selected. Step 4: Hierarchical clustering was performed using the Euclidean distance with EpressionView Pro software (MicroDiagnostic). Step 5: For six genes, the mean of the converted values was higher in the IPMN samples than in the other samples, and the converted values were multiplied by −1. The converted values of the selected genes from all specimens were summed as the gene-expression scores (GSs).
Gene ontology and pathway enrichment analyses. To identify the role of the selected genes from the CGEA, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed via the Database for Annotation, Visualization and Integrated Discovery (DAVID 6.8; http://david.abcc.ncifcrf.gov/) (21, 22). GO terms [grouped into the biological process (BP), cellular component (CC) and molecular function (MF) categories] and KEGG pathways with p<0.05, which is equivalent to 1.3 of the enrichment score, were considered significantly enriched in these analyses.
Statistical analysis. Continuous variables (i.e. age, tumor size, serum CEA and CA 19-9 levels and GS) are reported as the means±standard deviation (SD) and were compared using unpaired Student's t-test (two-tailed). Categorical variables (i.e. sex and location of disease) were determined using chi-square statistics. The diagnostic yield of CEA and CA 19-9 levels and GS in distinguishing IPMN from pancreatic malignancies (IPMC and PDAC) was assessed using the area under the receiver operating characteristic (ROC) curve (AUC). Additionally, the combined diagnostic yield of GS plus CEA and GS plus CA19-9 was evaluated. A value of p<0.05 was considered statistically significant. All statistical analyses were implemented in GraphPad Prism 7.0 (GraphPad, San Diego, CA, USA) and SPSS 26 (IBM, Armonk, NY, USA).
Results
Diagnostic yield of the gene-expression scoring system. Figure 1 shows the CGEA results of 40 blood specimens; 11,614 genes were selected in step 1, 29 in step 2, and 13 in step 3. The 13 selected genes are described in Table II. Hierarchical clustering analysis clearly categorized two different clusters: 1) the pancreatic malignancy cluster and 2) the IPMN cluster (Figures 2 and 3A). By calculating the GS, it was possible to distinguish pancreatic malignancy specimens from IPMN specimens using this analysis, as shown in Figures 3B and 4. Pancreatic malignancies had high GSs, while most of the IPMNs tended to have lower scores.
ROC analysis showed that the three markers were able to discriminate patients with pancreatic malignancies from those with IPMN, with an AUC of 0.71 for CEA [95% confidence interval (CI)=0.53-0.90], 0.93 for CA 19-9 (95% CI=0.85-1.00), and 0.94 for GS (95% CI=0.86-1.00) (Figure 5A). In addition, the combined value of CEA plus GS and CA19-9 plus GS showed slight improvement in diagnostic yield, with an AUC of 0.94 for CEA plus GS (95% CI=0.87-1.00) and 0.97 for CA 19-9 plus GS (95% CI=0.92-1.00). As shown in Figure 5A, GS showed the highest diagnostic yield compared with the other two tumor markers at the cut-off value of 14.6, and the sensitivity and specificity were 94.0% and 92.0%, respectively. The combinations, especially of CA19-9 plus GS, also slightly improved the diagnostic yield.
Validation of the gene-expression scoring system. ROC curve analysis of the GS using the validation set showed a slightly reduced diagnostic yield at an optimal cut-off score of 11.5, with an AUC of 0.82 (95% CI=0.58-1.00), a sensitivity of 78.0%, and a specificity of 100% (Figure 5B). However, the combinations of CEA plus GS and CA19-9 plus GS showed similarly high diagnostic yields, with an AUC of 0.98 for CEA plus GS (95% CI=0.91-1.00) and 0.93 for CA 19-9 plus GS (95% CI=0.79-1.00).
Detection of early-stage pancreatic malignancies using the gene scoring system. To clarify whether GS was useful in distinguishing IPMN and cStage I-II pancreatic malignancies, we evaluated the diagnostic yield of the three biomarkers, as shown in Figure 6. For this analysis, we enrolled seven patients with cStage I-II pancreatic malignancies and 29 patients with IPMN from both the discovery and validation sets. ROC analysis showed that these three markers were able to discriminate between patients with cancer and controls, with AUCs of 0.65 for CEA (95% CI=0.35-0.95), 0.79 for CA 19-9 (95% CI=0.62-0.97), and 0.88 (95% CI=0.73-1.03) for GS. As shown in Figure 4, GS showed the highest diagnostic yield compared with the other two tumor markers at the cut-off value of 14.8, and the sensitivity and specificity were 85.0% and 93.0%, respectively.
GO and KEGG pathway enrichment analysis. GO analysis was performed using the DAVID platform. Figure 7 lists the five significantly up-regulated functions in pancreatic malignancies compared to IPMN, divided into the GO BP (GO_BP in Figure 6A) and GO MF (GO_MF in Figure 7A) categories. Figure 7B shows the most significantly downregulated function in pancreatic malignancies compared to IPMN (GO_CC). In brief, the significantly up-regulated functions were related to key regulators of the inflammatory response, such as nuclear factor-kappa B (NF-ĸB) and interferon-gamma (IFNγ). Regarding KEGG pathway analysis, five genes were found to be registered in the KEGG database (Table III). Both KITLG and MS4A1 were involved in the ‘hematopoietic cell lineage’, with a significant enrichment score of 1.6 (p=0.025).
Discussion
Accurate blood-based diagnostic biomarkers are ideal methods for cancer screening programs since they can provide objective data for clinicians in a less invasive manner. We applied a novel gene-scoring system using blood specimens to distinguish both IPMC and PDAC from BD-IPMN. The scoring system, utilizing the expression levels of 13 genes, showed a high diagnostic yield (sensitivity=94.0%, specificity=92.0% and AUC=0.94), which was confirmed in the validation set. Furthermore, its diagnostic yield was not reduced even in early-stage pancreatic malignancies (sensitivity=85.0%, specificity 93.0% and AUC=0.88).
Several studies have been conducted aiming to distinguish benign from malignant IPMN. Since blood-based biomarkers have failed to demonstrate reliable diagnostic yields, recent studies have tended to evaluate samples obtained directly from the pancreatic duct or cystic fluid. According to a systematic review, some biomarkers (e.g. cytology and CEA levels in pancreatic juice or cystic fluid) showed high specificity but the sensitivity was generally low, possibly due to degraded and scant cellular specimens (17). Additionally, endoscopic techniques such as endoscopic ultrasound-guided fine-needle aspiration and ERCP are invasive and may cause life-threatening adverse events such as post-ERCP pancreatitis or tumor cell dissemination. Therefore, blood-based biomarkers have regained attention for the diagnosis of pancreatic neoplasms.
Recent advancements in technology enable us to detect and quantify extremely small amounts of nucleic acids [e.g. deoxyribonucleic acid (DNA) and ribonucleic acid (RNA)] or fractions of them in the blood. Regarding RNA, many researchers have evaluated the diagnostic yield of noncoding RNAs, especially microRNAs (miRs), in the blood to establish new biomarkers for IPMN cancer screening. Komatsu et al. reported that plasma miR-223 levels discriminated the malignant potential between benign IPMN and malignant IPMN, as well as the progressive extent of invasiveness between malignant IPMN and PDAC (23). Another group showed that the combination of imaging features with a cluster of miRs might improve the prediction of malignant IPMN (AUC=0.92) (24). Additionally, circulating cell-free DNA is an attractive target for biomarker research. Berger et al. measured the level of circulating cell-free DNA in patients with IPMN and found that it had a high discriminatory power for control versus IPMN as well as for control versus PDAC; however, the diagnostic performance for IPMN versus PDAC was not so high (75.0% sensitivity and 71.4% specificity) (25). Overall, these biomarkers may be promising but require further improvement.
While RNA isolated directly from whole blood may produce increased noise in gene expression and reduced sensitivity compared with total RNA isolated after various protocols, the advantages of profiling gene expression from whole blood are compelling: i) Cell isolation steps that may incidentally alter gene expression patterns are avoided; ii) easy isolation of mRNA from whole blood facilitates studies; and iii) rare subpopulations of cells remain included in the analysis (26). Therefore, we decided to use whole blood for a blood-based screening system that might be faster and more concise than other methods. A similar method was previously utilized to develop a blood-based mRNA screening system for pancreatic cancer (27). In this study, the authors compared the gene expression between PDAC and healthy controls and developed a 56-gene diagnostic system. Even though it showed moderate diagnostic yield for differentiating PDAC and controls (sensitivity=73.6% and specificity=64.7%), they found that the assessment of changes in mRNA expression in whole blood was a viable alternative screening strategy for PDAC. Unlike their study, we first focused on IPMN and its related pancreatic malignancies since the screening program for the general population has been considered to be inefficient because of the low lifetime risk of PDAC. Indeed, our newly developed gene-scoring system successfully showed a higher diagnostic yield in distinguishing IPMN and pancreatic malignancies than other biomarkers previously reported.
Regarding the biomolecular background of altered gene expression, GO analysis provided some insights. Among the 13 selected genes in the comprehensive gene expression analysis, seven with higher expression in pancreatic malignancies than IPMN were positively related to five significant functions: NF-ĸB transcription factor activity, inflammatory response, IFNγ-mediated signaling pathway, sequence-specific DNA binding transcription factor activity and calcium ion binding. NF-ĸB is a transcription factor that has been well studied and is related to carcinogenesis and tumor progression in several cancer types, including PDAC (28, 29). In the cytoplasm, NF-ĸB is inactivated by the inhibitor proteins IĸB-α and IĸB-β and then activated by upstream signals that the IĸB inhibitor protein allow to dissociate from NF-ĸB. These upstream signals can come from the epidermal growth factor receptor tyrosine kinase, interleukin-1β pathway, tumor necrosis factor-α pathway, CD95/tumor necrosis factor-related apoptosis-inducing ligand pathway, or mitogen-activated protein kinases (30). In fact, evidence supports direct and indirect relationships between the malignant progression of IPMN and NF-ĸB activation (31-33). IFNγ is a cytokine whose biological activity via the Janus kinase/signal transducer and activator of transcription (JAK/STAT) pathway is conventionally associated with cytostatic/cytotoxic and antitumor mechanisms during the cell-mediated adaptive immune response (34). As Fukushima et al. reported, the aberrant methylation of the suppressor of cytokine signaling-1 gene, an inhibitor of the JAK/STAT pathway, was found in invasive IPMN but not in benign IPMN, which is suggestive of a relation between the malignant progression of IPMN and IFNγ. Finally, calcium ions may be uniquely related to IPMN progression via the S100 protein family, a multigene calcium-binding family (35). S100 protein is related to various pathways, such as mitogen-activated protein kinases, phosphatidylinositol 3-kinase, Rho GTPases, NF-ĸB and JAK/STAT. Among various S100 proteins, S100P is reported to be highly expressed in IPMN but not in normal pancreatic ductal epithelium. Moreover, S100P was clearly expressed in the invasive components of IPMNs, including perineural, lymphatic and minimal invasion. In summary, our gene-scoring system comprehensively includes genes that have been suggested to be related to IPMN and its progression.
This study has several limitations. Firstly, this was a single-center, observational study with a small sample size. Secondly, most of the patients did not undergo surgical resection, and the final diagnosis was not confirmed. Considering the high accuracy of computed tomography and magnetic resonance imaging in distinguishing IPMN from other cyst types, we considered that it was possible to diagnose IPMN by imaging modalities. Regarding patients with IPMC, four out of seven patients underwent surgical resection, and pathological diagnosis was confirmed. The other three patients had a distinct intramural mass of more than 10 mm with infiltration to the pancreatic parenchyma or celiac artery, which was highly suggestive of the malignant transformation of BD-IPMN. Thirdly, the GS was measured once in all patients, and changes in the GS along with the malignant transformation of BD-IPMN during surveillance were not evaluated. Further large studies are warranted to confirm the results.
In conclusion, we developed a novel blood-based gene-expression scoring system based on various pro-tumoral molecular activities. To clarify whether this test may be an alternative for conventional tumor markers or imaging modalities in the cancer screening of patients with IPMN, further prospective studies should be undertaken in the nearest future.
Acknowledgements
The Authors greatly appreciate Ms. Chikako Sato and Ms. Rie Hikichi for their skillful assistance in the study.
Footnotes
Authors' Contributions
RS, HT, JI and SW conceived the presented idea. RS, HT, RH, JI and SW developed the theory and performed the computations. RS, HT, JI and SW verified the analytical methods. RS, NK, HI, TT, MS, HA, YS, YO, JN, MT, TK, MH, TH provided medical care to patients. HO supervised the findings of this work. All Authors discussed the results and contributed to the final article.
This article is freely accessible online.
Conflicts of Interest
The Nippon Gene Co., Ltd. provided support in the form of salary for one of the Authors (RH) but did not have any additional roles in the current study. Other Authors have nothing to declare for this study.
- Received September 10, 2020.
- Revision received September 18, 2020.
- Accepted September 21, 2020.
- Copyright© 2020, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved