Abstract
Background/Aim: Epithelial–mesenchymal transition (EMT) is a process co-opted by cancer cells to invade and form metastases. In the present study we analyzed gene expression profiles of primary breast cancer cells in culture in order to highlight genes related to EMT. Materials and Methods: Microarray expression analysis of primary cells isolated from a specimen of a patient with an infiltrating ductal carcinoma of the breast was performed. Real-Time Quantitative Reverse Transcription PCR (qRT-PCR) analyses validated microarray gene expression trends. Results: Thirty-six candidate genes were selected and used to generate a molecular network displaying the tight relationship among them. The most significant Gene Ontology biological processes characterizing this network were involved in cell migration and motility. Conclusion: Our data revealed the involvement of new genes which displayed tight relationships among them, suggesting a molecular network in which they could contribute to control of EMT in breast cancer. This study may offer a basis for understanding complex mechanisms which regulate breast cancer progression and for designing individualized anticancer therapies.
Breast cancer (BC) represents a highly complex group of tumors with distinct biological behaviour, prognosis and response to therapy (1-4). The application of gene expression profiling technology has provided molecular classification of BC into clinically relevant subtypes, allowing for new approaches to predict disease recurrence and response to different treatments, and the likelihood of metastatic progression, and new insights into related signaling pathways (5, 6). Indeed, microarray-based expression studies have showed that BC is an heterogeneous disease at both the molecular and clinical level, which comprises of at least five subtypes with distinct gene expression patterns, associated with different prognoses and outcomes, namely luminal A, luminal B, HER2-overexpressing, basal-like, and normal-like types (7-9). However, histological and molecular parameters currently identified are still insufficient to give a reliable prediction of the clinical outcome for individual patients. Prediction of the metastatic potential of cancer cells is a critical point in the treatment of BC, as well as for other types of tumors. Nowadays, despite progress in our understanding of cellular processes which control the metastatic process (10-13), further characterization of the molecular mechanisms that regulate BC progression and invasion is required in order to improve recovery from BC.
The metastatic program of a carcinoma starts when the neoplastic epithelial cells begin to detach from the tissue of origin, lose their stationary phenotype and acquire motile capability with a mesenchymal phenotype. This process, known as epithelial-to-mesenchymal transition (EMT) which was recognized in the past in morphogenetic steps during development, has more recently been described as a strategy for neoplastic cells, including BC cells, to induce invasion and metastasis (14-18). Findings that reveal the appearance of EMT in vivo are sometimes controversial, probably due to histological preparation of ex vivo tissues that are not sufficiently adequate to reveal the phenotypic and molecular signs of EMT. To overcome these limitations, primary cell cultivation from individual BC tissues is a relevant procedure for the in vitro study of an ample variety of neoplastic cell features, including metastatic traits. Recently, we described an analytical investigation of a neoplastic primary cell population isolated from an infiltrating ductal carcinoma (IDC) of the breast which revealed phenotypic traits and expression of several mesenchymal biomarkers, such as vimentin and alpha-smooth muscle actin (α-SMA), typical of EMT (19). Real-Time Quantitative Reverse Transcription PCR (qRT-PCR) analysis showed a high level of expression of multiple genes associated with EMT, particularly TGF-β, the major inducer of the EMT in BC (20) and some of the most representative targets of the TGF-β signal transduction pathway, supporting the occurrence of the process in cells of our study (19). Although some reports describe microarray analysis investigating EMT gene signatures in vitro by using immortalized BC cell lines (21-23), to our knowledge only one other study has examined EMT transition in BC primary culture by microarray analysis (24). For this reason, this field needs to be further explored. Thus, the purpose of this study was to analyze gene expression profile by cDNA microarray of BC primary culture in order to highlight genes related to EMT in BC.
Materials and Methods
Tissue and cell cultures. An extra portion of breast IDC tissue, unnecessary for histological diagnosis after surgery, was obtained for this study. The patient gave her written informed consent for research participation according to the Helsinki Declaration and the study was approved by the Ethical Committee of the San Raffaele G. Giglio Hospital, Cefalù (number of protocol: C.E.2012/16). The tissue was disaggregated mechanically immediately after surgery, subjected to enzymatic digestion with collagenase/hyaluronidase and the BC primary cells, hereafter named simply BC cells, were isolated by applying differential centrifugation method as previously described (19). The BC cells were grown at 37°C under 5% CO2-95% air atmosphere in a humidified incubator in a selective medium of DMEM with antibiotics supplemented with 10 mM Hepes (Invitrogen, Carlsbad, CA, USA), 1 mg/ml bovine serum albumin (BSA), 10 ng/ml cholera toxin, 0.5 μg/ml hydrocortisone, 5 μg/ml insulin, 5 ng/ml epidermal growth factor (EGF), 5 μg/ml apo-transferrin (all from Sigma-Aldrich, St. Louis, MO, USA). All gene expression experiments were conducted when BC cells were at the fourth growth passage. The non-tumor MCF-10A cell line was used as mammary epithelial reference sample in line with others in vitro BC studies (30-31) and cultured according to instructions (American Type Culture Collection, Manassas, VA, USA). The follow-up of the patient was not part of this study.
Whole-genome cDNA microarray expression analysis. Total RNA was extracted from BC and MCF-10A cells using Trizol and the RNeasy mini kit (Invitrogen) according to the manufacturer's guidelines. RNA concentration and purity were determined spectrophotometrically using the Nanodrop ND-1000 (Thermo Scientific Open Biosystems, Lafayette, CO, USA) and RNA integrity, measured as RNA integrity number (RIN) values, was assessed using a Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA). Only samples with a maximum RIN of 10 were used for further microarray analysis. Five hundred nanogrammes of total RNA from BC primary culture and MCF-10A cell line were used for cRNA synthesis and labelling according to Agilent Two-Color Microarray-Based Gene Expression Analysis protocol. Samples were labelled with Cy5 and Cy3 dye (Agilent Technologies). Fluorescent complementary cRNA samples (825 ng) were hybridized onto Whole Human Genome 4×44K microarray (Agilent Technologies) GeneChips containing all known genes and transcripts of an entire human genome. Four replicates were performed. Array hybridization was conducted for 17 h at 65°C. Images were achieved by an Agilent's DNA Microarray Scanner with Sure Scan high-Resolution Technology (Agilent Technologies) and analyzed using Feature Extraction expression software (Agilent Technologies, Santa Clara, CA, USA) that found and placed microarray grids, rejected outlier pixels, accurately determined feature intensities and ratios, flagged outlier features, and calculated statistical confidences.
Statistical data analysis, background correction, normalization and summary of expression measure were conducted with GeneSpring GX 10.0.2 software (Agilent Technologies). Data were filtered using two-step procedure: first the entities were filtered based on their flag values P (present) and M (marginal) and then filtered based on their signal intensity values, this enables very low signal values or those that have reached saturation to be removed. Statistically significant differences were computed by the Student's t-test and the significance level was set at p<0.05. The false discovery rate (FDR) was applied as a multiple test correction method. Average gene expression values between experimental groups were compared (on log scale) by means of a modified ANOVA (p<0.05). Genes were identified as being differentially expressed compared to MCF-10A cells if they had a fold-change (FC) in expression of at least 3 and a p-value <0.05.
The data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus (25) and are accessible through GEO Series accession number GSE53175 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE53175).
Microarray data are available in compliance with Minimum Information About a Microarray Experiment (MIAME) standards.
MetaCore network analyses. Microarray data sets were also analyzed by pathway analysis using the network building tool MetaCore (Thomson Reuters, Philadelphia, PA, USA) consisting of millions of relationships between proteins derived from literature publications about proteins and small molecules (including direct protein interaction, transcriptional regulation, binding, enzyme-substrate, and other structural or functional relationships). Results, i.e. maps of protein lists from the uploaded dataset, were then compared with all the possible pathway maps for all the proteins in the database, and the p-value was calculated based on hypergeometric distribution probability test. The most representative networks significantly changed were selected and analysed.
qRT-PCR analysis. Candidate genes for qRT-PCR analysis were chosen based on microarray results. One microgramme of total RNA was reverse-transcribed into cDNA with SuperScriptII reverse transcriptase according to the manufacturer's specifications (Invitrogen). One microlitre of cDNA (50 ng RNA equivalent) was analyzed by real-time PCR (1 cycle 95°C for 20 s and 40 cycles of 95°C for 3 s, 60°C for 30 s) in triplicate using Fast 7500 Real-Time PCR System (Applied Biosystems). Amplification reactions were performed in a 20 μl reaction volume containing 10 pmoles of each primer and the Fast SYBR Green Master Mix according to the manufacturer's specifications (Applied Biosystems). Reaction specificity was controlled by post-amplification melting curve analysis. The oligonucleotide primers were selected with the Primer3 software (26-27) and tested for human specificity using NCBI database. Primer sequences (forward and reverse) used are listed in Table I. Quantitative data, normalized versus rRNA 18S gene, were analyzed by average of triplicates Ct (cycle threshold) according to the 2–ΔΔct method using SDS software (Applied Biosystems). The data shown were generated from three independent experiments and the values are expressed relative to mRNA levels in the non-tumuorigenic MCF-10A control cells as the mean±SD.
Expression of genes in primary breast cancer cells compared to the Epithelial–Mesenchymal Transition core gene list of Gröger et al. (29).
STRING interaction analyses. In order to analyse potential interactions among selected genes, STRING database of known and predicted protein interactions was used (http://string-db.org). Direct (physical) and indirect (functional) associations derived from four sources: genomic context, high-throughput experiments, conserved coexpression, and previous bibliographical knowledge, were analysed as previously described (3).
Results
Overview of cDNA microarray gene expression. In the present study, a two-color microarray-based gene expression analysis (Agilent Technologies) was conducted on BC primary culture and on MCF-10A cells used as a reference sample. Comparative differential gene expression analysis revealed that 2,982 genes in BC cells had expression levels significantly altered by 3-fold or greater compared to the MCF-10A reference group: 1,553 genes were down-regulated and 1,429 were up-regulated (GSE53175). Altered transcripts were selected and grouped according to their involvement in specific biological pathways using integrated pathway enrichment analysis with GeneGo MetaCore. Datasets were loaded into the MetaCore software and the top enriched canonical metabolic pathways were analyzed. The result of this mapping revealed the involvement of a set of molecules controlling specific process networks such as cytoskeleton remodelling, cell-cycle regulation and cell adhesion in BC primary culture in comparison with the reference sample, processes known to have a role in EMT. In addition, lists of differentially expressed genes were analyzed in order to search for factors potentially involved in EMT or in BC, and new ones to propose as new markers of interest. Thirty-six candidate genes were selected, validated and analyzed using PubMatrix tool (28). In this way, lists of terms such as gene names can be assigned to a genetic, biological, or clinical relevance in a rapid flexible systematic fashion in order to confirm our assumptions.
EMT core list validation. Our microarray data were compared with the ‘EMT core gene list’ recently proposed by Gröger et al. (29). Based on cluster analysis of the gene expression studies (GES) derived from a multitude of cell types and EMT initiation methods, Gröger and colleagues conducted a meta-analysis of gene expression signature defining the EMT during cancer progression. They proposed a meaningful EMT core gene list which describes the majority of the involved genes across the analyzed GES. Cluster analysis of genes shared among at least 10 datasets containing 130 genes, out of which 67 were up- and 63 were down-regulated, were further classified into five categories: (i) cell adhesion and migration, (ii) development, cell differentiation and proliferation, (iii) angiogenesis and wound healing, (iv) metabolism, (v) other or unclassified, according to single enrichment analysis. As shown in Table II and in Figure 1, 91 (~70%) out of these 130 genes were found to be de-regulated in BC primary culture, with the same trend proposed by Gröger et al. (44 down-regulated genes and 47 up-regulated). Thirty five (~27%) genes were not included in our BC microarray dataset, while four genes (~3%) showed an opposite expression trend with respect to Gröger et al.'s list: Fibulin 1 (FBLN1), Latent Transforming growth factor Beta binding Protein 1 (LTBP1) and Tissue Factor Pathway Inhibitor (TFPI) were down-regulated, while Follistatin (FST) was up-regulated in BC cells.
Microarray validation experiments. Firstly, we assayed mRNA levels of the four genes showing an opposite gene expression trend with respect to Gröger's list: FBLN1, LTBP, TFPI and FST genes. qRT-PCR experiments confirmed the microarray expression trend (Tables III and IV).
Moreover, based on the microarray data set, Pubmatrix results and MetaCore analyses, we chose 32 candidate genes and performed qRT-PCR validation experiments (Table III and IV). Genes for validation were chosen based on two considerations: i) factors known to be involved in EMT transition or in BC; ii) lesser known genes involved in these processes to propose as new molecular markers. qRT-PCR analyses were performed in parallel with MCF-10A cells. Genes selected were some of the most representative targets of TGF-β signal transduction pathway, as well as cytoskeleton and adhesion proteins, and downstream targets of this pathway (32, 33). qRT-PCR analysis revealed an increase of mRNA levels of genes known to be involved in mesenchymal progression, EMT transition and in matrix modelling such as Transforming Growth Factor beta 1 (TGFB1), Twist homolog 1 (Drosophila) (TWIST1), Vimentin (VIM), Actin, alpha 2, smooth muscle, aorta (ACTA2), Cadherin 2, type 1, N-cadherin (CDH2), Wingless-type MMTV integration site family, member 5A (WNT5A), Matrix MetalloPeptidase 3 (MMP3), Lysyl Oxidase-Like 2 (LOXL2), Matrix-remodelling Associated 5 (MXRA5), Collagen type VIII alpha 1 (COL8A1), Collagen type VI alpha 3 (COL6A3), Fibronectin 1 (FN1) and connective tissue growth factor (CTGF). Other less well-known genes involved in these processes were also up-regulated: Platelet-derived Growth Factor Receptor, beta polypeptide (PDGFRB), Thrombospondin 1 (THBS1), Fibroblast Activation Protein, alpha (FAP), interleukin-6 (IL6), Keratin-19 (KRT19), Periostin (POSTN), Fibroblast Growth Factor 2 (FGF2), Inhibitor of DNA binding 2 (ID2) and cathepsin K (CTSK) (Table III).
Primer sequences used for qRT-PCR analyses
Microarray data for gene expression of cultured primary breast cancer cells compared with to Groger's EMT core gene list.
On the other hand, down-regulation was observed for some epithelial genes and factors including also several keratins, such as E-cadherin (CDH1), Amphiregulin (AREG), Epithelial Splicing Regulatory Protein 2 (ESRP2), Keratin-6A (KRT6A), Keratin-8 (KRT8), Keratin-5 (KRT5) and Keratin-18 (KRT18). In addition, down-regulation of Fibroblast Growth Factor Binding Protein 1 (FGFBP1), Tumor Protein p63 (TP63) and CD24 signal transducer (CD24) mRNA levels was observed (Table IV). All qRT-PCR validation experiments confirmed Microarray gene expression trends as shown in Table III and IV.
Thirty-six candidate gene interactions. In order to verify potential interactions among markers identified, we generated a molecular network by introducing these 36 genes into STRING, a molecular tool able to elaborate physical and functional associations among proteins (34). Despite data heterogeneity, Figure 2 clearly displays the tight relationships existing among molecules, showing that 34 out of 36 genes are closely connected in a single network. Moreover, the STRING analysis reveals that the most significant GO biological processes characterizing this network (Figure 3) are involved in cell migration and motility, in agreement with behaviour featured in tumor cells wring EMT.
Discussion
EMT is a recognized process involved in the morphogenetic steps during development, but which is also co-opted by various cancer cells to enable them to invade and form metastases at distant sites.
Over the past decade, a number of GES regarding EMT were conducted on a variety of cell types and different modes of EMT induction. Recently, Gröger and colleagues, using a meta-analysis of GES performed on different cell types and treatment modalities, defined a transcriptome signature named EMT core gene list, containing 130 well-known genes involved in EMT, as well as new genes selected among the multitude of GES so far described (29). All BC samples used in those studies were immortalized BC cell lines, which are considered well-established procedures in biomedicine for a more complete understanding of cellular cancer process.
A further contribution to these studies is offered by the primary cell cultures which more tightly maintain tumor characteristics and behaviour in vivo. Indeed, their application in clinical research could represent a powerful system for studying molecular mechanisms of human disease such as BC in order to define individualized anticancer therapies in the future. Recently, we described EMT in a primary cell culture derived from a patient with breast IDC which revealed phenotypic traits and expression of several mesenchymal biomarkers. The EMT phenomenon is difficult to predict due to its dynamic evolution inside tissues. Thus, the aim of the present study was to analyze the gene expression profile by cDNA microarray of BC cells from primary culture in order to identify known and less well-known putative EMT markers of BC. Indeed, a very limited number of reports describe gene expression profile of EMT in BC cell lines (21-23, 35-36), and to our knowledge, only one other study has examined EMT transition in BC primary culture by microarray approach (24).
Up-regulated genes in breast cancer primary culture.
Down-regulated genes in breast cancer primary culture
STRING analysis of known and predicted protein interactions existing among the 36 selected genes closely connected in a single network.
Several bibliographical data are available concerning the role of some of the selected genes in EMT transition such as: ACTA2, CD24, CDH1, CDH2, CTSK, CTGF, FGF2, FN1, ID2, IL6, KRT18, KRT8, LOXL2, MMP3, TP63, POSTN, TGFB1, THBS1, TWIST, VIM and WNT5A. For this reason, their functional role will be not discussed here.
Conversely, no data or few are available on COL6A3, COL8A1, AREG, ESRP2, FAP, PDGFRB, FST, LTBP1, KRT19, KRT5, KRT6A, FBLN1, FGFBP1, MXRA5 and TFPI genes in EMT.
In the tumor microenvironment, both stromal and cancer cells contribute along with various types of extracellular matrix (ECM) proteins to actively remodel the microenvironment and favour tumor growth and metastasis. EMT provides a mechanism where cells adopt a motile phenotype and, regarding this process, ECM proteins are often described: for example increased collagen production is linked with malignancy. Nevertheless, no one has described the role of COL8A1 in EMT and only recent work describes COL6A3 protein as being actively involved in mammary tumour progression through enhancing EMT, fibrosis and chemokine activity, thereby recruiting stromal cells to the tumor microenvironment (37, 38). Both COL6A3 and COL8A1 were up-regulated in our study. Notably, all these activities are associated with acquired drug resistance: Park et al. suggest that this protein is a predictor of chemoresistance to cisplatin in patients with solid tumors, including BC, being able to enhance EMT. Thus, COL6A3 has been highlighted as a promising candidate triggering drug resistance against platinum-based therapeutics, since its levels are vastly increased in cisplatin-resistant cancer cells in vitro. However, the detailed molecular mechanisms underlying these correlations remain elusive (39).
Analysis of the most significant Gene Ontology biological processes highlighted in the STRING network.
AREG, or amphiregulin protein, is an EGF receptor ligand, essential for in vitro proliferation of human keratinocytes under autocrine and growth factor-stimulated conditions (40). AREG was recently described by Yang et al. to be able to mediate EGF-independent growth and malignant behaviour of mammary epithelial cells in a non-cell-autonomous manner. It is a target for a transcriptional co-activator named TAZ factor having a PDZ-binding motif, which in human mammary epithelial cells promotes EMT, but no additional data are available on this topic. Positive correlation between TAZ and AREG proteins in a patient with BC was revealed in vivo. Of particular note, AREG can be secreted into the blood system, and its detection would be non-invasive and quite simple (41, 42).
To date, the epithelial splicing regulatory proteins (ESRPs) are the only known splicing factors that exhibit epithelial cell-type-specific expression and that undergo pronounced changes in expression during the EMT. A loss of ESRP2 expression is a generalized event during the EMT, and was shown by studies in which the induction of EMT by TGFB1, SNAIL, ZEB1, ZEB2, and E-cadherin knockdown was accompanied by a nearly complete abrogation of ESRP expression (22, 43, 44). Cells lacking ESRP1 and ESRP2 lost cell–cell adhesion, acquired a stellate-like shape, and became more motile. Changes in cell morphology and behaviour through abrogation of an epithelial splicing program suggests that the ESRPs regulate transcripts that, while expressed in both epithelial and mesenchymal cells, encode protein isoforms with unique functions specific to each cell type. For these reasons Warzecha and Carstens suggest a model whereby the ESRPs are master regulators of epithelial cell function and morphology (45). Interestly, only two articles describe the relation between ESRPs and BC, showing their down-regulation after EMT (TGFB-induced) during BC progression (46-49), in line with our data.
During EMT, tumor cells interact with activated stromal fibroblasts, also known as cancer-associated fibroblasts, in order to define tumour phenotype development. Quiescent fibroblasts become activated through mutual paracrine effects with tumour cells. Some differentially expressed markers of cancer-associated fibroblasts, including fibroblast activation protein (FAP or seprase) and PDGFRB, were up regulated in BC primary culture cells. Although an influence of these markers on EMT is postulated, their functional importance for modulation of cancer cell phenotype is not fully understood (49, 50).
Currently, no data regarding EMT and FST, LTBP1, KRT19, KRT5, KRT6A, FGFBP1 and TFPI, are available, but a relation between these factors and BC was described. Thus, they could represent new markers of EMT in BC.
In addition, scarce information exists for FBLN1 or MXRA5 genes and EMT or BC. FBLN1 is a secreted glycoprotein that becomes incorporated into a fibrillar extracellular matrix and has been indicated as a prognostic marker for near-term events (2.5 years) for both patients with lymph node-positive and those with lymph node-negative BC (51). Moreover, MXRA5 gene, which encodes for one of the matrix-remodelling associated proteins, has been recently described as a new cancer gene, frequently mutated in non-small cell lung carcinoma (52) and by us described for the first time in EMT and BC. Further analysis should clarify their role on this matter.
Concluding, gene expression data for BC cells of primary culture are in line with those reported in literature concerning EMT in cancer cells, with special focus on BC. In the present work we describe the involvement of novel genes able to guide this process, highlighting a molecular network that could contribute to the control of EMT. EMT markers identified here should act as prognostic indicators, for example in response to drug therapies or to ionizing radiation treatments, in which this transition may play a key role not yet well-known (53, 54).
Conclusion
Our gene expression data show the involvement of novel genes which display tight relationships among them, suggesting a molecular network in which they could contribute to regulation of EMT in BC. We trust that this study may offer a further basis for understanding the complex mechanisms which regulate BC progression, such as the EMT, and for designing individualized anticancer therapies in the future.
Acknowledgements
This work was supported by FIRB/MERIT (RBNE089KHH).
Footnotes
-
↵* These Authors contributed equally to this work.
-
Authors' Contributions
All Authors participated to the conception, design, interpretation, elaboration of the findings of the study, drafting and revising the final elaborate. In particular, VB and LM designed the study: LM isolated and maintained BCpc primary culture and VB performed microarray experiments and gene expression profile analyses. VB, LM, FPC and GIF performed qRT-PCR assays. Professor M.C. Gilardi and Professor C. Messa participated in the elaboration of the findings of the study, drafting and revising the final elaborate. All Authors read and approved the final content of the manuscript.
-
This article is freely accessible online.
-
Competing Interests
The Authors declare that they have no competing interests'.
- Received March 3, 2014.
- Revision received March 17, 2014.
- Accepted March 18, 2014.
- Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved