Large meta-analysis of multiple cancers reveals a common, compact and highly prognostic hypoxia metagene

Buffa, F M; Harris, A L; West, C M; Miller, C J

doi:10.1038/sj.bjc.6605450

Download PDF

Genetics and Genomics
Open access
Published: 19 January 2010

Large meta-analysis of multiple cancers reveals a common, compact and highly prognostic hypoxia metagene

F M Buffa¹,
A L Harris¹,
C M West² &
…
C J Miller³

British Journal of Cancer volume 102, pages 428–435 (2010)Cite this article

9763 Accesses
312 Citations
8 Altmetric
Metrics details

Subjects

An Erratum to this article was published on 28 September 2010

An Erratum to this article was published on 07 September 2010

This article has been updated

Abstract

Background:

There is a need to develop robust and clinically applicable gene expression signatures. Hypoxia is a key factor promoting solid tumour progression and resistance to therapy; a hypoxia signature has the potential to be not only prognostic but also to predict benefit from particular interventions.

Methods:

An approach for deriving signatures that combine knowledge of gene function and analysis of in vivo co-expression patterns was used to define a common hypoxia signature from three head and neck and five breast cancer studies. Previously validated hypoxia-regulated genes (seeds) were used to generate hypoxia co-expression cancer networks.

Results:

A common hypoxia signature, or metagene, was derived by selecting genes that were consistently co-expressed with the hypoxia seeds in multiple cancers. This was highly enriched for hypoxia-regulated pathways, and prognostic in multivariate analyses. Genes with the highest connectivity were also the most prognostic, and a reduced metagene consisting of a small number of top-ranked genes, including VEGFA, SLC2A1 and PGAM1, outperformed both a larger signature and reported signatures in independent data sets of head and neck, breast and lung cancers.

Conclusion:

Combined knowledge of multiple genes' function from in vitro experiments together with meta-analysis of multiple cancers can deliver compact and robust signatures suitable for clinical application.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

A single-cell atlas enables mapping of homeostatic cellular shifts in the adult human breast

Article Open access 28 March 2024

Austin D. Reed, Sara Pensa, … Walid T. Khaled

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Spencer S. Watson, Benoit Duc, … Johanna A. Joyce

Main

Gene-expression studies attempt to extrapolate biologically and clinically relevant hypotheses from gene expression patterns. However, many current studies make little use of existing knowledge such as gene function within specific pathways, and prognostic signatures are often derived with no reference to the functional roles of their components.

One increasingly popular method that aims to make use of prior knowledge is gene set enrichment analysis (GSEA) (Subramanian et al, 2005). It first conducts a supervised analysis by ranking genes according to their ability to discriminate between different sample groups, and then maps them onto previously defined gene sets, typically formed according to common function using annotation sources. The goal is to identify sets containing a statistically significant number of highly ranked genes, and then to use this information to provide functional characterisations for the samples in question. Although powerful, GSEA relies on stratification of the experimental samples into distinct groups, often making it unsuitable for use with heterogeneous clinical data sets.

Another approach often applied to microarray data involves creation of a co-expression network within which each ‘node’ represents a gene, and ‘edges’ are created between genes when their expression patterns are significantly correlated. Co-expression networks have been used to formulate functional and clinical hypotheses from in vivo data (Butte and Kohane, 2003; Hahn and Kern, 2005; Wolfe et al, 2005). A disadvantage with the approach is that it can be susceptible to the multiple testing issues that arise due to the large number of genes represented on a typical microarray. Setting a low threshold for a significant correlation between genes will result in the inclusion of many spurious links, whereas a high threshold will control the false-positive rate at the expense of omitting many genuine edges.

Here we illustrate and validate a network-based approach with parallels to both GSEA and co-expression networks; for a workflow of the method see Supplementary Material and Methods. It can be applied directly to clinical data, even when the samples cannot be partitioned in advance into distinct groups. The algorithm begins with a collection of ‘seed’ genes that are then used as starting point from which to build an association network. Rather than simply connect gene pairs with high correlation between their expression profiles, the approach defines a ‘neighbourhood of co-expression’ around each seed gene, and then connects seeds that have a significant degree of overlap between their neighbourhoods. This approach is relatively robust against the inclusion of spurious edges, as edges are only added when there is consistently high correlation to many intermediate genes that form the intersection between seeds. We previously used a seed-based approach successfully to predict hypoxia-related genes (Winter et al, 2007); this study develops the method in a meta-analysis context to produce robust signatures requiring fewer genes, making them more suitable for clinical use, for example in quantitative RT-PCR analyses of biopsies at presentation.

Hypoxia has a key role in defining the behaviour of many cancers including head and neck squamous cell carcinomas (HNSCCs) (Nordsmark et al, 2005) and breast carcinomas (BCs) (Fox et al, 2007); thus the identification of common hypoxia-regulated genes is important both for understanding of cancer evolution, and for improved prognosis or development of novel therapies. The described approach was applied to a large meta-analysis of HNSCCs and BCs to define successfully a common and robust hypoxia signature.

Materials and methods

Seed clustering

The process begins with k seed genes, ∏={π₁,π₁,…π_k} (‘gene’ is used throughout for convenience, although ‘transcript’ is generally more accurate). Spearman's correlation, ρ, is computed between seeds and genes Y={y₁, y₂,…y_m} in a data set of n samples, X={x₁, x₂,…x_n}. For each seed/gene pair, their ‘affinity’ is defined as:

where θ_t and θ_s define extent and sharpness of the cluster. When θ_s → 0, δ reduces to the step function with δ =0 if ρ²<θ_t, δ=1 if ρ²>θ_t. In this limit, the method is parameter free, and this will be used in this study. θ_t is defined objectively using a probability threshold, α, of observing a given correlation if the null hypothesis (i.e. no association) was true. This needs to be corrected for multiple testing (Hastie et al, 2001) to account for the size of Y; here, α=0.05 after Bonferroni correction was considered. Finally, a membership function is defined:

An increasing γ indicates stronger membership of a gene to a seed cluster.

Shared neighbourhood

The shared neighbourhood, S, between two seeds is defined as:

where γ is the membership (Eq. 2). Two seeds are considered to carry a high degree of related information if their clusters share many genes (high S values). A sign function is also defined:

where sgn(x) is the sign function: sgn(x)=1 if x>0, sgn(x)=−1 if x<0. If two seeds are correlated with their shared features in the same direction, F=1 (seeds are fully concordant); if they are correlated with their shared features in opposite direction, F=−1.

Seed-dependent connectivity

The strength of the relationship between a gene and the whole set of seeds is estimated using the connectivity function:

where γ is defined in Eq. 2 and w are weights that regulate the importance of each seed. In this study, we consider w=1, unless y_i is one of the seeds, or a probe set biding to the same transcript as the seed; in this case, to avoid bias, w=0 for that seed.

A connectivity score is defined as the fractional rank of C; that is the ranking normalised between 0 (lowest C) and 1 (highest C).

Bootstrapping, Monte Carlo and meta-connectivity score

Random sets of seeds are generated by Monte Carlo sampling, clusters are aggregated around them, and C and S are calculated. This procedure is repeated to generate null distributions and it provides an estimate of the probability of observing by chance a given value of C and S.

Bootstrapping is re-sampling with replacement of the original population; it is used to provide maximum likelihood best estimates when an analytical approach is not feasible (Hastie et al, 2001). Here, it is used to provide best estimates and confidence limits for C and S. These are used in a meta-analysis across several data sets to define a meta-connectivity score as:

where R[C(y_i)]_k is the fractional rank of C (Eq. 5), N_d is the number of datasets, σ²_k is the variance of the ranked C, R[C(y_i)]_k, in dataset k for gene y_i.

A common metagene between tumour types is derived by taking the Ĉ scores product, πĈ. This is effectively a rank product, as Ĉ is an average rank (Eq. 6).

Cumulative forest plots based on connectivity score

A summary expression score, E, is defined in each sample as the median of the absolute expression of the genes in the signature. The median is used as summary statistics to reduce the effect of outliers. A cumulative forest plot is defined: genes are added to the signature, one by one, in order of their connectivity, C, score so that genes that are introduced first have the highest connectivity. At each step, a summary expression, E, is derived using the new gene and genes from the previous steps. Samples are then ranked by their E value; this assigns a hypoxia score (HS) from lowest (least hypoxic) to highest (most hypoxic). Hypoxia score is then re-normalised between 0 and 1; introduced into a Cox multivariate analysis that includes the other significant clinical covariates and the hazard ratio (HR) of the HS is calculated.

Data sets, data processing and annotation

NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) was searched for gene expression studies in cancer, published in peer-reviewed journals, where microarray were performed on frozen material extracted before chemotherapy, radiotherapy or adjuvant treatment. Eight data sets (Table 1) were selected that used similar platforms (Affymetrix U133A, B and plus2, www.affymetrix.com). Processing was performed using ‘simpleaffy’ (Wilson and Miller, 2005); the ‘gcrma’ function was used to estimate expression values, data were quantile-normalised and logged (base2). Other data sets were identified for validation in which different technologies were used (Table 1); non-Affymetrix data sets were processed as described in the original publications. More details on pre-processing and annotation are given in the Supplementary Methods.

Table 1 Data sets used to train and validate the hypoxia signature

Full size table

Results

Derivation of a hypoxia expression network

A hypoxia expression network was built first in a data set comprising 59 HNSCC tumour samples (Vice 125; Table 1) using well-characterised hypoxia-related genes identified from the literature covering a comprehensive set of hypoxia-induced pathways (set A, Supplementary Table S1). These were adrenomedullin (ADM), adenylate kinase 3-like 1 (AK3L1), BCL2/adenovirus E1B 19 kDa interacting protein 3 (BNIP3), carbonic anhydrase IX (CA9), enolase 1 (ENO1), hexokinase 2 (HK2), lactate dehydrogenase A (LDHA), phosphoglycerate kinase 1 (PGK1), solute carrier family 2 member 1 (SLC2A1) and solute carrier family 2 (VEGFA). The resultant network (Figure 1) was observed to map distinct regions of the Reactome (www.reactome.org) network and several hypoxia-related pathways (Figures 2; Supplementary Figure S1). The method was applied to additional HNSCC and BC training data sets (Table 1) with similar results (Supplementary Table S2).

In the resulting expression networks, high shared neighbourhood, S (Eq. 3), values between seed pairs were generally associated with a high pair-wise correlation. However, this relationship did not always hold. An example is given in Supplementary Figure S2, where genes in a published 245-gene literature list (LL) (Winter et al, 2007) were used as starting seeds. Many of the seeds with high pair-wise S but low correlation appeared in the same KEGG (http://www.genome.jp/kegg/) pathway but could not be detected in a straightforward correlation analysis (Supplementary Figure S2). Furthermore some seeds showed markedly different in vivo and in vitro behaviours; for example, PFKFB3 (set B, Supplementary Table S1) did not have significant overlap with any other seeds, whereas CCNG2 showed a consistent inverse correlation with other seeds (F<0; Eq. 4), supporting results from previous studies (Choi and Chen, 2005). Thus, the method was able to identify seeds that behave differently from their peers; for the rest of this study, only the conservative seed set A was used. This set showed higher pair-wise S values than any other set of randomly selected seeds (repeated 1000 times) from the 245-gene LL.

Seed-dependent connectivity identifies a hypoxia signature

Genes in the co-expression networks were ranked by their connectivity score, C (Eq. 5), and compared with the hypoxia 245-gene LL. As the latter is biased towards up-regulated genes (Harris, 2002), only genes showing consistent positive correlation with the initial seeds were considered. To avoid bias, the initial seeds were excluded from this comparison. The relative proportion of known hypoxia genes increased with increasing connectivity, C, score (Figure 2), confirming its benefit as a metric for predicting functional relationships. Similar results were observed with different clustering and pre-processing methods (Supplementary Figure S3). However, differences were observed between data sets. Much of this inter-experimental variation is likely to reflect differences in both the patient populations and the processing of the biological material. For example, both data sets GSE6791 and GSE3494, which showed a lower level of enrichment for hypoxia genes than others, featured samples with the highest proportions of tumour cells selected either by microdissection or visual scoring.

Next we selected a subset of ‘hub’ genes from the hypoxia network, with the goal of using them as a hypoxia signature. Genes with high connectivity, C (Eq. 5), score (P<0.01, estimated by Monte Carlo simulation) were considered (Supplementary Table S2). Each of these genes had a greater-than-expected overlap with the neighbourhoods of all other genes in the hypoxia network (Supplementary Figure S4). The seeds were only selected if they were hubs with respect to all other seeds. Using the Reactome database, we confirmed that pathways known to be regulated by hypoxia, such as glycolysis, gluconeogenesis, glucose metabolism and Cori cycle (recycling of lactic acid), were consistently over-represented in these genes (Figure 2; Supplementary Table S3). Similarly, GO analysis (http://genecodis.dacya.ucm.es) found over-representation (false discovery rate <0.05) of pathways such as glycolysis, phosphoinositide-mediated signalling, nuclear mRNA splicing, translational initiation, regulation of cell cycle, ubiquitin-dependent protein catabolism, apoptosis and regulation of cell proliferation. Over-represented molecular functions included ATP binding, nucleotide binding, lipoic acid binding, oxidoreductase and L-lactate dehydrogenase activity.

Meta-signature enrichment and the prognostic value of compact signatures

We selected genes that showed consistent high connectivity across data sets and derived meta-signatures for hypoxia in HNSCC and BC. Interestingly, although some of the data sets performed poorly on their own, meta-analysis signatures were robust to their inclusion and performed well (Figures 2B and C).

We assessed the prognostic relevance of meta-signatures in four independent data sets (Table 1). Samples were ranked using a summary expression score, E, of the genes in the signature; this produced a hypoxia score, which assigns a hypoxic status to the tumours in the validation data sets. Multivariate Cox analysis including available clinical factors was carried out using each data set; clinical variables were selected using backward-stepwise maximum likelihood. The HS was introduced into the reduced clinical model to estimate the prognostic significance of the meta-signatures independently from other clinical variables (Supplementary Figure S5 and Table S4).

To address whether smaller signatures with equal prognostic ability could be derived by using a more stringent C score, cumulative forest plots were generated in which genes were introduced into the HS calculation one by one, in decreasing order of their meta-Ĉ score (Supplementary Figure S5). Only a few genes were needed before the HR stabilised and a reduced signature was found to be at least as prognostic as a larger one (Supplementary Figure S5 and Table S4). Interestingly, when genes were introduced into the cumulative plots in random order, rather than by their ranked Ĉ score, more genes were needed to reach equivalent prognostic significance (Supplementary Figure S5).

A common hypoxia metagene across cancer types

Common hubs in HNSCC and BC were selected by considering, for each gene, the product, πĈ, of the Ĉ scores between the HNSCC and BC meta-analyses. A common metagene was derived by considering genes with πĈ>0.5 (Table 2; Supplementary Table S5). This hard cut-off was chosen because a gene with a πĈ score approaching that which would be expected by chance (πĈ≈0.5) in one tumour site would have to achieve a maximal score in the other tumour site to be included.

Table 2 Top-ranked genes of the common hypoxia metagene

Full size table

We investigated in cell lines potential regulation of genes in the common metagene by hypoxia and by HIF1a, the main mediator of the hypoxia response in cancer. We considered two data sets: a hypoxia time course in a panel of epithelial and endothelial non-malignant cells (Chi et al, 2006), and an HIF1a and HIF2a siRNA experiment in MCF7 BC cells (Elvidge et al, 2006) exposed to hypoxia. For details of these data we refer to the original publications. Although differences between cell lines and BC in vivo are expected, a high proportion of genes in the common metagene (38 out of 51) showed either regulation in the hypoxia time course or in the siRNA experiment (Figure 3A, B; Supplementary Table S5). Several of these genes were also predicted as HIF1a targets and showed potential HIF1a binding sites (Supplementary Table S5). Furthermore, 22 had already been found hypoxia regulated by previously published report (Supplementary Table S5). Overall approximately 80% (42 out of 51) of genes in the common metagene were confirmed by at least one validation, several of them by more than one.

The common hypoxia metagene was prognostic in independent data sets of different cancer types (Table 3) and showed greater prognostic power than (1) an in vitro derived hypoxia signature (Chi et al, 2006, 2) the initial seeds and (3) our 99-gene HNSCC hypoxia metagene derived previously (Winter et al, 2007). A signature derived by selecting genes co-expressed with VEGF in BC (Desmedt et al, 2008) had no independent prognostic significance (data not shown), in agreement with the published study. In a further validation using Oncomine (http://www.oncomine.org), all but one of the 15 top-ranked (by πĈ score) genes showed prognostic significance in at least one tumour site (P<0.0001). The only top gene for which prognostic significance was not reported in Oncomine, SLC2A1 (GLUT1), is prognostic in other studies (Oliver et al, 2004).

Table 3 Prognostic significance of the common hypoxia metagene versus other hypoxia signatures

Full size table

Finally, cumulative forest plots based on connectivity score (Figure 3) showed no further improvement in HR after addition of a small number of genes. Although differences were observed between HNSCC, BC and lung cancers, we found in all cases that a common signature reduced to a small number of πĈ score top-ranked genes was at least as prognostic as the full signature (Figure 3C, D; Table 3).

Discussion

Hypoxia is a frequent feature of poor-prognosis tumours, and the identification of common in vivo hypoxia-related genes is desirable both for prognostic stratification of patients and development of novel therapies. Although prognostic markers of hypoxia have been identified, there are discrepancies between studies and powerful methods used in large meta-analyses are needed to define generally applicable signatures. A method is described for defining a hypoxia signature that combines previous knowledge derived from in vitro experiments, with co-expression data produced from in vivo samples. We show that by constructing a gene expression network and then extracting core ‘hub’ (high connectivity) genes it is possible to define signatures that are significantly enriched for phenotype-specific genes, and pathways. Although we have used this method to derive a compact and clinically relevant signature of hypoxia in cancer, the approach is likely to have broader applicability.

Specifically, we used the described method in a meta-analysis of 1136 HNSCCs and BCs to derive tissue-specific and common signatures of hypoxia by including only genes that are consistently useful across multiple experiments or tissue types. The ability of the method to derive highly prognostic hypoxia signatures despite differences between data sets highlights its robustness.

The gene expression network used to construct the signature was found to be biologically relevant and to map to a discrete set of biochemical pathways, which is significantly enriched for hypoxia-regulated genes and pathways. This finding highlights that not only in vitro data can assist understanding of clinical data, but also the reverse, that clinical data can be used to formulate specific biological hypotheses.

Remarkably, a reduced common hypoxia metagene containing as few as three genes, namely VEGFA, SLC2A1 and PGAM1, was as prognostic as a large signature in independent BC and HNSCC series. Furthermore, it was more prognostic than several reported signatures when tested in a set of independent data sets, suggesting a level of general applicability. Specifically, genes with highest connectivity were also the most prognostic across a panel of cancers. This further validates the method, as prognosis was not used to select genes that were only ranked by their connectivity; and this ranking was derived in independent data sets. Although a reduced signature was prognostic in all tumour sites tested, the number of genes before convergence was lower in HNSCC and BC than lung cancer. This offers another positive control as this was a common signature between HNSCC and BC, thus it is expected to reflect their biology to a better extent; however, it also indicates a degree of tumour specificity. The common signature and the tumour-type-specific signatures are being evaluated in prospective prognostic and predictive studies in HNSCC and breast cancer.

In summary, this study uses information from in vitro experiments regarding the function of multiple genes combined with in vivo co-expression patterns to derive a common hypoxia metagene in multiple cancers that is highly prognostic, while being compact and robust.

Change history

16 November 2011
This paper was modified 12 months after initial publication to switch to Creative Commons licence terms, as noted at publication

References

Beer DG, Kardia SL, Huang CC, Giordano TJ, Levin AM, Misek DE, Lin L, Chen G, Gharib TG, Thomas DG, Lizyness ML, Kuick R, Hayasaka S, Taylor JM, Iannettoni MD, Orringer MB, Hanash S (2002) Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med 8: 816–824
Article CAS Google Scholar
Butte AJ, Kohane IS (2003) Relevance networks: a first step towards finding genetic regulatory networks within microarray data. In The Analysis of Gene Expression Data Parmigiani G, Gar-rett ES, Irizarry RA, Zeger S (eds). Springer-Verlag: New York
Google Scholar
Carroll JS, Meyer CA, Song J, Li W, Geistlinger TR, Eeckhoute J, Brodsky AS, Keeton EK, Fertuck KC, Hall GF, Wang Q, Bekiranov S, Sementchenko V, Fox EA, Silver PA, Gingeras TR, Liu XS, Brown M (2006) Genome-wide analysis of estrogen receptor binding sites. Nat Genet 38: 1289–1297
Article CAS Google Scholar
Chi JT, Wang Z, Nuyten DS, Rodriguez EH, Schaner ME, Salim A, Wang Y, Kristensen GB, Helland A, Børresen-Dale AL, Giaccia A, Longaker MT, Hastie T, Yang GP, van de Vijver MJ, Brown PO (2006) Gene expression programs in response to hypoxia: cell type specificity and prognostic significance in human cancers. PLoS Med 3: e47
Article Google Scholar
Choi P, Chen C (2005) Genetic expression profiles and biologic pathway alterations in head and neck squamous cell carcinoma. Cancer 104: 1113–1128
Article CAS Google Scholar
Chung CH, Parker JS, Karaca G, Wu J, Funkhouser WK, Moore D, Butterfoss D, Xiang D, Zanation A, Yin X, Shockley WW, Weissler MC, Dressler LG, Shores CG, Yarbrough WG, Perou CM (2004) Molecular classification of head and neck squamous cell carcinomas using patterns of gene expression. Cancer Cell 5: 489–500
Article CAS Google Scholar
Cromer A, Carles A, Millon R, Ganguli G, Chalmel F, Lemaire F, Young J, Dembélé D, Thibault C, Muller D, Poch O, Abecassis J, Wasylyk B (2004) Identification of genes associated with tumorigenesis and metastatic potential of hypopharyngeal cancer by microarray analysis. Oncogene 23: 2484–2498
Article CAS Google Scholar
Desmedt C, Haibe-Kains B, Wirapati P, Buyse M, Larsimont D, Bontempi G, Delorenzi M, Piccart M, Sotiriou C (2008) Biological processes associated with breast cancer clinical outcome depend on the molecular subtypes. Clin Cancer Res 14: 5158–5165
Article CAS Google Scholar
Elvidge GP, Glenny L, Appelhoff RJ, Ratcliffe PJ, Ragoussis J, Gleadle JM (2006) Concordant regulation of gene expression by hypoxia and 2-oxoglutarate-dependent dioxygenase inhibition: the role of HIF-1alpha, HIF-2alpha, and other pathways. J Biol Chem 281: 15215–15226
Article CAS Google Scholar
Fox SB, Generali DG, Harris AL (2007) Breast tumour angiogenesis. Breast Cancer Res 9: 216
Article Google Scholar
Hahn MW, Kern AD (2005) Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol Biol Evol 22: 803–806
Article CAS Google Scholar
Harris AL (2002) Hypoxia—a key regulatory factor in tumour growth. Nat Rev Cancer 2: 38–47
Article CAS Google Scholar
Hastie R, Tibshirani J, Friedman H (2001) The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer-Verlag: New York
Book Google Scholar
Loi S, Haibe-Kains B, Desmedt C, Wirapati P, Lallemand F, Tutt AM, Gillet C, Ellis P, Ryder K, Reid JF, Daidone MG, Pierotti MA, Berns EM, Jansen MP, Foekens JA, Delorenzi M, Bontempi G, Piccart MJ, Sotiriou C (2008) Predicting prognosis using molecular profiling in estrogen receptor-positive breast cancer treated with tamoxifen. BMC Genomics 9: 239
Article Google Scholar
Miller LD, Smeds J, George J, Vega VB, Vergara L, Ploner A, Pawitan Y, Hall P, Klaar S, Liu ET, Bergh J (2005) An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc Natl Acad Sci USA 102: 13550–13555
Article CAS Google Scholar
Nordsmark M, Bentzen SM, Rudat V, Brizel D, Lartigau E, Stadler P, Becker A, Adam M, Molls M, Dunst J, Terris DJ, Overgaard J (2005) Prognostic value of tumor oxygenation in 397 head and neck tumors after primary radiation therapy. An international multi-center study. Radiother Oncol 77: 18–24
Article Google Scholar
Oliver RJ, Woodwards RT, Sloan P, Thakker NS, Stratford IJ, Airley RE (2004) Prognostic value of facilitative glucose transporter Glut-1 in oral squamous cell carcinomas treated by surgical resection; results of EORTC Translational Research Fund studies. Eur J Cancer 40: 503–507
Article CAS Google Scholar
Pyeon D, Newton MA, Lambert PF, den Boon JA, Sengupta S, Marsit CJ, Woodworth CD, Connor JP, Haugen TH, Smith EM, Kelsey KT, Turek LP, Ahlquist P (2007) Fundamental differences in cell cycle deregulation in human papillomavirus-positive and human papillomavirus-negative head/neck and cervical cancers. Cancer Res 67: 4605–4619
Article CAS Google Scholar
Raponi M, Zhang Y, Yu J, Chen G, Lee G, Taylor JM, Macdonald J, Thomas D, Moskaluk C, Wang Y, Beer DG (2006) Gene expression signatures for predicting prognosis of squamous cell and adenocarcinomas of the lung. Cancer Res 66: 7466–7472
Article CAS Google Scholar
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 102: 15545–15550
Article CAS Google Scholar
van de Vijver MJ, He YD, van't Veer LJ, Dai H, Hart AA, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, van der Velde T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R (2002) A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 347: 1999–2009
Article CAS Google Scholar
Wilson CL, Miller CJ (2005) Simpleaffy: a BioConductor package for Affymetrix Quality Control and data analysis. Bioinformatics 21: 3683–3685
Article CAS Google Scholar
Winter SC, Buffa FM, Silva P, Miller C, Valentine HR, Turley H, Shah KA, Cox GJ, Corbridge RJ, Homer JJ, Musgrove B, Slevin N, Sloan P, Price P, West CM, Harris AL (2007) Relation of a hypoxia metagene derived from head and neck cancer to prognosis of multiple cancers. Cancer Res 67: 3441–3449
Article CAS Google Scholar
Wolfe CJ, Kohane IS, Butte AJ (2005) Systematic survey reveals general applicability of ‘guilt-by-association’ within gene coexpression networks. BMC Bioinformatics 6: 227
Article Google Scholar

Download references

Acknowledgements

This work was supported by Cancer Research UK, the EU Integrated Project ACGT (FP6-IST-026996) and 7th Framework Programme METOXIA, the Oxford NIHR Comprehensive Biomedical Research Centre, the Manchester Experimental Cancer Medicine Centre and the Medical Research Council, UK. We thank Carla Moller for useful comments on the paper. Special thanks to all the authors of the studies summarised in Table 1.

Author information

Authors and Affiliations

Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, OX3 9DS, UK
F M Buffa & A L Harris
School of Cancer and Imaging Sciences, The University of Manchester, Manchester, M13 9PT, UK
C M West
Paterson Institute for Cancer Research, The University of Manchester, Manchester, M20 4BX, UK
C J Miller

Authors

F M Buffa
View author publications
You can also search for this author in PubMed Google Scholar
A L Harris
View author publications
You can also search for this author in PubMed Google Scholar
C M West
View author publications
You can also search for this author in PubMed Google Scholar
C J Miller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to F M Buffa.

Additional information

Conflict of interest

FM Buffa, CJ Miller, CM West and AL Harris are applicants on a patent submission seeking to use a reduced hypoxia signature as a prognostic marker in cancer.

Supplementary Information accompanies the paper on British Journal of Cancer website (http://www.nature.com/bjc)

Supplementary information

Supplementary Figure S1 (PDF 184 kb)

Supplementary Figure S2 (PDF 213 kb)

Supplementary Figure S3 (PDF 73 kb)

Supplementary Figure S4 (PDF 91 kb)

Supplementary Figure S5 (PPT 309 kb)

Supplementary Table S1 (DOC 42 kb)

Supplementary Table S2 (XLS 107 kb)

Supplementary Table S3 (XLS 157 kb)

Supplementary Table S4 (DOC 45 kb)

Supplementary Table S5 (XLS 52 kb)

Suppl_online_material.pdf (PDF 545 kb)

Rights and permissions

From twelve months after its original publication, this work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article

Buffa, F., Harris, A., West, C. et al. Large meta-analysis of multiple cancers reveals a common, compact and highly prognostic hypoxia metagene. Br J Cancer 102, 428–435 (2010). https://doi.org/10.1038/sj.bjc.6605450

Download citation

Received: 08 September 2009
Accepted: 14 October 2009
Published: 19 January 2010
Issue Date: 19 January 2010
DOI: https://doi.org/10.1038/sj.bjc.6605450

Keywords

This article is cited by

Prognostic biomarkers for the response to the radiosensitizer nimorazole combined with RCTx: a pre-clinical trial in HNSCC xenografts
- Lydia Koi
- Verena Bitto
- Michael Baumann
Journal of Translational Medicine (2023)
Prognostic stratification based on HIF-1α signaling for evaluating hypoxia status and immune landscape in hepatocellular carcinoma
- Hongkai Zhuang
- Zedan Zhang
- Yajin Chen
Journal of Big Data (2023)
Tamoxifen-predictive value of gene expression signatures in premenopausal breast cancer: data from the randomized SBII:2 trial
- Christine Lundgren
- Julia Tutzauer
- Lisa Rydén
Breast Cancer Research (2023)
Exploiting convergent phenotypes to derive a pan-cancer cisplatin response gene expression signature
- Jessica A. Scarborough
- Steven A. Eschrich
- Jacob G. Scott
npj Precision Oncology (2023)
How the histological structure of some lung cancers shaped almost 70 years of radiobiology
- Katja R. Worth
- Ioanna Papandreou
- Ester M. Hammond
British Journal of Cancer (2023)

Subjects

Abstract

Background:

Methods:

Results:

Conclusion:

Similar content being viewed by others

Main

Materials and methods

Seed clustering

Shared neighbourhood

Seed-dependent connectivity

Bootstrapping, Monte Carlo and meta-connectivity score

Cumulative forest plots based on connectivity score

Data sets, data processing and annotation

Results

Derivation of a hypoxia expression network

Seed-dependent connectivity identifies a hypoxia signature

Meta-signature enrichment and the prognostic value of compact signatures

A common hypoxia metagene across cancer types

Discussion

Change history

16 November 2011

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Search

Quick links