- Open Access
Identification of key regulators of pancreatic cancer progression through multidimensional systems-level analysis
© Rajamani and Bhasin. 2016
- Received: 11 September 2015
- Accepted: 19 February 2016
- Published: 3 May 2016
Pancreatic cancer is an aggressive cancer with dismal prognosis, urgently necessitating better biomarkers to improve therapeutic options and early diagnosis. Traditional approaches of biomarker detection that consider only one aspect of the biological continuum like gene expression alone are limited in their scope and lack robustness in identifying the key regulators of the disease. We have adopted a multidimensional approach involving the cross-talk between the omics spaces to identify key regulators of disease progression.
Multidimensional domain-specific disease signatures were obtained using rank-based meta-analysis of individual omics profiles (mRNA, miRNA, DNA methylation) related to pancreatic ductal adenocarcinoma (PDAC). These domain-specific PDAC signatures were integrated to identify genes that were affected across multiple dimensions of omics space in PDAC (genes under multiple regulatory controls, GMCs). To further pin down the regulators of PDAC pathophysiology, a systems-level network was generated from knowledge-based interaction information applied to the above identified GMCs. Key regulators were identified from the GMC network based on network statistics and their functional importance was validated using gene set enrichment analysis and survival analysis.
Rank-based meta-analysis identified 5391 genes, 109 miRNAs and 2081 methylation-sites significantly differentially expressed in PDAC (false discovery rate ≤ 0.05). Bimodal integration of meta-analysis signatures revealed 1150 and 715 genes regulated by miRNAs and methylation, respectively. Further analysis identified 189 altered genes that are commonly regulated by miRNA and methylation, hence considered GMCs. Systems-level analysis of the scale-free GMCs network identified eight potential key regulator hubs, namely E2F3, HMGA2, RASA1, IRS1, NUAK1, ACTN1, SKI and DLL1, associated with important pathways driving cancer progression. Survival analysis on individual key regulators revealed that higher expression of IRS1 and DLL1 and lower expression of HMGA2, ACTN1 and SKI were associated with better survival probabilities.
It is evident from the results that our hierarchical systems-level multidimensional analysis approach has been successful in isolating the converging regulatory modules and associated key regulatory molecules that are potential biomarkers for pancreatic cancer progression.
- Systems biology
Cancer is a complex disease that leads to dysregulation of multiple biological processes and pathways at regulatory, transcriptional and translational domains of central dogma. An extensive amount of genomics, proteomics and epigenetics data has been generated to probe the role of molecules from these domains in different pathways and biological processes involved in cancer pathophysiology. Most studies have concentrated on the individual contribution of molecules in disease pathophysiology, ignoring the interactions between omics data from different genomics levels. The generation of integrated system-level networks including the molecules dysregulated at regulatory (miRNA, methylation), transcriptional (genes) and translational (proteins, metabolites) omics levels helps in generating a complex network dysregulated in the disease. The analysis of this complex network can shed light on critical pathways and key molecules driving disease progression.
Pancreatic cancer is a very aggressive form of cancer with poor diagnosis and dismal prognosis. Of the exocrine pancreatic cancers, 95 % are pancreatic ductal adenocarcinoma (PDAC), with increasing occurrence in the last decades . Patients are mostly diagnosed at an advanced stage, resulting in poor response to therapies including surgical resection, leading to very low (6 %) 5-year survival rates . The probable early clinical symptoms like thromboembolism and new-onset type II diabetes occur much earlier, though the symptoms are not necessarily associated with pancreatic cancer. The early stage progress of the disease could be much slower than previously thought, giving patients with early diagnosis a much better survival probability . To date, the diagnosis is based on clinical signs and pathology even though the early symptoms are vague and non-definitive. This raises an urgent need for the development of reliable diagnostic, prognostic and therapeutic biomarkers. The availability of diagnostic biomarkers from peripheral body fluids will enable routine screening owing to the ease of testing as opposed to the highly invasive methods used currently for diagnosis, while the prognostic markers of early stage PDAC will give patients a better treatment plan and survival chance. In this context, intensive literature mining efforts have been made [4, 5] to identify possible candidate biomarkers in PDAC diagnosis and treatment. These provide a good starting point to collate the genes identified as important players in PDAC that could serve as prognostic or diagnostic biomarkers, but also need extensive filtering and unification with regards to experimental conditions under which each study was performed. For example, studies on prognostic or predictive biomarkers identified from survival analysis are generally done on patients with specific mutations or patients following cancer therapy with specific adjuvant therapy. The result from each study varies with respect to the identified predictive molecules owing to differences in design strategy. Also, predictive signatures obtained from studies have good predictive power for the subset of patients with a similar disease profile as in the study, but fail in the other possible disease profiles, namely, different causal mutations or different adjuvant therapy. This calls for biomarker identification based on biological behavior of genes in PDAC, independent of subpopulation disease profiles. It also calls for a method to filter genes manifesting a causal relationship with disease pathophysiology from the genes being affected by advancement of the disease process.
Another area that needs to be addressed in identification of biomarkers in PDAC is the basic biological processes affected in PDAC. It has been shown that dysregulation of multiple genomic spaces occurs in many cancers and the resultant disease is a culmination of multiple mutations; dysregulated control mechanisms, like regulatory microRNAs and epigenetic control; and dysregulated genes and proteins. The role of small non-coding microRNAs (miRNA) in regulating gene expression in cancer is being extensively studied. It has been shown the oncogenes and tumor suppressor genes are targets of differentially expressed miRNAs in solid tumors . Pedersen et al. studied differential DNA methylation between normal and PDAC samples to identify epigenetic markers . To understand the underlying biological process, these changes have to be taken into consideration as a whole, thus accounting for the biological cross-talk between these different functional components.
With the availability of data from different omics studies and large compilations of multi-omics data like Gene Expression Omnibus (GEO) , the International Cancer Genome Consortium (ICGC)  and The Cancer Genome Atlas (TCGA) , it is now possible to compare and contrast cancer profiles from normal profiles in multiple omics dimensions. The complex nature of interactions leading to the pathophysiology of cancer warrants the use of systems biology-based approaches to understand the cause–effect relationship between the dysregulated molecules identified by traditional supervised analysis methods. Biological interaction networks provide insights into the biology that is fundamental to the disease process. Additionally, biological networks can help to identify key genes that are critical in pathophysiological cancer networks, thus leading to biomarker identification. These key genes (hubs) in the network regulate a plethora of important functional genes and their deletion, in many cases, results in lethality . It has also been illustrated that interactive networks encompassing multiple omics domains can provide better insights about key regulatory (KR) molecules than individual dysregulated gene-based analysis, by taking into consideration the gene interactions . In this study, we integrated the information obtained from multiple omics dimensions, namely, the transcriptome, regulatory miRNA and the epigenome, to build a systems-level network to identify the key molecules associated with PDAC. Further functional and survival analysis of the identified KR genes clearly depicted their association with pathways linked to cancer progression and survival. Additional file 1: Figure S1 shows a schematic of the methodology used in this study.
List of pancreatic ductal adenocarcinoma omics datasets used in multidimensional analysis
Affymetrix Human Genome U133 Plus 2.0
Affymetrix Human Gene 1.0 ST
Affymetrix Human Gene 1.0 ST
Illumina HumanHT-12 V4.0 expression bead chip
Febit human miRBase v11
Febit Homo sapiens miRBase 13.0 plus
NanoString nCounter Human miRNA assay (v1)
Illumina HumanMethylation450 Bead Chip
Illumina Infinium Human DNA Methylation 450
Agilent-014850 Whole Human Genome Microarray
Affymetrix Human Genome U133 Plus 2.0 Array
The pancreatic cancer miRNA datasets used in this study are listed in Table 1. Raw data from the Febit platform were normalized using the quantile normalization method after background correction and summarized with replicate median using the linear models for microarray data (limma) Bioconductor package in R. Data from NanoString nCounter were processed using the R package NanoStringNorm  using median background correction and quantile normalization. Agilent data were normalized using the AgiMicroRna package from Bioconductor [17, 18] that includes background correction and quantile normalization. The methylation data on pancreatic cancer were obtained from GEO and TCGA (Table 1). The pre-normalized data were median-centered to remove any batch effects.
The quality of the normalized data from Affymetrix gene expression datasets was assessed using the arrayQualityMetrics package from Bioconductor . Individual array quality was assessed using various quality control plots including M-A plots, and homogeneity between arrays was determined using boxplots and density estimate plots. Outliers were identified using heatmaps and dendrograms based on inter-array expression distances (mean absolute distance of the M-value for each pair of arrays). Datasets that did not show any class-based (i.e., normal or PDAC) clustering or had greater than 10 % outliers were not included in the study.
Unsupervised analysis was performed to further ascertain the quality of datasets and identify outliers without biological relevance, based on inter-array correlations using principal component analysis (PCA; prcomp module from the stats package in R). PCA projects multivariate data objects onto a lower dimensional space while retaining as much of the original variance as possible [20, 21]. PCA methodology captures the inherent gene expression patterns in the data and identifies the correlation among biologically distinct samples.
Supervised analysis to identify differentially expressed molecules
We used the limma package  from the Bioconductor project to identify differentially expressed molecules (genes, miRNA or methylation) between pancreatic cancer and normal samples. The sample groups were compared by fitting a linear model for each variable (normalized expression values) and applying empirical Bayes smoothing to identify differentially expressed molecules. The molecules with absolute fold change ≥ 2 and multiple test corrected (51) P-value < 0.05 were considered significantly differentially expressed in this conventional analysis approach. The list of differentially expressed molecules from each dataset was compared using Venn diagrams created using jvenn  (Additional file 2: Figure S2).
Meta-analysis of data from the epigenome, transcriptome and regulatory miRNA
Meta-analysis was performed to identify the differentially expressed molecules from multiple datasets using a rank-based approach (i.e., RankProd [24, 25]) in the meta-analysis of microarray (MAMA) R package . RankProd is a non-parametric statistical method that uses the ranks of differentially expressed molecules (between conditions compared) to obtain the combined signature from multiple studies. The fold change of genes from individual studies is transformed to rank of genes across studies, and genes that are consistently differentially expressed in multiple studies are ranked highly. The false-positive predictions were restricted to less than 5 % [false discovery rate (FDR) ≤ 0.05], based on 1000 class label-based random permutations. The differentially expressed genes were identified only on the basis of the FDR P-value without any fold change restrictions. The rank products method provides a robust way to overcome heterogeneity among datasets and reduces loss of useful data, which is a major problem associated with the traditional methods of finding consensually differentially expressed genes after strict cutoff-based filtering of differentially expressed genes in individual datasets (see previous section).
Similar rank products-based meta-analysis was performed on multiple regulatory miRNA expression datasets and epigenome datasets to obtain PDAC meta-signatures of regulatory miRNAs and differential methylation, respectively. These gene, miRNA and methylation meta-signatures constitute the multidimensional PDAC signature used for further analysis.
Defining interactions between multiple omics dimensions
To determine the interactions between the epigenome and transcriptome, the genes and methylations sites were annotated using official HUGO gene symbol (version hg19 of the human genome). The interactions between methylation and gene expression signatures were deduced based on consensus genes found in both gene and methylation meta-signatures. These genes, which show differential expression as well as hypo/hypermethylation, are considered to be the genes regulated by methylation (methylation-regulated genes) irrespective of the directionality of gene expression or methylation. Biplots of gene expression changes with either methylation or miRNA expression changes were generated to extract any patterns between expression profiles (Additional file 3: Figure S3).
Transcriptome-regulatory miRNA interactions
The miRNA–gene interactions were determined using the RmiR package  from Bioconductor. The algorithm performs consensus miRNA and target gene interaction prediction using multiple algorithms (e.g., miRBase and targetScan) to reduce the chance of false-positive results. The miRNA and target predictions were performed using PDAC gene expression and miRNA meta-signatures identified from the analysis described above. The patterns of expression between target genes and miRNA were determined by generating biplots on the basis of fold change of differentially expressed targeted genes and interacting miRNAs. It was not clear if all these interactions were functional in PDAC as the known interactions were obtained based on sequence similarity. To obtain functional interactions in PDAC from among all the predicted interactions identified using RmiR analysis, an additional filter was applied based on the correlation found between gene and miRNA expression data from the same tissue samples in a previous multi-omics study . The differentially expressed genes and miRNAs depicting significant correlation at the expression level calculated using the cor function in R (R ≥ 0.8 or R ≤ –0.8) were considered the functional gene–miRNA interactions. Further, the genes that were regulated both by methylation and miRNA were considered genes under multiple regulatory controls (GMCs).
Pathway and functional analyses
To identify biological pathways significantly overrepresented in miRNA-regulated and methylation-regulated genes, pathway enrichment analysis was performed using Ingenuity Pathway Analysis (IPA) (Qiagen) software. The knowledge base of this software consists of functions, pathways and network models derived by systematically exploring the peer-reviewed scientific literature. It calculates the P-value using Fisher’s exact test for each pathway and functional category, according to the fit of the user’s data to IPA databases. The P-value measures how likely the observed association between a specific pathway/function and the dataset would be if it were due only to random chance. The functional categories and pathways with FDR < 0.05 were considered to be significantly associated.
Systems biology analysis
The meta-signatures of miRNA, genes and methylation were utilized in generating the network of interacting molecules from multiple omics domains using Cytoscape 2.8, an open source platform for biomolecular network visualization . The gene interaction network was created using the Michigan Molecular Interactions (MiMI) plugin for Cytoscape . The network was created based on molecular interactions from multiple biological interaction databases like BIND, BioGRID, CCSB at Harvard, cPath, DIP, GO, HPRD, IntAct, InterPro, KEGG and PubMed . The gene–miRNA interactions were obtained using RmiR analysis as previously described and incorporated in the comprehensive network generation process from multidimensional PDAC meta-signatures. The methylation meta-signature was not included at this step because it was redundant to our eventual goal of creating a network of GMCs. Subsequently, a comprehensive GMC (CGMC) network was extracted from the global multidimensional PDAC meta-signature network, containing GMCs along with their first interactive neighbors. The first neighbors were included to build a cohesive network around the GMCs because the GMCs themselves represent isolated hubs. Network-based pathways enrichment analysis was performed using the GeneMANIA  plugin in Cytoscape. Significant pathways associated with subnetworks of selected genes were determined using the FDR-adjusted hypergeometric test-based Q-values reported by GeneMANIA for a query gene-based search for pathways enrichment.
Identification of key regulatory molecules from the CGMC network
The network topological parameters were calculated using the CentiScaPe Plugin  in Cytoscape. Topological analysis was performed on the CGMC network to identify nodes that are critical for network stability. We performed topological analysis using node stress and neighborhood connectivity parameters from CentiScaPe to calculate an average rank (AR) score for each node. Node stress is a node centrality index calculated by measuring the number of shortest paths passing through a node . The neighborhood connectivity of a node n is defined as the average connectivity of all neighbors of n . The AR score was determined by the average rank of the nodes with respect to these two indices. The nodes with AR score cutoff ≤10 were considered to be KR hubs.
Partitioning around medoids analysis
Partitioning around medoids (PAM)  is a robust clustering method for partitioning based on a dissimilarity matrix. We used PAM implementation from the cluster package in R to partition the CGMC network around the identified KR hubs based on Euclidean distance of CGMC network gene expression matrix.
Gene set enrichment analysis
Gene set enrichment analysis (GSEA)  was performed (using the javaGSEA desktop application) on the CGMC network to determine the importance of the KR subnetworks (gene sets) created using PAM analysis. The GSEA algorithm determines whether a gene set is specifically enriched at the leading edges of a reference gene list sorted with respect to specified parameter of interest (class-based t-test statistic). The edges of the ranked list contain the most discriminatory genes in the gene list and the gene sets depicting overlap with these edges are significant. The significance of gene set enrichment was determined on the basis of 1000 class-based permutation tests . Gene sets with multiple test-corrected P-values less than 25 % (FDR ≤ 0.25) were considered significant.
Self organizing map clustering and survival analysis of key regulatory genes
To identify group-dependent patterns from the expression profiles of KR hubs identified using systems-level approaches, the self organizing map (SOM) clustering technique was adopted . SOM allows the grouping of gene expression patterns into an imposed structure in which adjacent clusters are related, thereby identifying sets of samples that follow certain expression patterns across KR hubs. We performed sample-based SOM clustering (som package in R) using Pearson correlation coefficient-based distance metrics to form two sample clusters with very distinct expression profiles of KRs. Survival analysis was performed on these clusters using the Kaplan–Meier analysis from the survival package in R . Survival analysis was also performed on individual KRs. The Kaplan–Meier estimate is a non-parametric maximum likelihood estimate of the survival function created based on the number of survivors and non-survivors at any given time point. The results of the survival analysis were visualized using a Kaplan–Meier survival curve. The significance of difference in survival among different groups was estimated using log-rank testing. The results were considered significant if the P-values from the log rank test were less than 0.075.
Meta-analysis of the epigenome, transcriptome and regulatory miRNA in pancreatic ductal adenocarcinoma
Pancreatic cancer data from multiple biological domains (omics data types) were integrated to generate a multidimensional PDAC signature associated with disease progression. The study included 177 gene expression profiles, 224 miRNA expression profiles and 210 DNA methylation profiles from different pancreatic cancer studies. Details about the datasets from which these profiles were obtained can be found in Table 1. After normalization and preprocessing, the individual datasets were analyzed using unsupervised and supervised analysis methods.
Gene expression meta-signature associated with pancreatic ductal adenocarcinoma
Rank-based meta-analysis using the rank products (RP) method was performed to address this inherent heterogeneity among datasets. RP performs a rank-based comparison of genes from different experiments to identify genes that are consistently ranked high (upregulated) and low (downregulated), clearly depicting advantages over linear modeling-based methods in meta-analysis by better handling noise from heterogeneous datasets. The RP-based method depicts a slightly better overlap among differentially expressed genes identified in meta-analysis (Additional file 2: Figure S2A). Owing to this better overlap and the inherent advantage of RP in handling noise, we adopted RP as opposed to limma for meta-analysis. We obtained a transcriptome-specific signature of 5391 genes that were consistently significantly differentially expressed (FDR ≤ 0.05) in PDAC compared to normal among all the datasets. A subset of top differentially expressed genes obtained from meta-analysis is shown in Fig. 1b. It is evident from the heatmap that differentially expressed genes identified from the meta-analysis are consistently over-expressed or under-expressed across all datasets. Many of these differentially expressed genes were initially unidentified because of filtering criteria, but selection using rank-based meta-analysis retained them appropriately. The most downregulated genes identified included genes involved in basic metabolism and functioning of pancreatic cells, for example, SYCN involved in exocytosis in pancreatic acinar cells, FAM3B involved in controlling basal insulin secretion in pancreatic beta cells, and many enzymes involved in protein processing and amino acid metabolism. The consistently upregulated genes include genes linked to cell cycle progression (e.g., S100A11), cytoskeletal reorganization (e.g., FNBP1 and CAPZB), kinetochore formation and spindle checkpoint (e.g., ZWINT), immune and inflammatory response (e.g., IL2RG, IL8 and CXCL5), angiogenesis (e.g., PLXDC2) and metalloproteinase (e.g., ADAM8). It was previously shown in a meta-analysis of PDAC expression data that many of these genes are indeed differentially expressed in PDAC . Of the 51 genes shown in that study as potential PDAC biomarkers, 38 were also enriched in our meta-analysis results, including S100A11, ADAM8 and IL8.
miRNA meta-signature associated with pancreatic ductal adenocarcinoma
The miRNA meta-analysis identified 109 consistently significantly differentially expressed miRNAs in PDAC compared to normal, with FDR ≤ 0.05 (miRNA PDAC meta-signature). A heatmap of the top differentially expressed miRNAs obtained from meta-analysis is shown in Fig. 2b. The heatmap clearly depicts uniform upregulation or downregulation of different miRNAs across 3 out of 4 datasets. MiRNAs miR-148a and miR-375 were significantly downregulated, whereas miR-93, miR-21, miR-10a, miR-107 and miR-23a were significantly upregulated in PDAC. Sun et al. found a similar dysregulation of miR-148a, miR-375, miR-21 and miR-10a in PDAC . It was shown that miR-10a could be involved in the tumor invasiveness through HOXA1 suppression in PDAC . Our results are consistent with previous findings based on GEO and TCGA data, that miR-21 is a potential biomarker in pancreatic cancer . miR-107, miR-222 and miR-148a are differentially expressed in pancreatic cancer and miR-222 is also known to be associated with poor survival probabilities .
Epigenetics meta-signature associated with pancreatic ductal adenocarcinoma
The multidimensional PDAC signature consists of all significantly differentially expressed genes, miRNAs and methylation sites in PDAC identified from the meta-analyses (FDR ≤ 0.05) as described above.
Understanding gene–miRNA and gene–epigenetic bimodal interactions associated with pancreatic ductal adenocarcinoma
The interaction between PDAC-associated gene expression and methylation changes was deciphered by comparing the differentially expressed and differentially methylated genes on the basis of HUGO gene symbols. Among the PDAC gene meta-signature, 715 genes showed overlap with the methylation meta-signature and were considered to be methylation-regulated genes (Fig. 4a).
The inverse proportionality postulate between control mechanisms and gene expression was tested by generating biplots of gene expression changes with either methylation or miRNA expression changes (Additional file 3: Figure S3). The predicted miRNA–gene interactions showed no direct or inverse correlation between gene and miRNA expression. Methylation biplots showed that hyper-methylation was associated with both upregulation and downregulation of gene expression. On the other hand, hypo-methylated genes (70 out of the 715 methylation-regulated genes) were mostly upregulated at the gene expression level (Fig. 4b). These results reinforce that the control mechanisms on gene expression are complex multi-level interactions and hence it is not possible to find a simple one-to-one relationship at a molecular level.
There were 189 genes that were common between the miRNA-regulated and methylation-regulated genes (Fig. 4a.) and thus are classified as genes under multiple regulatory controls (GMCs). Among the GMCs, the gene expression signature was ~70 % inversely correlated with the methylation signature, as expected, but the miRNA interactions did not show any significant pattern. This is probably because the miRNA–gene interactions were predicted on the basis of sequence similarity but have not been verified experimentally in pancreatic cancer. Furthermore, it has been suggested the gene–miRNA interactions do not necessarily have to be mechanisms for gene regulation, but could instead act as mechanisms for modulating cellular miRNA levels (sponge interactions) . To understand this complexity and identify the functional gene–miRNA interactions in PDAC, we analyzed a PDAC dataset containing sample-matched gene and miRNA expression profiles [28, 46]. We identified the functional gene–miRNA interactions on the basis of correlation analysis of the 49 miRNAs with 1150 predicted gene targets, previously identified from the sequence similarity-based prediction. The analysis identified 36 functionally relevant miRNAs that interact with at least one of the 189 GMCs. A hierarchical network of GMCs and their functionally relevant miRNA regulators is shown in Fig. 4b. The GMC nodes are dually colored for gene expression and methylation, showing largely inverse correlation, and the miRNAs are colored based on their differential expression values.
The results from functional gene–miRNA interaction analysis show that miR-210 and miR-222 are upregulated in PDAC and are correlated with many of the GMCs (Fig. 4b). The role of miR-210 has been extensively studied in connection with hypoxia in the tumor micro-environment in many solid tumors, including pancreatic cancer, and its oncogenic role in cancer progression is being explored . Similarly, miR-222 has been shown to be involved in epithelial-to-mesenchymal transition (EMT) in breast cancers, leading to an aggressive breast cancer phenotype . The miRNAs miR-194 and miR-195, known to be involved in anti-tumor mechanisms in the cell, are downregulated in our PDAC signature and correlated with many of the identified GMCs in our study. For example, miR-194 is an epithelial marker and is known to target many genes involved in the epithelial-to-mesenchymal transition, thereby reversing the transition , whereas miR-195 has been shown to induce apoptosis in human embryonic stem cells of neural origin .
The comparison of methylation-regulated and miRNA-regulated genes on the basis of significantly impacted (-log(P-value) ≥ 10) functional categories revealed a considerable overlap between the two sets. Commonly affected functional categories include cellular growth and proliferation, cellular development, cellular movement, cell death and survival, organismal survival, organismal development, embryonic development, cancer, gastrointestinal disease, and cardiovascular system development (Additional file 4: Figure S4). Methylation-regulated genes uniquely associated with pro-inflammatory and immune response-related pathways like the IL-1, IL-8 and NFκB signaling pathways. They were also enriched in pathways involved in cancer invasiveness and cellular motility, such as leukocyte extravasation, glioma invasiveness and Paxillin signaling pathways (Fig. 4c). The pathways specific to miRNA-regulated genes involved normal developmental pathways that regulate cell survival, cell proliferation and angiogenesis, such as TGFβ, HGF, Wnt and STAT3 signaling pathways (Fig. 4d). Similar to the functional categories, we also observed a significant overlap in enriched canonical pathways between methylation-regulated and miRNA-regulated genes, including PTEN, integrin and axonal guidance signaling pathways (marked with asterisk in Fig. 4c, d).
Systems biology analysis of the network of genes under multiple regulatory controls and identification of key regulatory hubs
One of the major shortcomings of the traditional approach is the assumption that the highly differentially expressed genes play causal roles in disease processes without differentiating driver and passenger gene expression changes. Multiple-systems biology studies have shown that the key driver genes, those playing a causal role in initiating signaling cascades that result in multiple highly differentially expressed genes, are only moderately altered in cancer and are not identified by the conventional approaches based on degree of differential expression . There is also the task of narrowing down to a few critical genes from a long list of differentially expressed genes identified from these approaches. The bimodal filtering of differentially expressed genes based on alteration of upstream regulatory controls developed in this study assists in identifying a set of candidate genes with the fewest false positives and a high potential of association with disease pathophysiology. It is hypothesized that these perturbations that channel through multiple genomic spaces are the ones that perform regulatory bottle-neck roles in driving disease progression and might provide better diagnostic/prognostic or therapeutic biomarkers . We took a systems-level approach to address the scale-free nature of biological systems in which a few key hubs regulate a majority of the other genes in the network . The directionality of flow of perturbations can also be identified from this network-based approach and this is important in isolating the causal changes from all other cancer-associated changes.
KR hubs were identified from the CGMC network through topological analysis on the basis of node interactions. The nodes were ranked based on neighborhood connectivity and node stress. From this analysis, eight KR hubs were identified (AR score ≤ 10), namely E2F3, HMGA2, RASA1, IRS1, NUAK1, ACTN1, SKI and DLL1. The top five hubs according to the AR score are shown in Fig. 6b (I–V). To understand the biological role of these KR hubs, we created KR subnetworks constituting these KR hubs along with their first neighbors and performed functional enrichment analysis of these subnetworks. E2F3 is a top-ranking KR hub with 24 interacting neighbors that were significantly linked with G1/S transition of the mitotic cell cycle (FDR = 0.0021). The NUAK1 subnetwork consisted of 17 neighbors associated with the TLR signaling pathway (FDR = 0.00041). Previous studies have shown that tumor cells express functional TLRs and their activation leads to tumor cell proliferation and resistance to apoptosis . The ACTN1 subnetwork had 22 interacting neighbors significantly associated with cell adhesion and cell junction organization (FDR = 8.57E − 05). The RASA1 hub had 34 interacting neighbors involved in many growth response signaling pathways like ERBB signaling (FDR = 4.71E − 11). SKI interacted with 13 neighboring nodes in the CGMC network and was involved in endocrine system development, pancreas development and negative regulation of TGFβ receptor signaling (FDR = 5.07E − 05).
Enrichment of key regulatory subnetworks in the overall comprehensive genes under multiple regulatory controls network
Gene set enrichment analysis of KR-PAM clusters
Survival analysis of identified key regulators in pancreatic cancer
The significance of the KRs was further tested using survival analysis on the eight identified KRs in two PDAC expression datasets. The samples were partitioned into two groups by the most contrasting expression characteristics for the eight KRs using the SOM clustering approach, and survival analysis was performed on the two clusters (Additional file 6: Figure S6; Ai. and Bi). The results showed that the KRs were able to clearly discriminate between better versus poor survivors (P-values 0.0368–0.0464), indicating their prognostic role in PDAC.
One of the most challenging tasks in the analysis of large-scale omics data is the identification of the causal (KR) changes and differentiating them from bystander effects associated with the disease. Owing to the large number of genes measured in genomic assays, it is difficult to achieve statistical power except for the most highly differentially expressed molecules. In many cases the KRs remain unidentified owing to subtle changes at expression level. Even when identified as being differentially expressed, KRs are hard to characterize based on just their differential expression measure from a long list of altered molecules. However, the alternative approach of identifying regulatory genes from biochemical studies is laborious and time consuming and does not compare to genome-scale experiments in providing comprehensive coverage.
Numerous studies have shown that network-based approaches that take into consideration the contribution of individual genes along with their interactions are an effective solution in identifying KRs of disease pathophysiology . It has been hypothesized that KRs are functional bottlenecks in disease initiation and progression because they regulate the expression of a large number of downstream effector genes . Most of these studies performed the network analysis on gene expression data, ignoring the contribution of upstream regulatory genomics and epigenetics spaces. We hypothesize that molecules whose alterations result in upstream regulatory changes in a feedback manner as well as in important downstream effector changes are critical for disease progression and pathophysiology.
Keeping this in mind, we adopted a multistep approach integrating the cross-talk between different genomics spaces (the epigenome, transcriptome and regulatory miRNAs) as well as an interactive network analysis to identify the key driver genes associated with PDAC. To achieve this, we initially performed rank-based meta-analysis individually on epigenetic, transcriptional and miRNA expression data to define a comprehensive landscape of molecular alterations in PDAC. It is well established that rank-based methods perform more reliably and consistently than non-parametric and parametric methods for meta-analysis of microarray data . In this study we implemented a rank-based meta-analysis approach, which enabled us to identify consistently differentially expressed molecules from multiple studies and increased the statistical power by eliminating false-positive genes from individual studies while retaining consistently differentially expressed molecules with subtle changes.
To generate a comprehensive view of alterations associated with PDAC as well as understanding the cross-talk among different genomic spaces, we adopted a multidimensional modeling-based approach. This step involved the integration of meta-analysis results from each omics dimension (the epigenome, transcriptome and regulatory miRNAs) and is important for understanding the regulatory cascades and cross-talk between gene expression and upstream regulatory mechanisms. Understanding of the role of these epigenetic and post-transcriptional modifications in the development of disease phenotype could be crucial to identify regulatory bottlenecks that play causal roles in the disease. Profound epigenetic changes have been shown to associate with the onset and progression of cancer, and changes in DNA methylation is one of the potent underlying mechanisms [58, 59]. Similarly, significant dysregulation has been reported in regulatory miRNA expression in cancer [60, 61]. In fact, it has been shown that miR-21 and miR-222, which formed part of our multidimensional PDAC meta-signature, are both probable prognostic markers for PDAC . It is important to note that miR-222 was also identified as a KR of PDAC pathophysiology in our systems-level network analysis. In this respect, the interactive network analysis of miRNA subnetworks created from the global PDAC meta-signatures network reiterated the importance of the systems-level approach in understanding the cause–effect relationship between the dysregulated molecules (Additional file 7: Figure S7). The analysis showed that miRNA-221 and miRNA-222 were highly correlated and were at the center of a subnetwork created around miR-222 (Additional file 7: Figure S7B), while the highly dysregulated miRNA-210 was one of the peripheral molecules in its subnetwork (Additional file 7: Figure S7A). In a conventional analysis of dysregulated genes, all three miRNAs would have been weighted equally in their contribution to disease pathophysiology, whereas a systems-level analysis clearly shows that miRNA-221 and miRNA-222 play a more causal role than miRNA-210 in PDAC disease progression. It was also evident from our analysis that miR-194, an epithelial marker that is downregulated in PDAC, is the center of the subnetwork created from miR-210.
It has been postulated that dysregulation in regulatory mechanisms can play important causal roles in disease initiation and/or progression [61, 62]. In many instances, these regulatory layers provide the missing links to rationalize the observed variance seen in gene expression. It is also established that regulatory miRNAs are themselves regulated by epigenetic changes and sponge miRNA targets , which would be another layer of regulatory control that needs to be factored into our multidimensional model in the future. We overcame this additional complexity here by including only the miRNA changes that were correlated with gene expression changes from multi-omics studies with both gene expression and miRNA expression data on matched samples. Overall, in the multidimensional modeling step we identified 189 genes that manifested concerted changes across gene expression and their regulatory mechanisms in pancreatic cancer. As expected, this consensus multidimensional PDAC signature was enriched in important pathways associated with cancer progression, such as cell cycle regulation, cell growth and proliferation, cell motility, cell-based immunity, and inflammatory response.
Further, to understand the cause–effect relationship and the mechanistic overview of PDAC progression, we performed interaction network analysis on GMCs on the basis of interactome information. This multidimensional network was analyzed for topological cues to identify the most probable candidates for KRs of the network and, by extension, key disease modulators. These KR molecules represent regulatory bottlenecks and play a causal role in disease initiation and progression. The presence of these regulatory bottlenecks in establishing disease phenotype has been shown in multiple studies relating to cancer [57, 63–68]. These studies showed that genes with no prior known oncogenic status or known mutations associated with a particular disease could still be influential in controlling many disease-related genes, and thus serve as regulatory bottlenecks that can be identified only by a systems-level approach. In our analysis, ranking based on a node centrality measure (node stress) and neighborhood connectivity identified eight probable regulatory hubs in the network. Thus, our consensus multidimensional approach along with the systems-level network analysis resulted in a dramatic reduction of testable hypotheses from 6863 molecules to eight genes. Finally, the ability to identify the KRs of disease processes independent of sample-matched omics profiles would be a great advantage that allows, in theory, the use of all available genomics data associated with the disease. However, this also imposes a huge responsibility of maintaining consistency in the subtype context across the multiple dimensions of analysis to produce meaningful and concordant results.
In this study we were able to isolate converging regulatory modules and KR molecules associated with PDAC pathophysiology by employing a hierarchical systems-level multidimensional data analysis approach. GSEA on subnetworks of these KRs depicted significant enrichment in the leading edges of the CGMC network genes, suggestive of their important roles in PDAC progression. The identified KRs were able to differentiate poor survivors from better survivors, further strengthening our evidence that they are genes of prognostic value and can be used as probable therapeutic targets in treatment of pancreatic cancer. Data integration methods for multidimensional data are still in their infancy, but multi-omics integration using new experimental and computational approaches is already producing useful functional models and meaningful insights into disease pathophysiology.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Jemal A, Siegel R, Ward E, Hao Y, Xu J, Thun MJ. Cancer statistics, 2009. CA Cancer J Clin. 2009;59(4):225–49. doi:10.3322/caac.20006.View ArticlePubMedGoogle Scholar
- Siegel R, Ma J, Zou Z, Jemal A. Cancer statistics, 2014. CA Cancer J Clin. 2014;64(1):9–29. doi:10.3322/caac.21208.View ArticlePubMedGoogle Scholar
- Nai Q, Luo H, Zhang P, Hossain MA, Gu P, Sidhom IW, et al. How early can pancreatic cancer be recognized? A case report and review of the literature. Case Rep Oncol. 2015;8(1):46–9. doi:10.1159/000375121.View ArticlePubMedPubMed CentralGoogle Scholar
- Bünger S, Laubert T, Roblick UJ, Habermann JK. Serum biomarkers for improved diagnostic of pancreatic cancer: a current overview. J Cancer Res Clin Oncol. 2011;137(3):375–89. doi:10.1007/s00432-010-0965-x.View ArticlePubMedGoogle Scholar
- Jamieson NB, Carter CR, McKay CJ, Oien KA. Tissue biomarkers for prognosis in pancreatic ductal adenocarcinoma: a systematic review and meta-analysis. Clin Cancer Res. 2011;17(10):3316–31. doi:10.1158/1078-0432.CCR-10-3284.View ArticlePubMedGoogle Scholar
- Volinia S, Calin GA, Liu CG, Ambs S, Cimmino A, Petrocca F, et al. A microRNA expression signature of human solid tumors defines cancer gene targets. Proc Natl Acad Sci U S A. 2006;103(7):2257–61. doi:10.1073/pnas.0510565103.View ArticlePubMedPubMed CentralGoogle Scholar
- Pedersen KS, Bamlet WR, Oberg AL, de Andrade M, Matsumoto ME, Tang H, et al. Leukocyte DNA methylation signature differentiates pancreatic cancer patients from healthy controls. PLoS One. 2011;6(3):e18223. doi:10.1371/journal.pone.0018223.View ArticlePubMedPubMed CentralGoogle Scholar
- Edgar R, Domrachev M, Lash AE. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002;30(1):207–10.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang J, Baran J, Cros A, Guberman JM, Haider S, Hsu J, et al. International Cancer Genome Consortium Data Portal--a one-stop shop for cancer genomics data. Database (Oxford). 2011;2011:bar026. doi:10.1093/database/bar026.
- Weinstein JN, Collisson EA, Mills GB, Shaw KR, Ozenberger BA, Ellrott K, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45(10):1113–20. doi:10.1038/ng.2764.View ArticlePubMedPubMed CentralGoogle Scholar
- Jeong H, Mason SP, Barabási AL, Oltvai ZN. Lethality and centrality in protein networks. Nature. 2001;411(6833):41–2. doi:10.1038/35075138.View ArticlePubMedGoogle Scholar
- Chuang HY, Lee E, Liu YT, Lee D, Ideker T. Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007;3:140. doi:10.1038/msb4100180.View ArticlePubMedPubMed CentralGoogle Scholar
- R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2015. URL https://www.r-project.org/
- Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4(2):249–64. doi:10.1093/biostatistics/4.2.249.View ArticlePubMedGoogle Scholar
- Du P, Kibbe WA, Lin SM. lumi: a pipeline for processing Illumina microarray. Bioinformatics. 2008;24(13):1547–8. doi:10.1093/bioinformatics/btn224.View ArticlePubMedGoogle Scholar
- Waggott D, Chu K, Yin S, Wouters BG, Liu FF, Boutros PC. NanoStringNorm: an extensible R package for the pre-processing of NanoString mRNA and miRNA data. Bioinformatics. 2012;28(11):1546–8. doi:10.1093/bioinformatics/bts188.View ArticlePubMedPubMed CentralGoogle Scholar
- López-Romero P. Pre-processing and differential expression analysis of Agilent microRNA arrays using the AgiMicroRna Bioconductor library. BMC Genomics. 2011;12:64. doi:10.1186/1471-2164-12-64.View ArticlePubMedPubMed CentralGoogle Scholar
- López-Romero P, González MA, Callejas S, Dopazo A, Irizarry RA. Processing of Agilent microRNA array data. BMC Res Notes. 2010;3:18. doi:10.1186/1756-0500-3-18.View ArticlePubMedPubMed CentralGoogle Scholar
- Kauffmann A, Gentleman R, Huber W. arrayQualityMetrics--a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25(3):415–6. doi:10.1093/bioinformatics/btn647.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang C, Rao N, Wang Y. Principal component analysis for exploring gene expression patterns. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2007;24(4):736–41.PubMedGoogle Scholar
- Yeung KY, Ruzzo WL. Principal component analysis for clustering gene expression data. Bioinformatics. 2001;17(9):763–74.View ArticlePubMedGoogle Scholar
- Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article3. doi:10.2202/1544-6115.1027.
- Bardou P, Mariette J, Escudié F, Djemiel C, Klopp C. jvenn: an interactive Venn diagram viewer. BMC Bioinformatics. 2014;15:293. doi:10.1186/1471-2105-15-293.View ArticlePubMedPubMed CentralGoogle Scholar
- Breitling R, Armengaud P, Amtmann A, Herzyk P. Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett. 2004;573(1-3):83–92. doi:10.1016/j.febslet.2004.07.055.View ArticlePubMedGoogle Scholar
- Hong F, Breitling R, McEntee CW, Wittner BS, Nemhauser JL, Chory J. RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis. Bioinformatics. 2006;22(22):2825–7. doi:10.1093/bioinformatics/btl476.View ArticlePubMedGoogle Scholar
- Ihnatova I. MAMA: Meta-Analysis of MicroArray. R package version 2.2.1. http://cran.r-project.org/package=MAMA. 2013.
- Favero F. RmiR: package to work with miRNAs and miRNA targets with R. R package version 1.18.0. http://www.bioconductor.org/packages/release/bioc/html/RmiR.html. 2015.
- Park M, Kim M, Hwang D, Kim WK, Kim SK, Shin J, et al. Characterization of gene expression and activated signaling pathways in solid-pseudopapillary neoplasm of pancreas. Mod Pathol. 2014;27(4):580–93. doi:10.1038/modpathol.2013.154.View ArticlePubMedGoogle Scholar
- Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27(3):431–2. doi:10.1093/bioinformatics/btq675.View ArticlePubMedPubMed CentralGoogle Scholar
- Tarcea VG, Weymouth T, Ade A, Bookvich A, Gao J, Mahavisno V, et al. Michigan molecular interactions r2: from interacting proteins to pathways. Nucleic Acids Res. 2009;37(Database issue):D642–6. doi:10.1093/nar/gkn722.View ArticlePubMedPubMed CentralGoogle Scholar
- Montojo J, Zuberi K, Rodriguez H, Bader GD, Morris Q. GeneMANIA: Fast gene network construction and function prediction for Cytoscape. F1000Res. 2014;3:153. doi:10.12688/f1000research.4572.1.PubMedPubMed CentralGoogle Scholar
- Scardoni G, Petterlini M, Laudanna C. Analyzing biological network parameters with CentiScaPe. Bioinformatics. 2009;25(21):2857–9. doi:10.1093/bioinformatics/btp517.View ArticlePubMedPubMed CentralGoogle Scholar
- Tong C, Niu J, Dai B, Xie Z. A novel complex networks clustering algorithm based on the core influence of nodes. ScientificWorldJournal. 2014;2014:801854. doi:10.1155/2014/801854.PubMedPubMed CentralGoogle Scholar
- Maslov S, Sneppen K. Specificity and stability in topology of protein networks. Science. 2002;296(5569):910–3. doi:10.1126/science.1065103.View ArticlePubMedGoogle Scholar
- Kaufman L, Rousseeuw PJ. Finding Groups in Data: an Introduction to Cluster Analysis. New York: Wiley; 1990.View ArticleGoogle Scholar
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. doi:10.1073/pnas.0506580102.View ArticlePubMedPubMed CentralGoogle Scholar
- Hollander M, Wolfe DA. Nonparametric Statistical Methods. 2nd ed. New York: John Wiley & Sons; 1999.Google Scholar
- Tamayo P, Slonim D, Mesirov J, Zhu Q, Kitareewan S, Dmitrovsky E, et al. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Proc Natl Acad Sci U S A. 1999;96(6):2907–12.View ArticlePubMedPubMed CentralGoogle Scholar
- Haibe-Kains B, Desmedt C, Sotiriou C, Bontempi G. A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all? Bioinformatics. 2008;24(19):2200–8. doi:10.1093/bioinformatics/btn374.View ArticlePubMedPubMed CentralGoogle Scholar
- Goonesekere NC, Wang X, Ludwig L, Guda C. A meta analysis of pancreatic microarray datasets yields new targets as cancer genes and biomarkers. PLoS One. 2014;9(4):e93046. doi:10.1371/journal.pone.0093046.View ArticlePubMedPubMed CentralGoogle Scholar
- Sun T, Kong X, Du Y, Li Z. Aberrant microRNAs in pancreatic cancer: researches and clinical implications. Gastroenterol Res Pract. 2014;2014:386561. doi:10.1155/2014/386561.PubMedPubMed CentralGoogle Scholar
- Ohuchida K, Mizumoto K, Lin C, Yamaguchi H, Ohtsuka T, Sato N, et al. MicroRNA-10a is overexpressed in human pancreatic cancer and involved in its invasiveness partially via suppression of the HOXA1 gene. Ann Surg Oncol. 2012;19(7):2394–402. doi:10.1245/s10434-012-2252-3.View ArticlePubMedGoogle Scholar
- Kwon MS, Kim Y, Lee S, Namkung J, Yun T, Yi SG, et al. Integrative analysis of multi-omics data for identifying multi-markers for diagnosing pancreatic cancer. BMC Genomics. 2015;16 Suppl 9:S4. doi:10.1186/1471-2164-16-S9-S4.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang H, Duan HO, Kirley SD, Zukerberg LR, Wu CL. Aberrant splicing of cables gene, a CDK regulator, in human cancers. Cancer Biol Ther. 2005;4(11):1211–5.View ArticlePubMedGoogle Scholar
- Sumazin P, Yang X, Chiu HS, Chung WJ, Iyer A, Llobet-Navas D, et al. An extensive microRNA-mediated network of RNA-RNA interactions regulates established oncogenic pathways in glioblastoma. Cell. 2011;147(2):370–81. doi:10.1016/j.cell.2011.09.041.View ArticlePubMedPubMed CentralGoogle Scholar
- Lee JH, Kim DG, Bae TJ, Rho K, Kim JT, Lee JJ, et al. CDA: combinatorial drug discovery using transcriptional response modules. PLoS One. 2012;7(8):e42573. doi:10.1371/journal.pone.0042573.View ArticlePubMedPubMed CentralGoogle Scholar
- Qin Q, Furong W, Baosheng L. Multiple functions of hypoxia-regulated miR-210 in cancer. J Exp Clin Cancer Res. 2014;33:50. doi:10.1186/1756-9966-33-50.View ArticlePubMedPubMed CentralGoogle Scholar
- Stinson S, Lackner MR, Adai AT, Yu N, Kim HJ, O’Brien C, et al. miR-221/222 targeting of trichorhinophalangeal 1 (TRPS1) promotes epithelial-to-mesenchymal transition in breast cancer. Sci Signal. 2011;4(186):pt5. doi:10.1126/scisignal.2002258.View ArticlePubMedGoogle Scholar
- Meng Z, Fu X, Chen X, Zeng S, Tian Y, Jove R, et al. miR-194 is a marker of hepatic epithelial cells and suppresses metastasis of liver cancer cells in mice. Hepatology. 2010;52(6):2148–57. doi:10.1002/hep.23915.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhou Y, Jiang H, Gu J, Tang Y, Shen N, Jin Y. MicroRNA-195 targets ADP-ribosylation factor-like protein 2 to induce apoptosis in human embryonic stem cell-derived neural progenitor cells. Cell Death Dis. 2013;4:e695. doi:10.1038/cddis.2013.195.View ArticlePubMedPubMed CentralGoogle Scholar
- Gu J, Tamura M, Yamada KM. Tumor suppressor PTEN inhibits integrin- and growth factor-mediated mitogen-activated protein (MAP) kinase signaling pathways. J Cell Biol. 1998;143(5):1375–83.View ArticlePubMedPubMed CentralGoogle Scholar
- Ramasamy A, Mondry A, Holmes CC, Altman DG. Key issues in conducting a meta-analysis of gene expression microarray datasets. PLoS Med. 2008;5(9):e184. doi:10.1371/journal.pmed.0050184.View ArticlePubMedPubMed CentralGoogle Scholar
- Ping Y, Deng Y, Wang L, Zhang H, Zhang Y, Xu C, et al. Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data. Nucleic Acids Res. 2015;43(4):1997–2007. doi:10.1093/nar/gkv074.View ArticlePubMedPubMed CentralGoogle Scholar
- Lefebvre C, Rajbhandari P, Alvarez MJ, Bandaru P, Lim WK, Sato M, et al. A human B-cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers. Mol Syst Biol. 2010;6:377. doi:10.1038/msb.2010.31.View ArticlePubMedPubMed CentralGoogle Scholar
- Craven KE, Gore J, Wilson JL, Korc M. Angiogenic gene signature in human pancreatic cancer correlates with TGF-beta and inflammatory transcriptomes. Oncotarget. 2015;7:323–41. doi:10.18632/oncotarget.6345.Google Scholar
- Huang B, Zhao J, Unkeless JC, Feng ZH, Xiong H. TLR signaling by tumor and immune cells: a double-edged sword. Oncogene. 2008;27(2):218–24. doi:10.1038/sj.onc.1210904.View ArticlePubMedGoogle Scholar
- Chen JC, Alvarez MJ, Talos F, Dhruv H, Rieckhof GE, Iyer A, et al. Identification of causal genetic drivers of human disease through systems-level analysis of regulatory networks. Cell. 2014;159(2):402–14. doi:10.1016/j.cell.2014.09.021.View ArticlePubMedPubMed CentralGoogle Scholar
- Feinberg AP, Vogelstein B. Hypomethylation distinguishes genes of some human cancers from their normal counterparts. Nature. 1983;301(5895):89–92.View ArticlePubMedGoogle Scholar
- Nones K, Waddell N, Song S, Patch AM, Miller D, Johns A, et al. Genome-wide DNA methylation patterns in pancreatic ductal adenocarcinoma reveal epigenetic deregulation of SLIT-ROBO, ITGA2 and MET signaling. Int J Cancer. 2014;135(5):1110–8. doi:10.1002/ijc.28765.View ArticlePubMedGoogle Scholar
- Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, Peck D, et al. MicroRNA expression profiles classify human cancers. Nature. 2005;435(7043):834–8. doi:10.1038/nature03702.View ArticlePubMedGoogle Scholar
- Frampton AE, Krell J, Jamieson NB, Gall TM, Giovannetti E, Funel N, et al. microRNAs with prognostic significance in pancreatic ductal adenocarcinoma: a meta-analysis. Eur J Cancer. 2015;51(11):1389–404. doi:10.1016/j.ejca.2015.04.006.View ArticlePubMedGoogle Scholar
- Muñoz-Rodríguez JL, Vrba L, Futscher BW, Hu C, Komenaka IK, Meza-Montenegro MM, et al. Differentially expressed microRNAs in postpartum breast cancer in hispanic women. PLoS One. 2015;10(4), e0124340. doi:10.1371/journal.pone.0124340.View ArticlePubMedPubMed CentralGoogle Scholar
- Aytes A, Mitrofanova A, Lefebvre C, Alvarez MJ, Castillo-Martin M, Zheng T, et al. Cross-species regulatory network analysis identifies a synergistic interaction between FOXM1 and CENPF that drives prostate cancer malignancy. Cancer Cell. 2014;25(5):638–51. doi:10.1016/j.ccr.2014.03.017.View ArticlePubMedPubMed CentralGoogle Scholar
- Chudnovsky Y, Kim D, Zheng S, Whyte WA, Bansal M, Bray MA, et al. ZFHX4 interacts with the NuRD core member CHD4 and regulates the glioblastoma tumor-initiating cell state. Cell Rep. 2014;6(2):313–24. doi:10.1016/j.celrep.2013.12.032.View ArticlePubMedPubMed CentralGoogle Scholar
- Compagno M, Lim WK, Grunn A, Nandula SV, Brahmachary M, Shen Q, et al. Mutations of multiple genes cause deregulation of NF-kappaB in diffuse large B-cell lymphoma. Nature. 2009;459(7247):717–21. doi:10.1038/nature07968.View ArticlePubMedPubMed CentralGoogle Scholar
- Bai J, Li Y, Shao T, Zhao Z, Wang Y, Wu A, et al. Integrating analysis reveals microRNA-mediated pathway crosstalk among Crohn’s disease, ulcerative colitis and colorectal cancer. Mol Biosyst. 2014;10(9):2317–28. doi:10.1039/c4mb00169a.View ArticlePubMedGoogle Scholar
- De Keersmaecker K, Real PJ, Gatta GD, Palomero T, Sulis ML, Tosello V, et al. The TLX1 oncogene drives aneuploidy in T cell transformation. Nat Med. 2010;16(11):1321–7. doi:10.1038/nm.2246.View ArticlePubMedPubMed CentralGoogle Scholar
- Della Gatta G, Palomero T, Perez-Garcia A, Ambesi-Impiombato A, Bansal M, Carpenter ZW, et al. Reverse engineering of TLX oncogenic transcriptional networks identifies RUNX1 as tumor suppressor in T-ALL. Nat Med. 2012;18(3):436–40. doi:10.1038/nm.2610.View ArticlePubMedGoogle Scholar