Panorama: a database for the oncogenic evaluation of somatic mutations in pan-cancer

Abstract

Somatic mutations, key alterations in cancer development, exert differential effects across tissues and biological layers, such as transcriptomes, proteomes, and post-translational modifications (PTMs). Although previous pan-cancer studies have characterized the molecular landscape of cancer, the effects of individual somatic mutations across different tissues remain insufficiently explored. Here, we developed Panorama to evaluate the oncogenic potential of single somatic mutations across all cancer types. We collected cancer proteogenomics or multiomics data from over 10 000 individuals across 19 cancer types. Based on five evaluation criteria, we assessed whether a specific mutation affects the abundance of a particular gene’s transcriptome, proteome, or phosphoproteome; the tumor microenvironment; specific RNA- or protein-based signaling pathways; and outlier-level overexpression of PTMs, aiding in potential drug target identification. By leveraging five oncogenic metrics, Panorama quantifies the oncogenic potential of individual somatic mutations and provides a framework for identifying driver mutations by incorporating their downstream effects. With Panorama, researchers can integrate cancer proteogenomics data, providing a comprehensive approach that enhances our understanding of single somatic mutations in specific tissues. Finally, Panorama was developed as a web-based database to ensure easy access for researchers and is freely available at http://139.150.65.64:8080/or https://github.com/prosium/panorama.

Introduction

Cancer cells, characterized by abnormal proliferation driven by disrupted regulatory mechanisms, are shaped by genomic and epigenomic changes [1, 2]. These alterations act in specific tissues and in ways that depend on biological layers, such as at the DNA, RNA, protein, gene, or pathway levels [3, 4]. The Cancer Genome Atlas (TCGA) studies have enhanced our understanding of the origins and causes of cancer by conducting systematic analyses of genomic changes specific to each cancer type [5–22]. They provided biological insights from multiomics data into precision medicine by depicting the cancer landscape and proposing targeted therapeutic strategies. Several years later, with the launch of the Clinical Proteomic Tumor Analysis Consortium (CPTAC), initiated by advancements in mass spectrometry technology, we now understand the impact of genomic changes on proteins and post-translational modifications (PTMs) in cancer cells [23–39].

Several pan-cancer studies have attempted to harmonize insights from studies on individual cancer types. Using TCGA data, various pan-cancer studies have emerged, including molecular classifications based on cell-of-origin characteristics [4], development of new scores associated with oncogenic differentiation using machine learning [40], identification of oncogenic processes and signaling pathways [3], discovery of driver genes and mutations [41], and perspectives on the immune landscape of cancer [42]. Utilizing CPTAC data, several pan-cancer papers have been published, including studies on the evaluation of oncogenic driver mutations [43], the impact of aberrant methylation patterns [44], the landscape of PTM dysregulation of cancer [45], connections between clinical imaging and proteogenomics [46], the development of data resource package [47], the functional network using machine learning [48], tumor immunity [49], cancer vulnerabilities revealed by whole-genome doubling (WGD) [50], and pan-cancer druggable targets [51]. However, our understanding of the impact of a single somatic mutation across various tissues remains limited, and integrative efforts to analyze consortium-scale data, such as TCGA and CPTAC, remain insufficient.

Therefore, in the current era of cancer proteogenomics, we have integrated data and extended pan-cancer studies to comprehensively evaluate the impact of somatic mutations based on five criteria: (1) association between the mutation and clinical outcomes; (2) dysregulation of gene expression caused by a single somatic mutation at the RNA, protein, and PTMs levels; (3) relationship between the mutation and the tumor microenvironment (TME); (4) dysregulation of signaling pathways induced by the single somatic mutation; and (5) identification of associated therapeutic targets. By integrating data from over 10 000 cancer patients, our curated database evaluates the oncogenic potential of single somatic mutations at the pan-cancer level and identifies putative therapeutic targets for patients harboring specific mutations.

Methods

Data collection

We collected two types of datasets, including proteogenomics and multiomics data (Fig. 1A), derived from several large-scale cancer studies, including TCGA, CPTAC, and Pan-Cancer Analysis of Whole Genomes (PCAWG) (Table S1). For the proteogenomics dataset, data were collected from 11 different cancer types, comprising glioblastoma (GBM) [39], breast carcinoma (BRCA) [21, 31], colon adenocarcinoma (COAD) [25], head and neck squamous cell carcinoma (HNSCC), [32] clear cell renal cell carcinoma [37], hepatocellular carcinoma (HCC) [35], lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LSCC) [26, 34, 52], ovary cancer (OV), [17, 23] pancreas ductal adenocarcinoma [38] (PDAC), prostate adenocarcinoma [53], and endometrial carcinoma [36]. For multiomics dataset, we retrieved data from 12 different cancer types: lower-grade glioma and GBM [18], COAD [20], HNSCC [19], kidney cancer such as chromophobe renal cell carcinoma, clear cell renal cell carcinoma papillary renal cell carcinoma (KICH, KIRC, and KIRP), [5, 15] LUAD and LUSCC [13, 16, 54, 55], PDAC [56], uterine carcinoma, and uterine corpus endometrial carcinoma (UCEC) [8].

Figure 1.

Workflow and scheme for the database. (A) Step 1: Dataset preparations. Data were collected from proteogenomics and multiomics datasets from multiple tissues and biological data types. The number of samples per tissue is shown in parentheses. (B) Step 2: Preprocessing. Data filtering criteria and tools used in the database. (C) Step 3: Evaluation. Five criteria for evaluating a single somatic mutation.

Open in new tab Download slide

Data processing

The data obtained in Step 1 were processed by filtering, normalization, and deconvolution (Fig. 1B). Somatic mutations with frequencies >5% within each cohort were selected for analysis. We used processed data directly from public sources (TCGA, CPTAC, and original publications) and subsequently applied Z-score normalization. Specifically, PTM profiles were normalized using site-level Z-scores, whereas transcriptomic and proteomic data were normalized using gene-level Z-scores. Two-level inference was performed using raw normalized data before Z-score calculation based on either mRNA or protein expression. Immune cell-type deconvolution was performed using xCell [57], CIBERSORT [58], and ESTIMATE [59]. For xCell and ESTIMATE, the R wrapper “immunedeconv” [60] was used with default settings. CIBERSORT was implemented using the “CIBERSORT.R” script and the “LM22.txt” reference profile, both obtained from the official CIBERSORT website. Pathway activity inference was performed using Progeny [61] and Gene Set Variation Analysis (GSVA) [62]. For GSVA, the Single Sample Gene Set Enrichment Analysis (ssGSEA) algorithm [63] was employed using the HALLMARK and KEGG gene matrix transposed files retrieved from the official MsigDB [64] website.

Furthermore, our statistical analysis performed intergroup comparisons after the simple deletion of missing data. While this approach is pragmatic for enabling researchers to easily reproduce our analyses and compare them to the original publications, we acknowledge it may introduce statistical bias. However, a sensitivity analysis confirmed this is a rare event, occurring in only 0.35% (n = 273) of all tested mutation-gene pairs (n = 78 079, P < .05, Fisher’s exact test). In future versions of the database, we plan to provide both the raw and imputed versions to offer greater flexibility. The methods described above (xCell, CIBERSORT, ESTIMATE, Progeny, and GSVA) are standard, well-validated methodologies widely used in pan-cancer analyses [43–45, 49, 65, 66]. We employed default settings as recommended by the tool developers, which align with these established protocols and ensure comparability and reproducibility with previous studies. Applying immune deconvolution tools to proteomic data is an important and emerging field, though a single “gold-standard” method is still in development. However, a recent study by Song et al. [67], while noting the lack of a gold-standard dataset, reported that xCell provides the most robust results when comparing proteome-based predictions to RNA-based benchmarks. Therefore, Panorama provides results from xCell, ESTIMATE, and CIBERSORT to allow for a comprehensive comparison of these methods.

Oncogenicity evaluation

Each mutation was evaluated across five categories: prognosis, abnormal gene expression, TME, pathway activity, and actionability (Fig. 1C). (1) Prognostic biomarker (PB): Prognostic significance was determined if, in any of the following, overall survival, disease-specific survival, disease-free survival, progression-free survival, or relapse-free survival, patients with the mutation had poorer outcomes than those without the mutation. Statistical significance was evaluated using log-rank tests. (2–3) Expression perturbator (EP) and immune modulator (IM): The impact of a mutation on the expression of genes or immune cell populations was evaluated by counting the number of genes or immune cell populations that were relatively increased in the mutation group compared to those in the wild-type (WT) group, with a false discovery rate (FDR) based on Student’s t-test. (4) Pathway effectors (PE): Pathways that exhibited statistically significant increases or decreases in intensity associated with a given mutation compared to WT patients were evaluated. The criterion for significance was FDR based on Student’s t-test. (5) Actionable driver (AD): Identification of therapeutic targets involved in discovering phosphorylation events upregulated to outlier levels relative to changes in RNA expression driven by a somatic mutation.

Comparison method for TP53 mutations at the pan-cancer level

For the PB metric, P-values were transformed using log10. For MA, we calculated the interquartile range (IQR) of the PTM data and defined PTMs exceeding QI + 1.5 × IQR as upper outliers; the number of such PTMs was then normalized by the total PTM count. For EP, the count of genes with a SignedFDR ≥ 1.301 (indicating positive significance) was normalized by the total number of genes. Similarly, for IM and PE, the number of terms with a SignedFDR ≥ 1.301 was normalized by the total number of terms. Each mutation was then ranked as follows: the top two cancer types received five points, the following two types received four points, the following four types received three points, the subsequent two types received two points, and the final two types received one point. Additionally, if a mutation lacked a term or gene with a SignedFDR ≥ 1.301, it was assigned a minimum score of 1.

Driver mutations

Mutations with frequencies of at least 5% were selected for each cohort. For every mutation, we calculated a score for each metric by dividing the number of values with a SignedFDR ≥ 1.301 by the total number of values. For PB metric, we applied the -log10 transformation to the P-values. This process yielded five scores for each mutation. Subsequently, we performed Z-score normalization for each cohort using these five metrics and computed their average, which was defined as the average oncogenic score (AOS). Finally, to distinguish driver mutations from passenger mutations, we performed a robust outlier analysis on the AOS values for each cohort. This analysis employed two simultaneous, standard statistical methods: (1) an IQR method, where outliers were defined as any value exceeding Q3 + 1.5 × IQR, and (2) a Z-score method. For the Z-score method, we defined outliers as mutations with a |Z-score| > 2.5. This threshold was chosen because it is stricter than ± 2SD (which captures ∼95% of the data) and is statistically grounded, identifying approximately the top 1% of outliers. Based on the results of these two tests and a curated list of known driver genes [41], we classified all mutations into a four-tier grading system to systematically assess confidence, as shown in Fig. 4(B): Grade 1 (identified as an outlier by both tests + known driver), Grade 2A (identified by both tests + novel candidate), Grade 2B (identified by one test + known driver), and Grade 3 (identified by one test + novel candidate).

Database construction and web interface

All the raw data were obtained from publicly available sources (Table S1). Additionally, the processed data derived from the raw data are shared in the Panorama GitHub repository (https://github.com/prosium/panorama). Furthermore, for interactive web applications, Panorama was constructed using data science libraries, such as Pandas, NumPy, Plotly, SciPy, and Streamlit. Furthermore, to support bioinformaticians in conducting large-scale analyses locally or in parallel, we have provided all Panorama results in the Parquet format on GitHub.

The Panorama web interface is organized with a logo, a database summary, and brief descriptions of the five functions on the right, while the left side features a menu bar, an overview of cancer data, and release notes (Fig. 2A). Clicking on the menu reveals several options, including the main page (homepage), a database overview (detailed data status), and five functions. Selecting one of the five functions allowed the user to navigate to a page with a brief introduction, dataset selection, result plots, and a download menu (Fig. 2B). This uniform layout across all five functions ensures consistency and enhances usability.

Figure 2.

The web interface of Panorama. (A) Overview of the Panorama web interface, (1) logo of the Panorama database, (2) a brief description and functionalities of the Panorama database, and (3) Panorama menu bar. (B) Detailed view of the Panorama database function interface: (1) a brief description of the function, (2) dataset selection options including tissue type, cancer type, dataset, gene name, and survival type, (3) result plot, and (4) download buttons for the plot and underlying data.

Open in new tab Download slide

Database application

We describe several use cases of the Panorama database below to demonstrate its functionality.

SMARCA4 mutation as a PB in LUAD

Mutations can promote tumor growth, proliferation, and metastatic potential and reduce treatment responsiveness. These effects may be associated with improved or worsened patient survival rates. When the effect of a specific mutation on the prognosis is consistently observed, it serves as a PB. In the CPTAC-LUAD cohort, 11 patients with SMARCA4 mutations showed a significantly poorer prognosis than the 94 WT patients (P = .001) (Fig. 3A). Similarly, in the TCGA-LUAD cohort, 43 patients with SMARCA4 mutations had significantly poorer outcomes than 457 WT patients (P = .048) (Fig. 3A). Given that patients with the SMARCA4 mutation in LUAD consistently exhibit a poor prognosis across two independent cohorts, SMARCA4 can be used as a PB for LUAD, as previously reported [68].

Figure 3.

Use cases of Panorama. (A) Kaplan–Meier survival curves for SMARCA4 mutations in LUAD from CPTAC (left) and TCGA (right) datasets. Log-rank test. (B) Box plots comparing Z-differences in mRNA and protein expression levels for AURKA across multiple breast cancer (BRCA) cohorts. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (C) Heatmaps illustrating the relationship between immune scores from ESTIMATE and xCell and gene expression changes as Z-differences, with significant associations marked by FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (D) Heatmaps showing the association between mutation status and tumor necrosis factor-alpha (TNFα) and estrogen signaling pathways in BRCA. Expression changes are represented as Z-differences, with significant differences denoted by asterisks indicating FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (E and F) (left) Scatter plots showing the correlation between RNA, proteome, or PTM changes for mutations in two tumor types, with outlier phosphorylation sites highlighted. The right panels display bar plots comparing the signed -log10(FDR) for proteome phosphorylation targets across all tumor types. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method.

Open in new tab Download slide

TP53 mutation as an EP in BRCA

One approach to identifying factors that play critical roles in cancer development and progression is to identify genes specifically upregulated or overexpressed in cancer cells, as these are mutation-specific oncogenes that promote tumor formation. We hypothesized that if a gene is upregulated in patients with a mutation, and this upregulation is consistently observed across independent cohorts, the overexpressed gene is not only a target of the specific mutation but also drives cancer development in the context of that mutation.

In the CPTAC-, TCGA-, and PCAWG-BRCA cohorts, the frequently observed TP53 mutation group showed consistent and significant overexpression of AURKA compared to the TP53 WT group at both the transcriptomic and proteomic levels (Fig. 3B). Consistent upregulation of AURKA in association with TP53 mutations provides strong evidence that AURKA plays a critical role in cancer development or progression in the context of TP53 mutation [69].

AKAP13 and KEAP1 mutations as IMs in UCEC and BRCA

Immunotherapy, increasingly used in cancer treatment, is more effective when there is a high infiltration of immune cells, including T cells, in the TME, a condition referred to as an immunologically “hot” state. In contrast, tumors in an immunologically “cold” state, characterized by fewer or suppressed immune cells, are less likely to respond to immunotherapy. We analysed mutations associated with immunologically hot or cold states using immune scores derived from ESTIMATE and xCell.

In the CPTAC- and TCGA-UCEC cohorts, the AKAP13 mutation was associated with elevated immune scores across both tools and independent cohorts (Fig. 3C, top). Conversely, the KEAP1 mutation in the CPTAC- and TCGA-LUAD cohorts was associated with a decreased immune score in both the tools and independent cohorts (Fig. 3C, bottom). The consistent patterns suggest that AKAP13 and KEAP1 mutations can be proposed as immunotherapy markers, as they significantly alter the TME to a hot or cold state, as is well known. These findings can enhance our understanding of immune evasion mechanisms, improve the prediction of immunotherapy responses by cancer type, and provide opportunities to identify new immunotherapeutic targets [70, 71].

TP53 mutation as a PE in BRCA

Measuring activation at the pathway level rather than the expression of a single gene is essential for a systematic understanding of biological mechanisms. We utilized the hallmark gene set and applied GSVA to calculate ssGSEA scores, comparing patients with specific mutations to those with WT. In the CPTAC- and TCGA-BRCA cohorts, patients with TP53 mutations showed significantly increased tumor necrosis factor-alpha (TNFα) signaling at both the transcriptome and proteome levels (Fig. 3D, top). Conversely, TP53 mutations in CPTAC- and TCGA-BRCA cohorts were associated with a significant decrease in estrogen signaling (Fig. 3D, bottom).

The consistent increase in TNFα signaling associated with TP53 mutations suggests that TP53 mutations significantly alter gene expression and signaling pathways. Furthermore, it suggests TNFα signaling as a potential therapeutic target for breast cancer patients harboring TP53 mutations [72].

EGFR and APC mutations as therapeutic targets in LUAD and HNSCC

A significant increase in phosphorylation to outlier levels in patients with specific mutations suggests abnormal activation of the corresponding signaling pathway or protein, indicating its potentially critical role in cancer development or progression. We identified statistically significant phosphorylation events at the outlier level in patients with specific mutations and validated their unique significance across 12 tumor types.

In LUAD, patients with EGFR mutations showed a significantly higher FDR value for Y62 phosphorylation of PTPN11 than for the next-highest phosphorylation, K269, of BLVRA (Fig. 3E, left). At the pan-cancer level, various mutations, including those in FLG, CUBN, and PPFIA2, significantly increased Y62 phosphorylation of PTPN11; however, none of these mutations elevated Y62 phosphorylation of PTPN11 to an outlier level (Fig. 3E, right). In HNSCC, S292 phosphorylation of AMPD2 was observed in patients with APC mutations, and this phosphorylation was the only significantly different event at the outlier level (Fig. 3F). The finding that PTPN11 is a well-established strong potential therapeutic target in LUAD and AMPD2 is a novel potential therapeutic target in HNSCC, despite showing no significant differences at the transcriptomic or proteomic levels (FDR > 0.05), emphasizes the importance of proteogenomic data [34].

Tissue-dependent dynamics of TP53 mutations

Even when an identical TP53 mutation occurs at the same locus within the gene, the functional consequences and impact on tumorigenesis are highly dependent on the tissue context [73]. As shown in Fig. 4(A), we compared the five TP53 mutation functions across various tissues at the pan-cancer level.

Figure 4.

Tissue-specificity of TP53 mutation and candidate driver mutations. (A) Radar plots displaying the evaluation of TP53 mutations across 12 cancer types. Each sector of the radar plot represents a different functional category: PB, EP, IM, PE, and AD. (B) Distribution of the AOS across 12 cancer types. The density plots show the complete distribution of AOS values for all mutations with >5% frequency in each cohort. The overlaid dots represent only those mutations identified as statistical outliers and classified into Grades 1, 2A, 2B, and 3. This four-tier grading system classifies mutations based on their outlier status and prior driver knowledge, as detailed in the section “Methods.” Key Grade 1 driver mutations are labeled with their gene names.

Open in new tab Download slide

In HCC, TP53 mutations are a poor-prognosis biomarker, whereas in LSCC, they are a better-prognosis biomarker; no significant association was observed in the other 10 cancer types. Additionally, more genes showed increased expression in BRCA and HCC compared to other cancers, and immune cell populations were notably elevated in BRCA and COAD. Moreover, HCC and BRCA showed greater upregulation of signaling pathways than other cancer types, while GBM and OV showed a greater number of PTMs reaching outlier levels. Using Panorama to analyse TP53 mutations at the pan-cancer level enables the identification of therapeutic targets and prognostic evaluation for specific cancer types, thereby uncovering tissue-dependent heterogeneity in TP53 mutation mechanisms.

Driver mutation discovery

Identifying driver mutations in cancer data is challenging. Most tools select driver mutations based on factors such as mutation recurrence across the genome, their localization within functional domains, and the surrounding context [74]. We assumed that driver mutations would be associated with a greater influence on prognosis, a higher number of overexpressed genes, overactivated immune cell populations, hyperactivated signaling pathways, and more actionable PTM targets than passenger mutations (see the section “Methods”). Therefore, we calculated the average values of the five Panorama-measured factors for all mutations with a frequency of at least 5% across the 12 tissue types.

At the pan-cancer level, we identified upper outliers (displayed in blue) based on each mutation’s AOS and highlighted the known driver mutations for each cancer type in red (Fig. 4B). Across all 12 cancer types, we divided the known driver mutations into novel candidates. We identified at least one known driver mutation for each cancer type and proposed several novel mutations (Table S2). Notably, although not previously reported, the SLC8A3 mutation in LUAD was associated with significant increases in the immune score, estimate score, and stroma score among patients harboring the mutation. Unlike many existing tools for driver mutation discovery, our approach using AOS enables the detection of novel mutations. Through this approach, we propose a novel method for identifying driver mutations using cancer proteogenomic data that comprehensively considers both mutation frequency and downstream mutation effects.

Discussion and perspectives

In Panorama, we aimed to elucidate the diverse impacts of single somatic mutations on cancer by integrating proteogenomic data, including clinical data, from over 10 000 patients with various cancer types. We performed robust analysis to highlight the intricate biological consequences of specific mutations. Over the years, numerous web-based databases have been developed for comprehensive pan-cancer analyses. These include tools that focus on comparisons between normal and tumor tissues [75], broad analysis platforms in cancer genomics [76], and extensive association analysis tools that leverage proteogenomic data [77]. Panorama’s unique contribution is its mutation-centric group comparison framework. It provides a single, streamlined visualization that allows for a simultaneous comparison of how a single somatic mutation (versus the WT group) affects three layers—RNA, proteome, and phosphoproteome. This function is a core feature of Panorama, as it intuitively reveals how a genomic alteration may bypass the transcriptome and directly enact its function at the proteome or phosphoproteome level, a specific question not directly streamlined by other correlation-based platforms. Furthermore, the ability to analyse and visualize associations with derived data, such as immune signaling or signaling activity, in a single heatmap is a unique advantage of Panorama.

Various genomic alterations, in addition to single somatic mutations—such as copy number or structural changes, WGD, fusions, and aberrant methylation—can underlie the changes observed in our PB, EP, IM, PE, and AD metrics. In particular, the recent study by Chang et al. [50] demonstrated that WGD does not act uniformly but can be classified into different types based on the level of genomic instability, which varies in a cancer-type-specific manner. Crucially, they found that these WGD types are distinctly associated with specific “kinase phosphorylation networks” and cell cycle pathway activation. This finding suggests that large-scale genomic events like WGD may ultimately converge on specific phosphorylation pathways to drive tumorigenesis, highlighting the necessity of evaluating WGD as a potential driver of the “Actionable Driver (AD)” events identified in Panorama. Therefore, we plan to expand Panorama to include various other genomic events, such as WGD, to provide a more comprehensive functional analysis and will also continuously integrate new data from publicly available databases.

Furthermore, co-occurring or mutually exclusive mutations can act as confounding factors when analysing the impact of a single mutation [78, 79]. Panorama provides lists of co-occurring and mutually exclusive mutations across all cohorts to mitigate this issue, enabling a more accurate interpretation of the results (Table S3). Recently, it was shown that even when the same EGFR amplification is found in recurrent GBM, treatment resistance arises from decreased EGFR activation and increased BRAF activation at the protein and phosphorylation levels [80]. Panorama will not be limited to primary cancer, and we plan to add functionality to compare “sensitive vs. resistant” functions. Additionally, among the newly identified subtypes of nonsmall cell lung cancer, some have been defined by phosphorylation features and others by immune scores, which were then in turn used to predict therapeutic response [81]. We are planning future research to move beyond single-mutation analysis and define new subtypes based on the AOS, which reflects all five features of Panorama.

Our comprehensive proteogenomic database, Panorama, integrates analyses from numerous studies to advance our understanding of how single somatic mutations differentially affect cancer progression across diverse tissues. By integrating diverse data types, we identified key PBs, elucidated the mechanisms of immune modulation, and identified potentially ADs. This approach enables precise characterization of somatic mutation functions, facilitates a deeper understanding of cancer heterogeneity, and enhances patient treatment efficacy. Furthermore, it is provided as a web-based database to ensure easy access for researchers.

Conflict of interest

None declared.

Funding

This research was supported by a grant (2017M3A9A7050615) from the Ministry of Science and ICT (MSIT) and the KRIBB Research Initiative Program (KGM5192531).

Data availability

Database files in Parquet format and code are freely available for download on GitHub (https://github.com/prosium/panorama).

References

Hanahan

Hallmarks of cancer: new dimensions

Cancer Discov

2022

;

–

10.1158/2159-8290.CD-21-1059

Hanahan

Weinberg

Hallmarks of cancer: the next generation

Cell

2011

;

144

646

–

10.1016/j.cell.2011.02.013

Sanchez-Vega

Mina

Armenia

et al.

Oncogenic signaling pathways in The Cancer Genome Atlas

Cell

2018

;

173

321

–

337 e10

10.1016/j.cell.2018.03.035

Hoadley

Yau

Hinoue

et al.

Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer

Cell

2018

;

173

291

–

304 e6

10.1016/j.cell.2018.03.022

Davis

Ricketts

Wang

et al.

The somatic genomic landscape of chromophobe renal cell carcinoma

Cancer Cell

2014

;

319

–

10.1016/j.ccr.2014.07.014

Ciriello

Gatza

Beck

et al.

Comprehensive molecular portraits of invasive lobular breast cancer

Cell

2015

;

163

506

–

10.1016/j.cell.2015.09.033

Cancer Genome Atlas Research Network

et al.

Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia

N Engl J Med

2013

;

368

2059

–

10.1056/NEJMoa1301689

Crossref

PubMed

WorldCat

Cancer Genome Atlas Research Network

Integrated genomic characterization of endometrial carcinoma

Nature

2013

;

497

–

Cancer Genome Atlas Research Network

et al.

Integrated genomic and molecular characterization of cervical cancer

Nature

2017

;

543

378

–

10.

Cancer Genome Atlas Research Network

The molecular taxonomy of primary prostate cancer

Cell

2015

;

163

1011

–

10.1016/j.cell.2015.10.025

Crossref

PubMed

WorldCat

11.

Cancer Genome Atlas Research Network

Comprehensive molecular characterization of urothelial bladder carcinoma

Nature

2014

;

507

315

–

12.

Cancer Genome Atlas Research Network

Comprehensive molecular characterization of gastric adenocarcinoma

Nature

2014

;

513

202

–

209

13.

Cancer Genome Atlas Research Network

Comprehensive molecular profiling of lung adenocarcinoma

Nature

2014

;

511

543

–

14.

Cancer Genome Atlas Research Network

Integrated genomic characterization of papillary thyroid carcinoma

Cell

2014

;

159

676

–

10.1016/j.cell.2014.09.050

Crossref

PubMed

WorldCat

15.

Cancer Genome Atlas Research Network

Comprehensive molecular characterization of clear cell renal cell carcinoma

Nature

2013

;

499

–

16.

Cancer Genome Atlas Research Network

Comprehensive genomic characterization of squamous cell lung cancers

Nature

2012

;

489

519

–

17.

Cancer Genome Atlas Research Network

Integrated genomic analyses of ovarian carcinoma

Nature

2011

;

474

609

–

18.

Cancer Genome Atlas Research Network

Comprehensive genomic characterization defines human glioblastoma genes and core pathways

Nature

2008

;

455

1061

–

19.

Cancer Genome Atlas Research Network

Comprehensive genomic characterization of head and neck squamous cell carcinomas

Nature

2015

;

517

576

–

20.

Cancer Genome Atlas Research Network

Comprehensive molecular characterization of human colon and rectal cancer

Nature

2012

;

487

330

–

21.

Cancer Genome Atlas Research Network

Comprehensive molecular portraits of human breast tumours

Nature

2012

;

490

–

22.

Brennan

Verhaak

RGW

McKenna

et al.

The somatic genomic landscape of glioblastoma

Cell

2013

;

155

462

–

10.1016/j.cell.2013.09.034

23.

Zhang

Liu

Zhang

et al.

Integrated proteogenomic characterization of human high-grade serous ovarian cancer

Cell

2016

;

166

755

–

10.1016/j.cell.2016.05.069

24.

Zhang

Wang

et al.

Proteogenomic characterization of human colon and rectal cancer

Nature

2014

;

513

382

–

25.

Vasaikar

Huang

Wang

et al.

Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities

Cell

2019

;

177

1035

–

1049 e19

10.1016/j.cell.2019.03.030

26.

Satpathy

Krug

Jean Beltran

et al.

A proteogenomic portrait of lung squamous cell carcinoma

Cell

2021

;

184

4348

–

4371 e40

10.1016/j.cell.2021.07.016

27.

Petralia

Tignor

Reva

et al.

Integrated proteogenomic characterization across major histological types of pediatric brain cancer

Cell

2020

;

183

1962

–

1985 e31

10.1016/j.cell.2020.10.044

28.

Mun

D-G

Bhin

Kim

et al.

Proteogenomic characterization of human early-onset gastric cancer

Cancer Cell

2019

;

111

–

124 e10

10.1016/j.ccell.2018.12.003

29.

Mertins

Mani

Ruggles

et al.

Proteogenomics connects somatic mutations to signalling in breast cancer

Nature

2016

;

534

–

30.

Chen

Hsiao

et al.

Comprehensive proteogenomic characterization of rare kidney tumors

Cell Rep Med

2024

;

101547

10.1016/j.xcrm.2024.101547

31.

Krug

Jaehnig

Satpathy

et al.

Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy

Cell

2020

;

183

1436

–

1456 e31

10.1016/j.cell.2020.10.036

32.

Huang

Chen

Savage

et al.

Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma

Cancer Cell

2021

;

361

–

379 e16

10.1016/j.ccell.2020.12.007

33.

Pan

Shah

et al.

Integrated proteomic and glycoproteomic characterization of human high-grade serous ovarian carcinoma

Cell Rep

2020

;

108276

10.1016/j.celrep.2020.108276

34.

Gillette

Satpathy

Cao

et al.

Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma

Cell

2020

;

182

200

–

225 e35

10.1016/j.cell.2020.06.013

35.

Gao

Zhu

Dong

et al.

Integrated proteogenomic characterization of HBV-related hepatocellular carcinoma

Cell

2019

;

179

561

–

577 e22

10.1016/j.cell.2019.08.052

36.

Dou

Kawaler

Cui Zhou

et al.

Proteogenomic characterization of endometrial carcinoma

Cell

2020

;

180

729

–

748 e26

10.1016/j.cell.2020.01.026

37.

Clark

Dhanasekaran

Petralia

et al.

Integrated proteogenomic characterization of clear cell renal cell carcinoma

Cell

2019

;

179

964

–

983 e31

10.1016/j.cell.2019.10.007

38.

Cao

Huang

Cui Zhou

et al.

Proteogenomic characterization of pancreatic ductal adenocarcinoma

Cell

2021

;

184

5031

–

5052 e26

10.1016/j.cell.2021.08.023

39.

Wang

L-B

Karpova

Gritsenko

et al.

Proteogenomic and metabolomic characterization of human glioblastoma

Cancer Cell

2021

;

509

–

528 e20

10.1016/j.ccell.2021.01.006

40.

Malta

Sokolov

Gentles

et al.

Machine learning identifies stemness features associated with oncogenic dedifferentiation

Cell

2018

;

173

338

–

354 e15

10.1016/j.cell.2018.03.034

41.

Bailey

Tokheim

Porta-Pardo

et al.

Comprehensive characterization of cancer driver genes and mutations

Cell

2018

;

173

371

–

385 e18

10.1016/j.cell.2018.02.060

42.

Thorsson

Gibbs

Brown

et al.

The immune landscape of cancer

Immunity

2018

;

812

–

830 e14

10.1016/j.immuni.2018.03.023

43.

Porta-Pardo

Tokheim

et al.

Pan-cancer proteogenomics connects oncogenic drivers to functional states

Cell

2023

;

186

3921

–

3944 e25

10.1016/j.cell.2023.07.014

44.

Liang

W-W

RJ-H

Jayasinghe

et al.

Integrative multi-omic cancer profiling reveals DNA methylation patterns associated with therapeutic vulnerability and cell-of-origin

Cancer Cell

2023

;

1567

–

1585 e7

10.1016/j.ccell.2023.07.013

45.

Geffen

Anand

Akiyama

et al.

Pan-cancer analysis of post-translational modifications reveals shared patterns of protein regulation

Cell

2023

;

186

3945

–

3967 e26

10.1016/j.cell.2023.07.013

46.

Wang

Hong

Demicco

et al.

Deep learning integrates histopathology and proteogenomics at a pan-cancer level

Cell Rep Med

2023

;

101173

10.1016/j.xcrm.2023.101173

47.

Dou

Da Veiga Leprevost

et al.

Proteogenomic data and resources for pan-cancer analysis

Cancer Cell

2023

;

1397

–

406

10.1016/j.ccell.2023.06.009

48.

Shi

Lei

Elizarraras

et al.

Mapping the functional network of human cancer through machine learning and pan-cancer proteogenomics

Nat Cancer

2025

;

205

–

10.1038/s43018-024-00869-z

49.

Petralia

Yaron

et al.

Pan-cancer proteogenomics characterization of tumor immunity

Cell

2024

;

187

1255

–

1277 e27

10.1016/j.cell.2024.01.027

50.

Chang

Kim

S‐J

Hwang

et al.

Pan-cancer proteogenomic landscape of whole-genome doubling reveals putative therapeutic targets in various cancer types

Clin Transl Med

2024

;

e1796

51.

Savage

Lei

et al.

Pan-cancer proteogenomics expands the landscape of therapeutic targets

Cell

2024

;

187

4389

–

4407 e15

10.1016/j.cell.2024.05.039

52.

J-Y

Zhang

Wang

et al.

Integrative proteomic characterization of human lung adenocarcinoma

Cell

2020

;

182

245

–

261 e17

10.1016/j.cell.2020.05.043

53.

Dong

J-Y

Huang

et al.

Integrative proteogenomic profiling of high-risk prostate cancer samples from Chinese patients indicates metabolic vulnerabilities and diagnostic biomarkers

Nat Cancer

2024

;

1427

–

10.1038/s43018-024-00820-2

54.

Chen

Y-J

Roumeliotis

Chang

Y-H

et al.

Proteogenomics of non-smoking lung cancer in East Asia delineates molecular signatures of pathogenesis and progression

Cell

2020

;

182

226

–

244 e17

10.1016/j.cell.2020.06.012

55.

Chen

Yang

Teo

ASM

et al.

Genomic landscape of lung adenocarcinoma in East Asians

Nat Genet

2020

;

177

–

10.1038/s41588-019-0569-6

56.

Cancer Genome Atlas Research Network

Integrated genomic characterization of pancreatic ductal adenocarcinoma

Cancer Cell

2017

;

185

–

203 e13

Crossref

PubMed

WorldCat

57.

Aran

Butte

xCell: digitally portraying the tissue cellular heterogeneity landscape

Genome Biol

2017

;

220

10.1186/s13059-017-1349-1

58.

Newman

Liu

Green

et al.

Robust enumeration of cell subsets from tissue expression profiles

Nat Methods

2015

;

453

–

59.

Yoshihara

Shahmoradgoli

Martínez

et al.

Inferring tumour purity and stromal and immune cell admixture from expression data

Nat Commun

2013

;

2612

60.

Sturm

Finotello

Petitprez

et al.

Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology

Bioinformatics

2019

;

i436

–

10.1093/bioinformatics/btz363

61.

Schubert

Klinger

Klünemann

et al.

Perturbation-response genes reveal signaling footprints in cancer gene expression

Nat Commun

2018

;

10.1038/s41467-017-02391-6

62.

Hänzelmann

Castelo

Guinney

GSVA: gene set variation analysis for microarray and RNA-seq data

BMC Bioinf

2013

;

10.1186/1471-2105-14-7

Google Scholar

Crossref

WorldCat

63.

Subramanian

Tamayo

Mootha

et al.

Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles

Proc Natl Acad Sci USA

2005

;

102

15545

–

10.1073/pnas.0506580102

64.

Liberzon

Subramanian

Pinchback

et al.

Molecular signatures database (MSigDB) 3.0

Bioinformatics

2011

;

1739

–

10.1093/bioinformatics/btr260

65.

Battaglia

Mimpen

Traets

JJH

et al.

A pan-cancer analysis of the microbiome in metastatic cancer

Cell

2024

;

187

2324

–

2335 e19

10.1016/j.cell.2024.03.021

66.

Ghoshdastider

Sendoel

Exploring the pan-cancer landscape of posttranscriptional regulation

Cell Rep

2023

;

113172

10.1016/j.celrep.2023.113172

67.

Feng

Calinawan

Pugliese

et al.

Decomprolute is a benchmarking platform designed for multiomics-based tumor deconvolution

Cell Rep Methods

2024

;

100708

10.1016/j.crmeth.2024.100708

68.

Tian

et al.

SMARCA4: current status and future perspectives in non-small-cell lung cancer

Cancer Lett

2023

;

554

216022

10.1016/j.canlet.2022.216022

69.

Sun

Chen

et al.

p53 Mutation directs AURKA overexpression via miR-25 and FBXW7 in prostatic small cell neuroendocrine carcinoma

Mol Cancer Res

2015

;

584

–

10.1158/1541-7786.MCR-14-0277-T

70.

Zavitsanou

A-M

Pillai

Hao

et al.

KEAP1 mutation in lung adenocarcinoma promotes immune evasion and immunotherapy resistance

Cell Rep

2023

;

113295

10.1016/j.celrep.2023.113295

71.

Shibolet

Giallourakis

Rosenberg

et al.

AKAP13, a RhoA GTPase-specific guanine exchange factor, is a novel regulator of TLR2 signaling

J Biol Chem

2007

;

282

35308

–

10.1074/jbc.M704426200

72.

Di Minin

Bellazzo

Dal Ferro

et al.

Mutant p53 reprograms TNF signaling in cancer cells through interaction with the tumor suppressor DAB2IP

Mol Cell

2014

;

617

–

10.1016/j.molcel.2014.10.013

73.

Wang

Guo

Wei

et al.

Targeting p53 pathways: mechanisms, structures, and advances in therapy

Signal Transduct Target Ther

2023

;

10.1038/s41392-023-01347-1

74.

Martínez-Jiménez

Muiños

Sentís

et al.

A compendium of mutational cancer driver genes

Nat Rev Cancer

2020

;

555

–

10.1038/s41568-020-0290-x

75.

Tang

Kang

et al.

GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis

Nucleic Acids Res

2019

;

W556

–

76.

de Bruijn

Kundra

Mastrogiacomo

et al.

Analysis and visualization of longitudinal genomic and clinical data from the AACR project GENIE biopharma collaborative in cBioPortal

Cancer Res

2023

;

3861

–

10.1158/0008-5472.CAN-23-0816

77.

Vasaikar

Straub

Wang

et al.

LinkedOmics: analyzing multi-omics data within and across 32 cancer types

Nucleic Acids Res

2018

;

D956

–

78.

Skoulidis

Heymach

Co-occurring genomic alterations in non-small-cell lung cancer biology and therapy

Nat Rev Cancer

2019

;

495

–

509

10.1038/s41568-019-0179-8

79.

Skoulidis

Byers

Diao

et al.

Co-occurring genomic alterations define major subsets of KRAS-mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities

Cancer Discov

2015

;

860

–

10.1158/2159-8290.CD-14-1236

80.

Kim

K-H

Migliozzi

Koo

et al.

Integrated proteogenomic characterization of glioblastoma evolution

Cancer Cell

2024

;

358

–

377 e8

10.1016/j.ccell.2023.12.015

81.

Song

Choi

Kim

et al.

Proteogenomic analysis reveals non-small cell lung cancer subtypes predicting chromosome instability, and tumor microenvironment

Nat Commun

2024

;

10164

10.1038/s41467-024-54434-4

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
January 2026	216
February 2026	63
March 2026	83
April 2026	74
May 2026	33

Article Contents

Panorama: a database for the oncogenic evaluation of somatic mutations in pan-cancer

Abstract

Introduction

Methods

Data collection

Data processing

Oncogenicity evaluation

Comparison method for TP53 mutations at the pan-cancer level

Driver mutations

Database construction and web interface

Database application

SMARCA4 mutation as a PB in LUAD

TP53 mutation as an EP in BRCA

AKAP13 and KEAP1 mutations as IMs in UCEC and BRCA

TP53 mutation as a PE in BRCA

EGFR and APC mutations as therapeutic targets in LUAD and HNSCC

Tissue-dependent dynamics of TP53 mutations

Driver mutation discovery

Discussion and perspectives

Conflict of interest

Funding

Data availability

References

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Panorama: a database for the oncogenic evaluation of somatic mutations in pan-cancer Open Access

Abstract

Introduction

Methods

Data collection

Data processing

Oncogenicity evaluation

Comparison method for TP53 mutations at the pan-cancer level

Driver mutations

Database construction and web interface

Database application

SMARCA4 mutation as a PB in LUAD

TP53 mutation as an EP in BRCA

AKAP13 and KEAP1 mutations as IMs in UCEC and BRCA

TP53 mutation as a PE in BRCA

EGFR and APC mutations as therapeutic targets in LUAD and HNSCC

Tissue-dependent dynamics of TP53 mutations

Driver mutation discovery

Discussion and perspectives

Conflict of interest

Funding

Data availability

References

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only

Gift article access

Gift article access

Gift article access

Gift article access

Panorama: a database for the oncogenic evaluation of somatic mutations in pan-cancer