Abstract

Somatic mutations, key alterations in cancer development, exert differential effects across tissues and biological layers, such as transcriptomes, proteomes, and post-translational modifications (PTMs). Although previous pan-cancer studies have characterized the molecular landscape of cancer, the effects of individual somatic mutations across different tissues remain insufficiently explored. Here, we developed Panorama to evaluate the oncogenic potential of single somatic mutations across all cancer types. We collected cancer proteogenomics or multiomics data from over 10 000 individuals across 19 cancer types. Based on five evaluation criteria, we assessed whether a specific mutation affects the abundance of a particular gene’s transcriptome, proteome, or phosphoproteome; the tumor microenvironment; specific RNA- or protein-based signaling pathways; and outlier-level overexpression of PTMs, aiding in potential drug target identification. By leveraging five oncogenic metrics, Panorama quantifies the oncogenic potential of individual somatic mutations and provides a framework for identifying driver mutations by incorporating their downstream effects. With Panorama, researchers can integrate cancer proteogenomics data, providing a comprehensive approach that enhances our understanding of single somatic mutations in specific tissues. Finally, Panorama was developed as a web-based database to ensure easy access for researchers and is freely available at http://139.150.65.64:8080/or https://github.com/prosium/panorama.

Introduction

Cancer cells, characterized by abnormal proliferation driven by disrupted regulatory mechanisms, are shaped by genomic and epigenomic changes [1, 2]. These alterations act in specific tissues and in ways that depend on biological layers, such as at the DNA, RNA, protein, gene, or pathway levels [3, 4]. The Cancer Genome Atlas (TCGA) studies have enhanced our understanding of the origins and causes of cancer by conducting systematic analyses of genomic changes specific to each cancer type [5–22]. They provided biological insights from multiomics data into precision medicine by depicting the cancer landscape and proposing targeted therapeutic strategies. Several years later, with the launch of the Clinical Proteomic Tumor Analysis Consortium (CPTAC), initiated by advancements in mass spectrometry technology, we now understand the impact of genomic changes on proteins and post-translational modifications (PTMs) in cancer cells [23–39].

Several pan-cancer studies have attempted to harmonize insights from studies on individual cancer types. Using TCGA data, various pan-cancer studies have emerged, including molecular classifications based on cell-of-origin characteristics [4], development of new scores associated with oncogenic differentiation using machine learning [40], identification of oncogenic processes and signaling pathways [3], discovery of driver genes and mutations [41], and perspectives on the immune landscape of cancer [42]. Utilizing CPTAC data, several pan-cancer papers have been published, including studies on the evaluation of oncogenic driver mutations [43], the impact of aberrant methylation patterns [44], the landscape of PTM dysregulation of cancer [45], connections between clinical imaging and proteogenomics [46], the development of data resource package [47], the functional network using machine learning [48], tumor immunity [49], cancer vulnerabilities revealed by whole-genome doubling (WGD) [50], and pan-cancer druggable targets [51]. However, our understanding of the impact of a single somatic mutation across various tissues remains limited, and integrative efforts to analyze consortium-scale data, such as TCGA and CPTAC, remain insufficient.

Therefore, in the current era of cancer proteogenomics, we have integrated data and extended pan-cancer studies to comprehensively evaluate the impact of somatic mutations based on five criteria: (1) association between the mutation and clinical outcomes; (2) dysregulation of gene expression caused by a single somatic mutation at the RNA, protein, and PTMs levels; (3) relationship between the mutation and the tumor microenvironment (TME); (4) dysregulation of signaling pathways induced by the single somatic mutation; and (5) identification of associated therapeutic targets. By integrating data from over 10 000 cancer patients, our curated database evaluates the oncogenic potential of single somatic mutations at the pan-cancer level and identifies putative therapeutic targets for patients harboring specific mutations.

Methods

Data collection

We collected two types of datasets, including proteogenomics and multiomics data (Fig. 1A), derived from several large-scale cancer studies, including TCGA, CPTAC, and Pan-Cancer Analysis of Whole Genomes (PCAWG) (Table S1). For the proteogenomics dataset, data were collected from 11 different cancer types, comprising glioblastoma (GBM) [39], breast carcinoma (BRCA) [21, 31], colon adenocarcinoma (COAD) [25], head and neck squamous cell carcinoma (HNSCC), [32] clear cell renal cell carcinoma [37], hepatocellular carcinoma (HCC) [35], lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LSCC) [26, 34, 52], ovary cancer (OV), [17, 23] pancreas ductal adenocarcinoma [38] (PDAC), prostate adenocarcinoma [53], and endometrial carcinoma [36]. For multiomics dataset, we retrieved data from 12 different cancer types: lower-grade glioma and GBM [18], COAD [20], HNSCC [19], kidney cancer such as chromophobe renal cell carcinoma, clear cell renal cell carcinoma papillary renal cell carcinoma (KICH, KIRC, and KIRP), [5, 15] LUAD and LUSCC [13, 16, 54, 55], PDAC [56], uterine carcinoma, and uterine corpus endometrial carcinoma (UCEC) [8].

Workflow and scheme for the database. (A) Step 1: Dataset preparations. Data were collected from proteogenomics and multiomics datasets from multiple tissues and biological data types. The number of samples per tissue is shown in parentheses. (B) Step 2: Preprocessing. Data filtering criteria and tools used in the database. (C) Step 3: Evaluation. Five criteria for evaluating a single somatic mutation.
Figure 1.

Workflow and scheme for the database. (A) Step 1: Dataset preparations. Data were collected from proteogenomics and multiomics datasets from multiple tissues and biological data types. The number of samples per tissue is shown in parentheses. (B) Step 2: Preprocessing. Data filtering criteria and tools used in the database. (C) Step 3: Evaluation. Five criteria for evaluating a single somatic mutation.

Data processing

The data obtained in Step 1 were processed by filtering, normalization, and deconvolution (Fig. 1B). Somatic mutations with frequencies >5% within each cohort were selected for analysis. We used processed data directly from public sources (TCGA, CPTAC, and original publications) and subsequently applied Z-score normalization. Specifically, PTM profiles were normalized using site-level Z-scores, whereas transcriptomic and proteomic data were normalized using gene-level Z-scores. Two-level inference was performed using raw normalized data before Z-score calculation based on either mRNA or protein expression. Immune cell-type deconvolution was performed using xCell [57], CIBERSORT [58], and ESTIMATE [59]. For xCell and ESTIMATE, the R wrapper “immunedeconv” [60] was used with default settings. CIBERSORT was implemented using the “CIBERSORT.R” script and the “LM22.txt” reference profile, both obtained from the official CIBERSORT website. Pathway activity inference was performed using Progeny [61] and Gene Set Variation Analysis (GSVA) [62]. For GSVA, the Single Sample Gene Set Enrichment Analysis (ssGSEA) algorithm [63] was employed using the HALLMARK and KEGG gene matrix transposed files retrieved from the official MsigDB [64] website.

Furthermore, our statistical analysis performed intergroup comparisons after the simple deletion of missing data. While this approach is pragmatic for enabling researchers to easily reproduce our analyses and compare them to the original publications, we acknowledge it may introduce statistical bias. However, a sensitivity analysis confirmed this is a rare event, occurring in only 0.35% (n = 273) of all tested mutation-gene pairs (n = 78 079, P < .05, Fisher’s exact test). In future versions of the database, we plan to provide both the raw and imputed versions to offer greater flexibility. The methods described above (xCell, CIBERSORT, ESTIMATE, Progeny, and GSVA) are standard, well-validated methodologies widely used in pan-cancer analyses [43–45, 49, 65, 66]. We employed default settings as recommended by the tool developers, which align with these established protocols and ensure comparability and reproducibility with previous studies. Applying immune deconvolution tools to proteomic data is an important and emerging field, though a single “gold-standard” method is still in development. However, a recent study by Song et al. [67], while noting the lack of a gold-standard dataset, reported that xCell provides the most robust results when comparing proteome-based predictions to RNA-based benchmarks. Therefore, Panorama provides results from xCell, ESTIMATE, and CIBERSORT to allow for a comprehensive comparison of these methods.

Oncogenicity evaluation

Each mutation was evaluated across five categories: prognosis, abnormal gene expression, TME, pathway activity, and actionability (Fig. 1C). (1) Prognostic biomarker (PB): Prognostic significance was determined if, in any of the following, overall survival, disease-specific survival, disease-free survival, progression-free survival, or relapse-free survival, patients with the mutation had poorer outcomes than those without the mutation. Statistical significance was evaluated using log-rank tests. (2–3) Expression perturbator (EP) and immune modulator (IM): The impact of a mutation on the expression of genes or immune cell populations was evaluated by counting the number of genes or immune cell populations that were relatively increased in the mutation group compared to those in the wild-type (WT) group, with a false discovery rate (FDR) based on Student’s t-test. (4) Pathway effectors (PE): Pathways that exhibited statistically significant increases or decreases in intensity associated with a given mutation compared to WT patients were evaluated. The criterion for significance was FDR based on Student’s t-test. (5) Actionable driver (AD): Identification of therapeutic targets involved in discovering phosphorylation events upregulated to outlier levels relative to changes in RNA expression driven by a somatic mutation.

Comparison method for TP53 mutations at the pan-cancer level

For the PB metric, P-values were transformed using log10. For MA, we calculated the interquartile range (IQR) of the PTM data and defined PTMs exceeding QI + 1.5 × IQR as upper outliers; the number of such PTMs was then normalized by the total PTM count. For EP, the count of genes with a SignedFDR ≥ 1.301 (indicating positive significance) was normalized by the total number of genes. Similarly, for IM and PE, the number of terms with a SignedFDR ≥ 1.301 was normalized by the total number of terms. Each mutation was then ranked as follows: the top two cancer types received five points, the following two types received four points, the following four types received three points, the subsequent two types received two points, and the final two types received one point. Additionally, if a mutation lacked a term or gene with a SignedFDR ≥ 1.301, it was assigned a minimum score of 1.

Driver mutations

Mutations with frequencies of at least 5% were selected for each cohort. For every mutation, we calculated a score for each metric by dividing the number of values with a SignedFDR ≥ 1.301 by the total number of values. For PB metric, we applied the -log10 transformation to the P-values. This process yielded five scores for each mutation. Subsequently, we performed Z-score normalization for each cohort using these five metrics and computed their average, which was defined as the average oncogenic score (AOS). Finally, to distinguish driver mutations from passenger mutations, we performed a robust outlier analysis on the AOS values for each cohort. This analysis employed two simultaneous, standard statistical methods: (1) an IQR method, where outliers were defined as any value exceeding Q3 + 1.5 × IQR, and (2) a Z-score method. For the Z-score method, we defined outliers as mutations with a |Z-score| > 2.5. This threshold was chosen because it is stricter than ± 2SD (which captures ∼95% of the data) and is statistically grounded, identifying approximately the top 1% of outliers. Based on the results of these two tests and a curated list of known driver genes [41], we classified all mutations into a four-tier grading system to systematically assess confidence, as shown in Fig. 4(B): Grade 1 (identified as an outlier by both tests + known driver), Grade 2A (identified by both tests + novel candidate), Grade 2B (identified by one test + known driver), and Grade 3 (identified by one test + novel candidate).

Database construction and web interface

All the raw data were obtained from publicly available sources (Table S1). Additionally, the processed data derived from the raw data are shared in the Panorama GitHub repository (https://github.com/prosium/panorama). Furthermore, for interactive web applications, Panorama was constructed using data science libraries, such as Pandas, NumPy, Plotly, SciPy, and Streamlit. Furthermore, to support bioinformaticians in conducting large-scale analyses locally or in parallel, we have provided all Panorama results in the Parquet format on GitHub.

The Panorama web interface is organized with a logo, a database summary, and brief descriptions of the five functions on the right, while the left side features a menu bar, an overview of cancer data, and release notes (Fig. 2A). Clicking on the menu reveals several options, including the main page (homepage), a database overview (detailed data status), and five functions. Selecting one of the five functions allowed the user to navigate to a page with a brief introduction, dataset selection, result plots, and a download menu (Fig. 2B). This uniform layout across all five functions ensures consistency and enhances usability.

The web interface of Panorama. (A) Overview of the Panorama web interface, (1) logo of the Panorama database, (2) a brief description and functionalities of the Panorama database, and (3) Panorama menu bar. (B) Detailed view of the Panorama database function interface: (1) a brief description of the function, (2) dataset selection options including tissue type, cancer type, dataset, gene name, and survival type, (3) result plot, and (4) download buttons for the plot and underlying data.
Figure 2.

The web interface of Panorama. (A) Overview of the Panorama web interface, (1) logo of the Panorama database, (2) a brief description and functionalities of the Panorama database, and (3) Panorama menu bar. (B) Detailed view of the Panorama database function interface: (1) a brief description of the function, (2) dataset selection options including tissue type, cancer type, dataset, gene name, and survival type, (3) result plot, and (4) download buttons for the plot and underlying data.

Database application

We describe several use cases of the Panorama database below to demonstrate its functionality.

SMARCA4 mutation as a PB in LUAD

Mutations can promote tumor growth, proliferation, and metastatic potential and reduce treatment responsiveness. These effects may be associated with improved or worsened patient survival rates. When the effect of a specific mutation on the prognosis is consistently observed, it serves as a PB. In the CPTAC-LUAD cohort, 11 patients with SMARCA4 mutations showed a significantly poorer prognosis than the 94 WT patients (P = .001) (Fig. 3A). Similarly, in the TCGA-LUAD cohort, 43 patients with SMARCA4 mutations had significantly poorer outcomes than 457 WT patients (P = .048) (Fig. 3A). Given that patients with the SMARCA4 mutation in LUAD consistently exhibit a poor prognosis across two independent cohorts, SMARCA4 can be used as a PB for LUAD, as previously reported [68].

Use cases of Panorama. (A) Kaplan–Meier survival curves for SMARCA4 mutations in LUAD from CPTAC (left) and TCGA (right) datasets. Log-rank test. (B) Box plots comparing Z-differences in mRNA and protein expression levels for AURKA across multiple breast cancer (BRCA) cohorts. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (C) Heatmaps illustrating the relationship between immune scores from ESTIMATE and xCell and gene expression changes as Z-differences, with significant associations marked by FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (D) Heatmaps showing the association between mutation status and tumor necrosis factor-alpha (TNFα) and estrogen signaling pathways in BRCA. Expression changes are represented as Z-differences, with significant differences denoted by asterisks indicating FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (E and F) (left) Scatter plots showing the correlation between RNA, proteome, or PTM changes for mutations in two tumor types, with outlier phosphorylation sites highlighted. The right panels display bar plots comparing the signed -log10(FDR) for proteome phosphorylation targets across all tumor types. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method.
Figure 3.

Use cases of Panorama. (A) Kaplan–Meier survival curves for SMARCA4 mutations in LUAD from CPTAC (left) and TCGA (right) datasets. Log-rank test. (B) Box plots comparing Z-differences in mRNA and protein expression levels for AURKA across multiple breast cancer (BRCA) cohorts. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (C) Heatmaps illustrating the relationship between immune scores from ESTIMATE and xCell and gene expression changes as Z-differences, with significant associations marked by FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (D) Heatmaps showing the association between mutation status and tumor necrosis factor-alpha (TNFα) and estrogen signaling pathways in BRCA. Expression changes are represented as Z-differences, with significant differences denoted by asterisks indicating FDR thresholds. The P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method. (E and F) (left) Scatter plots showing the correlation between RNA, proteome, or PTM changes for mutations in two tumor types, with outlier phosphorylation sites highlighted. The right panels display bar plots comparing the signed -log10(FDR) for proteome phosphorylation targets across all tumor types. P-value is calculated from the Student’s t-test and converted to FDR using the Benjamini–Hochberg method.

TP53 mutation as an EP in BRCA

One approach to identifying factors that play critical roles in cancer development and progression is to identify genes specifically upregulated or overexpressed in cancer cells, as these are mutation-specific oncogenes that promote tumor formation. We hypothesized that if a gene is upregulated in patients with a mutation, and this upregulation is consistently observed across independent cohorts, the overexpressed gene is not only a target of the specific mutation but also drives cancer development in the context of that mutation.

In the CPTAC-, TCGA-, and PCAWG-BRCA cohorts, the frequently observed TP53 mutation group showed consistent and significant overexpression of AURKA compared to the TP53 WT group at both the transcriptomic and proteomic levels (Fig. 3B). Consistent upregulation of AURKA in association with TP53 mutations provides strong evidence that AURKA plays a critical role in cancer development or progression in the context of TP53 mutation [69].

AKAP13 and KEAP1 mutations as IMs in UCEC and BRCA

Immunotherapy, increasingly used in cancer treatment, is more effective when there is a high infiltration of immune cells, including T cells, in the TME, a condition referred to as an immunologically “hot” state. In contrast, tumors in an immunologically “cold” state, characterized by fewer or suppressed immune cells, are less likely to respond to immunotherapy. We analysed mutations associated with immunologically hot or cold states using immune scores derived from ESTIMATE and xCell.

In the CPTAC- and TCGA-UCEC cohorts, the AKAP13 mutation was associated with elevated immune scores across both tools and independent cohorts (Fig. 3C, top). Conversely, the KEAP1 mutation in the CPTAC- and TCGA-LUAD cohorts was associated with a decreased immune score in both the tools and independent cohorts (Fig. 3C, bottom). The consistent patterns suggest that AKAP13 and KEAP1 mutations can be proposed as immunotherapy markers, as they significantly alter the TME to a hot or cold state, as is well known. These findings can enhance our understanding of immune evasion mechanisms, improve the prediction of immunotherapy responses by cancer type, and provide opportunities to identify new immunotherapeutic targets [70, 71].

TP53 mutation as a PE in BRCA

Measuring activation at the pathway level rather than the expression of a single gene is essential for a systematic understanding of biological mechanisms. We utilized the hallmark gene set and applied GSVA to calculate ssGSEA scores, comparing patients with specific mutations to those with WT. In the CPTAC- and TCGA-BRCA cohorts, patients with TP53 mutations showed significantly increased tumor necrosis factor-alpha (TNFα) signaling at both the transcriptome and proteome levels (Fig. 3D, top). Conversely, TP53 mutations in CPTAC- and TCGA-BRCA cohorts were associated with a significant decrease in estrogen signaling (Fig. 3D, bottom).

The consistent increase in TNFα signaling associated with TP53 mutations suggests that TP53 mutations significantly alter gene expression and signaling pathways. Furthermore, it suggests TNFα signaling as a potential therapeutic target for breast cancer patients harboring TP53 mutations [72].

EGFR and APC mutations as therapeutic targets in LUAD and HNSCC

A significant increase in phosphorylation to outlier levels in patients with specific mutations suggests abnormal activation of the corresponding signaling pathway or protein, indicating its potentially critical role in cancer development or progression. We identified statistically significant phosphorylation events at the outlier level in patients with specific mutations and validated their unique significance across 12 tumor types.

In LUAD, patients with EGFR mutations showed a significantly higher FDR value for Y62 phosphorylation of PTPN11 than for the next-highest phosphorylation, K269, of BLVRA (Fig. 3E, left). At the pan-cancer level, various mutations, including those in FLG, CUBN, and PPFIA2, significantly increased Y62 phosphorylation of PTPN11; however, none of these mutations elevated Y62 phosphorylation of PTPN11 to an outlier level (Fig. 3E, right). In HNSCC, S292 phosphorylation of AMPD2 was observed in patients with APC mutations, and this phosphorylation was the only significantly different event at the outlier level (Fig. 3F). The finding that PTPN11 is a well-established strong potential therapeutic target in LUAD and AMPD2 is a novel potential therapeutic target in HNSCC, despite showing no significant differences at the transcriptomic or proteomic levels (FDR > 0.05), emphasizes the importance of proteogenomic data [34].

Tissue-dependent dynamics of TP53 mutations

Even when an identical TP53 mutation occurs at the same locus within the gene, the functional consequences and impact on tumorigenesis are highly dependent on the tissue context [73]. As shown in Fig. 4(A), we compared the five TP53 mutation functions across various tissues at the pan-cancer level.

Tissue-specificity of TP53 mutation and candidate driver mutations. (A) Radar plots displaying the evaluation of TP53 mutations across 12 cancer types. Each sector of the radar plot represents a different functional category: PB, EP, IM, PE, and AD. (B) Distribution of the AOS across 12 cancer types. The density plots show the complete distribution of AOS values for all mutations with >5% frequency in each cohort. The overlaid dots represent only those mutations identified as statistical outliers and classified into Grades 1, 2A, 2B, and 3. This four-tier grading system classifies mutations based on their outlier status and prior driver knowledge, as detailed in the section “Methods.” Key Grade 1 driver mutations are labeled with their gene names.
Figure 4.

Tissue-specificity of TP53 mutation and candidate driver mutations. (A) Radar plots displaying the evaluation of TP53 mutations across 12 cancer types. Each sector of the radar plot represents a different functional category: PB, EP, IM, PE, and AD. (B) Distribution of the AOS across 12 cancer types. The density plots show the complete distribution of AOS values for all mutations with >5% frequency in each cohort. The overlaid dots represent only those mutations identified as statistical outliers and classified into Grades 1, 2A, 2B, and 3. This four-tier grading system classifies mutations based on their outlier status and prior driver knowledge, as detailed in the section “Methods.” Key Grade 1 driver mutations are labeled with their gene names.

In HCC, TP53 mutations are a poor-prognosis biomarker, whereas in LSCC, they are a better-prognosis biomarker; no significant association was observed in the other 10 cancer types. Additionally, more genes showed increased expression in BRCA and HCC compared to other cancers, and immune cell populations were notably elevated in BRCA and COAD. Moreover, HCC and BRCA showed greater upregulation of signaling pathways than other cancer types, while GBM and OV showed a greater number of PTMs reaching outlier levels. Using Panorama to analyse TP53 mutations at the pan-cancer level enables the identification of therapeutic targets and prognostic evaluation for specific cancer types, thereby uncovering tissue-dependent heterogeneity in TP53 mutation mechanisms.

Driver mutation discovery

Identifying driver mutations in cancer data is challenging. Most tools select driver mutations based on factors such as mutation recurrence across the genome, their localization within functional domains, and the surrounding context [74]. We assumed that driver mutations would be associated with a greater influence on prognosis, a higher number of overexpressed genes, overactivated immune cell populations, hyperactivated signaling pathways, and more actionable PTM targets than passenger mutations (see the section “Methods”). Therefore, we calculated the average values of the five Panorama-measured factors for all mutations with a frequency of at least 5% across the 12 tissue types.

At the pan-cancer level, we identified upper outliers (displayed in blue) based on each mutation’s AOS and highlighted the known driver mutations for each cancer type in red (Fig. 4B). Across all 12 cancer types, we divided the known driver mutations into novel candidates. We identified at least one known driver mutation for each cancer type and proposed several novel mutations (Table S2). Notably, although not previously reported, the SLC8A3 mutation in LUAD was associated with significant increases in the immune score, estimate score, and stroma score among patients harboring the mutation. Unlike many existing tools for driver mutation discovery, our approach using AOS enables the detection of novel mutations. Through this approach, we propose a novel method for identifying driver mutations using cancer proteogenomic data that comprehensively considers both mutation frequency and downstream mutation effects.

Discussion and perspectives

In Panorama, we aimed to elucidate the diverse impacts of single somatic mutations on cancer by integrating proteogenomic data, including clinical data, from over 10 000 patients with various cancer types. We performed robust analysis to highlight the intricate biological consequences of specific mutations. Over the years, numerous web-based databases have been developed for comprehensive pan-cancer analyses. These include tools that focus on comparisons between normal and tumor tissues [75], broad analysis platforms in cancer genomics [76], and extensive association analysis tools that leverage proteogenomic data [77]. Panorama’s unique contribution is its mutation-centric group comparison framework. It provides a single, streamlined visualization that allows for a simultaneous comparison of how a single somatic mutation (versus the WT group) affects three layers—RNA, proteome, and phosphoproteome. This function is a core feature of Panorama, as it intuitively reveals how a genomic alteration may bypass the transcriptome and directly enact its function at the proteome or phosphoproteome level, a specific question not directly streamlined by other correlation-based platforms. Furthermore, the ability to analyse and visualize associations with derived data, such as immune signaling or signaling activity, in a single heatmap is a unique advantage of Panorama.

Various genomic alterations, in addition to single somatic mutations—such as copy number or structural changes, WGD, fusions, and aberrant methylation—can underlie the changes observed in our PB, EP, IM, PE, and AD metrics. In particular, the recent study by Chang et al. [50] demonstrated that WGD does not act uniformly but can be classified into different types based on the level of genomic instability, which varies in a cancer-type-specific manner. Crucially, they found that these WGD types are distinctly associated with specific “kinase phosphorylation networks” and cell cycle pathway activation. This finding suggests that large-scale genomic events like WGD may ultimately converge on specific phosphorylation pathways to drive tumorigenesis, highlighting the necessity of evaluating WGD as a potential driver of the “Actionable Driver (AD)” events identified in Panorama. Therefore, we plan to expand Panorama to include various other genomic events, such as WGD, to provide a more comprehensive functional analysis and will also continuously integrate new data from publicly available databases.

Furthermore, co-occurring or mutually exclusive mutations can act as confounding factors when analysing the impact of a single mutation [78, 79]. Panorama provides lists of co-occurring and mutually exclusive mutations across all cohorts to mitigate this issue, enabling a more accurate interpretation of the results (Table S3). Recently, it was shown that even when the same EGFR amplification is found in recurrent GBM, treatment resistance arises from decreased EGFR activation and increased BRAF activation at the protein and phosphorylation levels [80]. Panorama will not be limited to primary cancer, and we plan to add functionality to compare “sensitive vs. resistant” functions. Additionally, among the newly identified subtypes of nonsmall cell lung cancer, some have been defined by phosphorylation features and others by immune scores, which were then in turn used to predict therapeutic response [81]. We are planning future research to move beyond single-mutation analysis and define new subtypes based on the AOS, which reflects all five features of Panorama.

Our comprehensive proteogenomic database, Panorama, integrates analyses from numerous studies to advance our understanding of how single somatic mutations differentially affect cancer progression across diverse tissues. By integrating diverse data types, we identified key PBs, elucidated the mechanisms of immune modulation, and identified potentially ADs. This approach enables precise characterization of somatic mutation functions, facilitates a deeper understanding of cancer heterogeneity, and enhances patient treatment efficacy. Furthermore, it is provided as a web-based database to ensure easy access for researchers.

Conflict of interest

None declared.

Funding

This research was supported by a grant (2017M3A9A7050615) from the Ministry of Science and ICT (MSIT) and the KRIBB Research Initiative Program (KGM5192531).

Data availability

Database files in Parquet format and code are freely available for download on GitHub (https://github.com/prosium/panorama).

References

1.

Hanahan
 
D
.
Hallmarks of cancer: new dimensions
.
Cancer Discov
.
2022
;
12
:
31
46
.

2.

Hanahan
 
D
,
Weinberg
 
RA
.
Hallmarks of cancer: the next generation
.
Cell
.
2011
;
144
:
646
74
.

3.

Sanchez-Vega
 
F
,
Mina
 
M
,
Armenia
 
J
 et al.  
Oncogenic signaling pathways in The Cancer Genome Atlas
.
Cell
.
2018
;
173
:
321
337 e10
.

4.

Hoadley
 
KA
,
Yau
 
C
,
Hinoue
 
T
 et al.  
Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer
.
Cell
.
2018
;
173
:
291
304 e6
.

5.

Davis
 
CF
,
Ricketts
 
CJ
,
Wang
 
M
 et al.  
The somatic genomic landscape of chromophobe renal cell carcinoma
.
Cancer Cell
.
2014
;
26
:
319
30
.

6.

Ciriello
 
G
,
Gatza
 
ML
,
Beck
 
AH
 et al.  
Comprehensive molecular portraits of invasive lobular breast cancer
.
Cell
.
2015
;
163
:
506
19
.

7.

Cancer Genome Atlas Research Network
 et al.  
Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia
.
N Engl J Med
.
2013
;
368
:
2059
74
.

8.

Cancer Genome Atlas Research Network
.
Integrated genomic characterization of endometrial carcinoma
.
Nature
.
2013
;
497
:
67
73
.

9.

Cancer Genome Atlas Research Network
 et al.  
Integrated genomic and molecular characterization of cervical cancer
.
Nature
.
2017
;
543
:
378
84
.

10.

Cancer Genome Atlas Research Network
.
The molecular taxonomy of primary prostate cancer
.
Cell
.
2015
;
163
:
1011
25
.

11.

Cancer Genome Atlas Research Network
.
Comprehensive molecular characterization of urothelial bladder carcinoma
.
Nature
.
2014
;
507
:
315
22
.

12.

Cancer Genome Atlas Research Network
.
Comprehensive molecular characterization of gastric adenocarcinoma
.
Nature
.
2014
;
513
:
202
209
.

13.

Cancer Genome Atlas Research Network
.
Comprehensive molecular profiling of lung adenocarcinoma
.
Nature
.
2014
;
511
:
543
50
.

14.

Cancer Genome Atlas Research Network
.
Integrated genomic characterization of papillary thyroid carcinoma
.
Cell
.
2014
;
159
:
676
90
.

15.

Cancer Genome Atlas Research Network
.
Comprehensive molecular characterization of clear cell renal cell carcinoma
.
Nature
.
2013
;
499
:
43
49
.

16.

Cancer Genome Atlas Research Network
.
Comprehensive genomic characterization of squamous cell lung cancers
.
Nature
.
2012
;
489
:
519
25
.

17.

Cancer Genome Atlas Research Network
.
Integrated genomic analyses of ovarian carcinoma
.
Nature
.
2011
;
474
:
609
15
.

18.

Cancer Genome Atlas Research Network
.
Comprehensive genomic characterization defines human glioblastoma genes and core pathways
.
Nature
.
2008
;
455
:
1061
68
.

19.

Cancer Genome Atlas Research Network
.
Comprehensive genomic characterization of head and neck squamous cell carcinomas
.
Nature
.
2015
;
517
:
576
82
.

20.

Cancer Genome Atlas Research Network
.
Comprehensive molecular characterization of human colon and rectal cancer
.
Nature
.
2012
;
487
:
330
37
.

21.

Cancer Genome Atlas Research Network
.
Comprehensive molecular portraits of human breast tumours
.
Nature
.
2012
;
490
:
61
70
.

22.

Brennan
 
CW
,
Verhaak
 
RGW
,
McKenna
 
A
 et al.  
The somatic genomic landscape of glioblastoma
.
Cell
.
2013
;
155
:
462
77
.

23.

Zhang
 
H
,
Liu
 
T
,
Zhang
 
Z
 et al.  
Integrated proteogenomic characterization of human high-grade serous ovarian cancer
.
Cell
.
2016
;
166
:
755
65
.

24.

Zhang
 
B
,
Wang
 
J
,
Wang
 
X
 et al.  
Proteogenomic characterization of human colon and rectal cancer
.
Nature
.
2014
;
513
:
382
87
.

25.

Vasaikar
 
S
,
Huang
 
C
,
Wang
 
X
 et al.  
Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities
.
Cell
.
2019
;
177
:
1035
1049 e19
.

26.

Satpathy
 
S
,
Krug
 
K
,
Jean Beltran
 
PM
 et al.  
A proteogenomic portrait of lung squamous cell carcinoma
.
Cell
.
2021
;
184
:
4348
4371 e40
.

27.

Petralia
 
F
,
Tignor
 
N
,
Reva
 
B
 et al.  
Integrated proteogenomic characterization across major histological types of pediatric brain cancer
.
Cell
.
2020
;
183
:
1962
1985 e31
.

28.

Mun
 
D-G
,
Bhin
 
J
,
Kim
 
S
 et al.  
Proteogenomic characterization of human early-onset gastric cancer
.
Cancer Cell
.
2019
;
35
:
111
124 e10
.

29.

Mertins
 
P
,
Mani
 
DR
,
Ruggles
 
KV
 et al.  
Proteogenomics connects somatic mutations to signalling in breast cancer
.
Nature
.
2016
;
534
:
55
62
.

30.

Li
 
GX
,
Chen
 
L
,
Hsiao
 
Y
 et al.  
Comprehensive proteogenomic characterization of rare kidney tumors
.
Cell Rep Med
.
2024
;
5
:
101547
.

31.

Krug
 
K
,
Jaehnig
 
EJ
,
Satpathy
 
S
 et al.  
Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy
.
Cell
.
2020
;
183
:
1436
1456 e31
.

32.

Huang
 
C
,
Chen
 
L
,
Savage
 
SR
 et al.  
Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma
.
Cancer Cell
.
2021
;
39
:
361
379 e16
.

33.

Hu
 
Y
,
Pan
 
J
,
Shah
 
P
 et al.  
Integrated proteomic and glycoproteomic characterization of human high-grade serous ovarian carcinoma
.
Cell Rep
.
2020
;
33
:
108276
.

34.

Gillette
 
MA
,
Satpathy
 
S
,
Cao
 
S
 et al.  
Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma
.
Cell
.
2020
;
182
:
200
225 e35
.

35.

Gao
 
Q
,
Zhu
 
H
,
Dong
 
L
 et al.  
Integrated proteogenomic characterization of HBV-related hepatocellular carcinoma
.
Cell
.
2019
;
179
:
561
577 e22
.

36.

Dou
 
Y
,
Kawaler
 
EA
,
Cui Zhou
 
D
 et al.  
Proteogenomic characterization of endometrial carcinoma
.
Cell
.
2020
;
180
:
729
748 e26
.

37.

Clark
 
DJ
,
Dhanasekaran
 
SM
,
Petralia
 
F
 et al.  
Integrated proteogenomic characterization of clear cell renal cell carcinoma
.
Cell
.
2019
;
179
:
964
983 e31
.

38.

Cao
 
L
,
Huang
 
C
,
Cui Zhou
 
D
 et al.  
Proteogenomic characterization of pancreatic ductal adenocarcinoma
.
Cell
.
2021
;
184
:
5031
5052 e26
.

39.

Wang
 
L-B
,
Karpova
 
A
,
Gritsenko
 
MA
 et al.  
Proteogenomic and metabolomic characterization of human glioblastoma
.
Cancer Cell
.
2021
;
39
:
509
528 e20
.

40.

Malta
 
TM
,
Sokolov
 
A
,
Gentles
 
AJ
 et al.  
Machine learning identifies stemness features associated with oncogenic dedifferentiation
.
Cell
.
2018
;
173
:
338
354 e15
.

41.

Bailey
 
MH
,
Tokheim
 
C
,
Porta-Pardo
 
E
 et al.  
Comprehensive characterization of cancer driver genes and mutations
.
Cell
.
2018
;
173
:
371
385 e18
.

42.

Thorsson
 
V
,
Gibbs
 
DL
,
Brown
 
SD
 et al.  
The immune landscape of cancer
.
Immunity
.
2018
;
48
:
812
830 e14
.

43.

Li
 
Y
,
Porta-Pardo
 
E
,
Tokheim
 
C
 et al.  
Pan-cancer proteogenomics connects oncogenic drivers to functional states
.
Cell
.
2023
;
186
:
3921
3944 e25
.

44.

Liang
 
W-W
,
Lu
 
RJ-H
,
Jayasinghe
 
RG
 et al.  
Integrative multi-omic cancer profiling reveals DNA methylation patterns associated with therapeutic vulnerability and cell-of-origin
.
Cancer Cell
.
2023
;
41
:
1567
1585 e7
.

45.

Geffen
 
Y
,
Anand
 
S
,
Akiyama
 
Y
 et al.  
Pan-cancer analysis of post-translational modifications reveals shared patterns of protein regulation
.
Cell
.
2023
;
186
:
3945
3967 e26
.

46.

Wang
 
JM
,
Hong
 
R
,
Demicco
 
EG
 et al.  
Deep learning integrates histopathology and proteogenomics at a pan-cancer level
.
Cell Rep Med
.
2023
;
4
:
101173
.

47.

Li
 
Y
,
Dou
 
Y
,
Da Veiga Leprevost
 
F
 et al.  
Proteogenomic data and resources for pan-cancer analysis
.
Cancer Cell
.
2023
;
41
:
1397
406
.

48.

Shi
 
Z
,
Lei
 
JT
,
Elizarraras
 
JM
 et al.  
Mapping the functional network of human cancer through machine learning and pan-cancer proteogenomics
.
Nat Cancer
.
2025
;
6
:
205
22
.

49.

Petralia
 
F
,
Ma
 
W
,
Yaron
 
TM
 et al.  
Pan-cancer proteogenomics characterization of tumor immunity
.
Cell
.
2024
;
187
:
1255
1277 e27
.

50.

Chang
 
E
,
Kim
 
S‐J
,
Hwang
 
HS
 et al.  
Pan-cancer proteogenomic landscape of whole-genome doubling reveals putative therapeutic targets in various cancer types
.
Clin Transl Med
.
2024
;
14
:
e1796
.

51.

Savage
 
SR
,
Yi
 
X
,
Lei
 
JT
 et al.  
Pan-cancer proteogenomics expands the landscape of therapeutic targets
.
Cell
.
2024
;
187
:
4389
4407 e15
.

52.

Xu
 
J-Y
,
Zhang
 
C
,
Wang
 
X
 et al.  
Integrative proteomic characterization of human lung adenocarcinoma
.
Cell
.
2020
;
182
:
245
261 e17
.

53.

Dong
 
B
,
Xu
 
J-Y
,
Huang
 
Y
 et al.  
Integrative proteogenomic profiling of high-risk prostate cancer samples from Chinese patients indicates metabolic vulnerabilities and diagnostic biomarkers
.
Nat Cancer
.
2024
;
5
:
1427
47
.

54.

Chen
 
Y-J
,
Roumeliotis
 
TI
,
Chang
 
Y-H
 et al.  
Proteogenomics of non-smoking lung cancer in East Asia delineates molecular signatures of pathogenesis and progression
.
Cell
.
2020
;
182
:
226
244 e17
.

55.

Chen
 
J
,
Yang
 
H
,
Teo
 
ASM
 et al.  
Genomic landscape of lung adenocarcinoma in East Asians
.
Nat Genet
.
2020
;
52
:
177
86
.

56.

Cancer Genome Atlas Research Network
.
Integrated genomic characterization of pancreatic ductal adenocarcinoma
.
Cancer Cell
.
2017
;
32
:
185
203 e13
.

57.

Aran
 
D
,
Hu
 
Z
,
Butte
 
AJ
.
xCell: digitally portraying the tissue cellular heterogeneity landscape
.
Genome Biol
.
2017
;
18
:
220
.

58.

Newman
 
AM
,
Liu
 
CL
,
Green
 
MR
 et al.  
Robust enumeration of cell subsets from tissue expression profiles
.
Nat Methods
.
2015
;
12
:
453
57
.

59.

Yoshihara
 
K
,
Shahmoradgoli
 
M
,
Martínez
 
E
 et al.  
Inferring tumour purity and stromal and immune cell admixture from expression data
.
Nat Commun
.
2013
;
4
:
2612
.

60.

Sturm
 
G
,
Finotello
 
F
,
Petitprez
 
F
 et al.  
Comprehensive evaluation of transcriptome-based cell-type quantification methods for immuno-oncology
.
Bioinformatics
.
2019
;
35
:
i436
45
.

61.

Schubert
 
M
,
Klinger
 
B
,
Klünemann
 
M
 et al.  
Perturbation-response genes reveal signaling footprints in cancer gene expression
.
Nat Commun
.
2018
;
9
:
20
.

62.

Hänzelmann
 
S
,
Castelo
 
R
,
Guinney
 
J
.
GSVA: gene set variation analysis for microarray and RNA-seq data
.
BMC Bioinf
.
2013
;
14
:
7
.

63.

Subramanian
 
A
,
Tamayo
 
P
,
Mootha
 
VK
 et al.  
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles
.
Proc Natl Acad Sci USA
.
2005
;
102
:
15545
50
.

64.

Liberzon
 
A
,
Subramanian
 
A
,
Pinchback
 
R
 et al.  
Molecular signatures database (MSigDB) 3.0
.
Bioinformatics
.
2011
;
27
:
1739
40
.

65.

Battaglia
 
TW
,
Mimpen
 
IL
,
Traets
 
JJH
 et al.  
A pan-cancer analysis of the microbiome in metastatic cancer
.
Cell
.
2024
;
187
:
2324
2335 e19
.

66.

Ghoshdastider
 
U
,
Sendoel
 
A
.
Exploring the pan-cancer landscape of posttranscriptional regulation
.
Cell Rep
.
2023
;
42
:
113172
.

67.

Feng
 
S
,
Calinawan
 
A
,
Pugliese
 
P
 et al.  
Decomprolute is a benchmarking platform designed for multiomics-based tumor deconvolution
.
Cell Rep Methods
.
2024
;
4
:
100708
.

68.

Tian
 
Y
,
Xu
 
L
,
Li
 
X
 et al.  
SMARCA4: current status and future perspectives in non-small-cell lung cancer
.
Cancer Lett
.
2023
;
554
:
216022
.

69.

Li
 
Z
,
Sun
 
Y
,
Chen
 
X
 et al.  
p53 Mutation directs AURKA overexpression via miR-25 and FBXW7 in prostatic small cell neuroendocrine carcinoma
.
Mol Cancer Res
.
2015
;
13
:
584
91
.

70.

Zavitsanou
 
A-M
,
Pillai
 
R
,
Hao
 
Y
 et al.  
KEAP1 mutation in lung adenocarcinoma promotes immune evasion and immunotherapy resistance
.
Cell Rep
.
2023
;
42
:
113295
.

71.

Shibolet
 
O
,
Giallourakis
 
C
,
Rosenberg
 
I
 et al.  
AKAP13, a RhoA GTPase-specific guanine exchange factor, is a novel regulator of TLR2 signaling
.
J Biol Chem
.
2007
;
282
:
35308
17
.

72.

Di Minin
 
G
,
Bellazzo
 
A
,
Dal Ferro
 
M
 et al.  
Mutant p53 reprograms TNF signaling in cancer cells through interaction with the tumor suppressor DAB2IP
.
Mol Cell
.
2014
;
56
:
617
29
.

73.

Wang
 
H
,
Guo
 
M
,
Wei
 
H
 et al.  
Targeting p53 pathways: mechanisms, structures, and advances in therapy
.
Signal Transduct Target Ther
.
2023
;
8
:
92
.

74.

Martínez-Jiménez
 
F
,
Muiños
 
F
,
Sentís
 
I
 et al.  
A compendium of mutational cancer driver genes
.
Nat Rev Cancer
.
2020
;
20
:
555
72
.

75.

Tang
 
Z
,
Kang
 
B
,
Li
 
C
 et al.  
GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis
.
Nucleic Acids Res
.
2019
;
47
:
W556
60
.

76.

de Bruijn
 
I
,
Kundra
 
R
,
Mastrogiacomo
 
B
 et al.  
Analysis and visualization of longitudinal genomic and clinical data from the AACR project GENIE biopharma collaborative in cBioPortal
.
Cancer Res
.
2023
;
83
:
3861
67
.

77.

Vasaikar
 
SV
,
Straub
 
P
,
Wang
 
J
 et al.  
LinkedOmics: analyzing multi-omics data within and across 32 cancer types
.
Nucleic Acids Res
.
2018
;
46
:
D956
63
.

78.

Skoulidis
 
F
,
Heymach
 
JV
.
Co-occurring genomic alterations in non-small-cell lung cancer biology and therapy
.
Nat Rev Cancer
.
2019
;
19
:
495
509
.

79.

Skoulidis
 
F
,
Byers
 
LA
,
Diao
 
L
 et al.  
Co-occurring genomic alterations define major subsets of KRAS-mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities
.
Cancer Discov
.
2015
;
5
:
860
77
.

80.

Kim
 
K-H
,
Migliozzi
 
S
,
Koo
 
H
 et al.  
Integrated proteogenomic characterization of glioblastoma evolution
.
Cancer Cell
.
2024
;
42
:
358
377 e8
.

81.

Song
 
KJ
,
Choi
 
S
,
Kim
 
K
 et al.  
Proteogenomic analysis reveals non-small cell lung cancer subtypes predicting chromosome instability, and tumor microenvironment
.
Nat Commun
.
2024
;
15
:
10164
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data