Article Navigation

Journal Article

Comprehensive data resources and analytical tools for pathological association of aminoacyl tRNA synthetases with cancer

Author Notes

Abstract

Mammalian cells have cytoplasmic and mitochondrial aminoacyl-tRNA synthetases (ARSs) that catalyze aminoacylation of tRNAs during protein synthesis. Despite their housekeeping functions in protein synthesis, recently, ARSs and ARS-interacting multifunctional proteins (AIMPs) have been shown to play important roles in disease pathogenesis through their interactions with disease-related molecules. However, there are lacks of data resources and analytical tools that can be used to examine disease associations of ARS/AIMPs. Here, we developed an Integrated Database for ARSs (IDA), a resource database including cancer genomic/proteomic and interaction data of ARS/AIMPs. IDA includes mRNA expression, somatic mutation, copy number variation and phosphorylation data of ARS/AIMPs and their interacting proteins in various cancers. IDA further includes an array of analytical tools for exploration of disease association of ARS/AIMPs, identification of disease-associated ARS/AIMP interactors and reconstruction of ARS-dependent disease-perturbed network models. Therefore, IDA provides both comprehensive data resources and analytical tools for understanding potential roles of ARS/AIMPs in cancers.

Database URL:http://ida.biocon.re.kr/ , http://ars.biocon.re.kr/

Introduction

Cells have an array of cytoplasmic and mitochondrial aminoacyl-tRNA synthetases (ARSs) that are involved in cellular protein synthesis. ARSs catalyze the ligation of amino acids to their cognate tRNAs during protein synthesis. Thus, ARSs have been considered as housekeepers involved in protein synthesis, being less sensitive to system perturbation compared with signal mediators or transcriptional factors that are fully dedicated to system control. However, recently, aberrant expression, mislocalization and variant formation of ARSs have been observed in various cancer cells ( 1 ). For this reason, more attention is being paid to potential roles of ARS/ARS-interacting multifunctional proteins (AIMPs) in system regulation and pathogenesis of various diseases.

Mammalian ARSs contain additional domains attached to their catalytic domains, compared with prokaryotic counterparts ( 2 ). Using these additional domains, they interact with other molecules to form diverse complexes, thereby affecting activities of disease-related cellular processes ( 3 ). More intriguingly, several mammalian ARSs form a macromolecular protein complex with three non-enzymatic factors named AIMPs ( 4 , 5 ). Although they are not the enzymes like ARSs, they are considered as the members of the ARS community since they play critical scaffolding role in the structural integrity of the multi-tRNA synthetase complex (MSC). Thus, we consider three AIMPs together with 20 cytosolic ARSs as the same community group. A growing volume of evidence shows that ARS/AIMPs are closely associated with disease pathogenesis through their interactions with disease-related molecules ( 1 , 6 ). These data indicate that alterations in interactions of ARS/AIMPs, together with abnormal expression and mislocalization of ARS/AIMPs, can lead to perturbation of disease-related cellular networks.

Huge amounts of genomic, transcriptomic, proteomic and interaction data for diverse diseases have been accumulated. Recently, the importance of ARS/AIMPs in cancer pathogenesis has been addressed in a series of publications ( 7–14 ). Thus, we previously reviewed potential roles of ARS/AIMPs in various cancers by reanalyzing previously published global datasets ( 1 ). However, exploration of abnormal expression and interaction of ARS/AIMPs in these diseases is limited due to the lack of comprehensive resources that can be used to effectively navigate and explore alterations in genomic, transcriptomic and proteomic data and also in interactions of ARS/AIMPs in various diseases. Furthermore, there is a severe lack of analytical tools that can be used to analyze associations of ARS/AIMPs with human diseases and to reconstruct ARS/AIMP-dependent disease-perturbed network models. Here, we present an Integrated Database for ARS/AIMPs (IDA) that provides (i) genomic, transcriptomic and proteomic data [mRNA expression, somatic mutation, copy number variation (CNV) and phosphorylation data] of ARS/AIMPs and their interactors in various cancers and (ii) protein–protein interactions (PPIs) of ARS/AIMPs.

Materials and methods

Database and website infrastructure

IDA is based on the relational database management system for data storage and integration with external data sources. Gene expression and protein-interaction data are stored in a MySQL database. Identifiers of all probes on the microarray types for gene expression data, Ensembl and dbSNP identifiers for genomic mutation data, identifiers of international protein index or UniProtKB/Swiss-Prot for protein expression or post-translational modifications (PTMs) and all identifiers for PPIs in the interactome databases have been converted into NCBI Entrez identifiers. Using the relationships among different identifiers, we have optimized the database schema ( Supplementary Figure S1 ) to improve the response time of the database when gene/protein expression, genomic mutation, PTM and PPI data are searched for. Moreover, IDA provides a web-based user interface for data exploration and visualization. Cancer-associated differential expression, mutation and CNVs of ARS/AIMPs and their first and second neighbors were summarized using several visualization tools, such as two-way heat maps, a pinwheel-shaped heat map and bar graphs, respectively. These data for individual genes can be visualized using profile plots or extracted in a tubular format. Finally, two different network analysis tools, Cancer-associated network and Functional modules are servicing. IDA web site is constructed with PHP 5.3.3, SWFObject v1.5, MATLAB R2011a, MySQL 5.1.71, CentOS release 6.5 and Apache/2.2.15 (Unix).

Identification of differentially expressed genes

IDA includes 40 cancer gene expression datasets. For the 36 cancer datasets generated from Affymetrix microarray platforms, the log ₂ intensities in each gene expression dataset were normalized using the GC-RMA normalization method ( 15 ). For the other four datasets generated from Illumina, ABI and custom microarray platforms, the log ₂ -intensities were normalized using the quantile normalization method ( 16 ). For each dataset, the expressed genes were then identified using a Gaussian mixture model method as previously described ( 17 ). Briefly, in the mixture modeling, the distribution of the observed probe intensity (probeset intensity for Affymetrix data) was fitted by two Gaussian probability density functions: one was for expressed genes while the other was for non-expressed genes. We then selected the expressed genes as the ones whose maximum intensities across all samples in the dataset were higher than the threshold intensity at which the two fitted Gaussian probability density functions meet. To identify differentially expressed genes (DEGs) between cancer and normal samples, an integrative statistical hypothesis testing was performed ( 18 , 19 ). Briefly, Student t -test and log ₂ -median-ratio test were applied to calculate T values and log ₂ -median-ratios for all the genes. Empirical distributions of the null hypothesis were then estimated by performing random permutations of samples and then by performing the kernel density estimation ( 20 ) to T values and log ₂ -median-ratios resulted from the random permutations. Next, the adjusted P values of each gene for the individual tests were computed by the two-tailed test using the empirical distribution. The P values from the two tests were then combined using Stouffer’s method ( 19 ). The DEGs were identified as the genes with P ≤ 0.05. Finally, as the representative P values of the genes in multiple datasets of each cancer type, the combined P values of genes in individual datasets were summarized using Stouffer’s method ( 19 ).

Deregulation and co-association scores for nodes and edges in the network models

For the nodes (ARS/AIMPs and there first cancer-associated neighbors) and edges in the ARS/AIMP cancer-associated network, the two types of cancer association scores model were estimated. First, the deregulation score for each node which denotes the representative degree of deregulation in the 10 cancer types for which gene expression data were collected. The combined P values of each node in 10 different cancer types were first calculated. These P values were then transformed to Z values using the inverse standard normal cumulative distribution. The cumulative sum of Z values of its direct interactors for each node was calculated in each cancer type ( 21 ). Finally, the average of the cumulative sum of Z values in 10 cancer types was computed as the deregulation score. Second, the co-association score between an ARS/AIMP and their first neighbors representing the extent of co-deregulation in 10 cancer types is estimated as ‘the number of cancer types where the interacting pairs of ARS/AIMP-first neighbors are commonly differentially expressed’ divided by ‘the total number of cancer types’.

Identification of key modules in the ARS/AIMP network model

Key modules in the ARS/AIMP network were identified using the random walk with restart (RWR) algorithm previously reported ( 22 ). Briefly, in the RWR algorithm, we used all nodes in the network as the seeds and assigned equal initial probabilities ( p ^t at t = 0) to the seeds. Random walks then started with the transitions rate of probability = 0.75 and then stopped until L ₁ norm of the difference between P ^t and P ^t+¹ (random walk probabilities of the nodes at t and t +1, respectively) became <10 ⁻⁶ . To evaluate the significance of p ^t , we repeated the random walk process for randomized networks where the edges were randomly permuted, estimated an empirical null distribution of P ^t resulted from the randomized networks and P values of P ^t for the nodes in the real network by the right-sided test using the empirical null distribution. From the ARS/AIMP network, we finally identified a set of subnetworks including the nodes with P < 0.05 and defined the subnetworks as key modules.

Identification of evolutionarily conserved motifs in the ARS/AIMP network model

To identify evolutionarily conserved regulatory motifs, we first mapped the nodes in the ARS/AIMP network model for each species into the ortholog groups in eggnog ( 23 ) and then removed redundant interactions and self-interactions arising from many-to-one mapping. In the network model for each species, regulatory motifs were searched using the ortholog groups, and the overlapped ones among the species were identified as evolutionarily conserved ones. The interactions among the ortholog groups in the conserved motifs were then converted to all the possible protein interactions. After the conversion, only the motifs composed of the interactions existing in the real interactome were selected. Finally, among them, the ones including at least one of ARS/AIMPs were selected as a final list of evolutionarily conserved motifs.

Results and discussion

Comprehensive data resources of ARS/AIMPs and their interactors in IDA

Cancer-associated genes

Recently, huge amounts of global genomic, transcriptomic and proteomic data for cells or tissues of various cancers have been generated and deposited into numerous databases. Although these data can be useful to understand association of ARS/AIMPs with cancers, different types of global data have been stored in separate databases, which make analyses of ARS/AIMPs inefficient. To resolve this problem, we collected genomic, transcriptomic, proteomic and interaction data of ARS/AIMPs and their interactors from numerous databases and integrated them to IDA. To facilitate analyses of association of ARS/AIMPs with cancers, we first defined cancer-associated genes (CAGs) using NCI cancer gene index ( Figure 1 , upper left). Among 6955 human genes in the NCI cancer gene index, we selected 3501 CAGs with the following attributes: (i) cancer associations of the genes should be validated by manual curation in the literatures; (ii) the genes should have no conflicting indications regarding their cancer associations; (iii) cancer associations of the genes should be experimentally validated in human samples and (iv) the genes should include one of the following role codes: Chemical_or_Drug_Has_Mechanism_Of_Action, Chemical_or_Drug_Has_Study_Therapeu-tic_Use_For, Chemical_or_Drug_FDA_Approved_for_Disease, Chemical_or_Drug_Has_ Accepted_Therapeutic_Use_For, Gene_Malfunction_Associated_With_Disease and Gene_ Anormaly_has_Disease-Relat-ed_Function.

Figure 1.

Data resources in IDA. IDA contains CAGs and genomic, transcriptomic, proteomic and PPIs for ARS/AIMPs and their first and second neighbor CAGs. First, IDA provides 3501 CAGs selected from the 6955 genes in NCBI cancer gene index based on the filtering criteria (top left). See text for the criteria. Second, IDA also provides genomic data (CNVs and somatic mutations) for ARS/AIMPs and their neighbors in diverse cancer types, which were collected from CanGEM, TumorScape, CCLE and COSMIC (bottom left). Third, IDA contains gene expression data of ARS/AIMPs and their neighbors for 10 cancer types, as well as P values and log ₂ -fold-changes that represent differential expression of the genes in the 10 cancer types (bottom right). IDA further contains proteomic data (protein abundance and PTMs) collected from dbDEPC, PHOSIDA and PhosPhoSitePlus. Finally, IDA provides PPIs of ARS/AIMPs and their neighbors collected from 15 interactome databases.

Open in new tab Download slide

CAGs interacting with ARS/AIMPs

Mammalian ARS/AIMPs have additional domains to catalytic domains ( 2 ). They interact with other molecules through the additional domains, thereby affecting activities of cancer-related cellular processes ( 3 ). Next, we thus identified the CAGs that interact with ARS/AIMPs among the 3501 CAGs. To this end, we collected 1 12 596 PPIs from the following 15 interactome databases with ( Figure 1 , upper right): (i) curated PPIs for each of which experimental evidence was previously reported from human protein reference database ( 24 ), Kyoto Encyclopedia of Genes and Genomes ( 25 ), Biomolecular interaction network database (BIND) ( 26 ), Database of Interacting Proteins ( 27 ), Biological General Repository for Interaction Datasets ( 28 ), Reactome ( 29 ), Molecular INTeraction database ( 30 ), IntAct ( 31 ), InnateDB ( 32 ), Search Tool for the Retrieval of Interacting Genes/Proteins (STRING, v9.0) ( 33 ) and PharmDB ( 34 ) and (ii) predicted PPIs based on paralog- and ortholog-based PPI prediction and text mining from STRING, BIND, interologous interaction database (I2D) ( 35 ), online predicted human interaction database ( 36 ), human PPI map (HiMAP) ( 37 ) and human PPI prediction ( 38 ). In addition, the resources that integrate PPIs in individual interactome databases, such as PSICQUIC (EBI) ( 39 ) and NCBI PPI databases, were also used. On the basis of the collected PPIs, we identified 123 first neighbor CAGs and 1293 second neighbor CAGs of ARS/AIMPs among the 3501 CAGs.

Transcriptomic data of ARS/AIMPs and their interacting CAGs

Alteration of expression levels of the genes in cancers suggests their associations with a broad spectrum of cancer pathophysiological processes. To examine cancer-associated alteration in gene expression, huge amounts of gene expression profiles from diverse types of cancers have been collected ( 40–43 ). Thus, we collected 40 cancer expression profiles for 10 representative cancer types (pancreatic, prostate, lung, breast, colon, kidney, head and neck, hematopoietic and lymphoid, liver and gastric cancer) ( Figure 1 , bottom right). The 164 cancer gene expression datasets were initially collected from the Gene Expression Omnibus ( 44 ) and ArrayExpress ( 45 ). Among the 164 datasets, 40 were then selected based on the following criteria: (i) each dataset should include more than 10 normal and 10 cancer samples and (ii) each cancer type should include at least two datasets ( Supplementary Table S1 ).

In each dataset, DEGs between cancerous and normal samples were identified using the integrative statistical method previously reported ( P < 0.05) ( 17 ), as described in Materials and Methods. For each cancer type, we then computed representative P values of the genes as the combined P values of the genes in multiple datasets of the cancer type ( 19 ) using Stouffer’s method (Materials and Methods). Also, to summarize expression changes in the 10 types of cancers, the representative log ₂ -fold-changes of the genes were computed as their averaged log ₂ -fold-changes. For comparative analysis of ARS/AIMPs and their interacting CAGs, we identified 1874 non-CAGs, as negative controls, which were not included in the cancer gene index and also showed no significant cancer-associated alterations in gene expression profiles (i.e. combined P < 0.05 in none of the 10 types of cancers). The comparison of the representative log ₂ -fold-changes revealed that ARS/AIMPs and their first and second neighbors in the 10 types of cancers showed significant changes in their expression levels, compared with non-CAGs ( Figure 2 ).

Figure 2.

Differential expression of ARS/AIMPs and their neighbors. IDA provides a functionality for exploration of differential expression of the four groups of the genes in the 10 cancer types: (i) ARS/AIMPs, (ii) and (iii) their first and second neighbor CAGs and (iv) non-CAGs. The color bar for each group shows sorted log ₂ -fold-changes of the genes in the group. Compared with non-CAGs, the ARS/AIMPs and their interactors showed more significant differential expression in the 10 cancer types. The t -test was used to compute P -values representing the significances of the differences between the mean log ₂ -fold-change values of ARS/AIMPs and non-CAGs. Color bar denotes the gradient of log ₂ -fold-changes of the genes. HLT: hematopoietic and lymphoid tissue.

Open in new tab Download slide

Genomic data of ARS/AIMPs and their interacting CAGs

Recently, a number of studies have showed that somatic mutations and CNVs were associated with cancer-related pathophysiological processes ( 40–43 , 46 ). Thus, to understand cancer-related alterations of ARS/AIMPs and their interactors in somatic mutations and CNVs, we first obtained CNV data for nine cancer types (skin, gastric, colon, head and neck, hematopoietic and lymphoid, lung, bone, soft tissue and uterine) from CanGEM ( 47 ) and also from Tumorscape ( 48 ) for 14 cancer types (leukemia and breast, colorectal, esophageal squamous, lung, liver, ovarian, prostate and renal cancers and medulloblastoma, melanoma, myeloproliferative disorder, glioma and sarcoma) ( Figure 1 , bottom left; Supplementary Table S2 ). Frequencies of gain and loss for each gene in individual types of cancers were pre-computed as described previously ( 47 ) and stored in IDA. We then obtained somatic mutations from cancer cell line encyclopedia (CCLE) ( 49 ) and catalog of somatic mutations in cancer for diverse cancer types ( 50 ). IDA contains a total of 7 09 008 somatic mutations including missense, indel, frameshift and nonsense mutations (1452 for ARS/AIMPs, 5 87 125 for their interacting CAGs and 1 20 431 for non-CAGs).

Using these data, we compared alteration degrees of CNVs in ARS/AIMPs, first and second neighbor CAGs and non-CAGs ( Figure 3 A). ARS/AIMPs and their interactors showed significant gains and losses of copy numbers in nine representative cancers, whereas little variation was observed in non-CAGs. We further examined gains and losses of ARS/AIMPs using the CNV data obtained from CCLE. In total, 18 cases of TARS amplifications were found in lung cancer cells ( Figure 3 B). High frequencies of NARS deletions were observed in several cancer types, such as large intestine, lung and pancreas cancer cells ( Figure 3 C). Especially, ∼23% of esophagus cancer cells contained NARS copy number deletions.

Figure 3.

Alterations of copy numbers of ARS/AIMPs and their neighbors. ( A ) Average frequencies of gains (amplification) and losses (deletion) of copy numbers in the four groups of the genes in nine representative cancer types using CNV data in CanGEM: (i) ARS/AIMPs, (ii) and (iii) first and second neighbor CAGs and (iv) non-CAGs. The frequency was defined by the number of amplification cases for the genes in each group divided by the total number of the genes in the group. ( B ) Amplification frequencies of ARS/AIMPs in lung cancer cell lines using CNV data in CCLE. ( C ) Heat map showing deletion frequencies of ARS/AIMPs in 16 different types of cancer cell lines using CNV data in CCLE. Color bar shows the gradient of deletion frequency. CNS, central nervous system; HLT, hematopoietic and lymphoid tissue.

Open in new tab Download slide

Also, we compared somatic mutations of ARS/AIMPs, first and second neighbor CAGs and non-CAGs ( Figure 4 ). The CAG neighbors showed relatively higher frequencies of somatic mutations (average relative frequency = 5.3) than non-CAGs (average relative frequency = 0.35) in all the cancer types examined. Interestingly, the mutation rates of ARA/AIMPs (average relative frequency = 0.35) were also relatively lower than their CAG neighbors, although ARS/AIMPs showed significant cancer-related alterations in their expression ( Figure 2 ). Nonetheless, several ARS/AIMPs showed relatively high frequency of missense/in-frame indel mutations in several cancer types, such as endometrium and prostate cancers (average relative frequencies = 0.78 and 0.74, respectively). For example, a total of 209 missense/in-frame indel mutations of ARS/AIMPs were found in endometrium cancers. Especially, among the mutations, nearly 80 missense mutations of MARS, CARS, LARS and EPRS were found in the endometrium cancers. Also, the same type of mutations, including both missense and in-frame deletion mutations of KARS, were observed with relative frequency of 0.74 in prostate cancers.

Figure 4.

Somatic mutations of ARS/AIMPs and their neighbors. Average numbers of somatic mutations in the four groups of the genes in 16 cancer types using somatic mutation data in CCLE: (i) ARS/AIMPs, (ii) and (iii) first and second neighbor CAGs and (iv) non-CAGs. The mutations were categorized into the following groups: (i) missense/in-frame InDels, (ii) frameshift, (iii) nonsense, (iv) gain of sequence, (v) complex, (vi) silent and (vii) unknown mutations. Color bar denotes the gradient of the number of each group of mutations.

Open in new tab Download slide

Proteomic data of ARS/AIMPs and their interacting CAGs

Post-translation modifications (PTMs) of ARS/AIMPs and their interacting CAGs, such as phosphorylation and ubiquitination, can modulate the interactions of ARS/AIMPs with CAGs, thereby affecting non-canonical functions of ARS/AIMPs in disease-related cellular processes. For example, EPRS, KRS and AIMP2 are dissociated from MSCs by phosphorylation and regulate inflammation, metastasis and apoptosis, respectively ( 51–53 ). Thus, we collected PTMs of ARS/AIMPs and their interactors, as well as their protein abundances, in diverse types of cancers ( Figure 1 , bottom right). First, protein abundance data in 20 cancer types (brain, breast, cervical, colorectal, esophageal, gall bladder, gastric, head and neck, liver, lung, ovarian, pancreatic, prostate, renal, skin, testicular and thyroid cancers and leukemia, lymphoma and sarcoma) were obtained from dbDEPC ( 54 ). Second, PTM data of ARS/AIMPs and their interactors were obtained from phosphorylation site database (PHOSIDA) and PhosPhoSitePlus ( 55 , 56 ). A total of 375 phosphorylations for ARS/AIMPs were found in several cancer cells including cervical, breast and colorectal cancer and leukemia cells. Additionally, IDA includes 84 acetylations, 410 ubiquitinations, 4 methylations and 1 sumoylation for ARS/AIMPs.

Analytical tools of ARS/AIMPs and their interactors in IDA

Web-based search and exploration tools

The goal of IDA is to provide not only molecular and interaction data of ARS/AIMPs but also analytical tools that generate network models and then network-driven hypotheses for understanding of cancer-associated functions of ARS/AIMPs. IDA provides a search tool (‘Detailed search about ARS’ in Figure 5 , center) that allows us to explore the collected genomic, transcriptomic and proteomic data for ARS/AIMPs. For example, when the genomic data for DARS were searched, the search output shows the genomic, proteomic and interaction data of DARS in IDA. The genomic data showed the gains of copy numbers in soft tissue cancers, but losses of copy numbers in gastric, head and neck and soft tissue cancers ( Figure 5 , top right). The proteomic data showed the five types of PTMs for DARS (phosphorylation, acetylation, methylation, ubiquitination and sumoylation), as well as the PTM sites ( Figure 5 , middle right): For example, the phosphorylation data showed five serine (S112, S146, S238, S249 and S427), three tyrosine (Y24, Y239 and Y245) and one threonine (T52) phosphorylation sites. Further detailed information of the PTMs can be obtained through the links to the original PTM databases.

Figure 5.

Analytical tools in IDA. IDA provides analytical tools for the search of ARS/AIMPs and their interactors (right), reconstruction of ARS/AIMP network models (top left) and identification of key functional modules in the network (bottom left). These example networks in the left panel were generated using ‘Cancer-associated network’ (top) and ‘Functional modules’ (bottom), respectively, in ‘Analysis’ of ‘Network’ menu. KARS was entered as an input, and a network (top left) was generated by clicking ‘Submit’ button after selecting first and second neighbor CAGs. The main page shows the interface for the search of ARS/AIMPs and their interacting CAGs (center). The exploration panels (right) show the search outputs: (i) genomic data of KARS (gains and losses of CNVs), (ii) proteomic data of KARS (PTMs) and (iii) PPIs of KARS. Detailed information for PTMs and CAG interactors of ARS/AIMPs can be found through the links to the original databases. The network modeling and analysis tools (left) show the ARS/AIMP network model and key functional modules identified from the network model. Colored nodes represent ARS/AIMPs (dark orange) and their first neighbors (orange). Node sizes represent the average deregulation scores in the 10 cancer types (see Materials and Methods).

Open in new tab Download slide

Moreover, another search tool (‘ARS-CAG interactome search’ in Figure 5 , center) allows us to explore the CAGs with which ARS/AIMPs interact. For example, the search output for KARS showed a list of CAGs interacting with KARS ( Figure 5 , bottom right). This list of interactors of KARS can be used to study their KRS-related functions and pathological implications such as cell migration (MAPK11/12/14) ( 52 , 57 ), immune response (MITF) ( 57 ) and Amyotrophic Lateral Sclerosis (SOD1) ( 58 , 59 ), supporting the utility of the interactors of ARS/AIMPs. Finally, the first and second CAG neighbors of ARS/AIMPs can be explored using the links to the list of interacting CAGs or the search tool for the CAGs (‘Detailed search about CAG’ in Figure 5 , center). All the genomic and proteomic data of the CAGs can be explored, similar to those of ARS/AIMPs shown in Figure 5 . Finally, the interactome of a list of genes with the molecules in IDA can be obtained using ‘Cancer-associated network’ in ‘Analysis’ of the ‘Network’ menu after entering the symbols of the genes (‘Select All’ group option).

Cancer-associated network models for ARS/AIMPs

An important goal of the tools in IDA is to develop cancer-associated network models for ARS/AIMPs by integrating both transcriptomic and interaction data and to generate hypotheses for mechanisms of ARS/AIMPs in cancer-related pathophysiological processes based on the network models. To this end, IDA first provides the tools to develop cancer-associated network models for ARS/AIMPs (‘Cancer-associated network’ in ‘Analysis’ of ‘Network’ menu). The network model was generated using ARS/AIMPs and their first neighbor CAGs and then visualized using Cytoscape ( 60 ) ( Figure 5 , top left). The nodes (ARS/AIMPs and their first neighbor CAGs) in the network model can be organized into functional modules in each of which the nodes have the same gene ontology biological processes ( Figure 6 A; ‘Cancer-associated network of ARS/AIMPs and first neighbor CAGs’ in ‘Output’ of ‘Network’ menu). The nodes in the network model can have different degrees of contributions to cancer-related pathophysiological processes. To represent the different degrees of node contributions to the cancer-related processes, we estimated deregulation score indicating representative degree of deregulation in the 10 cancer types for which gene expression data were collected (Materials and Methods). The interactions (edges) between ARS/AIMPs and CAGs in the network model can have different degrees of associations with cancers. To represent the different degree of cancer associations for each pair of ARS/AIMP and its CAG interactor, we estimated the co-association score for the interacting pair that represents the extent of co-deregulation in the 10 cancer types. The network model can reveal a set of the linked nodes with high deregulation and co-association scores, which can be considered as a functional module that can significantly contribute to cancer-related pathophysiological processes represented in the network model.

Figure 6.

Evolutionarily conserved motifs in the ARS/AIMP network model. ( A ) ARS/AIMPs cancer network model showing 23 ARS/AIMPs and 123 first neighbor CAGs. Colored nodes represent ARS/AIMPs (dark orange) and their first neighbors (orange). Node sizes represent the average deregulation scores in the 10 cancer types (Materials and Methods). Nodes were grouped, such that the nodes involved in the same cellular process, according to gene ontology biological process annotations, belonged to a functional module denoted by the orange background. ( B ) Two key functional modules identified using the random walk-based method. ( C ) Evolutionarily conserved motifs in the ARS/AIMP network models constructed in human, fly, bacteria and yeast. The motifs conserved between human and the other three species were shown.

Open in new tab Download slide

Key functional modules of ARS/AIMPs in the network model

To understand functions of ARS/AIMPs defined by their interactions with the CAGs, it is important to identify the sets of the ARS/AIMP-CAG nodes densely connected with high deregulation and co-association scores, which are referred to as key functional modules of ARS/AIMPs. These modules should include at least one ARS/AIMP and its interaction with the CAGs to examine cancer-related functions of ARS/AIMPs. Thus, IDA provides a random walk-based method to effectively identify the key ARS/AIMP-CAG functional modules (‘Functional modules’ in ‘Analysis’ of ‘Network’ menu). This method performs random walks based on the co-association scores and then identifies a set of the nodes that should include more than at least one ARS/AIMP with high residence probabilities after a certain period of time. For the network model in Figure 6 A, the random walk-based method generated two key functional ARS/AIMP-CAG modules ( Figure 6 B; ‘Functional modules for cancer-associated network of ARS/AIMPs and first neighbor CAGs’ in ‘Output’ of ‘Network’ menu). These modules (GARS and MARS subnetworks) show the links of the ARSs to (i) NFKB pathway through their interactions with NFKB2, IKBKG/E and MCC ( 61 , 62 ) and (ii) transcriptional regulators (ELF3 and BRF2). This suggests potential roles of GARS and MARS in NFKB signaling and transcriptional regulation. Furthermore, these modules show the possible association of the ARSs in the immune response (HLA-B and TRAF6) and cell migration (CD44 and MCC).

Evolutionary analysis of the cancer-associated network for ARS/AIMPs

In addition to the random walk-based method mentioned above, evolutionary analysis of the cancer-associated network for ARS/AIMP can be applied to the network model to identify core regulatory motifs of ARS/AIMPs, assuming that the core regulatory motifs should be conserved across different species. To this end, we collected PPIs in four different species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens ( Supplementary Table S3 ) and then reconstructed ARS/AIMP network models using first and second neighbors of ARS/AIMPs in individual species. Next, we identified evolutionarily conserved network motifs recurring in the network models of individual species (‘Evolutionarily conserved motifs’ in ‘Analysis’ of ‘Network’ menu). Among the identified motifs, we focused on the ones conserved in human to understand roles of ARSs in human networks (‘Evolutionarily conserved motifs of ARS/AIMPs and first and second neighbor CAGs’ in ‘Output’ of ‘Network’ menu). The identified motifs were mostly conserved in two species, Yeast-Human (4 motifs), Fly-Human (4 motifs) and Bacteria-Human (2 motifs) ( Figure 6 C).

These evolutionarily conserved motifs suggest that the components in the motifs may form functional complexes. For instance, the motif consisting of MARS, AIMP1 and QARS was suggested to be conserved among yeast, fly and human. These three proteins are the known components forming MSC with other ARSs in human ( 63 ). In the human form, the structural or functional association AIMP1 with QARS and MARS were experimentally demonstrated ( 4 , 64 ). The primitive complex form was found in yeast consisting of MARS, Acr1p (homolog of AIMP1) and EARS ( 65 ) and the functional significance of this complex was studied in-depth ( 66 ). The potential interactions among these three proteins were also suggested in fly ( 67 ), indicating that functional implications of the fly complex can be obtained from the in-depth studies conducted in other organisms. Thus, the conserved motif analysis can be useful to predict how these ARS-containing macromolecular complexes have evolved from unicellular to multicellular organisms in their structure and function.

The nodes in the conserved motifs are involved in a broad spectrum of cellular processes, including cell cycle and DNA repair, protein modification (e.g. neddylation) and localization, signal transduction and response to stress (e.g. NFKB pathway). These results indicate that ARSs can play potential roles in these processes. Interestingly, the link of ARS/AIMPs to NFKB pathway was predicted by both the random walk-based method and the evolutionary analysis of ARS/AIMP networks. Many motifs conserved in higher organisms, as well as a broad spectrum of cellular processes associated with these motifs, are consistent to previous findings that they can acquire evolutionarily new functions by the addition of new domains ( 68–70 ).

Conclusions

Several studies have discovered diverse implications of ARS/AIMPs in human diseases. However, there is still the lack of data resources and tools that can be used to analyze potential functional roles of ARS/AIMPs in human diseases. IDA was developed to provide a broad spectrum of genomic (somatic mutations and CNVs), transcriptomic (mRNA expression), proteomic (protein abundance and PTMs) and interactome data (PPIs) from 25 existing DBs ( Figure 1 ) for ARS/AIMPs and their interactors. However, IDA is not simply assemblies of information available in the 25 DBs that provide intact or federated links to the original data. The collected data were reprocessed, designed and integrated in such a way to facilitate the integrative analyses (analyses of cancer-related changes in gene and protein abundances and genomic variations, network analysis, functional modules and evolutionary motif analysis) and explorative analyses (navigation and visualization of the information) of cancer-related ARS data. When cancer-related alterations in abundances and genomic variations are discovered for ARS/AIMPs and/or their interactors, IDA can be used to evaluate whether the discovery was further investigated or not. Furthermore, IDA provides a battery of analytical tools for the integrative analyses and also the outputs from the integrative analyses. The outputs (e.g. network modules) can be used to generate hypotheses for functions of ARS/AIMPs or mechanisms for such functions in cancers, thereby providing new insights into cancer-related functions of ARS/AIMPs. Currently, IDA provides cancer-related data only and is thus limited to investigation of association of ARS/AIMPs with cancers. However, it will be expanded to include genomic, transcriptomic and proteomic data collected from metabolic and neurological diseases. Recently, a number of studies have shown important roles of ARS/AIMPs in pathogenesis of diverse diseases. Therefore, IDA can serve as a comprehensive resource of data and tools for studies of ARS/AIMPs that can facilitate elucidation of unknown functions of ARS/AIMPs in disease pathogenesis.

Funding

This work was supported by Global Frontier Project funded by the Ministry of Science, ICT and Future Planning and Basic Science Research Program funded by the Ministry of Education (NRF-M3A6A4-2010-0029785 and NRF-2013R1A1A2058353) and the grants from the Institute for Basic Science (CA1308), Next-Generation BioGreen 21 Program (No. PJ009072) and the POSCO Research Fund (Project No. 2013Y008). Funding for open access charge: Global Frontier Project (NRF-M3A6A4-2010-0029785).

Supplementary Data

Supplementary data are available at Database Online.

Conflict of interest. None declared.

References

Kim

You

Hwang

(

2011

)

Aminoacyl-tRNA synthetases and tumorigenesis: more than housekeeping

Nat. Rev. Cancer

708

–

718

Park

S.G.

Ewalt

K.L.

Kim

(

2005

)

Functional expansion of aminoacyl-tRNA synthetases and their interacting factors: new perspectives on housekeepers

Trends Biochem. Sci.

569

–

574

Park

S.G.

Schimmel

Kim

(

2008

)

Aminoacyl tRNA synthetases and their connections to disease

Proc. Natl Acad. Sci. USA

105

11043

–

11049

Google Scholar

Crossref

WorldCat

Han

J.M.

Lee

M.J.

Park

S.G.

et al. . (

2006

)

Hierarchical network between the components of the multi-tRNA synthetase complex: implications for complex formation

J. Biol. Chem.

281

38663

–

38667

Park

S.G.

Choi

E.C.

Kim

(

2010

)

Aminoacyl-tRNA synthetase-interacting multifunctional proteins (AIMPs): a triad for cellular homeostasis

IUBMB Life

296

–

302

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Yao

Fox

P.L.

(

2013

)

Aminoacyl-tRNA synthetases in medicine and disease

EMBO Mol. Med.

332

–

343

Sen

Zhou

Ripmaster

et al. . (

1997

)

Expression of a gene encoding a tRNA synthetase-like protein is enhanced in tumorigenic human myeloid leukemia cells and is cell cycle stage- and differentiation-dependent

Proc. Natl Acad. Sci. USA

6164

–

6169

Google Scholar

Crossref

WorldCat

Park

M.C.

Kang

Jin

et al. . (

2012

)

Secreted human glycyl-tRNA synthetase implicated in defense against ERK-activated tumorigenesis

Proc. Natl Acad. Sci. USA

109

E640

–

E647

Google Scholar

Crossref

WorldCat

Wellman

T.L.

Eckenstein

Wong

et al. . (

2014

)

Threonyl-tRNA synthetase overexpression correlates with angiogenic markers and progression of human ovarian cancer

BMC Cancer

620

Kim

B.H.

Jung

W.Y.

Lee

et al. . (

2014

)

Lysyl-tRNA synthetase (KRS) expression in gastric carcinoma and tumor-associated inflammation

Ann. Surg. Oncol.

2020

–

2027

Park

S.W.

Kim

S.S.

Yoo

N.J.

et al. . (

2010

)

Frameshift mutation of MARS gene encoding an aminoacyl-tRNA synthetase in gastric and colorectal carcinomas with microsatellite instability

Gut Liver

430

–

431

Choi

J.W.

J.Y.

Kundu

J.K.

et al. . (

2009

)

Multidirectional tumor-suppressive activity of AIMP2/p38 and the enhanced susceptibility of AIMP2 heterozygous mice to carcinogenesis

Carcinogenesis

1638

–

1644

Choi

J.W.

Kim

D.G.

Lee

A.E.

et al. . (

2011

)

Cancer-associated splicing variant of tumor suppressor AIMP2/p38: pathological implication in tumorigenesis

PLoS Genet.

e1001351

Park

B.J.

Y.S.

Park

S.Y.

et al. . (

2006

)

AIMP3 haploinsufficiency disrupts oncogene-induced p53 activation and genomic stability

Cancer Res.

6913

–

6918

Irizarry

R.A.

Gentleman

et al. . (

2004

)

A model-based background adjustment for oligonucleotide expression arrays

J. Am. Stat. Assoc.

909

–

917

Google Scholar

Crossref

WorldCat

Bolstad

B.M.

Irizarry

R.A.

Astrand

et al. . (

2003

)

A comparison of normalization methods for high density oligonucleotide array data based on variance and bias

Bioinformatics

185

–

193

Lee

H.J.

Suk

J.E.

Patrick

et al. . (

2010

)

Direct transfer of alpha-synuclein from neuron to astroglia causes inflammatory responses in synucleinopathies

J. Biol. Chem.

285

9262

–

9272

Chae

Ahn

B.Y.

Byun

et al. . (

2013

)

A systems approach for decoding mitochondrial retrograde signaling pathways

Sci. Signal

rs4

Hwang

Rust

A.G.

Ramsey

et al. . (

2005

)

A data integration methodology for systems biology

Proc. Natl Acad. Sci. USA

102

17296

–

17301

Google Scholar

Crossref

WorldCat

Bowman

A.W.

Azzalini

(

1997

)

Applied Smoothing Techniques for Data Analysis : the Kernel Approach with S-Plus Illustrations

Oxford University Press

Oxford, New York

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Toronen

Ojala

P.J.

Marttinen

et al. . (

2009

)

Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function

BMC Bioinformatics

307

Kohler

Bauer

Horn

et al. . (

2008

)

Walking the interactome for prioritization of candidate disease genes

Am. J. Hum. Genet.

949

–

958

Powell

Szklarczyk

Trachana

et al. . (

2012

)

eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges

Nucleic Acids Res.

D284

–

D289

Peri

Navarro

J.D.

Kristiansen

T.Z.

et al. . (

2004

)

Human protein reference database as a discovery resource for proteomics

Nucleic Acids Res.

D497

–

D501

Ogata

Goto

Sato

et al. . (

1999

)

KEGG: Kyoto Encyclopedia of Genes and Genomes

Nucleic Acids Res.

–

Bader

G.D.

Betel

Hogue

C.W.

(

2003

)

BIND: the biomolecular interaction network database

Nucleic Acids Res.

248

–

250

Xenarios

Rice

D.W.

Salwinski

et al. . (

2000

)

DIP: the database of interacting proteins

Nucleic Acids Res.

289

–

291

Stark

Breitkreutz

B.J.

Reguly

et al. . (

2006

)

BioGRID: a general repository for interaction datasets

Nucleic Acids Res.

D535

–

D539

Matthews

Gopinath

Gillespie

et al. . (

2009

)

Reactome knowledgebase of human biological pathways and processes

Nucleic Acids Res.

D619

–

D622

Chatr-aryamontri

Ceol

Palazzi

L.M.

et al. . (

2007

)

MINT: the Molecular INTeraction database

Nucleic Acids Res.

D572

–

D574

Kerrien

Alam-Faruque

Aranda

et al. . (

2007

)

IntAct–open source resource for molecular interaction data

Nucleic Acids Res.

D561

–

D565

Lynn

D.J.

Winsor

G.L.

Chan

et al. . (

2008

)

InnateDB: facilitating systems-level analyses of the mammalian innate immune response

Mol. Syst. Biol.

218

Snel

Lehmann

Bork

et al. . (

2000

)

STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene

Nucleic Acids Res.

3442

–

3444

Lee

H.S.

Bae

Lee

J.H.

et al. . (

2012

)

Rational drug repositioning guided by an integrated pharmacological network of protein, disease and drug

BMC Syst. Biol.

Brown

K.R.

Jurisica

(

2007

)

Unequal evolutionary conservation of human protein interactions in interologous networks

Genome Biol.

R95

Brown

K.R.

Jurisica

(

2005

)

Online predicted human interaction database

Bioinformatics

2076

–

2082

Rhodes

D.R.

Tomlins

S.A.

Varambally

et al. . (

2005

)

Probabilistic model of the human protein-protein interaction network

Nat. Biotechnol.

951

–

959

McDowall

M.D.

Scott

M.S.

Barton

G.J.

(

2009

)

PIPs: human protein-protein interaction prediction database

Nucleic Acids Res.

D651

–

D656

Aranda

Blankenburg

Kerrien

et al. . (

2011

)

PSICQUIC and PSISCORE: accessing and scoring molecular interactions

Nat. Methods

528

–

529

Cancer Genome

Atlas

(

2012

)

Comprehensive molecular portraits of human breast tumours

Nature

490

–

Cancer Genome

Atlas

(

2012

)

Comprehensive molecular characterization of human colon and rectal cancer

Nature

487

330

–

337

Cancer Genome

Atlas Research

(

2012

)

Comprehensive genomic characterization of squamous cell lung cancers

Nature

489

519

–

525

Cancer Genome

Atlas Research

(

2014

)

Comprehensive molecular characterization of urothelial bladder carcinoma

Nature

507

315

–

322

Barrett

Troup

D.B.

Wilhite

S.E.

et al. . (

2009

)

NCBI GEO: archive for high-throughput functional genomic data

Nucleic Acids Res.

D885

–

D890

Parkinson

Kapushesky

Shojatalab

et al. . (

2007

)

Array Express–a public database of microarray experiments and gene expression profiles

Nucleic Acids Res.

D747

–

D750

Cancer Genome

Atlas Research

(

2013

)

Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia

N. Engl. J. Med.

368

2059

–

2074

Scheinin

Myllykangas

Borze

et al. . (

2008

)

CanGEM: mining gene copy number changes in cancer

Nucleic Acids Res.

D830

–

D835

Beroukhim

Mermel

C.H.

Porter

et al. . (

2010

)

The landscape of somatic copy-number alteration across human cancers

Nature

463

899

–

905

Barretina

Caponigro

Stransky

et al. . (

2012

)

The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity

Nature

483

603

–

607

Forbes

S.A.

Bindal

Bamford

et al. . (

2011

)

COSMIC: mining complete cancer genomes in the catalogue of somatic mutations in cancer

Nucleic Acids Res.

D945

–

D950

Arif

Jia

Mukhopadhyay

et al. . (

2009

)

Two-site phosphorylation of EPRS coordinates multimodal regulation of noncanonical translational control activity

Mol. Cell

164

–

180

Kim

D.G.

Choi

J.W.

Lee

J.Y.

et al. . (

2012

)

Interaction of two translational components, lysyl-tRNA synthetase and p40/37LRP, in plasma membrane promotes laminin-dependent cell migration

FASEB J.

4142

–

4159

Han

J.M.

Park

B.J.

Park

S.G.

et al. . (

2008

)

AIMP2/p38, the scaffold for the multi-tRNA synthetase complex, responds to genotoxic stresses via p53

Proc. Natl Acad. Sci. USA

105

11206

–

11211

Google Scholar

Crossref

WorldCat

Zhang

et al. . (

2012

)

dbDEPC 2.0: updated database of differentially expressed proteins in human cancers

Nucleic Acids Res.

D964

–

D971

Gnad

Gunawardena

Mann

(

2011

)

PHOSIDA 2011: the posttranslational modification database

Nucleic Acids Res.

D253

–

D260

Hornbeck

P.V.

Chabra

Kornhauser

J.M.

et al. . (

2004

)

PhosphoSite: a bioinformatics resource dedicated to physiological protein phosphorylation

Proteomics

1551

–

1561

Park

S.G.

Kim

H.J.

Min

Y.H.

et al. . (

2005

)

Human lysyl-tRNA synthetase is secreted to trigger proinflammatory response

Proc. Natl Acad. Sci. USA

102

6356

–

6361

Google Scholar

Crossref

WorldCat

Kunst

C.B.

Mezey

Brownstein

M.J.

et al. . (

1997

)

Mutations in SOD1 associated with amyotrophic lateral sclerosis cause novel protein interactions

Nat. Genet.

–

Kawamata

Magrane

Kunst

et al. . (

2008

)

Lysyl-tRNA synthetase is a target for mutant SOD1 toxicity in mitochondria

J. Biol. Chem.

283

28321

–

28328

Shannon

Markiel

Ozier

et al. . (

2003

)

Cytoscape: a software environment for integrated models of biomolecular interaction networks

Genome Res.

2498

–

2504

Sigglekow

N.D.

Pangon

Brummer

et al. . (

2012

)

Mutated in colorectal cancer protein modulates the NFkappaB pathway

Anticancer Res.

–

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Bouwmeester

Bauch

Ruffner

et al. . (

2004

)

A physical and functional map of the human TNF-alpha/NF-kappa B signal transduction pathway

Nat. Cell Biol.

–

105

Kim

J.H.

Han

J.M.

Kim

(

2014

)

Protein-protein interactions and multi-component complexes of aminoacyl-tRNA synthetases

Top. Curr. Chem.

344

119

–

144

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Kim

Jin

K.S.

et al. . (

2014

)

Structure of the ArgRS-GlnRS-AIMP1 complex and its implications for mammalian translation

Proc. Natl Acad. Sci. USA

111

15084

–

15089

Google Scholar

Crossref

WorldCat

Koehler

Round

Simader

et al. . (

2013

)

Quaternary structure of the yeast Arc1p-aminoacyl-tRNA synthetase complex in solution and its compaction upon binding of tRNAs

Nucleic Acids Res.

667

–

676

Frechin

Enkler

Tetaud

et al. . (

2014

)

Expression of nuclear and mitochondrial genes encoding ATP synthase is synchronized by disassembly of a multisynthetase complex

Mol. Cell

763

–

776

Guruharsha

K.G.

Rual

J.F.

Zhai

et al. . (

2011

)

A protein complex network of Drosophila melanogaster

Cell

147

690

–

703

Koonin

E.V.

Aravind

Kondrashov

A.S.

(

2000

)

The impact of comparative genomics on our understanding of evolution

Cell

101

573

–

576

Lander

E.S.

Linton

L.M.

Birren

et al. . (

2001

)

Initial sequencing and analysis of the human genome

Nature

409

860

–

921

Pawson

Nash

(

2003

)

Assembly of cell regulatory systems through protein interaction domains

Science

300

445452

Google Scholar

Crossref

WorldCat

Author notes

^# These authors contributed equally to this work.

Citation details: Lee,J.-H., You,S., Hyeon,D.Y., et al. Comprehensive data resources and analytical tools for pathological association of aminoacyl tRNA synthetases with cancer. Database (2015) Vol. 2015: article ID bav022; doi:10.1093/database/bav022

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Views

1,189

Altmetric

Total Views 1,189

856 Pageviews

333 PDF Downloads

Since 11/1/2016

Month:	Total Views:
November 2016	1
December 2016	5
January 2017	5
February 2017	4
March 2017	8
April 2017	1
May 2017	6
June 2017	4
July 2017	1
August 2017	10
September 2017	6
October 2017	4
November 2017	3
December 2017	16
January 2018	15
February 2018	13
March 2018	25
April 2018	22
May 2018	12
June 2018	10
July 2018	16
August 2018	27
September 2018	13
October 2018	10
November 2018	14
December 2018	17
January 2019	13
February 2019	10
March 2019	13
April 2019	20
May 2019	14
June 2019	12
July 2019	13
August 2019	8
September 2019	17
October 2019	19
November 2019	12
December 2019	23
January 2020	15
February 2020	23
March 2020	10
April 2020	16
May 2020	10
June 2020	13
July 2020	20
August 2020	15
September 2020	14
October 2020	18
November 2020	11
December 2020	27
January 2021	6
February 2021	10
March 2021	24
April 2021	10
May 2021	5
June 2021	7
July 2021	1
August 2021	11
September 2021	9
October 2021	12
November 2021	13
December 2021	4
January 2022	9
February 2022	3
March 2022	10
April 2022	15
May 2022	3
June 2022	5
July 2022	4
August 2022	4
September 2022	7
October 2022	18
November 2022	19
December 2022	4
January 2023	6
February 2023	7
March 2023	9
April 2023	8
May 2023	8
June 2023	3
July 2023	7
August 2023	11
September 2023	14
October 2023	6
November 2023	17
December 2023	17
January 2024	16
February 2024	17
March 2024	15
April 2024	11
May 2024	13
June 2024	15
July 2024	15
August 2024	7
September 2024	7
October 2024	8
November 2024	12
December 2024	6
January 2025	5
March 2025	21
April 2025	5
May 2025	6
June 2025	5
July 2025	11
August 2025	9
September 2025	3
October 2025	9
November 2025	3
December 2025	6
January 2026	4

Article Contents

Comprehensive data resources and analytical tools for pathological association of aminoacyl tRNA synthetases with cancer

Abstract

Introduction

Materials and methods

Database and website infrastructure

Identification of differentially expressed genes

Deregulation and co-association scores for nodes and edges in the network models

Identification of key modules in the ARS/AIMP network model

Identification of evolutionarily conserved motifs in the ARS/AIMP network model

Results and discussion

Comprehensive data resources of ARS/AIMPs and their interactors in IDA

Cancer-associated genes

CAGs interacting with ARS/AIMPs

Transcriptomic data of ARS/AIMPs and their interacting CAGs

Genomic data of ARS/AIMPs and their interacting CAGs

Proteomic data of ARS/AIMPs and their interacting CAGs

Analytical tools of ARS/AIMPs and their interactors in IDA

Web-based search and exploration tools

Cancer-associated network models for ARS/AIMPs

Key functional modules of ARS/AIMPs in the network model

Evolutionary analysis of the cancer-associated network for ARS/AIMPs

Conclusions

Funding

Supplementary Data

References

Author notes

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Comprehensive data resources and analytical tools for pathological association of aminoacyl tRNA synthetases with cancer Open Access

Abstract

Introduction

Materials and methods

Database and website infrastructure

Identification of differentially expressed genes

Deregulation and co-association scores for nodes and edges in the network models

Identification of key modules in the ARS/AIMP network model

Identification of evolutionarily conserved motifs in the ARS/AIMP network model

Results and discussion

Comprehensive data resources of ARS/AIMPs and their interactors in IDA

Cancer-associated genes

CAGs interacting with ARS/AIMPs

Transcriptomic data of ARS/AIMPs and their interacting CAGs

Genomic data of ARS/AIMPs and their interacting CAGs

Proteomic data of ARS/AIMPs and their interacting CAGs

Analytical tools of ARS/AIMPs and their interactors in IDA

Web-based search and exploration tools

Cancer-associated network models for ARS/AIMPs

Key functional modules of ARS/AIMPs in the network model

Evolutionary analysis of the cancer-associated network for ARS/AIMPs

Conclusions

Funding

Supplementary Data

References

Author notes

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only

Gift article access

Gift article access

Gift article access

Gift article access

Comprehensive data resources and analytical tools for pathological association of aminoacyl tRNA synthetases with cancer