Abstract

Gram-negative bacteria utilize specialized secretion systems to deliver effectors into the cytoplasm of eukaryotic cells, where they manipulate host cellular functions to promote infection. These interactions play a crucial role in bacterial pathogenesis and pose significant public health challenges. Understanding of the protein–protein interactions (PPIs) between effectors and host proteins is essential for deciphering infection mechanisms and developing novel therapeutic strategies. However, existing databases on pathogen–host interactions often lack comprehensive coverage of secreted effectors and their host targets, limiting their utility in studying bacterial infections. Furthermore, current resources are often incomplete, poorly annotated, and lack predictive capabilities. To address these limitations, we have developed InTxDB, a comprehensive database dedicated to interactions involving bacterial type I-X secreted effectors and their host counterparts. InTxDB currently includes 1829 experimentally validated interaction pairs from 100 bacterial species, with detailed annotations on protein sequences, functions, subcellular localizations, secretion signals, local structural and functional properties, structures, and interaction sites. By filling a critical gap in existing databases, InTxDB provides a well-curated and functionally enriched resource to advance research on bacterial pathogenesis and host–pathogen interactions.

Introduction

Gram-negative bacteria utilize specialized secretion systems to transport proteins across their cell membranes, with at least 10 distinct systems identified [1–3]. These secretion systems can be classified into two main types. The first type, such as the type III secretion system (T3SS), enables the direct translocation of substrate proteins into eukaryotic host cells in a single step [4]. The second type, including the Sec and Tat pathways, initially transports substrate proteins to the periplasm, followed by a secondary mechanism that transfer across the outer membrane or into the host cell [5]. The translocated substrate proteins, known as effectors, play a critical role in bacterial infection and pathogenicity by manipulating host cellular processes to promote survival and proliferation [1, 5].

Once translocated into the host cytoplasm, bacterial effectors disrupt essential signalling pathways, leading to disease. For example, the T3SS effector YopJ, secreted by Yersinia pestis, inhibits the MAPK and NF-κB signalling pathways, suppressing the host inflammatory response and allowing immune detection [6]. The T4SS effector CagA, produced by Helicobacter pylori, binds to the SH2 domain of host proteins, activating SHP-2 phosphatase and inducing sustained ERK/MAPK signalling pathway, which promotes abnormal gastric epithelial cell proliferation and contributes to gastric cancer development [7]. Additionally, the T6SS effector VgrG, secreted by Pseudomonas aeruginosa, promotes inflammasome assembly and activates the NLRP3 signalling pathway, triggering the release of pro-inflammatory cytokines such as IL-1β, leading to a strong inflammatory response [8].

Over the past few decades, experimental techniques such as co-immunoprecipitation (Co-IP) and high-throughput methods like mass spectrometry (MS) and yeast two-hybrid (Y2H) [9–11] have significantly expanded our understanding of protein–protein interactions (PPIs) between gram-negative bacterial effectors and host proteins. These techniques curated in several host–pathogen protein interaction databases [12–14]. For example, PHISTO integrates host–pathogen PPIs and provides function annotations of pathogen-targeted human proteins [12]. STRING offers extensive PPI datasets with confidence score to assess interaction reliability [13]. IntAct provides detailed biological context and functional annotations for each interaction [14]. PHI-base specializes in pathogen–host interactions with documented phenotypic outcomes [15]. HoPaCI-DB focuses specifically on Pseudomonas and Coxiella effector systems [16], while DBSecSys focuses on Burkholderia effectors [17]. For secretion system effectors, dedicated databases exist including BastionHub for T1SE–T4SE and T6SE [18], T3DB for type III secretion systems [19], and SecretoMyc for mycobacterial effectors [20]. However, current databases collectively lack comprehensive coverage of all bacterial secretion system types (T1SE–T10SE), and they often lack atomic-level details, such as binding sites and structural information (Table 1).

Table 1.

Comparative analysis of major host–pathogen interaction and secretion system databases

DatabaseScope of interactionsMethodStructural dataSite informationSubcellular localizationInteraction networkPredicted interfaces
PHISTOPathogen–hostExperimental dataNoNoNoNoNo
HPIDBPathogen–hostExperimental dataNoNoNoNoNo
MINTPathogen–hostExperimental dataNoNoNoNoNo
PHI-basePathogen–hostExperimental dataNoNoNoNoNo
IntActPathogen–hostExperimental dataNoYesNoYesNo
HoPaCI-dbPseudomonas aeruginosa and Coxiella burnetiid –hostExperimental dataNoNoYesYesNo
BastionHubT1SE–T4SE, 6SEExperimental dataNoNoNoNoYes
T3DBT3SEExperimental dataNoNoNoNoYes
SecretoMycMycobacterial effectorsExperimental dataNoNoNoNoNo
DBSecSysBurkholderia mallei–host and Burkholderia pseudomalleil–hostExperimental data and predicted dataNoNoNoNoNo
DatabaseScope of interactionsMethodStructural dataSite informationSubcellular localizationInteraction networkPredicted interfaces
PHISTOPathogen–hostExperimental dataNoNoNoNoNo
HPIDBPathogen–hostExperimental dataNoNoNoNoNo
MINTPathogen–hostExperimental dataNoNoNoNoNo
PHI-basePathogen–hostExperimental dataNoNoNoNoNo
IntActPathogen–hostExperimental dataNoYesNoYesNo
HoPaCI-dbPseudomonas aeruginosa and Coxiella burnetiid –hostExperimental dataNoNoYesYesNo
BastionHubT1SE–T4SE, 6SEExperimental dataNoNoNoNoYes
T3DBT3SEExperimental dataNoNoNoNoYes
SecretoMycMycobacterial effectorsExperimental dataNoNoNoNoNo
DBSecSysBurkholderia mallei–host and Burkholderia pseudomalleil–hostExperimental data and predicted dataNoNoNoNoNo
Table 1.

Comparative analysis of major host–pathogen interaction and secretion system databases

DatabaseScope of interactionsMethodStructural dataSite informationSubcellular localizationInteraction networkPredicted interfaces
PHISTOPathogen–hostExperimental dataNoNoNoNoNo
HPIDBPathogen–hostExperimental dataNoNoNoNoNo
MINTPathogen–hostExperimental dataNoNoNoNoNo
PHI-basePathogen–hostExperimental dataNoNoNoNoNo
IntActPathogen–hostExperimental dataNoYesNoYesNo
HoPaCI-dbPseudomonas aeruginosa and Coxiella burnetiid –hostExperimental dataNoNoYesYesNo
BastionHubT1SE–T4SE, 6SEExperimental dataNoNoNoNoYes
T3DBT3SEExperimental dataNoNoNoNoYes
SecretoMycMycobacterial effectorsExperimental dataNoNoNoNoNo
DBSecSysBurkholderia mallei–host and Burkholderia pseudomalleil–hostExperimental data and predicted dataNoNoNoNoNo
DatabaseScope of interactionsMethodStructural dataSite informationSubcellular localizationInteraction networkPredicted interfaces
PHISTOPathogen–hostExperimental dataNoNoNoNoNo
HPIDBPathogen–hostExperimental dataNoNoNoNoNo
MINTPathogen–hostExperimental dataNoNoNoNoNo
PHI-basePathogen–hostExperimental dataNoNoNoNoNo
IntActPathogen–hostExperimental dataNoYesNoYesNo
HoPaCI-dbPseudomonas aeruginosa and Coxiella burnetiid –hostExperimental dataNoNoYesYesNo
BastionHubT1SE–T4SE, 6SEExperimental dataNoNoNoNoYes
T3DBT3SEExperimental dataNoNoNoNoYes
SecretoMycMycobacterial effectorsExperimental dataNoNoNoNoNo
DBSecSysBurkholderia mallei–host and Burkholderia pseudomalleil–hostExperimental data and predicted dataNoNoNoNoNo

To address this gap, we have developed InTxDB (http://bis.zju.edu.cn/InTxDB), a dedicated database for gram-negative bacteria type I-X secreted effectors and their host interactions. InTxDB currently contains 1829 experimentally validated interaction pairs from 100 bacterial species, with comprehensive annotations on protein sequences, functions, subcellular localizations, secretion signals, local structural and functional properties, structures, and interaction sites. By providing a well-curated and functionally enriched dataset, InTxDB serves as a valuable resource for studying bacterial pathogenesis and host–pathogen interactions, facilitating further research and therapeutic development.

Materials and methods

Collection of experimentally verified effector-human PPI data

We collected experimentally validated PPIs from three public databases: HPIDB [21], PHISTO [12], and IntAct [14], as well as recently published literature (Fig. 1). First, we obtained experimentally validated effectors from two public databases: SecReT4 [22] and TxSEdb (http://61.160.194.165:3080/TxSEdb/,unpublished data). Next, the gene names of these effectors and their corresponding bacterial species were used as search terms to retrieve relevant PPI-related literature. Candidate articles from the search results were manually screened and curated, leading to the final collection of experimentally verified PPIs.

The workflow of InTxDB. The construction of InTxDB involves three main stages: data collection, data processing, and web construction. Data collection integrates three components: TxSE-human PPIs from databases and literature, structural data from PDB and AlphaFold DB, and interaction networks derived from compiled PPIs and STRING. Data processing includes transmembrane helix, signal peptide prediction (TMHMM2.0, SignalP6.0) and analysis of protein’s local structural and functional properties(ESM2), protein docking simulations (ZDOCK, HDOCK, AlphaFold3), and interaction site analysis (PDBePISA). The web platform offers an interactive interface with advanced browsing, search, and computational analysis tools for predicting and studying bacterial effector–host interactions, serving as a comprehensive resource for researchers.
Figure 1.

The workflow of InTxDB. The construction of InTxDB involves three main stages: data collection, data processing, and web construction. Data collection integrates three components: TxSE-human PPIs from databases and literature, structural data from PDB and AlphaFold DB, and interaction networks derived from compiled PPIs and STRING. Data processing includes transmembrane helix, signal peptide prediction (TMHMM2.0, SignalP6.0) and analysis of protein’s local structural and functional properties(ESM2), protein docking simulations (ZDOCK, HDOCK, AlphaFold3), and interaction site analysis (PDBePISA). The web platform offers an interactive interface with advanced browsing, search, and computational analysis tools for predicting and studying bacterial effector–host interactions, serving as a comprehensive resource for researchers.

Three-dimensional complex structures of effector-human PPIs

The three-dimensional structures of bacterial effectors and human proteins were sourced from the Protein Data Bank (PDB) [23] databases and AlphaFold [24] databases (Fig. 1). Selection criteria mandated that the protein structures must contain full-length sequences. In cases where the required structures were unavailable in the PDB databases, they were obtained from the AlphaFold database.

The three-dimensional structures of effector-human PPIs were generated through docking simulations using three software tools: ZDOCK [25], HDOCK [26], and AlphaFold3 [27]. ZDOCK and HDOCK performed docking using PDB files, whereas AlphaFold3 utilized protein sequences. Protein sequences were retrieved from UniProt [28] and NCBI [29] databases. For proteins with sequences exceeding 1500 amino acids in length, only the first 1500 amino acids were selected for docking. This truncation strategy, while ensuring computational feasibility, inevitably introduces certain limitations: potential interaction sites located in the C-terminal regions of very large proteins may be omitted, leading to incomplete structural coverage and possibly underestimating biologically relevant binding events. As such, these results should be interpreted with caution, and future updates of our database will consider domain-based [30] or overlapping-fragment docking [31] strategies to mitigate this limitation.

Prediction of effector transmembrane helices, signalling peptides, and subcellular localization

Transmembrane helices were predicted using TMHMM 2.0 [32] software, and signal peptides along with their cleavage sites were predicted using SignalP 6.0 [33] software. Subcellular localization data were retrieved from the UniProt database. For proteins with unknown subcellular localization, predictions were generated using PSORTb v3.0 [34] and DeepLocPro [35].

Interaction interface analysis

Interfacing residues are predicted using PDBePISA [36]. Residue importance was analysed using PDBePISA, and the BURIEDSURFACEAREASCORE values were normalized by first subtracting the mean from each value and then dividing by the standard deviation.

Network construction

STRING (https://string-db.org/) [13] is a database that systematically collects and integrates PPIs, including physical interactions and functional associations. All interaction networks associated with the specified protein were downloaded from the STRING database by selecting the protein species and inputting the protein sequence or protein name. The user can freely set the confidence level to select the desired interaction network. The resulting interaction dataset was exported in a compatible TSV format for further organization and network integration.

Analysis of protein’s local structural and functional properties

We integrated the ESM2 model [37] to compute per-residue embeddings across each protein sequence. These embeddings were generated using the ESM2-large model [37], producing a 1280-dimensional vector for every amino acid position. The resulting values are visualized as an interactive plot directly on individual protein pages, where users can dynamically explore regions of high or low embedding magnitude corresponding to putative functional domains, conserved sites, or disordered regions.

Database construction

InTxDB utilizes Django as the backend web framework. A MySQL service is employed to store data tables, including the basic information of bacterial effectors and host PPI entries. Network visualization is interactively presented using Cytoscape.js [38] and ECharts.js [39] plugins. The Mol* (/'molstar/) [40], a modern web-based open-source tool, is used for the visualization and analysis of large-scale molecular data, particularly for displaying three-dimensional protein structures.

Results

Overall description of InTxDB

InTxDB is a comprehensive database dedicated to interactions between gram-negative bacterial type I-X secreted effectors (TxSEs) and their human host proteins. It currently contains 1829 experimentally validated interaction pairs from 100 bacterial species (Table 2). Each entry is extensively annotated with information on protein sequences, functions, subcellular localizations, secretion signals, local structural and functional properties, structural data, and interaction sites.

Table 2.

The number of PPIs and the number of bacteria effectors, bacterial species, and human proteins of each interaction

TypeNumber of bacterial effectorsNumber of human proteinNumber of bacterial speciesNumber of PPIs
T1SE11241640
T2SE4536
T3SE58767161228
T4SE5827624414
T5SE30372862
T6SE310110
T7SE11331138
T8SE4235
T9SE5638
T10SE311318
TypeNumber of bacterial effectorsNumber of human proteinNumber of bacterial speciesNumber of PPIs
T1SE11241640
T2SE4536
T3SE58767161228
T4SE5827624414
T5SE30372862
T6SE310110
T7SE11331138
T8SE4235
T9SE5638
T10SE311318
Table 2.

The number of PPIs and the number of bacteria effectors, bacterial species, and human proteins of each interaction

TypeNumber of bacterial effectorsNumber of human proteinNumber of bacterial speciesNumber of PPIs
T1SE11241640
T2SE4536
T3SE58767161228
T4SE5827624414
T5SE30372862
T6SE310110
T7SE11331138
T8SE4235
T9SE5638
T10SE311318
TypeNumber of bacterial effectorsNumber of human proteinNumber of bacterial speciesNumber of PPIs
T1SE11241640
T2SE4536
T3SE58767161228
T4SE5827624414
T5SE30372862
T6SE310110
T7SE11331138
T8SE4235
T9SE5638
T10SE311318

The database architecture consists of three main components: data collection, data processing, and web construction (Fig. 1). Data collection involves extracting experimentally validated PPIs from literature and databases such as IntAct [14], PHISTO [12], and HPIDB3.0 [21], while protein structures are sourced from RCSB PDB [23] and AlphaFold DB [24]. Data processing includes transmembrane helix and signal peptide prediction (TMHMM2.0 [32], SignalP6.0 [33]), analysis of protein’s local structural and functional properties (ESM2 [37]), protein docking simulations (ZDOCK [25], HDOCK [26], AlphaFold3 [27]), and interaction site analysis (PDBePISA [36]). Finally, in the web construction phase, users can explore interaction data through a searchable and interactive interface, with tools such as Diamond [41] and HPInet [42], alongside statistical insights into bacterial species, taxonomy, and interaction distributions.

By integrating high-confidence experimental data, structural predictions, and computational tools, InTxDB serves as a valuable resource for studying bacterial pathogenesis and host–pathogen interactions.

User interface of InTxDB

InTxDB provides a user-friendly interface for exploring effector proteins and their interactions through multiple search options. Figure 2 illustrates the home page, which features a quick search bar, allowing users to retrieve proteins using gene names, protein names, or protein IDs. Additionally, the home page presents statistical summaries of effectors and their interactions, along with a visual representation of the interaction network, offering an intuitive overview of the database’s content and connectivity.

InTxDB database home page and statistics of datasets.
Figure 2.

InTxDB database home page and statistics of datasets.

Browse and data exploration

InTxDB offers a user-friendly interface with an intuitive filtering system. Users can refine search results using filters on the left sidebar, including bacterial species, secretion system type, and protein length, enabling quick access to relevant information (Fig. 3A).

Two modules of InTxDB. (A) Browse module: Users can filter interaction data using options such as bacterial species, secretion system type, and protein length to quickly locate relevant information. (B) Detailed information page: Displays comprehensive details on selected interactions, including protein sequences, structures, interaction networks, and docking predictions, providing insights into bacterial effector–host interactions.
Figure 3.

Two modules of InTxDB. (A) Browse module: Users can filter interaction data using options such as bacterial species, secretion system type, and protein length to quickly locate relevant information. (B) Detailed information page: Displays comprehensive details on selected interactions, including protein sequences, structures, interaction networks, and docking predictions, providing insights into bacterial effector–host interactions.

Upon selecting an entry, users are directed to a detailed information page, organized into four main sections: Bacterial Protein, Human Protein, PPI (Protein–Protein Interaction), and Docking Details (Fig. 3B). These sections can be easily accessed via the sidebar navigation:

  • Bacterial Protein and Human Protein sections: Display fundamental details, including protein structures. The Bacterial Protein section provides additional annotations such as signal peptides and transmembrane helices.

  • PPI section: Presents the interaction network, offering insights into the relationships between effectors and human proteins, along with detailed information for each node.

  • Docking Details section: Showcases predicted docking generated by ZDOCK [25], HDOCK [26], and AlphaFold3 [27], along with interaction site analysis from PDBePISA [36] and an assessment of residue importance for both interacting proteins.

Species–specific interaction networks

InTxDB offers species-specific interaction networks, allowing users to explore bacterial effector–host interactions with enhanced filtering options. Users can refine results using the filter bar in the top left, selecting criteria such as species, interaction detection method, and effector type. Filtered entries can be toggled between ‘Interactions’ and ‘Interactors’, providing flexibility in data visualization. Each interaction or protein entry links to a detailed information page, enabling deeper insights into bacterial pathogenesis (Fig. 4).

Species of InTxDB. InTxDB provides species-specific interaction networks, allowing users to explore bacterial effector–host interactions across different bacterial species. Users can filter results by species, interaction detection method, and effector type, with options to toggle between ‘Interactions’ and ‘Interactors’ for a more detailed analysis of the dataset.
Figure 4.

Species of InTxDB. InTxDB provides species-specific interaction networks, allowing users to explore bacterial effector–host interactions across different bacterial species. Users can filter results by species, interaction detection method, and effector type, with options to toggle between ‘Interactions’ and ‘Interactors’ for a more detailed analysis of the dataset.

Tool

InTxDB provides two integrated data analysis tools to facilitate interaction prediction and sequence alignment. The Diamond [41] tool enables users to perform sequence alignment with customizable parameters and provides downloadable results (Fig. 5A). The HPInet Prediction tool [42] allows users to analyse interactions based on bacterial protein sequences, human protein sequences, or predefined PPI lists. The tool generates network structure visualizations and residue importance heatmaps, offering insights into key interaction sites (Fig. 5B). These tools enhance the functionality of InTxDB, supporting computational predictions and deeper analysis of bacterial effector–host interactions.

Tools of InTxDB. (A) Diamond tool: Provides sequence alignment capabilities with customizable parameters, allowing users to analyse and download alignment results. (B) HPInet Prediction tool: Predicts bacterial effector–host interactions based on sequence data or predefined PPI lists, generating network visualizations and residue importance heatmaps for deeper interaction analysis.
Figure 5.

Tools of InTxDB. (A) Diamond tool: Provides sequence alignment capabilities with customizable parameters, allowing users to analyse and download alignment results. (B) HPInet Prediction tool: Predicts bacterial effector–host interactions based on sequence data or predefined PPI lists, generating network visualizations and residue importance heatmaps for deeper interaction analysis.

Statistical insights

The Statistics section displays the distribution of bacterial species whose secretion effectors interact with host proteins, the classification of bacterial proteins, and the distribution of the four main subcellular localizations of human proteins that interact with effector proteins from different bacterial secretion systems. Additionally, functional enrichment analysis was performed on the human proteins, highlighting significantly overrepresented biological processes in Gene Ontology (GO), molecular pathways in KEGG, and regulatory processes in Reactome, thereby providing insights into their functional roles during pathogen–host interaction.

Application in biological research

InTxDB serves as a valuable resource for biological research, such as investigating how LtxA, a leukotoxin secreted by Aggregatibacter actinomycetemcomitans, contributes to disease. By searching LtxA in InTxDB, users can identify its interaction with Integrin beta-2, a key immune cell receptor. Through PPI network analysis, it can be inferred that LtxA disrupts immune cell function, supported by previous findings [43]. Additionally, docking results help predict potential interaction targets, which may aid in developing novel biological therapies for conditions such as leukaemia, AIDS, and autoimmune disorders [44–46].

Discussion

InTxDB provides a comprehensive resource for understanding the complex interactions between bacterial secretion effectors and host proteins, which are central to the pathogenesis of many bacterial infections. Unlike existing databases like IntAct [14] and PHISTO [12], which cover general host–pathogen interactions, InTxDB specifically focuses on bacterial secretion system effectors and their human targets. Additionally, it offers three-dimensional protein structures, docking models, and interaction site analyses, facilitating a deeper understanding of microbial virulence mechanisms and the identification of potential therapeutic targets.

Recent studies have emphasized the significance of bacterial effector proteins in manipulating host cellular processes to promote infection [47–50]. Among these, the type III secretion system (T3SS) effectors from Salmonella spp. play a pivotal role in hijacking host cell machinery to facilitate bacterial survival and replication [48–50]. For instance, the T3SS effector SopE2 reprograms macrophage metabolism, enabling Salmonella typhimurium to enhance intracellular replication and virulence [48]. Similar interactions have been studied for many other bacterial pathogens, demonstrating the widespread role of T3SS effectors in manipulating host cellular functions to promote infection [49]. InTxDB offers detailed data on these interactions, enabling researchers to explore not only the effectors involved but also their molecular mechanisms. The classification of bacterial proteins and their corresponding effector types can help researchers identify potential virulence factors and therapeutic targets [51, 52].

The HPInet prediction tool in InTxDB enhances the ability to predict PPI by leveraging known sequence data [42]. By integrating experimental data with predictive modelling, InTxDB enables users to make informed hypotheses about the roles of uncharacterized effectors or host proteins in bacterial infections.

Moreover, InTxDB provides subcellular localization data for human proteins interacting with bacterial effectors, which is crucial for understanding how these interactions influence cellular processes. Recent studies have demonstrated that the localization of host proteins plays a key role in bacterial infection strategies [53, 54]. For instance, the nuclear localization of the T4SS effector protein CBU0781 (AnkG) from Coxiella burnetii is essential for inhibiting apoptosis [54]. The anti-apoptotic activity of AnkG is regulated by p32-mediated intracellular transport, highlighting the importance of p32 localization in suppressing apoptosis during bacterial infection [54]. The subcellular localization statistics provided by InTxDB contribute to this growing body of knowledge. Understanding where these interactions occur within host cells is critical for designing therapeutic strategies that can block bacterial entry or disrupt bacterial manipulation of host cellular functions.

While the current version of InTxDB focuses exclusively on experimentally validated interactions to ensure high data reliability, we recognize the important potential of computational approaches to expand our understanding of host–pathogen interactions. In forthcoming updates, we plan to implement domain-based [30] or overlapping-fragment docking [31] to achieve more comprehensive structural coverage of these complexes. Additionally, we will introduce a dedicated module for high-confidence, computationally predicted interactions generated using state-of-the-art tools like AlphaFold3 [27] or AlphaFold-Multimer [55]. To further enhance biological relevance, predicted docking interfaces will be systematically analysed and mapped to known mutation sites, thereby enriching the interpretation of disease-associated variants. The database will continue to grow through the incorporation of newly published effector–host interaction datasets and expanded taxonomic coverage of gram-negative bacterial pathogens.

In conclusion, InTxDB serves as an essential resource for researchers in microbiology, immunology, and drug discovery. By providing access to high-quality datasets on bacterial effector–host protein interactions, along with powerful sequence alignment and predictive modelling tools, InTxDB accelerates research into bacterial pathogenesis and supports the discovery of novel therapeutic targets for combating bacterial infections.

Conclusions

InTxDB provides a comprehensive and user-friendly platform for exploring the interactions between bacterial secretion effectors and host proteins, facilitating a deeper understanding of host–pathogen interactions. The database offers a robust suite of tools, including sequence alignment and prediction features such as Diamond [41] and HPInet [42]. These tools empower researchers to easily access, query, and analyse complex interaction data, uncovering valuable insights into the molecular mechanisms underlying bacterial pathogenesis. The inclusion of key statistics on bacterial species distribution, protein classification, and subcellular localization of human proteins further enriches the database, offering a holistic view of bacterial effector–host protein interactions. InTxDB is a valuable resource for the scientific community, enabling both basic and applied research in microbial pathogenesis, drug discovery, and therapeutic intervention strategies.

Author contributions

M.C. and Yueming Hu conceived and supervised the project. Y.Z., Yueming Hu, L.L., and M.C. coordinated the project. Y.Z. contributed to dataset collection. Y.Z., Yueming Hu, and L.L. provided codes, models, and software tools. Y.Z. and L.L. developed the website. Y.Z., Yueming Hu, L.L., E.L., S.L., H.Y., S.Z., H.C., Q.F., Yanshi Hu, and M.C. wrote the first draft of this manuscript. Y.Z., Yueming Hu, L.L., and M.C. revised the manuscript accordingly.

Conflict of interest

None declared.

Funding

This work was supported by the National Key Research and Development Program of China [2023YFE0112300]; National Natural Sciences Foundation of China [32070677, 32270709, 32261133526]; 151 Talent Project, and Science and Technology Innovation Leader of Zhejiang Province [2022R52035]; and Jiangsu Collaborative Innovation Center for Modern Crop Production and Collaborative Innovation Center for Modern Crop Production co-sponsored by province and ministry.

Data availability

The online version of InTxDB is freely accessible at http://bis.zju.edu.cn/InTxDB.

References

1.

Costa
 
TR
,
Felisberto-Rodrigues
 
C
,
Meir
 
A
 et al.  
Secretion systems in gram-negative bacteria: structural and mechanistic insights
.
Nat Rev Micro
.
2015
;
13
:
343
59
.

2.

Paillat
 
M
,
Lunar Silva
 
I
,
Cascales
 
E
 et al.  
A journey with type IX secretion system effectors: selection, transport, processing and activities
.
Microbiology (Reading)
.
2023
;
169
:
001320
.

3.

Palmer
 
T
,
Finney
 
AJ
,
Saha
 
CK
 et al.  
A holin/peptidoglycan hydrolase-dependent protein secretion system
.
Mol Microbiol
.
2021
;
115
:
345
55
.

4.

Cornelis
 
G
.
The type III secretion injectisome
.
Nat Rev Micro
.
2006
;
4
:
811
25
.

5.

Green
 
ER
,
Mecsas
 
J
.
Bacterial secretion systems: an overview
.
Microbiol Spectr
.
2016
;
4
:
10.1128
.

6.

Mittal
 
R
,
Peak-Chew
 
SY
,
Sade
 
RS
 et al.  
The acetyltransferase activity of the bacterial toxin YopJ of Yersinia is activated by eukaryotic host cell inositol hexakisphosphate
.
J Biol Chem
.
2010
;
285
:
19927
34
.

7.

Hatakeyama
 
M
.
Helicobacter pylori CagA and gastric cancer: a paradigm for hit-and-run carcinogenesis
.
Cell Host Microbe
.
2014
;
15
:
306
16
.

8.

Sana
 
TG
,
Hachani
 
A
,
Bucior
 
I
 et al.  
The second type VI secretion system of Pseudomonas aeruginosa strain PAO1 is regulated by quorum sensing and Fur and modulates internalization in epithelial cells
.
J Biol Chem
.
2012
;
287
:
27095
105
.

9.

Tan
 
L
,
Yammani
 
RR
.
Co-immunoprecipitation-blotting: analysis of protein-protein interactions
.
Methods Mol Biol
.
2022
;
2413
:
145
54
.

10.

Richards
 
AL
,
Eckhardt
 
M
,
Krogan
 
NJ
.
Mass spectrometry-based protein-protein interaction networks for the study of human diseases
.
Mol Syst Biol
.
2021
;
17
:
e8792
.

11.

Duarte
 
CEM
,
Euclydes
 
NC
.
Protein-protein interaction via two-hybrid assay in yeast
.
Methods Mol Biol
.
2024
;
2724
:
193
210
.

12.

Durmuş Tekir
 
S
,
Çakır
 
T
,
Ardiç
 
E
 et al.  
PHISTO: pathogen-host interaction search tool
.
Bioinformatics
.
2013
;
29
:
1357
58
.

13.

Szklarczyk
 
D
,
Kirsch
 
R
,
Koutrouli
 
M
 et al.  
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest
.
Nucleic Acids Res
.
2023
;
51
:
D638
46
.

14.

Orchard
 
S
,
Ammari
 
M
,
Aranda
 
B
 et al.  
The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases
.
Nucleic Acids Res
.
2014
;
42
:
D358
63
.

15.

Urban
 
M
,
Cuzick
 
A
,
Seager
 
J
 et al.  
PHI-base—the multi-species pathogen-host interaction database in 2025
.
Nucleic Acids Res
.
2025
;
53
:
D826
38
.

16.

Bleves
 
S
,
Dunger
 
I
,
Walter
 
MC
 et al.  
HoPaCI-DB: host-Pseudomonas and Coxiella interaction database
.
Nucleic Acids Res
.
2014
;
42
:
D671
6
.

17.

Memišević
 
V
,
Kumar
 
K
,
Zavaljevski
 
N
 et al.  
DBSecSys 2.0: a database of Burkholderia mallei and Burkholderia pseudomallei secretion systems
.
BMC Bioinf
.
2016
;
17
:
387
.

18.

Wang
 
J
,
Li
 
J
,
Hou
 
Y
 et al.  
BastionHub: a universal platform for integrating and analyzing substrates secreted by gram-negative bacteria
.
Nucleic Acids Res
.
2021
;
49
:
D651
59
.

19.

Wang
 
Y
,
Huang
 
H
,
Sun
 
M
 et al.  
T3DB: an integrated database for bacterial type III secretion system
.
BMC Bioinf
.
2012
;
13
:
66
.

20.

Gracy
 
J
,
Vallejos-Sanchez
 
K
,
Cohen-Gonsaud
 
M
.
SecretoMyc, a web-based database on mycobacteria secreted proteins and structure-based homology identification using bio-informatics tools
.
Tuberculosis (Edinb)
.
2023
;
141
:
102375
.

21.

Ammari
 
MG
,
Gresham
 
CR
,
McCarthy
 
FM
 et al.  
HPIDB 2.0: a curated database for host-pathogen interactions
.
Database (Oxford)
.
2016
;
2016
:
baw103
.

22.

Bi
 
D
,
Liu
 
L
,
Tai
 
C
 et al.  
SecReT4: a web-based bacterial type IV secretion system resource
.
Nucleic Acids Res
.
2013
;
41
:
D660
5
.

23.

Rose
 
PW
,
Prlić
 
A
,
Altunkaya
 
A
 et al.  
The RCSB protein data bank: integrative view of protein, gene and 3D structural information
.
Nucleic Acids Res
.
2017
;
45
:
D271
81
.

24.

Varadi
 
M
,
Anyango
 
S
,
Deshpande
 
M
 et al.  
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models
.
Nucleic Acids Res
.
2022
;
50
:
D439
44
.

25.

Pierce
 
BG
,
Wiehe
 
K
,
Hwang
 
H
 et al.  
ZDOCK server: interactive docking prediction of protein-protein complexes and symmetric multimers
.
Bioinformatics
.
2014
;
30
:
1771
73
.

26.

Yan
 
Y
,
Zhang
 
D
,
Zhou
 
P
 et al.  
HDOCK: a web server for protein-protein and protein-DNA/RNA docking based on a hybrid strategy
.
Nucleic Acids Res
.
2017
;
45
:
W365
73
.

27.

Abramson
 
J
,
Adler
 
J
,
Dunger
 
J
 et al.  
Accurate structure prediction of biomolecular interactions with AlphaFold 3
.
Nature
.
2024
;
630
:
493
500
.

28.

UniProt Consortium
.
UniProt: a worldwide hub of protein knowledge
.
Nucleic Acids Res
.
2019
;
47
:
D506
15
.

29.

Sayers
 
EW
,
Bolton
 
EE
,
Brister
 
JR
 et al.  
Database resources of the national center for biotechnology information
.
Nucleic Acids Res
.
2022
;
50
:
D20
26
.

30.

Christoffer
 
C
,
Kihara
 
D
.
Domain-based protein docking with extremely large conformational changes
.
J Mol Biol
.
2022
;
434
:
167820
.

31.

Sarfaraz
 
S
,
Muneer
 
I
,
Liu
 
H
.
Combining fragment docking with graph theory to improve ligand docking for homology model structures
.
J Comput Aided Mol Des
.
2020
;
34
:
1237
59
.

32.

Krogh
 
A
,
Larsson
 
B
,
von Heijne
 
G
 et al.  
Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes
.
J Mol Biol
.
2001
;
305
:
567
80
.

33.

Nielsen
 
H
,
Teufel
 
F
,
Brunak
 
S
 et al.  
SignalP: the evolution of a web server
.
Methods Mol Biol
.
2024
;
2836
:
331
67
.

34.

Yu
 
NY
,
Wagner
 
JR
,
Laird
 
MR
 et al.  
PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes
.
Bioinformatics
.
2010
;
26
:
1608
15
.

35.

Moreno
 
J
,
Nielsen
 
H
,
Winther
 
O
 et al.  
Predicting the subcellular location of prokaryotic proteins with DeepLocPro
.
Bioinformatics
.
2024
;
40
:
btae677
.

36.

Krissinel
 
E
,
Henrick
 
K
.
Inference of macromolecular assemblies from crystalline state
.
J Mol Biol
.
2007
;
372
:
774
97
.

37.

Lin
 
Z
,
Akin
 
H
,
Rao
 
R
 et al.  
Evolutionary-scale prediction of atomic-level protein structure with a language model
.
Science
.
2023
;
379
:
1123
30
.

38.

Franz
 
M
,
Lopes
 
CT
,
Fong
 
D
 et al.  
Cytoscape.js 2023 update: a graph theory library for visualization and analysis
.
Bioinformatics
.
2023
;
39
:
btad031
.

39.

Li
 
D
,
Mei
 
H
,
Shen
 
Y
 et al.  
ECharts: a declarative framework for rapid construction of web-based visualization
.
Vis Inform
.
2018
;
2
:
136
46
.

40.

Sehnal
 
D
,
Bittrich
 
S
,
Deshpande
 
M
 et al.  
Mol* Viewer: modern web app for 3D visualization and analysis of large biomolecular structures
.
Nucleic Acids Res
.
2021
;
49
:
W431
37
.

41.

Buchfink
 
B
,
Reuter
 
K
,
Drost
 
HG
.
Sensitive protein alignments at tree-of-life scale using DIAMOND
.
Nat Methods
.
2021
;
18
:
366
68
.

42.

Hu
 
Y
,
Liu
 
L
,
Zhu
 
Y
 et al.  
HPInet: interpretable prediction of host-pathogen protein-protein interactions using a transformer-based neural network
.
bioRxiv
 
2025
.
https://doi.org/2025.08.02.668281

43.

Kelk
 
P
,
Johansson AClaesson
 
R
,
Hänström
 
L
 et al.  
Caspase 1 involvement in human monocyte lysis induced by Actinobacillus actinomycetemcomitans leukotoxin
.
Infect Immun
.
2003
;
71
:
4448
55
.

44.

Kachlany
 
SC
,
Schwartz
 
AB
,
Balashova
 
NV
 et al.  
Anti-leukemia activity of a bacterial toxin with natural specificity for LFA-1 on white blood cells
.
Leuk Res
.
2010
;
34
:
777
85
.

45.

Hioe
 
CE
,
Tuen
 
M
,
Vasiliver-Shamis
 
G
 et al.  
HIV envelope gp120 activates LFA-1 on CD4 T-lymphocytes and increases cell susceptibility to LFA-1-targeting leukotoxin (LtxA)
.
PLoS One
.
2011
;
6
:
e23202
.

46.

Orenz
 
HM
,
Grunke
 
M
,
Hieronymus
 
T
., et al.  
In vitro apoptosis and expression of apoptosis-related molecules in lymphocytes from patients with systemic lupus erythematosus and other autoimmune diseases
.
Arthritis Rheum
.
1997
;
40
:
306
17
.

47.

Hatakeyama
 
M
.
Oncogenic mechanisms of the Helicobacter pylori CagA protein
.
Nat Rev Cancer
.
2004
;
4
:
688
94
.

48.

Jiang
 
L
,
Wang
 
P
,
Song
 
X
 et al.  
Salmonella typhimurium reprograms macrophage metabolism via T3SS effector SopE2 to promote intracellular replication and virulence
.
Nat Commun
.
2021
;
12
:
879
.

49.

Swietnicki
 
W
.
Secretory system components as potential prophylactic targets for bacterial pathogens
.
Biomolecules
.
2021
;
11
:
892
.

50.

Navarro
 
L
,
Koller
 
A
,
Nordfelth
 
R
 et al.  
Identification of a molecular target for the Yersinia protein kinase A
.
Mol Cell
.
2007
;
26
:
465
77
.

51.

Sanya
 
DRA
,
Onésime
 
D
,
Vizzarro
 
G
 et al.  
Recent advances in therapeutic targets identification and development of treatment strategies towards P seudomonas aeruginosa infections
.
BMC Microbiol
.
2023
;
23
:
86
.

52.

Qin
 
S
,
Xiao
 
W
,
Zhou
 
C
 et al.  
Pseudomonas aeruginosa: pathogenesis, virulence factors, antibiotic resistance, interaction with host, technology advances and emerging therapeutics
.
Signal Transduct Target Ther
.
2022
;
7
:
199
.

53.

Hicks
 
SW
,
Galán
 
JE
.
Exploitation of eukaryotic subcellular targeting mechanisms by bacterial effectors
.
Nat Rev Micro
.
2013
;
11
:
316
26
.

54.

Eckart
 
RA
,
Bisle
 
S
,
Schulze-Luehrmann
 
J
 et al.  
Antiapoptotic activity of Coxiella burnetii effector protein AnkG is controlled by p32-dependent trafficking
.
Infect Immun
.
2014
;
82
:
2763
71
.

55.

Evans
 
R
,
O’Neill
 
M
,
Pritzel
 
A
 et al.  
Protein complex prediction with AlphaFold-Multimer
.
bioRxiv
.
2021
.

Author notes

Y.Z., L.L., and Y.H. contributed equally to this study.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.