BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors

Author Notes

Abstract

Gram-negative pathogenic bacteria inject type III secreted effectors (T3SEs) into host cells to sabotage their immune signaling networks. Because T3SEs constitute a meeting-point of pathogen virulence and host defense, they are of keen interest to host–pathogen interaction research community. To accelerate the identification and functional understanding of T3SEs, we present BEAN 2.0 as an integrated web resource to predict, analyse and store T3SEs. BEAN 2.0 includes three major components. First, it provides an accurate T3SE predictor based on a hybrid approach. Using independent testing data, we show that BEAN 2.0 achieves a sensitivity of 86.05% and a specificity of 100%. Second, it integrates a set of online sequence analysis tools. Users can further perform functional analysis of putative T3SEs in a seamless way, such as subcellular location prediction, functional domain scan and disorder region annotation. Third, it compiles a database covering 1215 experimentally verified T3SEs and constructs two T3SE-related networks that can be used to explore the relationships among T3SEs. Taken together, by presenting a one-stop T3SE bioinformatics resource, we hope BEAN 2.0 can promote comprehensive understanding of the function and evolution of T3SEs.

Database URL:http://systbio.cau.edu.cn/bean/

Introduction

T3SEs are proteins secreted by Gram-negative pathogenic bacteria to interfere with host immune signaling networks ( 1 , 2 ). They are secreted into host cells through type-III secretion systems (T3SSs) ( 1 ), which are encoded by animal and plant pathogenic bacteria, such as Salmonella typhi , Escherichia coli O157:H7, Yersina enterocolitica , Pseudomonas syringae pv tomato DC3000 and Ralstonia solanacearum . By blocking the immune signaling pathways at specific subcellular locations, these cytotoxic proteins are thought to assist pathogenic bacteria in evading the attacks from immune systems ( 3 , 4 ). T3SEs have evolved diverse functional domains ( 5 ) to mimic the functions of host cell proteins or covalently modify them ( 2 , 4 ). However, the biochemical mechanism used by T3SEs can be very different from their counterparts in host cells. For example, Shigella T3SE OspF can irreversibly remove phosphate group from phosphothreonine residue in mitogen-activated protein kinases Erk1/2 or p38 by converting threonine into dehydrobutyrine ( 6 ). This strategy has not been found in enzymes from eukaryotic cells. Moreover, the evolution of T3SEs seems also unusual. A previous study ( 7 ) showed that pathogenic bacteria can generate new T3SEs through terminal reassortment of existing T3SE sequences, implying complex evolutionary relationships among T3SEs. Recently, it has been realized that protein intrinsic disorder regions, flexible segments without fixed 3D structure, are evolutionary hallmarks of T3SEs ( 8 ).

The unique functions of T3SEs make them not only the powerful weapons of pathogens but also useful probes for researchers to investigate mechanisms of host immunity. Systematical characterization of the repertoires of T3SEs in pathogenic bacteria is helpful to identify the main virulence strategy commonly adopted by different pathogenic bacteria as well as the evolutionary relationships among different T3SEs ( 9–11 ). With the rapid development of high-throughput sequencing technologies, more and more genomes of pathogenic bacteria have been fully sequenced ( 11 ). There is an unprecedented requirement for bioinformatics tools/resources that can accurately identify and conveniently analyze T3SEs from these genomic data. To this end, a few state-of-the-art bioinformatics methods have been developed to predict T3SEs ( 12–19 ). Meanwhile, there also exist several excellent T3SE databases (e.g. T3SEdb ( 20 ), Effective ( 21 ) and T3DB ( 22 )), although they are mainly designed to store/predict T3SE sequences and provide limited analysis tools to further annotate T3SEs. Moreover, the relationships among different T3SEs are hardly explored by them. With the accumulation of more T3SE data, we anticipate the development of more comprehensive T3SE web resources is still highly required.

We previously developed a machine-learning predictor BEAN (Bacterial Effector ANalyzer) to identify T3SEs from pathogen genomes. In this predictor, the compositions of evolutionarily conserved amino acid (AA) pairs ( 23 ) were used to represent N-terminal secretion signals in T3SEs ( 17 ). Since BEAN was released in 2013, its web server has predicted >35 000 protein sequences submitted by users from ∼30 countries. Despite BEAN having shown good performance, there is still room for improvement. Indeed, some useful information was overlooked in the original version of BEAN. First, traditional sequence alignment-based search usually gives a reliable prediction if the query protein is very similar to a known T3SE. Second, the unique functional domains harboring on T3SEs can also be useful to discriminate T3SEs and non-T3SEs. Third, although the type III secretion signal ( 1 ) is believed to reside within the N-terminal of T3SEs in most cases, C-terminal is also required for the secretion of some T3SEs. For example, the C-terminal region (residues from 321 to 409) of Salmonella T3SE SipC is essential for its translocation into HeLa cells ( 24 ). The six residues (519–524) of C-terminal is required for efficient secretion of T3SE Tir in E. coli (EHEC O157:H7) ( 25 ). Other cases include Salmonella T3SEs SifA ( 26 ) and SipB ( 27 ).

Here, we developed BEAN 2.0 as an integrative web resource of T3SEs ( Figure 1 ). In addition to integrating the above information to improve the accuracy of T3SE prediction, BEAN 2.0 also provided multiple functional analysis tools to assist users in annotating putative T3SEs conveniently. Moreover, BEAN 2.0 compiled 1215 verified T3SEs from 221 pathogenic bacteria into a database and it also provided two networks that can be interactively visualized to explore the relationships among different T3SEs. Through providing a one-stop bioinformatics service, we hope BEAN 2.0 can accelerate the identification and analysis of new T3SEs.

Figure 1.

Overview of the resources in BEAN 2.0.

Open in new tab Download slide

Materials and Methods

Data collection

We collected 1202 T3SEs from the Uniprot database (version of 2014.01). CD-hit ( http://weizhong-lab.ucsd.edu/cd-hit/ ) with the sequence identity cutoff of 40% was used to remove similar sequences. As a result, 249 T3SEs were retained. Among them, six T3SEs have sequence lengths <100, which were further skipped. To obtain a 1:2 ratio of positive to negative samples, 486 negative samples were randomly selected from the non-T3SE dataset used in our previous work ( 17 ), which was compiled from eight well-studied Gram-negative bacterial proteomes by several criteria. The pairwise sequence identity among negative samples was also controlled as ≤40% using CD-hit. Finally, we obtained 243 T3SEs and 486 non-T3SEs, which constitute the major benchmark dataset in this work. The partition of the benchmark dataset across 5-fold cross-validation test, the independent test and the genome-wide test, will be detailed in the following sections.

Overall workflow of T3SE prediction in BEAN 2.0

Using a similar strategy as described by Kumar et al. ( 28 ) in mitochondrial protein prediction, BEAN 2.0 consists of three components: sequence alignment-based predictor, domain-based predictor and machine-learning predictor. The compiled non-redundant benchmark dataset is used to derive this system.

A query protein sequence will firstly be processed by the sequence alignment-based predictor. It tries to search for the most similar sequence of the query one in the training data using BLAST. If a highly similar sequence is found, the corresponding label (T3SE or non-T3SE) will be assigned to the query protein. If no similar sequence is detected with threshold E-value ≤0.01, the query protein will be switched to the domain-based predictor. The domain-based predictor scans the query protein sequence (E-value ≤1e-5) and compares its Pfam domains with the domains compiled from the training dataset. First, we collected the Pfam domains of the training dataset. Then we divided the domains into three types: (i) exclusively T3SE domains only observed in T3SEs; (ii) exclusively non-T3SE domains only observed in non-T3SEs and (iii) shared domains observed in both of T3SE and non-T3SE. If the query protein harbors T3SE-exclusive domains (non-T3SE-exclusive domains), it will be predicted as a T3SE (non-T3SE); otherwise, it will be further processed by the machine-learning predictor. The parameters used in BLAST and Pfam scan were preliminarily optimized according to the suggestions in Kumar et al. ’s work ( 28 ).

In the machine-learning predictor, the homologs of the query protein are searched firstly through HHblits ( 29 ) for constructing a sequence profile. Then, the resulting profile will be divided into three parts: N-terminal 2–51 AAs, 52–121 AAs and C-terminal 50 AAs. Since the first N-terminal AA is methionine in most bacterial protein sequences, it is ignored in this step. A 1600-dimension feature vector is extracted using the profile-based k -spaced amino acid pair composition (HH-CKSAAP) ( 17 ) to represent each of the three parts. The only exception is when the length of the middle part is <70 AAs, a 1600-dimension zero vector is used to encode this part. Then, the 4800-dimension feature vector is input into a linear Support Vector Machine model to predict the label of the query protein. The linear SVM model was also learned from the training dataset and the corresponding SVM parameters were the same as our previous work ( 17 ).

Evaluation the performance of BEAN 2.0

To assess the performance improvement of the hybrid strategy, we conducted a 5-fold cross-validation test on five models, including BEAN, BEAN*, BEAN*+BLAST, BEAN*+Pfam and BEAN 2.0. Note that BEAN stands for the original BEAN method, BEAN* represents the model trained by 4800-dimension vectors, BEAN*+BLAST means the combination of BEAN* and sequence alignment-based predictor, and BEAN*+Pfam indicates the combination of BEAN* and domain-based predictor. We randomly divided the benchmark dataset into a training dataset containing 200 T3SEs and 400 non-T3SEs and an independent test set containing 43 T3SEs and 86 non-T3SEs. The 5-fold cross-validation test on the training dataset was carried out. We further compared BEAN 2.0 with four existing T3SE predictors through the independent test. We trained a BEAN 2.0 model based on the whole training dataset and used it to predict the independent dataset. The standalone programs of EffectiveT3 ( 13 ), BPBAac ( 16 ) and T3_MM ( 18 ) were downloaded from their websites to predict the independent test dataset. Since the standalone version of ANN ( 14 ) is not available, we used the web server directly to do the prediction.

We collected the T3SEs on two independent genome datasets from a plant pathogen Pseudomonas syringae pv. phaseolicola 1448a ( P. syringae ) ( 29 ) and an animal pathogen E. coli O157:H7 ( E. coli ) ( 30 ). We chose these two pathogens because most of T3SEs in their genomes have been systematically screened. To ensure fair comparison, we removed the T3SEs of the query species from our training set. We collected 23 and 48 known T3SEs in 5045 proteins of P. syringae and 5255 proteins of E.coli . After removing the protein sequences containing < 100 residues, we obtained 22 and 45 known T3SEs in 4626 proteins of P. syringae and 4475 proteins of E.coli . The four existing T3SE predictors were also used to screen T3SEs from these two genomes.

Performance assessment parameters

In the 5-fold cross-validation test or the independent test, we used five parameters [sensitivity, specificity, accuracy, F1-score and Matthew correlation coefficient] to evaluate the performance. These parameters are defined as:

\begin{matrix} specificity = \frac{TN}{TN+FP} \\ sensitivity = \frac{T P}{T P + F N} \\ accuracy = \frac{T P + T N}{T P + F P + T N + F N} \\ F 1 - score = \frac{2 \times T P}{2 \times T P + F P + F N} \\ M C C = \frac{T P \times T N - F N \times F P}{\sqrt{(T P + F N) \times (T N + F P) \times (T P + F P) \times (T N + F N)}} \end{matrix}

where TP, FP, TN and FN stand for the number of true positives, false positives, true negatives and false negatives.

With respect to genome-wide tests, we mainly used the median of the known T3SEs’ ranks to compare different predictors. We also listed the ranks of known T3SEs in the whole genomes of these two species. For BEAN 2.0, we prioritized all positive prediction results in the whole genome as: sequence alignment-based prediction > domain-based prediction > machine-learning prediction. The prediction results from the same method were further sorted according to E-values (sequence alignment-based prediction, and domain-based prediction) or prediction scores (machine-learning prediction). For the other four existing predictors, their output scores were used to rank prediction results.

Construction of BEAN 2.0 web server

The core algorithm was developed by PERL program and the construction of web server was based on LAMP (Linux+Apache+MySQL+PHP), an open-source software frequently used to build high-availability websites. The prediction model used in the website was trained with the whole benchmark dataset (i.e. all 243 T3SEs and 486 non-T3SEs). The 1202 T3SEs originally collected for constructing the BEAN 2.0 model and the other 13 T3SEs newly collected during the development of the web server were included in our T3SE database.

Results and Discussion

Accurate prediction of T3SEs using BEAN 2.0

To test the usefulness of the hybrid approach ( Supplementary Figure S1 ), we compared the performance of our original BEAN and the prediction models integrating with sequence alignment, domain information or the sequence composition beyond N-terminal. We found these three types of information can stably improve the sensitivity of prediction. Further test showed that the performance improvement of the hybrid approach should not be attributed to the similarity of N-terminal sequences in our training data ( Supplementary Table S1 ).Through the 5-fold cross-validation test on the training dataset, BEAN 2.0 achieved a sensitivity of 92.00% and a specificity of 97.00% in comparison to the 80.00% and 96.75% of our original BEAN model ( Supplementary Figures S2 and Supplementary Data ). We also demonstrated there is no significant overfitting caused by the high feature dimensionality in BEAN 2.0 ( Supplementary Figure S4 and Table S2 ). Although many existing predictors only use the N-terminal signal sequence, our results indicate that from the practical point of view the other sequence regions can also facilitate the identification of T3SEs.

The performance of BEAN 2.0 was compared with five existing methods through the independent dataset ( Figure 2 ). As shown in Figure 2 , BEAN 2.0 achieved the best accuracy value of 95.35%, which is 13.18%, 7.75%, 10.85%, 12.40% and 3.88% higher than EffectiveT3 ( 13 ), BPBAac ( 16 ), ANN ( 14 ), T3_MM ( 18 ) and BEAN. The good performance of BEAN 2.0 was also confirmed in other five different independent dataset selection scenarios ( Supplementary Table S3 ). Generally, BEAN 2.0 reveals a very stable improvement in specificity and sensitivity.

Figure 2.

The performance of different T3SE predictors on the independent dataset.

Open in new tab Download slide

Moreover, we tested these methods on two independent genome datasets from a plant pathogen P. syringae and an animal pathogen E. coli . For 22 and 45 known T3SEs in 4626 proteins of P. syringae and 4475 proteins of E. coli (>100 AAs), BEAN 2.0 successfully predicted 21 and 33 of them in the top 50 ranked candidates according to their scores. The median of known T3SEs’ ranks in BEAN 2.0 results is 18.5 and 35 for P. syringae and E. coli , respectively. Regarding the other methods, T3_MM gave the second best result with a median rank of 37.5 in P. syringae , and BEAN gave the second best result with a median rank of 132 in E. coli . The obvious advantages shown in the genome-wide T3SE predictions indicate that BEAN 2.0 has a practical applicability in T3SE screening (Detailed comparison results are provided in Supplementary Tables S4–S6 ).

The major performance improvement of BEAN2.0, especially for the increased sensitivity, can be ascribed to the following two aspects. First, BEAN2.0 takes the sequence composition of the middle and C-terminal into account with the purpose of incorporating more possible type-III secretion-related information, such as chaperone-binding sites and potential C-terminal secretion signal. The information may increase the chance of detecting more T3SEs. Second, although the overall performance of BEAN is better than BLAST or Pfam, it has also some drawbacks. BEAN tries to model the sequence composition of all the training data and learns the most stable characteristics that can be used to distinguish T3SEs and non-T3SEs. This process strengthens the common distinctive characteristics, but it also weakens the ones that can only be observed in a handful T3SEs. As sequence alignment-based methods, on the contrary, BLAST and Pfam can sensitively detect some unique sequence patterns shared by a few T3SE proteins. Due to the methodological complementary between BEAN and BLAST/Pfam, the integration of them can result in a substantial performance improvement.

An integrative T3SE web resource

An integrative analysis platform is valuable to further investigate potential functions of bacterial secretion proteins ( 31 ). Therefore, in addition to T3SE prediction, BEAN 2.0 also provides other three types bioinformatics resources to facilitate T3SE function research, including a sequence annotation suite, a curated T3SE database and two functional relationship networks constructed from the known T3SEs.

BEAN 2.0 uses sequences in FASTA format to predict T3SE candidates. Users can submit 200 sequences in one job at most. The job name is required and the email address is a voluntary choice. If the email address is provided, BEAN 2.0 will send an email containing the URL of the prediction result when the job is finished. Generally, it takes ∼3 min to conduct T3SE prediction for a query sequence. For genome-wide T3SE prediction, a command-line version can be downloaded and deployed on users’ local machine and run in the multi-threading mode.

The prediction results can be directly transferred to sequence annotation suite for subcellular location prediction, domain annotation or long disorder region detection. The default subcellular location predictor is the widely used TargetP ( 32 ). But considering only three different location information (mitochondrion, chloroplast and secretion proteins) can be given in TargetP, we also provide an alternative choice using Cell-PLoc package ( 33 ) whose prediction covers up to 22 subcellular locations. For domain annotation and protein disorder region analysis, Pfam ( 34 ) and IUPred ( 35 ) were used. The users are also allowed to submit new sequences in FASTA format through ‘Analysis’ button. Sequence annotation results will be shown in interactive figures and tables. All of the results are allowed for downloading. Registered users can keep their query sequences confidential and manage their jobs. Although we encourage users to register with BEAN 2.0, the web server also allows anonymous use.

T3SE databases and T3SE-related networks

Our database includes 1215 curated T3SEs from 221 pathogenic bacteria ( Figure 3 A and B), displaying T3SE name, source organism, sequence length, experimental status, Pfam domain and subcellular locations in host. Users can search them through an interactive Javascript table. To facilitate the database update, we also encourage users to contribute newly discovered T3SEs to our database through ‘Contribution’ dialog box.

Figure 3.

The composition of T3SE database in BEAN 2.0. ( A ) The proportion of T3SEs from animal and plant bacteria. ( B ) The functional categories of T3SE domains. ( C ) The T3SE network (i.e. Effector-Net). Two T3SEs are connected if they share a common Pfam domain. T3SEs from plant bacteria are colored as green, while T3SEs from animal bacteria are colored as purple. ( D ) T3SEs from both plant and animal bacteria are connected through YopJ domain (Pfam ID: PF03421).

Open in new tab Download slide

In addition, BEAN 2.0 provides the T3SE-specific domain information extracted from known T3SE. The information is shown in a form, including the exclusively T3SE Pfam domains and its possible function.

Inspired by a previous study that used protein homology networks to visualize the evolution of T3SSs ( 36 ), we constructed two interactive networks (Effector-Net and Domain-Net) to visualize the relationships among T3SEs and T3SE domains, respectively ( Figure 3 C and Supplementary Figures S5 and Supplementary Data ). In Effector-Net, each T3SE is represented as a node and two T3SEs are connected if they share a common domain. The nodes marked in green (purple) represent that they are secreted by plant pathogens (animal pathogens). The current Effector-Net covers 261 non-redundant T3SEs and 832 edges. These 261 T3SEs consist of 52 connected subnetworks. In Domain-Net, each Pfam domain harboring on known T3SEs is represented as a node and two domains are connected if they concurrence on a T3SE. The current Domain-Net included 74 different Pfam domains and 59 edges among them. Most domains do not connect with any other, indicating that they are exclusively observed in only one non-redundant T3SE. The most highly connected node in Domain-Net corresponds to a putative Pfam-B domain (Pfam ID: PB005666), which connects with six domains including TTSSLRR, NEL, Lipase_GDSL, Sif, Tox_PLDMTX and Pfam-B domain PB006720. With the accumulation of known T3SE data, we speculate these networks will be more complete. By combining with network analysis tools, they can become useful tools for analyzing the functional and evolutionary relationship among T3SEs. For example, even though most domains are specific to animal or plant pathogen T3SEs, there are also some domains such as YopJ domain (Pfam ID: PF03421), which has a serine/threonine acetyltransferase activity and can block host immune signaling by inhibiting kinase phosphorylation ( 37 ), shared by both plant pathogen and animal pathogen T3SEs ( Figure 3 D). This observation suggests that acetylation of host kinase is a prevalent strategy used by pathogens.

Conclusions

Here, we present BEAN 2.0 as an accurate, practical and convenient bioinformatics platform for T3SE research ( Supplementary Figure S7 ). First, BEAN 2.0 provides a highly accurate predictor. While machine-learning T3SE predictors have been developed in the past several years, we show traditional sequence alignment and domain analysis can substantially improve prediction accuracy if they are integrated with machine-learning predictors in a rational way. Second, BEAN 2.0 integrates with other bioinformatics tools providing a comprehensive analysis platform to timely annotate T3SEs in the three most important aspects: subcellular locations in host cell, disorder regions and functional domains. Finally, BEAN 2.0 stores 1215 verified T3SEs and allows users to explore their relationships through two interactive T3SE-related networks. With the rapid progress of genome sequencing, the validated T3SEs are accumulating at unprecedented rates. There is a higher requirement for systematically summarizing these T3SEs and extracting new knowledge about the evolution of T3SEs. The data resources deposited in BEAN 2.0 represent our first step in the direction.

Funding

This work was supported by grants from the National Natural Science Foundation of China (31271414 and 31471249).

Conflict of interest . None declared.

References

Galán

J.E.

Wolf-Watz

(

2006

)

Protein delivery into eukaryotic cells by type III secretion machines

Nature

444

567

–

573

Galán

J.E.

(

2009

)

Common themes in the design and function of bacterial effectors

Cell Host Microbe

571

–

579

Brodsky

I.E.

Medzhitov

(

2009

)

Targeting of immune signalling networks by bacterial pathogens

Nat. Cell Biol.

521

–

526

Ribet

Cossart

(

2010

)

Pathogen-mediated posttranslational modifications: a re-emerging field

Cell

143

694

–

702

Dean

(

2011

)

Functional domains and motifs of bacterial type III effector proteins and their roles in infection

FEMS Microbiol. Rev.

1100

–

1125

Zhou

et al. . (

2007

)

The phosphothreonine lyase activity of a bacterial type III effector family

Science

315

1000

–

1003

Stavrinides

Guttman

D.S.

(

2006

)

Terminal reassortment drives the quantum evolution of type III effectors in bacterial pathogens

PLoS Pathog.

e104

Marín

Uversky

V.N.

Ott

(

2013

)

Intrinsic disorder in pathogen effectors: protein flexibility as an evolutionary hallmark in a molecular arms race

Plant Cell

3153

–

3157

Shames

S.R.

Finlay

B.B.

(

2012

)

Bacterial effector interplay: a new way to view effector function

Trends Microbiol.

214

–

219

Lindeberg

Cunnac

Collmer

(

2012

)

Pseudomonas syringae type III effector repertoires: last words in endless arguments

Trends Microbiol.

199

–

208

Baltrus

D.A.

Nishimura

M.T.

Romanchuk

et al. . (

2011

)

Dynamic evolution of pathogenicity revealed by sequencing and comparative genomics of 19 Pseudomonas syringae isolates

PLoS Pathog.

e1002132

Samudrala

Heffron

McDermott

J.E.

(

2009

)

Accurate prediction of secreted substrates and identification of a conserved putative secretion signal for type III secretion systems

PLoS Pathog.

e1000375

Arnold

Brandmaier

Kleine

et al. . (

2009

)

Sequence-based prediction of type III secreted proteins

PLoS Pathog.

e1000376

Löwer

Schneider

(

2009

)

Prediction of type III secretion signals in genomes of Gram-negative bacteria

PLoS One

e5917

Yang

Zhao

Morgan

et al. . (

2010

)

Computational prediction of type III secreted proteins from Gram-negative bacteria

BMC Bioinformatics

S47

Wang

Zhang

Sun

M.-A.

et al. . (

2011

)

High-accuracy prediction of bacterial type III secreted effectors based on position-specific amino acid composition profiles

Bioinformatics

777

–

784

Dong

Zhang

Y.-J.

Zhang

(

2013

)

Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes

PLoS One

e56632

Wang

Sun

Bao

et al. . (

2013

)

T3_MM: a Markov model effectively classifies bacterial type III secretion signals

PLoS One

e58173

Yang

Guo

Luo

et al. . (

2013

)

Effective identification of Gram-negative bacterial type III secreted effectors using position-specific residue conservation profiles

PLoS One

e84439

Tay

Govindarajan

Khan

et al. . (

2010

)

T3SEdb: data warehousing of virulence effectors secreted by the bacterial Type III secretion system

BMC Bioinformatics

Jehl

M.-A.

Arnold

Rattei

(

2011

)

Effective—a database of predicted secreted bacterial proteins

Nucleic Acids Res.

D591

–

D595

Wang

Huang

Sun

et al. . (

2012

)

T3DB: an integrated database for bacterial type III secretion system

BMC Bioinformatics

Chen

Kurgan

L.A.

Ruan

(

2008

)

Prediction of protein structural class using novel evolutionary collocation-based sequence representation

J. Comput. Chem

1596

–

1604

Chang

Chen

Zhou

(

2005

)

Delineation and characterization of the actin nucleation and effector translocation activities of Salmonella SipC

Mol. Microbiol.

1379

–

1389

Allen-Vercoe

Toh

M.C.W.

Waddell

et al. . (

2005

)

A carboxy-terminal domain of Tir from enterohemorrhagic Escherichia coli O157:H7 (EHEC O157:H7) required for efficient type III secretion

FEMS Microbiol. Lett.

243

355

–

364

Kim

B.H.

Kim

H.G.

Kim

J.S.

et al. . (

2007

)

Analysis of functional domains present in the N-terminus of the SipB protein

Microbiology

153

2998

–

3008

Brown

N.F.

Szeto

Jiang

et al. . (

2006

)

Mutational analysis of Salmonella translocated effector members SifA and SopD2 reveals domains implicated in translocation, subcellular localization and function

Microbiology

152

2323

–

2343

Kumar

Verma

Raghava

G.P.S.

(

2006

)

Prediction of mitochondrial proteins using support vector machine and hidden Markov model

J. Biol. Chem.

281

5357

–

5363

Remmert

Biegert

Hauser

et al. . (

2012

)

HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment

Nat. Methods

173

–

175

Google Scholar

Crossref

WorldCat

Zumaquero

Macho

A.P.

Rufián

J.S.

et al. . (

2010

)

Analysis of the role of the type III effector inventory of Pseudomonas syringae pv. phaseolicola 1448a in interaction with the plant

J. Bacteriol.

192

4474

–

4488

Tobe

Beatson

S.A.

Taniguchi

et al. . (

2006

)

An extensive repertoire of type III secretion effectors in Escherichia coli O157 and the role of lambdoid phages in their dissemination

Proc. Natl. Acad. Sci. U.S.A.

103

14941

–

14946

Liu

Tai

et al. . (

2013

)

SecReT4: a web-based bacterial type IV secretion system resource

Nucleic Acids Res.

D660

–

D665

Emanuelsson

Brunak

von Heijne

et al. . (

2007

)

Locating proteins in the cell using TargetP, SignalP and related tools

Nat. Protoc.

953

–

971

Chou

K.-C.

Shen

H.-B.

(

2008

)

Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms

Nat. Protoc.

153

–

162

Finn

R.D.

Bateman

Clements

et al. . (

2014

)

Pfam: the protein families database

Nucleic Acids Res.

D222

–

D230

Dosztányi

Csizmok

Tompa

et al. . (

2005

)

IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content

Bioinformatics

3433

–

3434

Medini

Covacci

Donati

(

2006

)

Protein homology network families reveal step-wise diversification of type III and type IV secretion systems

PLoS Comput. Biol.

e173

Mukherjee

Keitany

et al. . (

2006

)

Yersinia YopJ acetylates and inhibits kinase activation by blocking phosphorylation

Science

312

1211

–

1214

Author notes

^† These authors contributed equally to the work.

Citation details: Dong,X., Lu,X. and Zhang,Z. BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors. Database (2015) Vol. 2015: article ID bav064; doi:10.1093/database/bav064

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/ ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
November 2016	2
December 2016	5
January 2017	6
February 2017	5
March 2017	4
April 2017	5
May 2017	6
June 2017	6
July 2017	9
August 2017	8
September 2017	3
October 2017	3
November 2017	5
December 2017	25
January 2018	19
February 2018	17
March 2018	21
April 2018	23
May 2018	10
June 2018	18
July 2018	20
August 2018	24
September 2018	13
October 2018	7
November 2018	19
December 2018	14
January 2019	13
February 2019	6
March 2019	31
April 2019	30
May 2019	20
June 2019	25
July 2019	20
August 2019	16
September 2019	18
October 2019	19
November 2019	8
December 2019	27
January 2020	13
February 2020	16
March 2020	19
April 2020	12
May 2020	14
June 2020	26
July 2020	14
August 2020	34
September 2020	46
October 2020	28
November 2020	27
December 2020	20
January 2021	29
February 2021	16
March 2021	23
April 2021	22
May 2021	14
June 2021	5
July 2021	20
August 2021	26
September 2021	17
October 2021	27
November 2021	37
December 2021	11
January 2022	26
February 2022	20
March 2022	25
April 2022	21
May 2022	3
June 2022	17
July 2022	16
August 2022	12
September 2022	32
October 2022	20
November 2022	19
December 2022	11
January 2023	9
February 2023	9
March 2023	13
April 2023	20
May 2023	4
June 2023	7
July 2023	6
August 2023	15
September 2023	17
October 2023	12
November 2023	11
December 2023	32
January 2024	25
February 2024	25
March 2024	13
April 2024	14
May 2024	14
June 2024	7
July 2024	13
August 2024	22
September 2024	9
October 2024	10
November 2024	16
December 2024	12
January 2025	10
February 2025	14
March 2025	14
April 2025	6
May 2025	11
June 2025	15
July 2025	12
August 2025	17
September 2025	17
October 2025	10
November 2025	8
December 2025	14
January 2026	10
February 2026	7
March 2026	20
April 2026	6
May 2026	1

Article Contents

BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors

Abstract

Introduction

Materials and Methods

Data collection

Overall workflow of T3SE prediction in BEAN 2.0

Evaluation the performance of BEAN 2.0

Performance assessment parameters

Construction of BEAN 2.0 web server

Results and Discussion

Accurate prediction of T3SEs using BEAN 2.0

An integrative T3SE web resource

T3SE databases and T3SE-related networks

Conclusions

Funding

References

Author notes

Supplementary data

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

Article Contents

BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors Open Access

Abstract

Introduction

Materials and Methods

Data collection

Overall workflow of T3SE prediction in BEAN 2.0

Evaluation the performance of BEAN 2.0

Performance assessment parameters

Construction of BEAN 2.0 web server

Results and Discussion

Accurate prediction of T3SEs using BEAN 2.0

An integrative T3SE web resource

T3SE databases and T3SE-related networks

Conclusions

Funding

References

Author notes

Supplementary data

Citations

Views

Altmetric

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only

Gift article access

Gift article access

Gift article access

Gift article access

BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors