TRGdb: a universal resource for the exploration of taxonomically restricted genes in bacteria Open Access

Khalturin

Hemmrich

Fraune

et al. (

2009

)

More than just orphans: are taxonomically-restricted genes important in evolution?

Trends Genet.

404

–

413

Wilson

G.A.

Feil

E.J.

Lilley

A.K.

et al. (

2007

)

Large-scale comparative genomic ranking of taxonomically restricted genes (TRGs) in bacterial and archaeal genomes

PLoS One

, e324.

Fukuchi

and

Nishikawa

(

2004

)

Estimation of the number of authentic orphan genes in bacterial genomes

DNA Res.

, 219–31, 311–313.

Jordan

I.K.

Makarova

K.S.

Spouge

J.L.

et al. (

2001

)

Lineage-specific gene expansions in bacterial and archaeal genomes

Genome Res.

555

–

565

Siew

and

Fischer

(

2003

)

Unravelling the ORFan Puzzle

Comp. Funct. Genomics

432

–

441

Siew

and

Fischer

(

2003

)

Analysis of singleton ORFans in fully sequenced microbial genomes

Proteins

241

–

251

10.

Fischer

and

Eisenberg

(

1999

)

Finding families for genomic ORFans

Bioinformatics

759

–

762

11.

Daubin

and

Ochman

(

2004

)

Bacterial genomes as new gene homes: the genealogy of ORFans in E. Coli

Genome Res.

1036

–

1042

12.

Chin

K.-H.

Ruan

S.-K.

Wang

A.H.-J.

et al. (

2007

)

XC5848, an ORFan protein from Xanthomonas campestris, adopts a novel variant of Sm-like motif

Proteins

1006

–

1010

13.

Zhou

Huang

Zou

et al. (

2015

)

Genome-wide identification of lineage-specific genes within Caenorhabditis elegans

Genomics

106

242

–

248

14.

Johnson

B.R.

and

Tsutsui

N.D.

(

2011

)

Taxonomically restricted genes are associated with the evolution of sociality in the honey bee

BMC Genom.

, 164.

15.

Zile

Dessimoz

Wurm

et al. (

2020

)

Only a single taxonomically restricted gene family in the drosophila melanogaster subgroup can be identified with high confidence

Genome Biol. Evol.

1355

–

1366

16.

Sollars

E.S.A.

Harper

A.L.

Kelly

L.J.

et al. (

2017

)

Genome sequence and genetic diversity of European ash trees

Nature

541

212

–

216

17.

Toll-Riera

Castelo

Bellora

et al. (

2009

)

Evolution of primate orphan proteins

Biochem. Soc. Trans.

778

–

782

18.

Edwards

R.A.

and

Rohwer

(

2005

)

Viral metagenomics

Nat. Rev. Microbiol.

504

–

510

19.

Prangishvili

Garrett

R.A.

and

Koonin

E.V.

(

2006

)

Evolutionary genomics of archaeal viruses: unique viral genomes in the third domain of life

Virus Res.

117

–

20.

Siew

Azaria

and

Fischer

(

2004

)

The ORFanage: an ORFan database

Nucleic Acids Res.

D281

–

D283

21.

Karlowski

W.M.

Varshney

and

Zielezinski

(

2023

)

Taxonomically restricted genes in Bacillus may form clusters of homologs and can be traced to a large reservoir of noncoding sequences

Genome Biol. Evol

, evad023.

22.

Parks

D.H.

Chuvochina

Rinke

et al. (

2022

)

GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy

Nucleic Acids Res.

D785

–

D794

23.

Buchfink

Reuter

and

Drost

H.-G.

(

2021

)

Sensitive protein alignments at tree-of-life scale using DIAMOND

Nat. Methods

366

–

368

24.

Camacho

Coulouris

Avagyan

et al. (

2009

)

BLAST: architecture and applications

BMC Bioinform.

–

Crossref

25.

Mészáros

Erdos

and

Dosztányi

(

2018

)

IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding

Nucleic Acids Res.

W329

–

W337

26.

Fernandez-Escamilla

A.-M.

Rousseau

Schymkowitz

et al. (

2004

)

Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins

Nat. Biotechnol.

1302

–

1306

27.

Virtanen

Gommers

Oliphant

T.E.

et al. (

2020

)

SciPy 1.0: fundamental algorithms for scientific computing in Python

Nat. Methods

261

–

272

28.

Mistry

Chuguransky

Williams

et al. (

2021

)

Pfam: the protein families database in 2021

Nucleic Acids Res.

D412

–

D419

29.

Steinegger

and

Söding

(

2017

)

MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets

Nat. Biotechnol.

1026

–

1028

30.

Delano

M.L.

Mischler

S.A.

Underwood

W.J.

(

2002

) Biology and diseases of ruminants: sheep, goats, and cattle. In:

Fox

J.G

(ed.)

Laboratory Animal Medicine

Academic Press (Elsevier)

Cambridge, Massachusetts, United States

, pp.

519

–

614

31.

Lopera

Miller

I.J.

McPhail

K.L.

et al. (

2017

)

Increased biosynthetic gene dosage in a genome-reduced defensive bacterial symbiont

Msystems

e00096

32.

Martinez-Gutierrez

C.A.

Aylward

F.O.

and

Battistuzzi

F.U.

(

2021

)

Phylogenetic signal, congruence, and uncertainty across bacteria and archaea

Mol. Biol. Evol.

5514

–

5527

33.

Doolittle

W.F.

and

Papke

R.T.

(

2006

)

Genomics and the bacterial species problem

Genome Biol.

, 116.

34.

Tsang

A.K.L.

Lee

H.H.

Yiu

S.-M.

et al. (

2017

)

Failure of phylogeny inferred from multilocus sequence typing to represent bacterial phylogeny

Sci. Rep.

, 4536.