A database resource and online analysis tools for coronaviruses on a historical and global scale

PubMed

21.

Sayers

E.W.

,

Beck

J.

,

Brister

J.R.

et al. (

2020

)

Database resources of the National Center for Biotechnology Information

.

Nucleic Acids Res.

,

48

,

D9

–

D16

.

22.

Madeira

F.

,

Park

Y.M.

,

Lee

J.

et al. (

2019

)

The EMBL-EBI search and sequence analysis tools APIs in 2019

.

Nucleic Acids Res.

,

47

,

W636

–

W641

.

23.

Yue

H.

,

Shu

D.

,

Wang

M.

et al. (

2018

)

Genome-wide identification and expression analysis of the HD-zip gene family in wheat (Triticum aestivum L.)

.

Genes

,

9

,

70

.

24.

She

R.

,

Chu

J.S.

,

Wang

K.

et al. (

2009

)

GenBlastA: enabling BLAST to identify homologous gene sequences

.

Genome Res.

,

19

,

143

–

149

.

25.

Li

W.

and

Godzik

A.

(

2006

)

Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences

.

Bioinformatics

,

22

,

1658

–

1659

.

26.

Shu

Y.

and

McCauley

J.

(

2017

)

GISAID: global initiative on sharing all influenza data - from vision to reality

.

Euro Surveill.

,

22

, 30494.

27.

Patient

S.

,

Wieser

D.

,

Kleen

M.

et al. (

2008

)

UniProtJAPI: a remote API for accessing UniProt data

.

Bioinformatics

,

24

,

1321

–

1322

.

28.

Berman

H.M.

,

Westbrook

J.

,

Feng

Z.

et al. (

2000

)

The Protein Data Bank

.

Nucleic Acids Res.

,

28

,

235

–

242

.

29.

Burley

S.K.

,

Berman

H.M.

,

Bhikadiya

C.

et al. (

2019

)

RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy

.

Nucleic Acids Res.

,

47

,

D464

–

D474

.

30.

Thakur

A.

,

Rajput

A.

and

Kumar

M.

(

2016

)

MSLVP: prediction of multiple subcellular localization of viral proteins using a support vector machine

.

Mol. Biosyst.

,

12

,

2572

–

2586

.

31.

Krogh

A.

,

Larsson

B.

,

von Heijne

G.

et al. (

2001

)

Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes

.

J. Mol. Biol.

,

305

,

567

–

580

.

32.

Hung

C.L.

,

Lin

Y.S.

,

Lin

C.Y.

et al. (

2015

)

CUDA ClustalW: an efficient parallel algorithm for progressive multiple sequence alignment on multi-GPUs

.

Comput. Biol. Chem.

,

58

,

62

–

68

.

33.

Harris

R.S.

(

2007

)

Improved pairwise alignment of genomic DNA

.

Ph.D. Thesis

.

Pennsylvania State University

.

34.

Edgar

R.C.

(

2004

)

MUSCLE: a multiple sequence alignment method with reduced time and space complexity

.

BMC Bioinform.

,

5

, 113.

35.

Edgar

R.C.

(

2004

)

MUSCLE: multiple sequence alignment with high accuracy and high throughput

.

Nucleic Acids Res.

,

32

,

1792

–

1797

.

36.

Price

M.N.

,

Dehal

P.S.

and

Arkin

A.P.

(

2010

)

FastTree 2—approximately maximum-likelihood trees for large alignments

.

PLoS One

,

5

, e9490.

37.

Hutter

S.

,

Vilella

A.J.

and

Rozas

J.

(

2006

)

Genome-wide DNA polymorphism analyses using VariScan

.

BMC Bioinform.

,

7

, 409.

38.

Vilella

A.J.

,

Blanco-Garcia

A.

,

Hutter

S.

et al. (

2005

)

VariScan: Analysis of evolutionary patterns from large-scale DNA sequence polymorphism data

.

Bioinformatics

,

21

,

2791

–

2793

.

39.

DeGiorgio

M.

,

Huber

C.D.

,

Hubisz

M.J.

et al. (

2016

)

SweepFinder2: increased sensitivity, robustness and flexibility

.

Bioinformatics

,

32

,

1895

–

1897

.

40.

Holsinger

K.E.

and

Weir

B.S.

(

2009

)

Genetics in geographically structured populations: defining, estimating and interpreting F(ST)

.

Nat. Rev. Genet.

,

10

,

639

–

650

.

41.

Fumagalli

M.

,

Vieira

F.G.

,

Korneliussen

T.S.

et al. (

2013

)

Quantifying population genetic differentiation from next-generation sequencing data

.

Genetics

,

195

,

979

–

992

.

42.

Zhu

Z.

,

Guan

Z.

,

Liu

G.

et al. (

2019

)

SGID: a comprehensive and interactive database of the silkworm

.

Database

,

2019

, baz134.

43.

Zhu

Z.

and

Meng

G.

(

2019

)

ASFVdb: An integrative resource for genomics and proteomics analyses of African swine fever

.

Database, 2019, baaa023

.

44.

Zhu

Z.

,

Wang

Y.

,

Zhou

X.

et al. (

2020

)

SWAV: a web-based visualization browser for sliding window analysis

.

Sci. Rep.

,

10

, 149.

45.

Yachdav

G.

,

Wilzbach

S.

,

Rauscher

B.

et al. (

2016

)

MSAViewer: interactive JavaScript visualization of multiple sequence alignments

.

Bioinformatics

,

32

,

3501

–

3503

.

PubMed

46.

Shank

S.D.

,

Weaver

S.

and

Kosakovsky Pond

S.L.

(

2018

)

phylotree.js - a JavaScript library for application development and interactive data visualization in phylogenetics

.

BMC Bioinform.

,

19

, 276.

47.

Kent

W.J.

(

2002

)

BLAT—the BLAST-like alignment tool

.

Genome Res.

,

12

,

656

–

664

.

48.

Johnson

M.

,

Zaretskaya

I.

,

Raytselis

Y.

et al. (

2008

)

NCBI BLAST: a better web interface

.

Nucleic Acids Res.

,

36

,

W5

–

W9

.

49.

Wang

J.

,

Youkharibache

P.

,

Zhang

D.

et al. (

2020

)

iCn3D, a web-based 3D viewer for sharing 1D/2D/3D representations of biomolecular structures

.

Bioinformatics

,

36

,

131

–

135

.

50.

Lu

R.

,

Zhao

X.

,

Li

J.

et al. (

2020

)

Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding

.

Lancet

,

395

,

565

–

574

.

51.

Hu

B.

,

Zeng

L.P.

,

Yang

X.L.

et al. (

2017

)

Discovery of a rich gene pool of bat SARS-related coronaviruses provides new insights into the origin of SARS coronavirus

.

PLoS Pathog.

,

13

, e1006698.

10.1101/2020.02.17.951335

52.

Niederwerder

M.C.

and

Hesse

R.A.

(

2018

)

Swine enteric coronavirus disease: a review of 4 years with porcine epidemic diarrhoea virus and porcine deltacoronavirus in the United States and Canada

.

Transbound Emerg. Dis.

,

65

,

660

–

675

.

53.

Xiao

K.

,

Zhai

J.

,

Feng

Y.

et al. (

2020

)

Isolation and characterization of 2019-nCoV-like coronavirus from Malayan Pangolins

. bioRxiv. doi:

Crossref

54.

Ye

J.

,

Fang

L.

,

Zheng

H.

et al. (

2006

)

WEGO: a web tool for plotting GO annotations

.

Nucleic Acids Res.

,

34

,

W293

–

W297

.

55.

Watterson

G.A.

(

1975

)

On the number of segregating sites in genetical models without recombination

.

Theor. Popul. Biol.

,

7

,

256

–

276

.

56.

Tajima

F.

(

1989

)

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism

.

Genetics

,

123

,

585

–

595

.

57.

Nielsen

R.

,

Williamson

S.

,

Kim

Y.

et al. (

2005

)

Genomic scans for selective sweeps using SNP data

.

Genome Res.

,

15

,

1566

–

1575

.

58.

Zhu

L.

and

Bustamante

C.D.

(

2005

)

A composite-likelihood approach for detecting directional selection from DNA sequence data

.

Genetics

,

170

,

1411

–

1421

.

59.

Chen

Y.

,

Guo

Y.

,

Pan

Y.

et al. (

2020

)

Structure analysis of the receptor binding of 2019-nCoV

.

Biochem. Biophys. Res. Commun

,

525

,

135

–

140

.

Crossref

10.1101/2020.02.05.20020107

60.

Cai

G.

(

2020

)

Bulk and single-cell transcriptomics identify tobacco-use disparity in lung gene expression of ACE2, the receptor of 2019-nCov

.

medRxiv

. doi: