Diseases 2.0: a weekly updated database of disease–gene associations from text mining and data integration Open Access

10.1093/bioinformatics/btz070

Comeau

D.C.

Wei

C.-H.

Islamaj Doǧan

et al. (

2019

)

PMC text mining subset in BioC: about three million full-text articles and growing

Bioinformatics

3533

–

3535

. doi:

Pandi

M.-T.

van der Spek

P.J.

Koromina

et al. (

2020

)

A novel text-mining approach for retrieving pharmacogenomics associations from the literature

Front. Pharmacol.

, 602030. doi:

10.3389/fphar.2020.602030

Karadeniz

Hur

et al. (

2015

)

Literature mining and ontology based analysis of host-Brucella gene–gene interaction network

Front. Microbiol.

, 1386. doi:

10.3389/fmicb.2015.01386

Qin

Yao

and

Xia

(

2021

)

A novel metric to quantify the effect of pathway enrichment evaluation with respect to biomedical text-mined terms: development and feasibility study

JMIR Med. Inform.

, e28247. doi:

10.2196/28247

10.1186/s12859-018-2048-y

Simmons

Singhal

and

(

2016

)

Text mining for precision medicine: bringing structure to EHRs and biomedical literature to understand genes and health

Adv. Exp. Med. Biol.

939

139

–

166

Zhou

and

B.-Q.

(

2018

)

The research on gene-disease association based on text-mining of PubMed

BMC Bioinformatics

, 37. doi:

Czarnecki

and

Shepherd

A.J.

(

2014

)

Mining Biological Networks from Full-Text Articles

Springer

New York, NY

pp. 135

–

145

Google Preview

10.1093/bioinformatics/btn469

10.

Jenssen

T.K.

Laegreid

Komorowski

et al. (

2001

)

A literature network of human genes for high-throughput analysis of gene expression

Nat. Genet.

–

. doi:

11.

Tsuruoka

Tsujii

and

Ananiadou

(

2008

)

FACTA: a text search engine for finding associated biomedical concepts

Bioinformatics

2559

–

2560

. doi:

12.

The UniProt Consortium

. (

2018

)

UniProt: the universal protein knowledgebase

Nucleic Acids Res.

D158

–

D169

13.

Amberger

J.S.

Bocchini

C.A.

Scott

A.F.

et al. (

2018

)

OMIM.org: leveraging knowledge across phenotype–gene relationships

Nucleic Acids Res.

D1038

–

D1043

. doi:

14.

Fomous

Mitchell

J.A.

and

McCray

(

2006

)

Genetics home reference: helping patients understand the role of genetics in health and disease

Community Genet.

274

–

278

PubMed

10.1002/0471142905.hg1011s57

15.

Forbes

Bhamra

Bamford

et al. (

2008

)

The Catalogue of Somatic Mutations in Cancer (COSMIC)

Curr. Protoc. Hum. Genet

. doi:

10.1038/s41568-020-0290-x

16.

Martínez-Jiménez

Muiños

Sentís

et al. (

2020

)

A compendium of mutational cancer driver genes

Nat. Rev. Cancer

555

–

572

. doi:

17.

Rouillard

A.D.

Gundersen

G.W.

Fernandez

N.F.

et al. (

2016

)

The Harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins

Database

, baw100. doi:

10.1093/database/baw100

18.

Beck

Shorter

and

Brookes

A.J.

(

2019

)

GWAS Central: a comprehensive resource for the discovery and comparison of genotype and phenotype data from genome-wide association studies

Nucleic Acids Res.

D933

–

D940

10.1093/bioinformatics/btab427

19.

M.J.

Liu

Wang

et al. (

2015

)

GWASdb v2: an update database for human genetic variants identified by genome-wide association studies

Nucleic Acids Res.

D869

–

D876

. doi:

20.

Frazer

K.A.

Murray

S.S.

Schork

N.J.

et al. (

2009

)

Human genetic variation and its contribution to complex traits

Nat. Rev. Genet.

241

–

251

. doi:

21.

Pallejà

Horn

Eliasson

et al. (

2011

)

DistiLD database: diseases and traits in linkage disequilibrium blocks

Nucleic Acids Res.

D1036

–

D1040

. doi:

22.

Yang

J.J.

Grissa

Lambert

C.G.

et al. (

2021

)

TIGA: target illumination GWAS analytics

Bioinformatics

3865

–

3873

. doi:

23.

Rappaport

Twik

Plaschkes

et al. (

2016

)

MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search

Nucleic Acids Res.

D877

–

D887

. doi:

24.

Piñero

Ramírez-Anguita

J.M.

Saüch-Pitarch

et al. (

2019

)

The DisGeNET knowledge platform for disease genomics: 2019 update

Nucleic Acids Res.

D845

–

D855

10.1080/13506129.2019.1603143

25.

Sheils

T.K.

Mathias

S.L.

Kelleher

K.J.

et al. (

2020

)

TCRD and Pharos 2021: mining the human proteome for disease biology

Nucleic Acids Res.

D1334

–

D1346

. doi:

26.

Ochoa

Hercules

Carmona

et al. (

2021

)

Open Targets Platform: supporting systematic drug–target identification and prioritisation

Nucleic Acids Res.

D1302

–

D1310

. doi:

27.

Szklarczyk

Gable

A.L.

Lyon

et al. (

2018

)

STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

Nucleic Acids Res.

D607

–

D613

. doi:

28.

Schriml

L.M.

Mitraka

Munro

et al. (

2018

)

Human Disease Ontology 2018 update: classification, content and workflow expansion

Nucleic Acids Res.

D955

–

D962

. doi:

29.

Nastou

K.C.

Nasi

G.I.

Tsiolaki

P.L.

et al. (

2019

)

AmyCo: the amyloidoses collection

Amyloid

112

–

117

. doi:

30.

Hutchins

B.I.

Yuan

Anderson

J.M.

et al. (

2016

)

Relative Citation Ratio (RCR): a new metric that uses citation rates to measure influence at the article level

PLoS Biol.

–

. doi:

10.1371/journal.pbio.1002541

31.

Doǧan

R.I.

Wilbur

W.J.

and

Comeau

D.C.

(

2014

)

BioC and simplified use of the PMC open access dataset for biomedical text mining

. Proceedings of the 4th Workshop on Building and Evaluating Resources for Health and Biomedical Text Processing.

32.

Chawla

D.S.

(

2020

)

A single ‘paper mill’ appears to have churned out 400 papers, sleuths find

Science

. 10.1126/science.abb4930.

10.1371/journal.pone.0065390

33.

Joulin

Grave

Bojanowski

et al. (

2017

)

Bag of Tricks for Efficient Text Classification

. 10.18653/V1/E17-2068.

34.

Flicek

Ahmed

Amode

M.R.

et al. (

2012

)

Ensembl 2013

Nucleic Acids Res.

D48

–

D55

. doi:

35.

Gray

K.A.

Daugherty

L.C.

Gordon

S.M.

et al. (

2012

)

Genenames.org: the HGNC resources in 2013

Nucleic Acids Res.

D545

–

D552

. doi:

36.

Pafilis

Pletscher-Frankild

Fanini

et al. (

2013

)

The SPECIES and ORGANISMS resources for fast and accurate identification of taxonomic names in text

PLoS One

, e65390. doi:

10.1038/d41586-021-00733-5

37.

Else

and

Van Noorden

(

2021

)

The fight against fake-paper factories that churn out sham science

Nature

591

516

–

519

. doi:

38.

Stelzer

Rosen

Plaschkes

et al. (

2016

)

The GeneCards suite: from gene data mining to disease genome sequence analyses

Curr. Protoc. Bioinform.

1.30.1

–

1.30.33

10.1093/bioinformatics/btx200

39.

Cannon

D.C.

Yang

J.J.

Mathias

S.L.

et al. . (

2017

)

TIN-X: target importance and novelty explorer

Bioinformatics

2601

–

2603

. doi:

40.

Lachmann

Schilder

B.M.

Wojciechowicz

M.L.

et al. (

2019

)

Geneshot: search engine for ranking genes from arbitrary text queries

Nucleic Acids Res.

W571

–

W577

. doi:

41.

Rouillard

A.D.

Gundersen

G.W.

Fernandez

N.F.

et al. (

2016

)

The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins

Database

2016

, baw100. doi:

10.1093/database/baw100