New reasons for biologists to write with a formal language Open Access

Boole

(

1854

)

An Investigation of the Laws of Thought

Walton & Maberly

London and Cambridge, UK

Winston

J.E.

(

2018

)

Twenty-first century biological nomenclature—the enduring power of names

Integr. Comp. Biol.

1122

–

1131

PubMed

Woodger

J.H.

(

1929

)

Biological Principles: A Critical Study

Routledge & Kegan Paul Ltd

London

Nicholson

D.J.

and

Gawne

(

2014

)

Rethinking Woodger’s legacy in the philosophy of biology

J. Hist. Biol.

243

–

292

Hirschman

Morgan

A.A.

and

Yeh

A.S.

(

2002

)

Rutabaga by any other name: extracting biological names

J. Biomed. Inform.

247

–

259

Chen

Liu

and

Friedman

(

2005

)

Gene name ambiguity of eukaryotic nomenclatures

Bioinformatics

248

–

256

10.

Rodriguez-Esteban

and

Jiang

(

2017

)

Differential gene expression in disease: a comparison between high-throughput studies and the literature

BMC Med. Genomics

, 59.

11.

Jonnalagadda

Tari

Hakenberg

et al. (

2009

)

Towards effective sentence simplification for automatic processing of biomedical text

. In: Proceedings of Human Language Technologies.

Association for Computational Linguistics, New York, NY, USA, Boulder, Colorado

, pp.

177

–

180

12.

Bornmann

and

Mutz

(

2015

)

Growth rates of modern science: a bibliometric analysis based on the number of publications and cited references

J. Assoc. Inf. Sci. Technol.

2215

–

2222

Crossref

13.

Else

(

2020

)

How a torrent of COVID science changed research publishing—in seven charts

Nature

588

, 553.

14.

Brainard

(

2020

)

Scientists are drowning in COVID-19 papers. Can new tools keep them afloat?

Science

15.

Baumgartner

W.A.

Jr,

Cohen

K.B.

Fox

L.M.

et al. (

2007

)

Manual curation is not sufficient for annotation of genomic databases

Bioinformatics

i41

–

i48

16.

Rodriguez-Esteban

(

2015

)

Biocuration with insufficient resources and fixed timelines

Database (Oxford)

2015

, bav116.

17.

Chandras

Weaver

Zouberakis

et al. (

2009

)

Models for financial sustainability of biological databases and resources

Database (Oxford)

2009

, bap017.

18.

Reiser

Berardini

T.Z.

et al. (

2016

)

Sustainable funding for biocuration: The Arabidopsis Information Resource (TAIR) as a case study of a subscription-based funding model

Database (Oxford)

2016

, baw018.

19.

Karp

P.D.

(

2016

)

How much does curation cost?

Database (Oxford)

2016

, baw110.

20.

Karp

P.D.

(

2016

)

Can we replace curation with information extraction software?

Database (Oxford)

2016

, baw150.

21.

Poux

Arighi

C.N.

Magrane

et al. (

2017

)

On expert curation and scalability: UniProtKB/Swiss-Prot as a case study

Bioinformatics

3454

–

3460

22.

Bourne

P.E.

Lorsch

J.R.

and

Green

E.D.

(

2015

)

Perspective: sustaining the big-data ecosystem

Nature

527

S16

–

23.

Rodriguez-Esteban

(

2016

) Understanding human disease knowledge through text mining: what is text mining? In:

Loging

(ed).

Bioinformatics and Computational Biology in Drug Discovery and Development

Cambridge University Press

Cambridge, UK

, pp. 47–62.

24.

Zhu

and

Zheng

(

2020

)

Biomedical event extraction with a novel combination strategy based on hybrid deep neural networks

BMC Bioinform.

, 47.

25.

Percha

and

Altman

R.B.

(

2018

)

A global network of biomedical relationships derived from text

Bioinformatics

2614

–

2624

26.

Mehryary

Kaewphan

Hakala

et al. (

2016

)

Filtering large-scale event collections using a combination of supervised and unsupervised learning for event trigger classification

J. Biomed. Semant.

, 27.

27.

Gyori

B.M.

Bachman

J.A.

Subramanian

et al. (

2017

)

From word models to executable models of signaling networks using automated assembly

Mol. Syst. Biol.

, 954.

28.

Huang

C.C.

and

(

2016

)

Community challenges in biomedical text mining over 10 years: success, failure and the future

Brief. Bioinform.

132

–

144

29.

Kim

J.D.

Ohta

and

Tsujii

(

2008

)

Corpus annotation for mining biomedical events from literature

BMC Bioinform.

, 10.

30.

Thompson

Nawaz

McNaught

et al. (

2011

)

Enriching a biomedical event corpus with meta-knowledge annotation

BMC Bioinform.

, 393.

31.

Cao

Lin

Han

et al. (

2021

)

Knowledgeable or educated guess? Revisiting language models as knowledge bases

. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Association for Computational Linguistics, New York, NY, USA.

32.

Wang

Liu

and

Zhang

(

2021

)

Can generative pre-trained language models serve as knowledge bases for closed-book QA?

In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Association for Computational Linguistics, New York, NY, USA.

33.

Hogan

Blomqvist

Cochez

et al. (

2021

)

Knowledge graphs

ACM Comput. Surv.

, 1–37.

34.

Hoyt

C.T.

Domingo-Fernández

Aldisi

et al. (

2019

)

Re-curation and rational enrichment of knowledge graphs in Biological Expression Language

Database

2019

, baz068.

35.

Sun

Wang

Feng

et al. (

2021

)

ERNIE 3.0: large-scale knowledge enhanced pre-training for language understanding and generation

. arXiv: 2107.02137.

36.

Fei

Ren

Zhang

et al. (

2021

)

Enriching contextualized language model from knowledge graph for biomedical information extraction

Brief. Bioinform.

, bbaa110.

37.

Yuan

Liu

Tan

et al. (

2021

)

Improving biomedical pretrained language models with knowledge

. In: Proceedings of the 20th Workshop on Biomedical Language Processing, Association for Computational Linguistics, New York, NY, USA, pp.

180

–

190

38.

Zhao

Jiang

et al. (

2020

)

A novel method for multiple biomedical events extraction with reinforcement learning and knowledge bases

. In: 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), IEEE Computer Society, Los Alamitos, CA, USA, pp.

402

–

407

39.

Balabin

Hoyt

C.T.

Birkenbihl

et al. (

2022

)

STonKGs: a sophisticated transformer trained on biomedical text and knowledge graphs

Bioinformatics

: 1648–56.

40.

Weikum

Dong

Razniewski

et al. (

2021

)

Machine knowledge: creation and curation of comprehensive knowledge bases

Found. Trends® Databases

. arXiv, 11564.

41.

Yun

Zhang

et al. (

2021

)

Knowledge modeling: a survey of processes and techniques

Int. J. Intell. Syst.

1686

–

1720

Crossref

42.

Wang

de Melo

et al. (

2016

)

Visualizing and curating knowledge graphs over time and space

. In:

Proceedings of ACL-2016 System Demonstrations. Association for Computational Linguistics

. Berlin, Germany, pp. 25–30.

43.

Leaman

Wei

C.H.

Allot

et al. (

2020

)

Ten tips for a text-mining-ready article: how to improve automated discoverability and interpretability

PLoS Biol.

, e3000716.

44.

Fujiyoshi

Bruford

E.A.

Mroz

et al. (

2021

)

Opinion: standardizing gene product nomenclature-a call to action

Proc. Natl. Acad. Sci. U. S. A.

118

, e2025207118.

. https://biological-expression-language.github.io/.

45.

Biological Expression Language

46.

Woodger

J.H.

(

1952

)

Biology and Language

The University Press

Cambridge

47.

Fabbrizzi

(

2008

)

Communicating about matter with symbols: evolving from alchemy to chemistry

J. Chem. Educ.

, 1501.

48.

Krallinger

Leitner

Rabal

et al. (

2015

)

CHEMDNER: the drugs and chemical names extraction challenge

J. Cheminform.

, S1.

49.

Crossland

M.P.

(

1962

)

Historical Studies in the Language of Chemistry

Heinemann

London

50.

Strömbäck

and

Lambrix

(

2005

)

Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAX

Bioinformatics

4401

–

4407

51.

Le Novère

Hucka

et al. (

2009

)

The systems biology graphical notation

Nat. Biotechnol.

735

–

741

52.

Demir

Cary

M.P.

Paley

et al. (

2010

)

The BioPAX community standard for pathway data sharing

Nat. Biotechnol.

935

–

942

53.

Schwartz

and

Nenadic

(

2013

)

PathNER: a tool for systematic identification of biological pathway mentions in the literature

BMC Syst. Biol.

, S2.

54.

Takada

and

Aggarwal

B.B.

(

2004

)

TNF activates Syk protein tyrosine kinase leading to TNF-induced MAPK activation, NF-kappaB activation, and apoptosis

J. Immunol.

173

1066

–

1077

55.

Cokol

Rodriguez-Esteban

and

Rzhetsky

(

2007

)

A recipe for high impact

Genome Biol.

, 406.