Gold-standard ontology-based anatomical annotation in the CRAFT Corpus

3

Friedman

C.

,

Rindflesch

T.C.

,

Corn

M.

(

2013

)

Natural language processing: state of the art and prospects for significant progress, a workshop sponsored by the National Library of Medicine

.

J. Biomed. Inf

.,

46

,

765

–

773

.

4

Cohen

K.B.

,

Demner-Fushman

D.

(

2014

)

Biomedical Natural Language Processing

.

John Benjamins

,

Amsterdam

.

5

Holzinger

A.

,

Schanti

J.

,

Schroettner

M.

et al. (

2014

) Biomedical text mining: state-of-the-art, open problems and future challenges. In:

Holzinger

A.

,

Jurisica

I.

(eds).

Interactive Knowledge Discovery and Data Mining in Biomedical Informatics

.

Springer, Berlin Heidelberg

, pp.

271

–

300

.

6

Ivanović

M.

,

Budimac

Z.

(

2014

)

An overview of ontologies and data sources in medical domains

.

Expert Syst. Appl

.,

41

,

5158

–

5166

.

7

Hoehndorf

R.

,

Schofield

P.N.

,

GKoutos

G.V.

(

2015

)

The role of ontologies in biological and biomedical research: a functional perspective

.

Briefings Bioinf

.,

16

,

1069

–

1080

.

8

Smith

B.

,

Ashburner

M.

,

Rosse

C.

et al. (

2007

)

The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration

.

Nat. Biotechnol

.,

25

,

1251

–

1255

.

9

Neves

M.

(

2014

)

An analysis on the entity annotations in biological corpora

.

F1000 Res

.,

3

,

96.

10

Wissler

L.

,

Almashraee

M.

,

Monett

D.

,

Paschke

A.

(

2014

) The gold standard in corpus annotation. IEEE Ger. Stud. Conf. 2014.

11

Bada

M.

,

Eckert

M.

,

Evans

D.

et al. (

2012

)

Concept annotation in the CRAFT corpus

.

BMC Bioinf

.,

13

,

161.

12

Verspoor

K.

,

Cohen

K.B.

,

Lanfranchi

A.

et al. (

2012

)

A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools

.

BMC Bioinf

.,

13

,

207.

13

Funk

C.

,

Baumgartner

W.

Jr.,

Garcia

B.

et al. (

2014

)

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

.

BMC Bioinf

.,

15

,

59

.

14

Liu

H.

,

Christiansen

T.

,

Baumgartner

W.A.

Jr,

Verspoor

K.

(

2012

)

BioLemmatizer: a lemmatization tool for morphological processing of biomedical text

.

J. Biomed. Semantics

,

3

,

3.

15

Chae

J.

,

Jung

Y.

,

Lee

T.

et al. (

2014

)

Identifying non-elliptical entity mentions in a coordinated NP with ellipses

.

J. Biomed. Inf

.,

47

,

139

–

152

.

16

Campos

D.

,

Matos

S.

,

Oliveira

J.

(

2013

)

A modular framework for biomedical concept recognition

.

BMC Bioinf

.,

14

,

281.

http://dx.doi.org/10.1093/bioinformatics/btt317

17

Nunes

T.

,

Campos

D.

,

Matos

S.

,

Oliveira

J.

(

2013

)

BeCAS: biomedical concept recognition services and visualization

.

Bioinformatics

,

29

,

1915

–

1916

.

18

Groza

T.

,

Verspoor

K.

(

2015

)

Assessing the impact of case sensitivity and term information gain on biomedical concept recognition

.

PLOS One

,

10

,

e0119091.

19

Funk

C.S.

,

Cohen

K.B.

,

Hunter

L.E.

,

Verspoor

K.

(

2016

)

Gene Ontology synonym generation rules lead to increased performance in biomedical concept recognition

.

J. Biomed. Semantics

,

7

,

52.

20

Tsai

C.T.

,

Roth

D.

(

2016

)

Concept grounding to multiple knowledge bases via indirect supervision

.

Trans. Assoc. Comput. Linguist

.,

4

,

141

–

154

.

21

Tseytlin

E.

,

Mitchell

K.

,

Legowski

E.

et al. (

2016

)

NOBLE–flexible concept recognition for large-scale biomedical natural language processing

.

BMC Bioinf

.,

17

,

32.

22

Campos

D.

,

Lourenço

J.

,

Matos

S.

,

Oliveira

J.L.

(

2014

)

Egas: a collaborative and interactive document curation platform

.

Database

,

2014

,

bau048.

23

Hsu

Y.Y.

,

Kao

H.Y.

(

2015

)

Curatable named-entity recognition using semantic relations

.

IEEE/ACM Trans. Comput. Biol. Bioinf

.,

12

,

785

–

792

.

24

Collier

N.

,

Tran

M.V.

,

Le

H.Q.

et al. (

2013

)

Learning to recognize phenotype candidates in the auto-immune literature using SVM re-ranking

.

PLOS One

,

8

,

e72965.

25

Song

M.

,

Kim

W.C.

,

Lee

D.

et al. (

2015

)

PKDE4J: entity and relation extraction for public knowledge discovery. J

.

Biomed. Inf

.,

57

,

320

–

332

.

26

Funk

C.S.

,

Kahanda

I.

,

Ben-Hur

A.

,

Verspoor

K.M.

(

2015

)

Evaluating a variety of text-mined features for automatic protein function with GOstruct

.

J. Biomed. Semantics

,

6

,

9.

27

Kim

J.D.

,

Cohen

K.B.

,

Kim

J.J.

(

2015

)

PubAnnotation-query: a search tool for corpora with multi-layers of annotation

.

BMC Proc

.,

9

,

A3.

28

Eshleman

R.M.

,

Yang

H.

,

Levine

B.

(

2015

)

Structuring unstructured clinical narratives in OpenMRS with medical concept extraction

.

IEEE Int. Conf. Bioinf. Biomed

.,

764

–

769

.

29

Gerner

M.

,

Nenadic

G.

,

Bergman

C.M.

(

2010

)

An exploration of mining gene expression mentions and their anatomical locations from biomedical text

.

Proc. 2010 Workshop Biomed. Nat. Lang. Process. ACL

,

2010

,

72

–

80

.

30

Gerner

M.

,

Sarafraz

F.

,

Bergman

C.M.

,

Nenadic

G.

(

2012

)

BioContext: an integrated text mining system for large-scale extraction and contextualization of biomolecular events

.

Bioinformatics

,

28

,

2154

–

2161

.

31

Neves

M.

,

Damaschun

A.

,

Kurtz

A.

,

Leser

U.

(

2012

)

Annotating and evaluating text for stem cell research

. In:

Proceedings of the Third Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM)

,

Istanbul

,

Turkey

, Vol. 2012.

32

Ohta

T.

,

Pyysalo

S.

,

Tsujii

J.

,

Ananiadou

S.

(

2012

)

Open-domain anatomical entity mention detection

. In:

Proceedings of the Workshop on Detecting Structure in Scholarly Discourse

,

Jeju, Republic of Korea, Association for Computational Linguistics

, pp.

27

–

36

.

http://dx.doi.org/10.1093/bioinformatics/btt580

33

Pyysalo

S.

,

Ohta

T.

,

Miwa

M.

et al. (

2012

)

Event extraction across multiple levels of biological organization

.

Bioinformatics

,

28

,

i575

–

i581

.

34

Pyysalo

S.

,

Ananiadou

S.

(

2014

)

Anatomical entity mention detection at literature scale

.

Bioinformatics

,

30

,

868

–

875

.

35

Skeppstedt

M.

,

Kvist

M.

,

Nilsson

G.

,

Dalianis

H.

(

2014

)

Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study

.

J. Biomed. Inf

.,

49

,

148

–

158

.

36

Xu

Y.

,

Hua

J.

,

Ni

Z.

et al. (

2014

)

Anatomical entity recognition with a hierarchical framework augmented by external resources

.

PLOS One

,

9

,

e108396.

37

Pyysalo

S.

,

Ohta

T.

,

Rak

R.

et al. (

2015

)

Overview of the cancer genetics and pathway curation tasks of BioNLP shared task 203

.

BMC Bioinf

.,

16

,

S2.

38

Mungall

C.J.

,

Torniai

C.

,

Gkoutos

G.V.

et al. (

2012

)

Uberon, an integrative multi-species anatomy ontology

.

Genome Biol

.,

13

,

R5.

39

Gennari

J.H.

,

Musen

M.A.

,

Fergerson

R.W.

et al. (

2003

)

The evolution of Protégé: an environment for knowledge-based systems development

.

Int. J. Hum.-Comp. Stud

.,

58

,

89

–

123

.

40

Ogren

P.V.

(

2006

)

Knowtator: a Protégé plug-in for annotated corpus construction

. In:

Proceedings of the 2006 Conference on North American Chapter of the Association for Computational Linguistics-Human Language Technologies

,

New York, NY, USA, Association for Computational Linguistics

, pp.

273

–

275

.

41

Bada

M.

,

Eckert

M.

,

Palmer

M.

,

Hunter

L.E.

(

2010

)

An overview of the CRAFT concept annotation guidelines

. In:

Proceedings of the Linguistic Annotation Workshop IV, Association for Computational Linguistics Conference

,

Uppsala, Sweden, Association for Computational Linguistics

, pp.

207

–

211

.

42

The Gene Ontology Consortium

. (

2000

)

Gene ontology: tool for the unification of biology

.

Nat. Genet

.,

25

,

25

–

29

.

PubMed

43

Munn

K.

,

Smith

B.

(eds). (

2008

)

Applied Ontology: An Introduction

.

Ontos Verlag

.

44

Verspoor

C.M.

,

Joslyn

C.

,

Papcun

G.J.

(

2003

)

The gene ontology as a source of lexical semantic knowledge for a biological natural language processing application

. In:

Proceedings of the SIGIR 2003 Workshop on Text Analysis and Search for Bioinformatics

,

Toronto

,

Canada

, pp.

51

–

56

.

45

Beisswanger

E.

,

Poprat

M.

,

Hahn

U.

(

2008

) Lexical properties of OBO ontology class names and synonyms. In: Proceedings of 3rd International Symposium on Semantic Mining in Biomedicine,

Turku, Finland, Turku Centre for Computer Science (TUCS)

.

46

Hripcsak

G.

,

Rothschild

A.S.

(

2005

)

Agreement, the F-measure, and reliability in information retrieval

.

J. Am. Med. Inf. Assoc

.,

12

,

296

–

298

.

47

Kim

J.-D.

,

Ohta

T.

,

Tateisi

Y.

,

Tsujii

J.

(

2003

)

GENIA corpus—a semantically annotated corpus for bio-textmining

.

Bioinformatics

,

19

, 180

–

182

.

48

Campillos

L.

,

Deléger

L.

,

Grouin

C.

et al. (

2017

)

A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI ANNOTATED Text corpus (MERLOT)

.

Lang. Resour. Eval

., pp.

1

–

31

, doi: 10.1007/s10579-017-9382-y.

49

Albright

D.

,

Lanfranchi

A.

,

Fredriksen

A.

et al. (

2013

)

Towards comprehensive syntactic and semantic annotations of the clinical narrative

.

J. Am. Med. Inf. Assoc

.,

20

,

922

–

930

.

50

Bard

J.

,

Rhee

S.Y.

,

Ashburner

M.

(

2005

)

An ontology for cell types

.

Genome Biol

.,

6

,

R21.

51

Bekhuis

T.

(

2006

)

Conceptual biology, hypothesis discovery, and text mining: Swanson’s legacy

.

Biomed. Digital Libr

.,

3

,

2.

52

Rebholz-Schuhmann

D.

,

Oelrich

A.

,

Hoehndorf

R.

(

2012

)

Text-mining solutions for biomedical research: enabling integrative biology

.

Nat. Rev. Genet

.,

13

,

829

–

839

.

53

Mungall

C.J.

,

Bada

M.

,

Berardini

T.Z.

et al. (

2011

)

Cross-product extensions of the gene ontology

.

J. Biomed. Inf

.,

44

,

80

–

86

.

http://dx.doi.org/10.1002/dvg.22878

54

Manda

P.

,

Balhoff

J.P.

,

Lapp

H.

et al. (

2015

)

Using the phenoscape knowledgebase to relate genetic perturbations to phenotypic evolution

.

Genesis

,

53

,

561

–

571

.

55

Mungall

C.J.

,

McMurry

J.A.

,

Köhler

S.

et al. (

2017

)

The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species

.

Nucleic Acids Res

.,

45

,

D712

–

D722

.

56

Komljenovic

A.

,

Roux

J.

,

Robinson-Rechavi

M.

,

Bastian

F.B.

(

2016

)

BgeeDB, an R package for retrieval of curated expression datasets and for gene list expression localization enrichment tests

.

F1000 Res

.,

5

,

2748.

57

Rosse

C.

,

Mejino

J.L.V.

Jr. (

2008

) The foundational model of anatomy ontology. In:

Davidson

B.A.

,

Baldock

R.

(eds).

Anatomy Ontologies for Bioinformatics: Principles and Practice

.

Springer

,

New York

, pp.

59

–

117

.

58

Lipscomb

C.E.

(

2000

)

Medical subject headings (MeSH)

.

Bull. Med. Libr. Assoc

.,

88

,

265

–

266

.

PubMed

59

Cornet

R.

,

de Keizer

N.

(

2008

)

Forty years of SNOMED: a literature review

.

BMC Med. Inf. Decis. Making

,

8

,

S2.

60

Lindberg

D.A.B.

,

Humphreys

B.L.

,

McCray

A.T.

(

1993

)

The unified medical language system

.

Yearb. Med. Inf

.,

1993

,

41

–

51

.

61

Hayamizu

T.F.

,

Mangan

M.

,

Corradi

J.P.

et al. (

2005

)

The Adult Mouse Anatomical Dictionary: a tool for annotating and integrating data

.

Genome Biol

.,

6

,

R29.

62

Yoder

M.J.

,

Mikó

I.

,

Seltmann

K.C.

et al. (

2010

)

A gross anatomy ontology for hymenoptera

.

PLOS One

,

5

,

e15991.

63

Ohta

T.

,

Tateisi

Y.

,

Kim

J.-D.

et al. (

2001

)

Ontology based corpus annotation and tools

.

Genome Inf

.,

12

,

469

–

470

.

64

Tanenblatt

M.

,

Coden

A.

,

Sominsky

I.

(

2010

) The Concept Mapper approach to named entity recognition. In: Proceedings of 7th International Conference on Language Resources and Evaluation (LREC),

Valletta, Malta, European Langauge Resources Assocation

.

65

Campos

D.

,

Matos

S.

,

Oliveira

J.L.

(

2012

) Biomedical named entity recognition: a survey of machine-learning tools. In:

Sakurai

S.

(ed).

Theory and Applications for Advanced Text Mining

. InTech.

Google Preview

66

Bada

M.

(

2014

)

Mapping of biomedical text to concepts of lexicons, terminologies, and ontologies

.

Meth. Mol. Biol

.,

1159

,

33

–

46

.