Enhancing biomedical relation extraction through data-centric and preprocessing-robust ensemble learning approach

The different preprocessing sentence pair inputs

Sentence pairs	Sentence 1		Sentence 2
Sentence pairs	Relation type task	Novelty task	Relation type & Novelty tasks
Baseline (Tag, Entity)	-	-	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs1 (Tag, Entity)	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs2 (Tag, Entity)	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs3 (Tag, Entity ID)	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	Association between promoter −1607 polymorphism of <entity_type1> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …
Pairs4 (Tag, Entity ID)	What is the relation type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	What is the novelty type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …

Sentence pairs	Sentence 1		Sentence 2
Sentence pairs	Relation type task	Novelty task	Relation type & Novelty tasks
Baseline (Tag, Entity)	-	-	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs1 (Tag, Entity)	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs2 (Tag, Entity)	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs3 (Tag, Entity ID)	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	Association between promoter −1607 polymorphism of <entity_type1> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …
Pairs4 (Tag, Entity ID)	What is the relation type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	What is the novelty type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …

Table 1.

Open in new tab Download slide

The different preprocessing sentence pair inputs

Sentence pairs	Sentence 1		Sentence 2
Sentence pairs	Relation type task	Novelty task	Relation type & Novelty tasks
Baseline (Tag, Entity)	-	-	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs1 (Tag, Entity)	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs2 (Tag, Entity)	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs3 (Tag, Entity ID)	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	Association between promoter −1607 polymorphism of <entity_type1> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …
Pairs4 (Tag, Entity ID)	What is the relation type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	What is the novelty type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …

Sentence pairs	Sentence 1		Sentence 2
Sentence pairs	Relation type task	Novelty task	Relation type & Novelty tasks
Baseline (Tag, Entity)	-	-	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs1 (Tag, Entity)	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	<entity_type1> entity1 </entity_tpye1> [MASK] <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs2 (Tag, Entity)	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	What is [MASK] between <entity_type1> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity1 </entity_type1> and <entity_type2> entity2 </entity_type2> in Southern Chinese …
Pairs3 (Tag, Entity ID)	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	<entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2>	Association between promoter −1607 polymorphism of <entity_type1> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …
Pairs4 (Tag, Entity ID)	What is the relation type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	What is the novelty type between <entity_type1> entity_id1 </entity_tpye1> and <entity_type2> entity_id2 </entity_type2> ?	Association between promoter −1607 polymorphism of <entity_type> entity_id1 </entity_type1> and <entity_type2> entity_id2 </entity_type2> in Southern Chinese …

Table 1 shows preprocessing pipelines that improve model performance. Each time, we provide two sentences: the first sentence (sentence1) is an entity ID pair or generated prompt question to provide semantic similarity for its corresponding pair; the second sentence (sentence2) corresponds to a co-occurrence context in which the entity is replaced by its corresponding entity ID and that entity is placed between two boundary tags (e.g. <GeneOrGeneProduct> and </GeneOrGeneProduct> for genes). These tags are kept as single tokens by adding them to the pretrained language model’s vocabulary. Moreover, we include the special tokens “[CLS]” and “[SEP]” at the beginning of the instance and in between sentence1 and sentence2, respectively, to conform to the standard practices of using pretrained BERT models for classification.

This study uses the state-of-the-art pretrained model PubMedBERT [5], which specializes in the medical domain, and fine-tunes it on the BioRED [2] dataset to form a text classification model. In the fine-tuning process, we pay attention to two aspects: the relation type and novelty of the relation. Each is treated as a separate task, so we create a classifier for each task: relation type and novelty. They are trained independently to provide confidence scores for each class. We can use these confidence scores to measure the model’s confidence in the predicted class for each instance.

Deep learning model

The system used in subtask 1 is based on the BioRED [2] open source RE system implementation. Each instance of a relation candidate consists of two biomedical entities and their co-occurrence context. However, entity spans may be larger, and the relations with more than two entity IDs must be expanded to include those instances. The objective of this task is to classify the instances into predefined relation extraction types, or mark them as unrelated (i.e. “None”). This work also attempts to identify novel findings and existing information using the labels “Novel,” “No” (not novel findings), or “None” (negative examples). The model’s architecture can be found in Fig. 1.

Figure 1.

Model architecture.

Subtask 2’s end-to-end system development leveraged PubTator’s API [19] to access and biomedical concepts and entity IDs during preprocessing. These concepts and IDs were then standardized to match dataset formats. To standardize the dataset provided by the PubTator API, certain terms were mapped to more general categories. For example, the expressions “ProteinMutation,” “DNAMutation,” and “SNP” should be transformed into “SequenceVariant.” Additionally, “Chemical” became “ChemicalEntity,” “Disease” turned into “DiseaseOrPhenotypicFeature,” “Gene” developed into “GeneOrGeneProduct,” and finally “Species” became “OrganisationTaxon.” Entity IDs were also transformed, including removing prefixes like “MESH:,” “tmVar,” and hyphens. We also generalized certain notations, like replacing “CVCL:” with “CVCL_” and ‘”S#” with “RS.” We made predictions on the processed output thanks to the pretrained models from subtask 1, and their performance was further improved by applying an ensemble learning method. The workflow is illustrated in Fig. 2.

Figure 2.

Workflow of our approach.

Open in new tab Download slide

Ensemble model

The ensemble learning method’s Max Rule enhances system reliability and quality. This technique aggregates confidence scores from multiple models trained by using a different preprocessing input. The class with the highest probability score will therefore become the model’s final output as ensured by the Max Rule, which improves the robustness and accuracy of the classification system. However, there are situations where relations are found between entities, but the novelty was categorized as “None.” In this scenario, we replace the novelty type with “Novel,” which indicates the presence of new findings. Using these predictions, we generate the submission file.

Ablation studies and discussion

Utilizing the BioCreative VIII BioRED Track CodaLab platform, the assessment score is computed. Subtask 1 encompasses three benchmark schemas: (i) entity pair: extracting the concept identifiers within the relation, (ii) entity pair + relation type: identifying the specific relation type for the extracted pairs, and (iii) entity pair + relation type + novelty: annotating the novelty for the extracted pairs. Subtask 2 introduces two additional benchmark schemas: (i) biological concepts (NER): recognizing the biomedical named entity, and (ii) ID: extracting the entity ID.

In our investigation, we systematically explored the influence of various model input formats on our text classification system performance. Initially, we introduced prompt sentences by integrating boundary tags and the token “[MASK]” between entity pairs (designated as pairs1). However, this modification did not yield statistically significant changes in performance metrics. Subsequently, we experimented with representing prompt text directly as pairs2, which resulted in notable performance enhancements of 2, 3, and 2% over the baseline across entity pair, relation type, and novelty benchmarks, respectively.

Further augmenting our exploration, we endeavored to normalize entity IDs by substituting entity names with unique identifiers within text instances (pairs3 and pairs4). While this adjustment did not elicit significant changes in relation type and novelty classification metrics, pairs4 exhibited a commendable 2% increase over the baseline, specifically in the entity pair benchmark.

Table 2 illustrates the significant impact of diverse information input strategies on our model’s performance across various metrics. In pursuit of model optimization, we meticulously curated sets of sentences demonstrating robust performance in both relationship identification and novelty detection. Subsequent application of ensemble learning techniques to these select sets led to a substantial improvement in our model’s efficacy. This approach emerged as a pivotal factor in achieving enhanced results, underscoring the importance of leveraging collaborative methodologies in text analysis endeavors.

Table 2.

Performance on the test dataset of subtask 1

Inputs	Entity pair			+ Relation type			+ Novelty
Inputs	P	R	F	P	R	F	P	R	F
Baseline	0.7107	0.7686	0.7385	0.5012	0.5419	0.5207	0.5325	0.5758	0.5533
Pairs1	0.7379	0.7412	0.7395	0.5223	0.5247	0.5235	0.5544	0.5569	0.5556
Pairs2	0.7327	0.7697	0.7508	0.5246	0.5511	0.5375	0.5502	0.5780	0.5637
Pairs3	0.7419	0.7259	0.7338	0.5445	0.5328	0.5386	0.5624	0.5502	0.5562
Pairs4	0.7423	0.7619	0.7520	0.5257	0.5396	0.5326	0.5513	0.5658	0.5585
+ Ensemble	0.7754	0.7488	0.7619	0.5769	0.5570	0.5668	0.6073	0.5864	0.5967
+ Data-centric	0.8002	0.7579	0.7785	0.6120	0.5796	0.5953	0.6219	0.5891	0.6050

Inputs	Entity pair			+ Relation type			+ Novelty
Inputs	P	R	F	P	R	F	P	R	F
Baseline	0.7107	0.7686	0.7385	0.5012	0.5419	0.5207	0.5325	0.5758	0.5533
Pairs1	0.7379	0.7412	0.7395	0.5223	0.5247	0.5235	0.5544	0.5569	0.5556
Pairs2	0.7327	0.7697	0.7508	0.5246	0.5511	0.5375	0.5502	0.5780	0.5637
Pairs3	0.7419	0.7259	0.7338	0.5445	0.5328	0.5386	0.5624	0.5502	0.5562
Pairs4	0.7423	0.7619	0.7520	0.5257	0.5396	0.5326	0.5513	0.5658	0.5585
+ Ensemble	0.7754	0.7488	0.7619	0.5769	0.5570	0.5668	0.6073	0.5864	0.5967
+ Data-centric	0.8002	0.7579	0.7785	0.6120	0.5796	0.5953	0.6219	0.5891	0.6050

P is precession, R is recall, and F is F1 score. Bold text represents the highest score for the main task.

Table 2.

Performance on the test dataset of subtask 1

Inputs	Entity pair			+ Relation type			+ Novelty
Inputs	P	R	F	P	R	F	P	R	F
Baseline	0.7107	0.7686	0.7385	0.5012	0.5419	0.5207	0.5325	0.5758	0.5533
Pairs1	0.7379	0.7412	0.7395	0.5223	0.5247	0.5235	0.5544	0.5569	0.5556
Pairs2	0.7327	0.7697	0.7508	0.5246	0.5511	0.5375	0.5502	0.5780	0.5637
Pairs3	0.7419	0.7259	0.7338	0.5445	0.5328	0.5386	0.5624	0.5502	0.5562
Pairs4	0.7423	0.7619	0.7520	0.5257	0.5396	0.5326	0.5513	0.5658	0.5585
+ Ensemble	0.7754	0.7488	0.7619	0.5769	0.5570	0.5668	0.6073	0.5864	0.5967
+ Data-centric	0.8002	0.7579	0.7785	0.6120	0.5796	0.5953	0.6219	0.5891	0.6050

Inputs	Entity pair			+ Relation type			+ Novelty
Inputs	P	R	F	P	R	F	P	R	F
Baseline	0.7107	0.7686	0.7385	0.5012	0.5419	0.5207	0.5325	0.5758	0.5533
Pairs1	0.7379	0.7412	0.7395	0.5223	0.5247	0.5235	0.5544	0.5569	0.5556
Pairs2	0.7327	0.7697	0.7508	0.5246	0.5511	0.5375	0.5502	0.5780	0.5637
Pairs3	0.7419	0.7259	0.7338	0.5445	0.5328	0.5386	0.5624	0.5502	0.5562
Pairs4	0.7423	0.7619	0.7520	0.5257	0.5396	0.5326	0.5513	0.5658	0.5585
+ Ensemble	0.7754	0.7488	0.7619	0.5769	0.5570	0.5668	0.6073	0.5864	0.5967
+ Data-centric	0.8002	0.7579	0.7785	0.6120	0.5796	0.5953	0.6219	0.5891	0.6050

P is precession, R is recall, and F is F1 score. Bold text represents the highest score for the main task.

Furthermore, our observations highlight the efficacy of a data-centric approach in improving overall model performance. By prioritizing the acquisition of high-quality data and employing collaborative techniques, we can effectively navigate the complexities inherent in text classification tasks, resulting in increased accuracy and efficacy.

Top performing systems

Official results [20, 21] show that PubMedBERT alone achieves an F1 score of 0.7429 for Entity Pair identification, 0.5193 for Entity Pair + Relation Type classification, and 0.5625 for Entity Pair + Novelty Type classification. BioREx alone performs even better on Entity Pair identification (0.7568) and Entity Pair + Relation Type classification (0.5689). In contrast, GPT 3.5 and GPT 4 are significantly behind these models.

Top-performing teams in the competition employed diverse strategies. Team 129, for instance, developed a multitask system using PubMedBERT for context encoding and incorporating coreference resolution, entity pair identification, novelty identification, and relation extraction tasks. Their approach also leveraged adversarial training and model ensembles to boost performance and robustness, achieving F1 scores of 0.7559, 0.5667, and 0.5920, respectively, for the three tasks. This highlights the limitations of relying solely on a single preprocessing method, as our own system also benefited from an ensemble approach similar to Team 129.

Team 114 took a different approach, designing a unified model with BioLinkBERT to jointly classify relations and detect novelty. They employed techniques like surrounding entities with special tokens, utilizing a multihead attention layer, combining loss functions, negative sampling, and ensembling. Their best model achieved F1 scores of 0.7427, 0.5476, and 0.5854. While generally outperforming our single preprocessing method, their results fell short of our ensemble method. However, given their model’s strength, incorporating an ensemble technique could potentially further improve their performance.

Interestingly, Team 118 was the only team utilizing BioREx. However, their decision to augment training data with outputs from large language models like ChatGPT, Claude, and BingChat may have introduced noise-hindering BioREx’s performance (F1 scores of 0.7346, 0.5531, and 0.5645). Conversely, our new data-centric system with BioREx outperforms even our ensemble approach with PubMedBERT.

Conclusion

Performances were significantly affected by different prompts and text representations. A change from naïve input of 73.85% for entity pairs to 75.20% for entity pairs with prompts and different text representations was observed. Novelty has been improved by 2%. Even when utilizing simplistic prompts and text representations across diverse configurations, sufficient diversity was maintained, allowing ensemble techniques to be effectively applied. This integration resulted in a notable improvement of ∼8% in novelty evaluation performance. Moreover, the model demonstrated even greater efficacy upon the implementation of a data-centric approach. Notably, performance metrics improved by an additional 1%, further emphasizing the efficacy of this methodology in enhancing overall model performance.

Limitation and future work

To predict novelty, we rely on a relation-type prediction. If the relation type is “None” (no relation as a negative example), the novelty prediction will be ignored, resulting in a false negative outcome. In the case where the novelty prediction appears to be “None” (negative example), however, the prediction of relation type is not “None,” we will classify that novelty as “Novel” (novel finding), which will affect performance. As a part of our future research, we intend to apply some other techniques, such as the Hierarchical Bayesian approach, to solve the false negative issue.

Conflict of interest:

None declared.

Funding

This research was supported by the Ministry of Science and Technology, Taiwan (112–2221-E-008-062-MY3).

Data availability

The BioRED track dataset is available at https://ftp.ncbi.nlm.nih.gov/pub/lu/BC8-BioRED-track/. The BioRED evaluation leaderboard is available at https://codalab.lisn.upsaclay.fr/competitions/16381.

References

Arighi

BioCreative - BioCreative VIII Challenge and Workshop

. n.d.-a. https://biocreative.bioinformatics.udel.edu/events/biocreative-viii/biocreative-viii-challenge/ (29 September

2023

, date last accessed).

Luo

Lai

P-T

Wei

C-H

et al.

BioRED: a rich biomedical relation extraction dataset

Brief Bioinform

2022

;

:bbac282. doi:

10.1093/bib/bbac282

10.18653/v1/2021.eacl-main.20

Radford

Narasimhan

Salimans

et al.

Improving language understanding by generative pre- training

OpenAI

2018

Devlin

Chang

Lee

et al.

Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers)

2019

, 4171–4186.

Tinn

Cheng

et al.

Domain-specific language model pretraining for biomedical natural language processing

ACM Trans Comput Healthcare

2022

;

–

. doi:

Schick

Schütze

. (

2021

)

Exploiting cloze-questions for few-shot text classification and natural language inference

. doi:

Ullah

Azam

Faheem

et al.

Comparing prompt-based and standard fine-tuning for Urdu text classification

2023

. doi:

10.18653/v1/2023.findings-emnlp.449

Azam

Rizwan

Karim

Exploring data augmentation strategies for hate speech detection in Roman Urdu

. In: Proceedings of the Thirteenth Language Resources and Evaluation Conference, pp.

4523

–

Marseille, France

European Language Resources Association

2022

Lai

P-T

Wei

C-H

Luo

et al.

BioREx: improving biomedical relation extraction by leveraging heterogeneous datasets

J Biomed Informat

2023

;

146

:104487. doi:

10.1016/j.jbi.2023.104487

10.

Khan

Chaudhari

Chandra

A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation

Expert Syst Appl

2023

;

244

:122778.

10.1016/j.artmed.2004.07.016

11.

Bunescu

Kate

et al.

Comparative experiments on learning information extractors for proteins and their interactions

Artif Intell Med

2005

;

139

–

. doi:

12.

Miranda

Mehryary

Luoma

et al.

Overview of DrugProt BioCreative VII track: quality evaluation and large scale text mining of drug-gene/protein relations

. In: Proceedings of the Seventh BioCreative Challenge Evaluation Workshop,

Online: BioCreative

, pp.

–

2021

13.

Herrero-Zazo

Segura-Bedmar

Martínez

et al.

The DDI corpus: an annotated corpus with pharmacological substances and drug–drug interactions

J Biomed Informat

2013

;

914

–

. doi:

10.1016/j.jbi.2013.07.011

10.1093/bioinformatics/btl616

14.

Fundel

Küffner

Zimmer

RelEx—relation extraction using dependency parse trees

Bioinformatics

2007

;

365

–

. doi:

15.

Wei

C-H

Peng

Leaman

et al.

Assessing the state of the art in biomedical relation extraction: overview of the BioCreative V chemical-disease relation (CDR) task

Database

2016

;

2016

:baw032. doi:

10.1093/database/baw032

10.1093/bioinformatics/btq667

16.

Doughty

Kertesz-Farkas

Bodenreider

et al.

Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature

Bioinformatics

2011

;

408

–

. doi:

17.

Thorn

Klein

Altman

PharmGKB: the pharmacogenomics knowledge base

Pharmacogenomics: Methods and Protocols

2013;

1015

311

–

18.

Piñero

Ramírez-Anguita

Saüch-Pitarch

et al.

The DisGeNET knowledge platform for disease genomics: 2019 update

J Nucleic Acids Res

2020

;

D845

–

D55

https://www.ncbi.nlm.nih.gov/research/pubtator/api.html (

19.

U.S. National Library of Medicine

Pubtator Central API - NCBI - NLM - NIH

National Center for Biotechnology Information

n.d.

29 September 2023

, data last accessed).

20.

Islamaj

Arighi

Campbell

et al.

Proceedings of the BioCreative VIII challenge and workshop: curation and evaluation in the era of generative models

. In: AMIA 2023 Annual Symposium.

New Orleans, USA

: Zenodo,

2023

. doi: 10.5281/zenodo.10103191

21.

Islamaj

Lai

P-T

Wei

C-H

et al.

The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII

Database

2024

;

2024

:baae069. doi:

10.1093/database/baae069