Enriching Contextualized Representations with Biomedical Ontologies: Extending KnowBert to UMLS - CEA - Université Paris-Saclay Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Enriching Contextualized Representations with Biomedical Ontologies: Extending KnowBert to UMLS

Résumé

Currently, biomedical document processing is mostly human work. Software solutions which attempt to alleviate this burden exist but generally do not perform well enough to be helpful in many applications. Concurrently, there exist projects which organize concepts in the biomedical field. Therefore, we seek to leverage existing structured knowledge resources to improve biomedical language modeling. In this paper, we provide an implementation integrating the UMLS knowledgebase into a BERT-based language model, aiming to improve its performance in biomedical Named Entity Recognition. To achieve this, we extend KnowBert, a recently developed technique for integrating knowledge into language models. Preliminary results reveal the challenges of applying KnowBert to the biomedical domain given the number and subtlety of different concepts in UMLS. Going forward, addressing these challenges and combining this with other approaches such as BioBERT may help expand the range of usefully automatable biomedical language processing tasks.
Fichier principal
Vignette du fichier
KnowBert_UMLS_NoteSpringer.pdf (700.63 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

cea-04563039 , version 1 (29-04-2024)

Identifiants

Citer

Guilhem Piat, Nasredine Semmar, Alexandre Allauzen, Hassane Essafi, Julien Tourille. Enriching Contextualized Representations with Biomedical Ontologies: Extending KnowBert to UMLS. Intelligent Computing Proceedings of the 2022 Computing Conference, Jul 2022, Londres, United Kingdom. pp.760-773, ⟨10.1007/978-3-031-10464-0_52⟩. ⟨cea-04563039⟩
0 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More