Transformer-based approach for symptom recognition and multilingual linking

被引:0
|
作者
Vassileva, Sylvia [1 ]
Grazhdanski, Georgi [1 ]
Koychev, Ivan [1 ]
Boytcheva, Svetla [1 ,2 ]
机构
[1] Sofia Univ St Kliment Ohridski, Fac Math & Informat, Blvd James Bourchier 5, Sofia 1164, Bulgaria
[2] Ontotext, Ul Nikola Gabrovski 79, Sofia 1700, Bulgaria
关键词
D O I
10.1093/database/baae090
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a transformer-based approach for symptom Named Entity Recognition (NER) in Spanish clinical texts and multilingual entity linking on the SympTEMIST dataset. For Spanish NER, we fine tune a RoBERTa-based token-level classifier with Bidirectional Long Short-Term Memory and conditional random field layers on an augmented train set, achieving an F1 score of 0.73. Entity linking is performed via a hybrid approach with dictionaries, generating candidates from a knowledge base containing Unified Medical Language System aliases using the cross-lingual SapBERT and reranking the top candidates using GPT-3.5. The entity linking approach shows consistent results for multiple languages of 0.73 accuracy on the SympTEMIST multilingual dataset and also achieves an accuracy of 0.6123 on the Spanish entity linking task surpassing the current top score for this subtask.Database URL: https://github.com/svassileva/symptemist-multilingual-linking
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] Multilingual Controllable Transformer-Based Lexical Simplification
    Sheang, Kim Cheng
    Saggion, Horacio
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2023, (71): : 109 - 123
  • [2] Practical Transformer-based Multilingual Text Classification
    Wang, Cindy
    Banko, Michele
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 121 - 129
  • [3] Multilingual Transformer-Based Personality Traits Estimation
    Leonardi, Simone
    Monti, Diego
    Rizzo, Giuseppe
    Morisio, Maurizio
    INFORMATION, 2020, 11 (04)
  • [4] Transformer-Based Multilingual Speech Emotion Recognition Using Data Augmentation and Feature Fusion
    Al-onazi, Badriyya B.
    Nauman, Muhammad Asif
    Jahangir, Rashid
    Malik, Muhmmad Mohsin
    Alkhammash, Eman H.
    Elshewey, Ahmed M.
    APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [5] Assessing the Effectiveness of Multilingual Transformer-based Text Embeddings for Named Entity Recognition in Portuguese
    de Lima Santos, Diego Bernardes
    de Carvalho Dutra, Frederico Giffoni
    Parreiras, Fernando Silva
    Brandao, Wladmir Cardoso
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 473 - 483
  • [6] A transformer-based approach for Arabic offline handwritten text recognition
    Momeni, Saleh
    Babaali, Bagher
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3053 - 3062
  • [7] A transformer-based approach for Arabic offline handwritten text recognition
    Saleh Momeni
    Bagher BabaAli
    Signal, Image and Video Processing, 2024, 18 : 3053 - 3062
  • [8] A Transformer-Based Approach to Multilingual Fake News Detection in Low-Resource Languages
    De, Arkadipta
    Bandyopadhyay, Dibyanayan
    Gain, Baban
    Ekbal, Asif
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [9] A transformer-based network for speech recognition
    Tang L.
    International Journal of Speech Technology, 2023, 26 (02) : 531 - 539
  • [10] SketchFormer: transformer-based approach for sketch recognition using vector images
    Anil Singh Parihar
    Gaurav Jain
    Shivang Chopra
    Suransh Chopra
    Multimedia Tools and Applications, 2021, 80 : 9075 - 9091