A method for automatic detection of acronyms in texts and building a dataset for acronym disambiguation

被引:3
|
作者
Azimi, Sasan [1 ]
Veisi, Hadi [1 ]
Amouie, Reyhaneh [1 ]
机构
[1] Univ Tehran, Tehran, Iran
关键词
Acronym disambiguation; Tech-mining; Text Mining; Natural Language Processing;
D O I
10.1109/icspis48872.2019.9066084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, there is an increasing tendency for using acronyms in technical texts, which has led to ambiguous acronyms with different possible expansions. Diversity of expansions of a single acronym makes recognizing its expansion a challenging task. Replacing acronyms with incorrect expansions will lead to problems in text mining procedures, namely text normalization, summarization, machine translation, and tech-mining. Tech-mining involves exploring and analyzing technical texts to recognize the relations between technologies. This paper is aimed at proposing a method for building a dataset that meets the requirements for training acronym disambiguation models in technical texts. In this paper, challenges in automatic acronym disambiguation are presented. We have proposed a method for building the dataset and the accuracy of the acronym disambiguation model is 86%.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] eHomeSeniors Dataset: An Infrared Thermal Sensor Dataset for Automatic Fall Detection Research
    Riquelme, Fabian
    Espinoza, Cristina
    Rodenas, Tomas
    Minonzio, Jean-Gabriel
    Taramasco, Carla
    SENSORS, 2019, 19 (20)
  • [22] Automatic Detection of Necrotizing Fasciitis: A Dataset and Early Results
    Das, Anik
    Amin, Sumaiya
    Hughes, James Alexander
    2021 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2021, : 68 - 75
  • [23] UMAFall: A Multisensor Dataset for the Research on Automatic Fall Detection
    Casilari, Eduardo
    Santoyo-Ramon, Jose A.
    Cano-Garcia, Jose M.
    14TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC 2017) / 12TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC 2017) / AFFILIATED WORKSHOPS, 2017, 110 : 32 - 39
  • [24] An Automatic Pavement Crack Detection System with FocusCrack Dataset
    Yan, Xinyun
    Shi, Shang
    Xu, Xiaohu
    He, Zhengran
    Zhou, Xiaofeng
    Wang, Chishe
    Lu, Zhiyi
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [25] Automatic Detection and Classification of Information Events in Media Texts
    Khoroshilov, Al-dr A.
    Musabaev, R. R.
    Kozlovskaya, Ya. D.
    Nikitin, Yu. A.
    Khoroshilov, A. A.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2020, 54 (04) : 202 - 214
  • [26] Automatic Detection of Stop Words for Texts in the Uzbek Language
    Madatov K.
    Bekchanov S.
    Vičič J.
    Informatica (Slovenia), 2023, 47 (02): : 143 - 150
  • [27] Automatic Detection and Classification of Information Events in Media Texts
    Al-dr A. Khoroshilov
    R. R. Musabaev
    Ya. D. Kozlovskaya
    Yu. A. Nikitin
    A. A. Khoroshilov
    Automatic Documentation and Mathematical Linguistics, 2020, 54 : 202 - 214
  • [28] Automatic Algerian Sarcasm Detection from Texts and Images
    Bousmaha, Kheira Zineb
    Hamadouche, Khaoula
    Djouabi, Hadjer
    Hadrich-Belguith, Lamia
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (07)
  • [29] Automatic Generation of Point Cloud Synthetic Dataset for Historical Building Representation
    Pierdicca, Roberto
    Mameli, Marco
    Malinverni, Eva Savina
    Paolanti, Marina
    Frontoni, Emanuele
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, PT I, 2019, 11613 : 203 - 219
  • [30] Semi-Automatic Dataset Annotation Applied to Automatic Violent Message Detection
    Botella-Gil, Beatriz
    Sepulveda-Torres, Robiert
    Bonet-Jover, Alba
    Martinez-Barco, Patricio
    Saquete, Estela
    IEEE ACCESS, 2024, 12 : 19651 - 19664