Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach

被引:0
|
作者
Hossam Faris
Maria Habib
Mohammad Faris
Alaa Alomari
Pedro A. Castillo
Manal Alomari
机构
[1] The University of Jordan,King Abdullah II School for Information Technology
[2] Altibbi,undefined
[3] ETSIIT-CITIC,undefined
[4] University of Granada,undefined
关键词
Altibbi; BiLSTM; Deep learning; Long short-term memory; LSTM; Medical question classification; Word2Vec;
D O I
暂无
中图分类号
学科分类号
摘要
Automated question classification is a fundamental component of automated question-answering systems, which plays a critical role in promoting medical and healthcare services. Developing an automated question classification system depends heavily on natural language processing and data mining techniques. Question classification methods based on classical machine learning techniques face limitations in capturing the hidden relationships of features, as well as, handling complex languages and very large-scale datasets. Therefore, this paper proposes a deep learning approach for question classification, since deep learning methods have the powerful capability to extract implicit, hidden relationships and automatically generate dense representations of features. The proposed question classification model depends on unidirectional and bidirectional long short-term memory networks (LSTM and BiLSTM), which essentially developed to handle the Arabic language in the field of healthcare. The features are represented and created using a domain-specific word embedding model (Word2Vec) that is constructed by training around 1.5 million medical consultations from Altibbi company. Altibbi is a telemedicine company that is used as a case study and a source for curating and collecting the data. The proposed deep learning approach is a multi-class classification algorithm that automatically labels and maps the questions into 15 categories of medical specialities. The proposed deep learning model is evaluated using several evaluation metrics, including accuracy, precision, recall, and F1-score. Markedly, the proposed model achieved a superb classification capacity in terms of classification accuracy rate, which gained 87.2%.
引用
收藏
页码:1811 / 1827
页数:16
相关论文
共 50 条
  • [1] Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach
    Faris, Hossam
    Habib, Maria
    Faris, Mohammad
    Alomari, Alaa
    Castillo, Pedro A.
    Alomari, Manal
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) : 1811 - 1827
  • [2] Arabic Sentiment Analysis Based on Word Embeddings and Deep Learning
    Elhassan, Nasrin
    Varone, Giuseppe
    Ahmed, Rami
    Gogate, Mandar
    Dashtipour, Kia
    Almoamari, Hani
    El-Affendi, Mohammed A.
    Al-Tamimi, Bassam Naji
    Albalwy, Faisal
    Hussain, Amir
    COMPUTERS, 2023, 12 (06)
  • [3] Arabic Text Classification Based on Word and Document Embeddings
    El Mahdaouy, Abdelkader
    Gaussier, Eric
    El Alaoui, Said Ouatik
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 32 - 41
  • [4] Genre Classification using Word Embeddings and Deep Learning
    Kumar, Akshi
    Rajpal, Arjun
    Rathore, Dushyant
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2142 - 2146
  • [5] Arabic Quran Verses Authentication Using Deep Learning and Word Embeddings
    Touati-Hamad, Zineb
    Laouar, Mohamed Ridda
    Bendib, Issam
    Hakak, Saqib
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (04) : 681 - 688
  • [6] A survey of word embeddings based on deep learning
    Wang, Shirui
    Zhou, Wenan
    Jiang, Chao
    COMPUTING, 2020, 102 (03) : 717 - 740
  • [7] A survey of word embeddings based on deep learning
    Shirui Wang
    Wenan Zhou
    Chao Jiang
    Computing, 2020, 102 : 717 - 740
  • [8] A Deep Learning Approach for Arabic Manuscripts Classification
    Al-homed, Lutfieh S.
    Jambi, Kamal M.
    Al-Barhamtoshy, Hassanin M.
    SENSORS, 2023, 23 (19)
  • [9] A Deep Learning Approach for Arabic Text Classification
    Sundus, Katrina
    Al-Haj, Fatima
    Hammo, Bassam
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 258 - 264
  • [10] Contextual Semantic Embeddings Based on Transformer Models for Arabic Biomedical Questions Classification
    Talghalit, Ismail Ait
    Alami, Hamza
    El Alaoui, Said Ouatik
    HighTech and Innovation Journal, 2024, 5 (04): : 1024 - 1037