Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach

被引:0
|
作者
Hossam Faris
Maria Habib
Mohammad Faris
Alaa Alomari
Pedro A. Castillo
Manal Alomari
机构
[1] The University of Jordan,King Abdullah II School for Information Technology
[2] Altibbi,undefined
[3] ETSIIT-CITIC,undefined
[4] University of Granada,undefined
关键词
Altibbi; BiLSTM; Deep learning; Long short-term memory; LSTM; Medical question classification; Word2Vec;
D O I
暂无
中图分类号
学科分类号
摘要
Automated question classification is a fundamental component of automated question-answering systems, which plays a critical role in promoting medical and healthcare services. Developing an automated question classification system depends heavily on natural language processing and data mining techniques. Question classification methods based on classical machine learning techniques face limitations in capturing the hidden relationships of features, as well as, handling complex languages and very large-scale datasets. Therefore, this paper proposes a deep learning approach for question classification, since deep learning methods have the powerful capability to extract implicit, hidden relationships and automatically generate dense representations of features. The proposed question classification model depends on unidirectional and bidirectional long short-term memory networks (LSTM and BiLSTM), which essentially developed to handle the Arabic language in the field of healthcare. The features are represented and created using a domain-specific word embedding model (Word2Vec) that is constructed by training around 1.5 million medical consultations from Altibbi company. Altibbi is a telemedicine company that is used as a case study and a source for curating and collecting the data. The proposed deep learning approach is a multi-class classification algorithm that automatically labels and maps the questions into 15 categories of medical specialities. The proposed deep learning model is evaluated using several evaluation metrics, including accuracy, precision, recall, and F1-score. Markedly, the proposed model achieved a superb classification capacity in terms of classification accuracy rate, which gained 87.2%.
引用
收藏
页码:1811 / 1827
页数:16
相关论文
共 50 条
  • [31] Multi-Label Arabic Text Classification Based On Deep Learning
    Alsukhni, Batool
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
  • [32] Learning from Few Samples: Lexical Substitution with Word Embeddings for Short Text Classification
    Elekes, Abel
    Di Stefano, Antonino Simone
    Schaeler, Martin
    Boehm, Klemens
    Keller, Matthias
    2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 111 - 119
  • [33] Classification of tastants: A deep learning based approach
    Dutta, Prantar
    Jain, Deepak
    Gupta, Rakesh
    Rai, Beena
    MOLECULAR INFORMATICS, 2023, 42 (12)
  • [34] Zero-Shot Learning Based Approach For Medieval Word Recognition Using Deep-Learned Features
    Chanda, Sukalpa
    Baas, Jochem
    Haitink, Daniel
    Hamel, Sebastien
    Stutzmann, Dominique
    Schomaker, Lambert
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 345 - 350
  • [35] Deep Learning based Classification for Healthcare Data Analysis System
    Irfan, Muhammad
    Hameed, Ibrahim A.
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,
  • [36] Impact of Stemming and Word Embedding on Deep Learning-Based Arabic Text Categorization
    Almuzaini, Huda Abdulrahman
    Azmi, Aqil M.
    IEEE ACCESS, 2020, 8 : 127913 - 127928
  • [37] A New Approach using Deep Learning and Reinforcement Learning in HealthCare: Skin Cancer Classification
    Yousra, Dahdouh
    Abdelhakim, Anouar Boudhir
    Mohamed, Ben Ahmed
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (05) : 557 - 564
  • [38] Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
    Asroni, Asroni
    Ku-Mahamud, Ku Ruhana
    Damarjati, Cahya
    Slamat, Hasan Basri
    BAGHDAD SCIENCE JOURNAL, 2021, 18 (02) : 925 - 936
  • [39] Hybrid deep learning model for Arabic text classification based on mutual information
    Abdulghani, Farah A.
    Abdullah, Nada A. Z.
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 1901 - 1908
  • [40] An optimized hybrid deep learning model based on word embeddings and statistical features for extractive summarization
    Wazery, Yaser M.
    Saleh, Marwa E.
    Ali, Abdelmgeid A.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)