Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach

被引：0

作者：

Hossam Faris

Maria Habib

Mohammad Faris

Alaa Alomari

Pedro A. Castillo

Manal Alomari

机构：

[1] The University of Jordan,King Abdullah II School for Information Technology

[2] Altibbi,undefined

[3] ETSIIT-CITIC,undefined

[4] University of Granada,undefined

来源：

Journal of Ambient Intelligence and Humanized Computing | 2022年 / 13卷

关键词：

Altibbi; BiLSTM; Deep learning; Long short-term memory; LSTM; Medical question classification; Word2Vec;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Automated question classification is a fundamental component of automated question-answering systems, which plays a critical role in promoting medical and healthcare services. Developing an automated question classification system depends heavily on natural language processing and data mining techniques. Question classification methods based on classical machine learning techniques face limitations in capturing the hidden relationships of features, as well as, handling complex languages and very large-scale datasets. Therefore, this paper proposes a deep learning approach for question classification, since deep learning methods have the powerful capability to extract implicit, hidden relationships and automatically generate dense representations of features. The proposed question classification model depends on unidirectional and bidirectional long short-term memory networks (LSTM and BiLSTM), which essentially developed to handle the Arabic language in the field of healthcare. The features are represented and created using a domain-specific word embedding model (Word2Vec) that is constructed by training around 1.5 million medical consultations from Altibbi company. Altibbi is a telemedicine company that is used as a case study and a source for curating and collecting the data. The proposed deep learning approach is a multi-class classification algorithm that automatically labels and maps the questions into 15 categories of medical specialities. The proposed deep learning model is evaluated using several evaluation metrics, including accuracy, precision, recall, and F1-score. Markedly, the proposed model achieved a superb classification capacity in terms of classification accuracy rate, which gained 87.2%.

引用

页码：1811 / 1827

页数：16

共 50 条

[31] Multi-Label Arabic Text Classification Based On Deep Learning
Alsukhni, Batool
2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
[32] Learning from Few Samples: Lexical Substitution with Word Embeddings for Short Text Classification
Elekes, Abel
Di Stefano, Antonino Simone
Schaeler, Martin
Boehm, Klemens
Keller, Matthias
2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 111 - 119
[33] Classification of tastants: A deep learning based approach
Dutta, Prantar
Jain, Deepak
Gupta, Rakesh
Rai, Beena
MOLECULAR INFORMATICS, 2023, 42 (12)
[34] Zero-Shot Learning Based Approach For Medieval Word Recognition Using Deep-Learned Features
Chanda, Sukalpa
Baas, Jochem
Haitink, Daniel
Hamel, Sebastien
Stutzmann, Dominique
Schomaker, Lambert
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 345 - 350
[35] Deep Learning based Classification for Healthcare Data Analysis System
Irfan, Muhammad
Hameed, Ibrahim A.
PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,
[36] Impact of Stemming and Word Embedding on Deep Learning-Based Arabic Text Categorization
Almuzaini, Huda Abdulrahman
Azmi, Aqil M.
IEEE ACCESS, 2020, 8 : 127913 - 127928
[37] A New Approach using Deep Learning and Reinforcement Learning in HealthCare: Skin Cancer Classification
Yousra, Dahdouh
Abdelhakim, Anouar Boudhir
Mohamed, Ben Ahmed
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (05) : 557 - 564
[38] Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
Asroni, Asroni
Ku-Mahamud, Ku Ruhana
Damarjati, Cahya
Slamat, Hasan Basri
BAGHDAD SCIENCE JOURNAL, 2021, 18 (02) : 925 - 936
[39] Hybrid deep learning model for Arabic text classification based on mutual information
Abdulghani, Farah A.
Abdullah, Nada A. Z.
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (08): : 1901 - 1908
[40] An optimized hybrid deep learning model based on word embeddings and statistical features for extractive summarization
Wazery, Yaser M.
Saleh, Marwa E.
Ali, Abdelmgeid A.
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)

← 1 2 3 4 5 →