Hybrid deep learning model for Arabic text classification based on mutual information

被引:7
|
作者
Abdulghani, Farah A. [1 ]
Abdullah, Nada A. Z. [1 ]
机构
[1] Univ Baghdad, Coll Sci, Dept Comp, Baghdad, Iraq
来源
关键词
Arabic text classification; Deep learning; Mutual information; C-LSTM;
D O I
10.1080/02522667.2022.2060910
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Text categorization refers to the process of grouping text or documents into classes or categories according to their content, which is a significant task in natural language processing. The majority of the present work focused on English text, with a few experiments on Arabic text. The text classification process consists of many steps, from preprocessing documents (removing stop words and stem method), to feature extraction and classification phase. A new improved approach for Arabic text categorization was proposed using mutual information in a hybrid deep learning model for classification. To test the proposed model, two datasets of Arabic documents are employed. The experimental results demonstrate that employing the proposed mutual information exceeds other prior techniques in terms of performance. In Akhbarona corpus, the Multi-Layer Perceptron achieved a minimum accuracy of 96.09%, while the hybrid Convolution-Long Short-Term Memory had a performance level of 99.28%. In Khaleej corpus, the Gated Recurrent Unit had the maximum accuracy of 98.23%, while Multi-Layer Perceptron had the lowest accuracy of 97.23%
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 50 条
  • [41] A Hybrid Deep Learning Technique for Personality Trait Classification From Text
    Ahmad, Hussain
    Asghar, Muhammad Usama
    Asghar, Muhammad Zubair
    Khan, Aurangzeb
    Mosavi, Amir H.
    [J]. IEEE ACCESS, 2021, 9 : 146214 - 146232
  • [42] Mutual Information based hybrid model and deep learning for Acute Lymphocytic Leukemia detection in single cell blood smear images
    Jha, Krishna Kumar
    Dutta, Himadri Sekhar
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 179
  • [43] Multimodal Deep Learning using Images and Text for Information Graphic Classification
    Kim, Edward
    McCoy, Kathleen F.
    [J]. ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, : 143 - 148
  • [44] Classification of Image and Text Data Using Deep Learning-Based LSTM Model
    Yechuri, Praveen Kumar
    Ramadass, Suguna
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1809 - 1817
  • [45] Modified Pointwise Mutual Information-Based Feature Selection for Text Classification
    Georgieva-Trifonova, Tsvetanka
    [J]. PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 333 - 353
  • [46] An Efficient Hybrid Model for Arabic Text Recognition
    Lamtougui, Hicham
    El Moubtahij, Hicham
    Fouadi, Hassan
    Satori, Khalid
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 2871 - 2888
  • [47] Feature Selection for Text Classification Using Mutual Information
    Sel, Ilhami
    Karci, Ali
    Hanbay, Davut
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [48] Fault Diagnosis Algorithm Based on Mutual Information and Deep Learning
    Shen Yang
    Zhu Lin
    Guo Jian
    Zhou Chuan
    Chen Qingwei
    Cheng Yong
    [J]. 2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 546 - 551
  • [49] A Novel Hybrid Model Based on Machine and Deep Learning Techniques for the Classification of Microalgae
    Kaya, Volkan
    Akgul, Smail
    Tanir, Ozge Zencir
    [J]. PHYTON-INTERNATIONAL JOURNAL OF EXPERIMENTAL BOTANY, 2023, 92 (09) : 2519 - 2534
  • [50] Machine learning algorithms in Arabic Text Classification: A Review
    Aboalnaser, Sara A.
    [J]. 12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 290 - 295