Accuracy Evaluation of Arabic Text Classification

被引:0
|
作者
Sayed, Mostafa [1 ]
Salem, Rashed [2 ]
Khedr, Ayman E. [3 ]
机构
[1] Beni Suef Univ, Fac Comp & Informat, Bani Suwayf, Egypt
[2] Menoufia Univ, Fac Comp & Informat, Menoufia, Egypt
[3] Future Univ Egypt, Fac Comp & Informat, Cairo, Egypt
关键词
Arabic Text Classification; Machine Learning techniques; Deep Learning; Support Vector Machine; K-Nearest Neighbor;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Categorization of Arabic text is a significant challenge nowadays owing to the richness of text that occurs through various modules. Also, the Arabic language is considered the fifth spoken one. During the last decade, scholars incubated few concerns about this regard comparing with English language. The objective behind this investigation is to perform and evaluate new mechanism relating to different techniques of machine learning specifically for classifying Arabic text in fresh different data set. Preprocessing steps along with the representation pattern of text are essential for handling text without artifacts. We use a binary term occurrence matrix as mutual information for feature vector representation method. This paper evaluates the outcomes of classification via using Deep learning, K-Nearest Neighbor, Support Vector Machine and Naive Bayes classifiers in similarity text level and N-gram level. It has been extracted out the outcomes that the Deep learning achieves better performance compared to itself in case of increasing similarity level and N-gram level.
引用
收藏
页码:365 / 370
页数:6
相关论文
共 50 条
  • [21] NADA: New Arabic Dataset for Text Classification
    Alalyani, Nada
    Marie-Sainte, Souad Larabi
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 206 - 212
  • [22] Evaluating Various Tokenizers for Arabic Text Classification
    Zaid Alyafeai
    Maged S. Al-shaibani
    Mustafa Ghaleb
    Irfan Ahmad
    [J]. Neural Processing Letters, 2023, 55 : 2911 - 2933
  • [23] Arabic text classification based on analogical proportions
    Bounhas, Myriam
    Elayeb, Bilel
    Chouigui, Amina
    Hussain, Amir
    Cambria, Erik
    [J]. EXPERT SYSTEMS, 2024,
  • [24] Evaluating Various Tokenizers for Arabic Text Classification
    Alyafeai, Zaid
    Al-shaibani, Maged S.
    Ghaleb, Mustafa
    Ahmad, Irfan
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2911 - 2933
  • [25] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    [J]. INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [26] Enhanced Arabic information retrieval system based on Arabic text classification
    Ghwanmeh, Sameh
    Kanaan, Ghassan
    Al-Shalabi, Riyad
    Ababneh, Ahmad
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 527 - +
  • [27] Arabic Text Summary Evaluation Method
    Ellouze, Samira
    Jaoua, Maher
    Belguith, Lamia Hadrich
    [J]. SUSTAINABLE ECONOMIC GROWTH, EDUCATION EXCELLENCE, AND INNOVATION MANAGEMENT THROUGH VISION 2020, VOLS I-VII, 2017, : 3532 - 3541
  • [28] The Effect of Stemming on Arabic Text Classification: An Empirical Study
    Wahbeh, Abdullah
    Al-Kabi, Mohammed
    Al-Radaideh, Qasem
    Al-Shawakfa, Emad
    Alsmadi, Izzat
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2011, 1 (03) : 54 - 70
  • [29] Standard and Dialectal Arabic Text Classification for Sentiment Analysis
    Maghfour, Mohcine
    Elouardighi, Abdeljalil
    [J]. MODEL AND DATA ENGINEERING, MEDI 2018, 2018, 11163 : 282 - 291
  • [30] Arabic text classification using deep learning models
    Elnagar, Ashraf
    Al-Debsi, Ridhwan
    Einea, Omar
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)