Subsequence Kernels-Based Arabic Text Classification

被引:0
|
作者
Nehar, Attia [2 ]
Benmessaoud, Abdelkader [1 ]
Cherroun, Hadda [1 ]
Ziadi, Djelloul [3 ]
机构
[1] Univ Amar Telidji, Lab Informat & Math, Laghouat, Algeria
[2] Univ Ziane Achour, Djelfa, Algeria
[3] Normandie Univ, Lab LITIS, EA 4108, Rouen, France
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Kernel methods have known huge success in machine learning. This success is mainly due to their flexibility to deal with high dimensionality of the feature space of complex data such as graphs, trees or textual data. In the field of text classification (TC) their performances have supplanted traditional algorithms. For textual data, different kernels were introduced (P-spectrum, AII-Sub-sequences, Gap-Weighted Subsequences kernel,...) to improve the performance of TC systems. In this paper, we carried out a system for Arabic TC which supports aspects of order and co-occurrence of words within a text. Transducers, specific automata, are used to represent documents. Such representation allows an efficient implementation of subsequence kernel. An empirical study is conducted to evaluate the ATC system on the large SPA corpus. Results show an improvement of the classification in terms of precision.
引用
收藏
页码:206 / 213
页数:8
相关论文
共 50 条
  • [1] Rational kernels for Arabic Root Extraction and Text Classification
    Nehar, Attia
    Ziadi, Djelloul
    Cherroun, Hadda
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2016, 28 (02) : 157 - 169
  • [2] Weighted Radial Basis Function Kernels-Based Support Vector Machines for Multispectral Image Classification
    Chen, Shih-Yu
    Ouyang, Yen Chieh
    Chang, Chein-I
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 4339 - 4342
  • [3] Compression-Based Arabic Text Classification
    Ta'amneh, Haneen
    Abu Keshek, Ehsan
    Issa, Manar Bani
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 594 - 600
  • [4] Arabic Text Classification based on Semantic Relations
    Hijazi, Musab
    Zeki, Akram
    Ismail, Amelia
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (02): : 937 - 946
  • [5] Arabic text classification based on analogical proportions
    Bounhas, Myriam
    Elayeb, Bilel
    Chouigui, Amina
    Hussain, Amir
    Cambria, Erik
    EXPERT SYSTEMS, 2024, 41 (10)
  • [6] Enhanced Arabic information retrieval system based on Arabic text classification
    Ghwanmeh, Sameh
    Kanaan, Ghassan
    Al-Shalabi, Riyad
    Ababneh, Ahmad
    2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 527 - +
  • [7] Representative Kernels-Based CNN for Faster Transmission in Federated Learning
    Li, Wei
    Shen, Zichen
    Liu, Xiulong
    Wang, Mingfeng
    Ma, Chao
    Ding, Chuntao
    Cao, Jiannong
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13062 - 13075
  • [8] Arabic Text Mining Using Rule Based Classification
    Thabtah, Fadi
    Gharaibeh, Omar
    Al-Zubaidy, Rashid
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2012, 11 (01)
  • [9] Arabic Text Classification Based on Word and Document Embeddings
    El Mahdaouy, Abdelkader
    Gaussier, Eric
    El Alaoui, Said Ouatik
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 32 - 41
  • [10] Classification of Cyberbullying Text in Arabic
    Rachid, Benaissa Azzeddine
    Azza, Harbaoui
    Ben Ghezala, Hajjami Henda
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,