Deep Learning for Hindi Text Classification: A Comparison

被引:9
|
作者
Joshi, Ramchandra [1 ]
Goel, Purvi [2 ]
Joshi, Raviraj [2 ]
机构
[1] Pune Inst Comp Technol, Dept Comp Engn, Pune, Maharashtra, India
[2] Indian Inst Technol Madras, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
关键词
Natural language processing; Convolutional neural networks; Recurrent neural networks; Sentence embedding; Hindi text classification;
D O I
10.1007/978-3-030-44689-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) and especially natural language text analysis have seen great advances in recent times. Usage of deep learning in text processing has revolutionized the techniques for text processing and achieved remarkable results. Different deep learning architectures like CNN, LSTM, and very recent Transformer have been used to achieve state of the art results variety on NLP tasks. In this work, we survey a host of deep learning architectures for text classification tasks. The work is specifically concerned with the classification of Hindi text. The research in the classification of morphologically rich and low resource Hindi language written in Devanagari script has been limited due to the absence of large labeled corpus. In this work, we used translated versions of English data-sets to evaluate models based on CNN, LSTM and Attention. Multilingual pre-trained sentence embeddings based on BERT and LASER are also compared to evaluate their effectiveness for the Hindi language. The paper also serves as a tutorial for popular text classification techniques.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 50 条
  • [41] Comparative Analysis of Deep Learning Models for Myanmar Text Classification
    Phyu, Myat Sapal
    Nwet, Khin Thandar
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT I, 2020, 12033 : 76 - 85
  • [42] Multimodal deep learning for dementia classification using text and audio
    Lin, Kaiying
    Washington, Peter Y.
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] Research on the method of educational text classification based on deep learning
    Wang, Yuqin
    [J]. INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2022, 32 (03) : 313 - 326
  • [44] A Proposed Deep Learning based Framework for Arabic Text Classification
    Sayed, Mostafa
    Abdelkader, Hatem
    Khedr, Ayman E.
    Salem, Rashed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 305 - 313
  • [45] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [46] Emotionally charged text classification with deep learning and sentiment semantic
    Jeow Li Huan
    Arif Ahmed Sekh
    Chai Quek
    Dilip K. Prasad
    [J]. Neural Computing and Applications, 2022, 34 : 2341 - 2351
  • [47] Deep Learning-based Text Classification: A Comprehensive Review
    Minaee, Shervin
    Kalchbrenner, Nal
    Cambria, Erik
    Nikzad, Narjes
    Chenaghlu, Meysam
    Gao, Jianfeng
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [48] Deep Text Prior: Weakly Supervised Learning for Assertion Classification
    Liventsev, Vadim
    Fedulova, Irina
    Dylov, Dmitry
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 243 - 257
  • [49] Survey of Short Text Classification Methods Based on Deep Learning
    Gan, Yating
    An, Jianye
    Xu, Xue
    [J]. Computer Engineering and Applications, 2024, 59 (04) : 43 - 53
  • [50] A deep learning approach for image and text classification using neutrosophy
    Wajid M.A.
    Zafar A.
    Wajid M.S.
    [J]. International Journal of Information Technology, 2024, 16 (2) : 853 - 859