A Comparative Study on Various Text Classification Methods

被引:4
|
作者
Khanna, Samarth [1 ]
Tiwari, Bishnu [1 ]
Das, Priyanka [1 ]
Das, Asit Kumar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Sibpur, Howrah, India
关键词
Text classification; Featurization; Classifiers; Receiver operating characteristics curve; KNN;
D O I
10.1007/978-981-15-2449-3_46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the exponential growth in the enhancement of modes of information exchange, the spread of text has become not only substantially faster, but also widespread. Due to this, text has become an indispensable part of all kinds of decisionmaking. Hence, it has become imperative to analyse the methods that can help make sense of this text as efficiently as possible. We shall make an attempt at the same by discussing various tools to make this very task increasingly productive. We shall try to analyse the relationship between the way an algorithm works and how it performs on various sets of data having different types of featurization. We shall analyse featurization techniques such as bag of words/N-grams, Tf-Idf vectorization, average Word2Vec and Tf-Idf Word2Vec.
引用
收藏
页码:539 / 549
页数:11
相关论文
共 50 条
  • [21] Different Word Representation For Text Classification: A Comparative Study
    Alsagour, Eman
    Alhenki, Lubna
    Al-Dhelaan, Mohammed
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [22] An Experimental Study of Feature Selection Methods for Text Classification
    Uchyigit, Gulden
    Clark, Keith
    PERSONALIZATION TECHNIQUES AND RECOMMENDER SYSTEMS, 2008, : 303 - 320
  • [23] Enhanced Text Classification Methods to Improve the Performance of the Various Text Mining Processes using Rapid Miner
    Jain, Shitanshu
    Jain, S. C.
    Vishwakarma, Santosh
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 6 - 10
  • [24] A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance
    Hongxia Lu
    Louis Ehwerhemuepha
    Cyril Rakovski
    BMC Medical Research Methodology, 22
  • [25] A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance
    Lu, Hongxia
    Ehwerhemuepha, Louis
    Rakovski, Cyril
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [26] A comparative study of various methods of hemoglobin determinations
    Senty, EG
    JOURNAL OF LABORATORY AND CLINICAL MEDICINE, 1923, 8 : 591 - 604
  • [27] COMPARATIVE STUDY OF POLYPYROMELLITIMIDES OBTAINED BY VARIOUS METHODS
    TIMOFEEVA, GI
    PAVLOVA, SA
    TSEITLIN, GM
    AZAROV, VI
    SKVORTSO.VI
    KORSHAK, VV
    VYSOKOMOLEKULYARNYE SOEDINENIYA SECTION A, 1971, 13 (10): : 2348 - +
  • [28] Comparative study of various methods of the serodiagnosis of tuberculosis
    Jousset, A
    Paraskevopoulos, P
    COMPTES RENDUS DES SEANCES DE LA SOCIETE DE BIOLOGIE ET DE SES FILIALES, 1905, 57 : 1063 - 1065
  • [29] A Comparative Study of Various Edge Detection Methods
    Yousaf, Rehan Mehmood
    Habib, Hafiz Adnan
    Dawood, Hussain
    Shafiq, Sidra
    2018 14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2018, : 96 - 99
  • [30] COMPARATIVE STUDY ON VARIOUS METHODS OF FINGER PLETHYSMOGRAPHY
    MATSUMURA, M
    TOHOKU JOURNAL OF EXPERIMENTAL MEDICINE, 1968, 94 (04): : 337 - 346