A Comparative Study on Various Text Classification Methods

被引:4
|
作者
Khanna, Samarth [1 ]
Tiwari, Bishnu [1 ]
Das, Priyanka [1 ]
Das, Asit Kumar [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Sibpur, Howrah, India
关键词
Text classification; Featurization; Classifiers; Receiver operating characteristics curve; KNN;
D O I
10.1007/978-981-15-2449-3_46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the exponential growth in the enhancement of modes of information exchange, the spread of text has become not only substantially faster, but also widespread. Due to this, text has become an indispensable part of all kinds of decisionmaking. Hence, it has become imperative to analyse the methods that can help make sense of this text as efficiently as possible. We shall make an attempt at the same by discussing various tools to make this very task increasingly productive. We shall try to analyse the relationship between the way an algorithm works and how it performs on various sets of data having different types of featurization. We shall analyse featurization techniques such as bag of words/N-grams, Tf-Idf vectorization, average Word2Vec and Tf-Idf Word2Vec.
引用
收藏
页码:539 / 549
页数:11
相关论文
共 50 条
  • [1] A Comparative Study of Classification and Clustering Methods from Text of Books
    Probierz, Barbara
    Kozak, Jan
    Hrabia, Anita
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 13 - 25
  • [2] Comparative study of various machine learning methods on ASD classification
    Rimal, Ramchandra
    Brannon, Mitchell
    Wang, Yingxin
    Yang, Xin
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023,
  • [3] Comparative Study of Feature Selection Methods for Medical Full Text Classification
    Adriano Goncalves, Carlos
    Lorenzo Iglesias, Eva
    Borrajo, Lourdes
    Camacho, Rui
    Seara Vieira, Adrian
    Goncalves, Celia Talma
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2019), PT II, 2019, 11466 : 550 - 560
  • [4] A comparative study of feature selection methods for binary text streams classification
    Matheus Bernardelli de Moraes
    Andre Leon Sampaio Gradvohl
    Evolving Systems, 2021, 12 : 997 - 1013
  • [5] A comparative study of feature selection methods for binary text streams classification
    de Moraes, Matheus Bernardelli
    Sampaio Gradvohl, Andre Leon
    EVOLVING SYSTEMS, 2021, 12 (04) : 997 - 1013
  • [6] Study on Text Classification Methods
    Zhang Xuan
    Tian Da-gang
    INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 123 - 125
  • [7] Study of Various Text Summarization Methods
    Khan, Sarim
    Pathak, Abhay
    Chopra, Rishabh
    Parihar, Hemant Singh
    Kaur, Preet Chandan
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 115 - 126
  • [8] Classification Models of Text: A Comparative Study
    Zhan, Tiffany
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 1221 - 1225
  • [9] Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks
    Wu, Xing
    Gao, Chaochen
    Lin, Meng
    Zang, Liangjun
    Hu, Songlin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 871 - 875