Text Document Classification

被引:0
|
作者
Novovicova, Jana [1 ]
机构
[1] UTIA, CRCIM, Prague, Czech Republic
来源
ERCIM NEWS | 2005年 / 62期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
During the last twenty years the number of text documents in digital form has grown enormously in size. As a consequence, it is of great practical importance to be able to automatically organize and classify documents. Research into text classification aims to partition unstructured sets of documents into groups that describe the contents of the documents. There are two main variants of text classification: text clustering and text categorization. The former is concerned with finding a latent group structure in the set of documents, while the latter (also known as text classification) can be seen as the task of structuring the repository of documents according to a group structure that is known in advance.
引用
收藏
页码:53 / 54
页数:2
相关论文
共 50 条
  • [21] A Framework for Explainable Text Classification in Legal Document Review
    Mahoney, Christian J.
    Zhang, Jianping
    Huber-Fliflet, Nathaniel
    Gronvall, Peter
    Zhao, Haozhen
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1858 - 1867
  • [22] The Influence of Feature Representation of Text on the Performance of Document Classification
    Martincic-Ipsic, Sanda
    Milicic, Tanja
    Todorovski, Ljupco
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [23] A New Similarity Measure for Document Classification and Text Mining
    Eminagaoglu, Mete
    Goksen, Yilmaz
    ECONOMIES OF THE BALKAN AND EASTERN EUROPEAN COUNTRIES, 2020, : 353 - 366
  • [24] Hierarchical Hamming clustering model in text document classification
    Diao, Q
    Diao, HN
    Wang, YC
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN & COMPUTER GRAPHICS, 1999, : 1299 - 1303
  • [25] An Automated Text Document Classification Framework using BERT
    Shah, Momna Ali
    Iqbal, Muhammad Javed
    Noreen, Neelum
    Ahmed, Iftikhar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (03) : 279 - 285
  • [26] Pattern Document Weight Discovery For Text Classification Mining
    Brindha, S.
    Prabha, K.
    Sukumaran, S.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 651 - 655
  • [27] Knowledge-enhanced document embeddings for text classification
    Sinoara, Roberta A.
    Camacho-Collados, Jose
    Rossi, Rafael G.
    Navigli, Roberto
    Rezende, Solange O.
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 955 - 971
  • [28] Text Classification and Document Layout Analysis of Paper Fragments
    Diem, Markus
    Kleber, Florian
    Sablatnig, Robert
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 854 - 858
  • [29] Weighted Document Frequency for Feature Selection in Text Classification
    Li, Baoli
    Yan, Qiuling
    Xu, Zhenqiang
    Wang, Guicai
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 132 - 135
  • [30] USING CONCEPTUAL DOCUMENT REPRESENTATION FOR MULTILINGUAL TEXT CLASSIFICATION
    Borges Garcia, A.
    Castro Castro, D.
    Ortega-Bueno, R.
    HOLOS, 2018, 34 (02) : 386 - 396