Text Document Classification

被引:0
|
作者
Novovicova, Jana [1 ]
机构
[1] UTIA, CRCIM, Prague, Czech Republic
来源
ERCIM NEWS | 2005年 / 62期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
During the last twenty years the number of text documents in digital form has grown enormously in size. As a consequence, it is of great practical importance to be able to automatically organize and classify documents. Research into text classification aims to partition unstructured sets of documents into groups that describe the contents of the documents. There are two main variants of text classification: text clustering and text categorization. The former is concerned with finding a latent group structure in the set of documents, while the latter (also known as text classification) can be seen as the task of structuring the repository of documents according to a group structure that is known in advance.
引用
收藏
页码:53 / 54
页数:2
相关论文
共 50 条
  • [1] Text classification with document embeddings
    Huang, Chaochao (chaochaohuang12@fudan.edu.cn), 1600, Springer Verlag (8801):
  • [2] Text Classification with Document Embeddings
    Huang, Chaochao
    Qiu, Xipeng
    Huang, Xuanjing
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 131 - 140
  • [3] Discriminative features for text document classification
    Torkkola, K
    PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (04) : 301 - 308
  • [4] Discriminative features for text document classification
    K. Torkkola
    Formal Pattern Analysis & Applications, 2004, 6 : 301 - 308
  • [5] Text Document Classification and Pattern Recognition
    Wu, Qin
    Fuller, Eddie
    Zhang, Cun-Quan
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, 2009, : 405 - 410
  • [6] Incremental learning for text document classification
    Chen, ZhiHang
    Huang, Liping
    Murphey, Yi L.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2591 - 2596
  • [7] Text Graph Transformer for Document Classification
    Zhang, Haopeng
    Zhang, Jiawei
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8322 - 8327
  • [8] Protein classification based on text document classification techniques
    Cheng, BYM
    Carbonell, JG
    Klein-Seetharaman, J
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (04) : 955 - 970
  • [9] Text document classification using swarm intelligence
    Vizine, AL
    de Castro, LN
    Gudwin, RR
    2005 INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'05: MODELING, EXPLORATION, AND ENGINEERING, 2005, : 134 - 139
  • [10] A New Method of Automatic Text Document Classification
    Yatsko, V. A.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2021, 55 (03) : 122 - 133