Text Document Classification

被引:0
|
作者
Novovicova, Jana [1 ]
机构
[1] UTIA, CRCIM, Prague, Czech Republic
来源
ERCIM NEWS | 2005年 / 62期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
During the last twenty years the number of text documents in digital form has grown enormously in size. As a consequence, it is of great practical importance to be able to automatically organize and classify documents. Research into text classification aims to partition unstructured sets of documents into groups that describe the contents of the documents. There are two main variants of text classification: text clustering and text categorization. The former is concerned with finding a latent group structure in the set of documents, while the latter (also known as text classification) can be seen as the task of structuring the repository of documents according to a group structure that is known in advance.
引用
收藏
页码:53 / 54
页数:2
相关论文
共 50 条
  • [1] Text classification with document embeddings
    [J]. Huang, Chaochao (chaochaohuang12@fudan.edu.cn), 1600, Springer Verlag (8801):
  • [2] Text Classification with Document Embeddings
    Huang, Chaochao
    Qiu, Xipeng
    Huang, Xuanjing
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 131 - 140
  • [3] Discriminative features for text document classification
    Torkkola, K
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (04) : 301 - 308
  • [4] Text Document Classification and Pattern Recognition
    Wu, Qin
    Fuller, Eddie
    Zhang, Cun-Quan
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, 2009, : 405 - 410
  • [5] Discriminative features for text document classification
    K. Torkkola
    [J]. Formal Pattern Analysis & Applications, 2004, 6 : 301 - 308
  • [6] Incremental learning for text document classification
    Chen, ZhiHang
    Huang, Liping
    Murphey, Yi L.
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2591 - 2596
  • [7] Text Graph Transformer for Document Classification
    Zhang, Haopeng
    Zhang, Jiawei
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8322 - 8327
  • [8] Protein classification based on text document classification techniques
    Cheng, BYM
    Carbonell, JG
    Klein-Seetharaman, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (04) : 955 - 970
  • [9] Text document classification using swarm intelligence
    Vizine, AL
    de Castro, LN
    Gudwin, RR
    [J]. 2005 INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'05: MODELING, EXPLORATION, AND ENGINEERING, 2005, : 134 - 139
  • [10] Integrating Rich Document Representations for Text Classification
    Jiang, Suqi
    Lewris, Jason
    Voltmer, Michael
    Wang, Hongning
    [J]. 2016 IEEE SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (SIEDS), 2016, : 303 - 308