Text Document Classification and Pattern Recognition

被引:2
|
作者
Wu, Qin [1 ]
Fuller, Eddie [1 ]
Zhang, Cun-Quan [1 ]
机构
[1] W Virginia Univ, Dept Math, Morgantown, WV 26506 USA
关键词
D O I
10.1109/ASONAM.2009.21
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this extended abstract, a novel approach is proposed for text pattern recognition. Instead of the traditional models which are mainly based on the frequency of keywords for text document classification, we introduce a new graph theory model which is constructed based on both information about frequency and position of keywords. We applied this new idea to the detection of fraudulent emails written by the same person, and plagiarized publications. The results on these case studies show that this new method performs much better than traditional methods.
引用
收藏
页码:405 / 410
页数:6
相关论文
共 50 条
  • [1] Pattern Document Weight Discovery For Text Classification Mining
    Brindha, S.
    Prabha, K.
    Sukumaran, S.
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 651 - 655
  • [2] Text Document Classification
    Novovicova, Jana
    [J]. ERCIM NEWS, 2005, (62): : 53 - 54
  • [3] Text classification with document embeddings
    [J]. Huang, Chaochao (chaochaohuang12@fudan.edu.cn), 1600, Springer Verlag (8801):
  • [4] Text Classification with Document Embeddings
    Huang, Chaochao
    Qiu, Xipeng
    Huang, Xuanjing
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 131 - 140
  • [5] A Similarity Function for Feature Pattern Clustering and High Dimensional Text Document Classification
    Kotte, Vinay Kumar
    Rajavelu, Srinivasan
    Rajsingh, Elijah Blessing
    [J]. FOUNDATIONS OF SCIENCE, 2020, 25 (04) : 1077 - 1094
  • [6] A Similarity Function for Feature Pattern Clustering and High Dimensional Text Document Classification
    Vinay Kumar Kotte
    Srinivasan Rajavelu
    Elijah Blessing Rajsingh
    [J]. Foundations of Science, 2020, 25 : 1077 - 1094
  • [7] AUTOMATIC DOCUMENT CLASSIFICATION-SYSTEM USING PATTERN-RECOGNITION TECHNIQUES
    HAMILL, KA
    ZAMORA, A
    [J]. PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1978, 15 : 152 - 155
  • [8] Discriminative features for text document classification
    Torkkola, K
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (04) : 301 - 308
  • [9] Discriminative features for text document classification
    K. Torkkola
    [J]. Formal Pattern Analysis & Applications, 2004, 6 : 301 - 308
  • [10] Incremental learning for text document classification
    Chen, ZhiHang
    Huang, Liping
    Murphey, Yi L.
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2591 - 2596