Text representation combining syntax in vector space model

被引:0
|
作者
Liu P.-Y. [1 ,2 ]
Yang Y.-Z. [1 ,2 ]
Zhao J. [1 ,2 ]
机构
[1] School of Information Science and Engineering, Shandong Normal University
[2] Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology
关键词
Feature item; Phrase; Syntactic cues;
D O I
10.4156/aiss.vol3.issue7.30
中图分类号
学科分类号
摘要
In order to improve the semantic description of items in SVM, and overcome the defect that semantic units are independent of each other, this paper proposed a feature granulation description method based on phrases. This method refered to text representation and organization among feature items, identified base phrases through syntactic cues, and built the relation tree which contained feature items and head verb, then replaced words in BOW with base phrases. Experimental results indicates that the new approach improves the performance of the classifier, increases the relationship between terms, overcomes the defect of mutual independence between feature items, and keeps favourable effect even if the number of feature items is small.
引用
收藏
页码:251 / 259
页数:8
相关论文
共 50 条
  • [1] Research on Ontology-Based Text Representation of Vector Space Model
    Wei, Guiying
    Bao, Mingming
    Wu, Sen
    [J]. 2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [2] Strategies for Short Text Representation in the Word Vector Space
    Pita, Marcelo
    Pappa, Gisele L.
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 266 - 271
  • [3] Plagiarism Detection in Text using Vector Space Model
    Ekbal, Asif
    Saha, Sriparna
    Choudhary, Gaurav
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 366 - 371
  • [4] Summarization of Text Clustering based Vector Space Model
    Chen, Mingzhen
    Song, Yu
    [J]. 2009 IEEE 10TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED INDUSTRIAL DESIGN & CONCEPTUAL DESIGN, VOLS 1-3: E-BUSINESS, CREATIVE DESIGN, MANUFACTURING - CAID&CD'2009, 2009, : 2362 - 2365
  • [5] Combining Vector Space Features and Convolution Neural Network for Text Sentiment Analysis
    Wang Yun
    Wang Xu An
    Zhang Jindan
    Yu, Chenghai
    [J]. COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 780 - 790
  • [6] Lower dimensional representation of text data in vector space based information retrieval
    Park, H
    Jeon, M
    Rosen, JB
    [J]. COMPUTATIONAL INFORMATION RETRIEVAL, 2001, : 3 - 23
  • [7] On a New Model for Automatic Text Categorization Based on Vector Space Model
    Suzuki, Makoto
    Yamagishi, Naohide
    Ishidat, Takashi
    Gotot, Masayuki
    Hirasawa, Shigeichi
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3152 - 3159
  • [8] Combining syntax and semantics through prime form representation
    Bittencourt, Guilherme
    [J]. JOURNAL OF LOGIC AND COMPUTATION, 2008, 18 (01) : 13 - 33
  • [9] On a new model for automatic text categorization based on vector space model
    Faculty of Information Science, Shonan Institute of Technology, 1-1-25 Tsujido Nishikaigan, Fujisawa, Kanagawa, 251-8511, Japan
    不详
    不详
    [J]. Conf. Proc. IEEE Int. Conf. Syst. Man Cybern., 2010, (3152-3159):
  • [10] A Survey on Text Document Categorization using Enhanced Sentence Vector Space Model and Bi-Gram Text Representation Model based on Novel Fusion Techniques
    Amensisa, Abdisa Demissie
    Patil, Seema
    Agrawal, Poorva
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 218 - 225