A new similarity computing method based on concept similarity in Chinese text processing

被引:0
|
作者
PENG Jing1
2 Department of Science and Technology
机构
基金
中国博士后科学基金; 北京市自然科学基金; 中国国家自然科学基金;
关键词
concept similarity; similarity computing; vector space; inner product space;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ;
摘要
The paper proposes a new text similarity computing method based on concept similarity in Chinese text processing. The new method converts text to words vector space model at first,and then splits words into a set of concepts. Through computing the inner products between concepts,it obtains the similarity between words. The new method computes the similarity of text based on the similarity of words at last. The contributions of the paper include:1) propose a new computing formula between words;2) propose a new text similarity computing method based on words similarity;3) successfully use the method in the application of similarity computing of WEB news;and 4) prove the validity of the method through extensive experiments.
引用
收藏
页码:1215 / 1230
页数:16
相关论文
共 50 条
  • [1] A new similarity computing method based on concept similarity in Chinese text processing
    Peng Jing
    Yang DongQing
    Tang ShiWei
    Wang TengJiao
    Gao Jun
    [J]. SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2008, 51 (09): : 1215 - 1230
  • [2] A new similarity computing method based on concept similarity in Chinese text processing
    Jing Peng
    DongQing Yang
    ShiWei Tang
    TengJiao Wang
    Jun Gao
    [J]. Science in China Series F: Information Sciences, 2008, 51
  • [3] A New Method of Computing Chinese Word Similarity Based on Statistics
    Zhang, Bo
    Hong, Lei
    Song, Shubin
    He, Liang
    Li, Guorong
    [J]. 2012 FIFTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE), 2012, : 43 - 46
  • [4] Concept Bag: A New Method for Computing Concept Similarity in Biomedical Data
    Bradshaw, Richard L.
    Gouripeddi, Ramkiran
    Facelli, Julio C.
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2019), PT II, 2019, 11466 : 15 - 23
  • [5] Text similarity computing based on standard deviation
    Liu, T
    Guo, J
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 456 - 464
  • [6] Computing Text Similarity Based on HNC Theory
    Wei, Xiangfeng
    Zang, Hanfen
    Zhang, Quan
    [J]. RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 150 - 154
  • [7] An Improved Method of Computing Chinese Sentence Similarity
    Wang, Lu
    He, Zhongshi
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND COGNITIVE INFORMATICS, 2015, : 29 - 33
  • [8] The Similarity of Text Based on Hierarchical Network of Concept
    Xu Xiaoqing
    Wang Jingzhong
    [J]. PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL I, 2009, : 304 - +
  • [9] A concept similarity based text classification algorithm
    Peng, Jing
    Yang, Dong-qing
    Tang, Shi-Wei
    Gao, Jun
    Zhang, Peng-yi
    Fu, Yan
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 535 - 539
  • [10] A New Method of calculating the Concept Similarity
    Yang, Guowei
    Chen, Min
    Zhang, Xiaofeng
    [J]. MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 1951 - 1956