String Similarity Computing Based on Position And Cosine

被引:0
|
作者
Cheng, Na [1 ]
Yu, Zhongqing [1 ,2 ]
Wang, Kaixi [1 ,2 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao, Shandong, Peoples R China
[2] Qingdao Univ, Coll Data Sci & Software Engn, Qingdao, Shandong, Peoples R China
关键词
angle cosine; position encoding; approximately duplicate records; data cleaning; products select;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
E-Business platform needs to have the production selection functionalities according to the products' feature and their cost performance, and at the same time, we need to clean data in the production and sale process, so it is important to calculate similarity between products. This paper proposes a new way to compute the similarity of string by segmenting string into words, numbering the corresponding positions and vectorizing the string. Then the similarity between the strings is computed by computing the cosine angle of the two vectors. Experiments show that the method avoids the maximum or minimum of LCS and GST. In addition, the proposed method also improves the accuracy of similarity calculation.
引用
收藏
页码:256 / 261
页数:6
相关论文
共 50 条
  • [21] A novel artificial bee colony algorithm based on the cosine similarity
    Xiang, Wan-li
    Li, Yin-zhen
    He, Rui-chun
    Gao, Ming-xia
    An, Mei-qing
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 115 : 54 - 68
  • [22] Hierarchical Clustering Algorithm for Binary Data Based on Cosine Similarity
    Gao, Xiaonan
    Wu, Sen
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCES (LISS), 2018,
  • [23] Computing the matrix cosine
    Higham, NJ
    Smith, MI
    [J]. NUMERICAL ALGORITHMS, 2003, 34 (01) : 13 - 26
  • [24] A Taxonomy based Semantic Similarity of Documents using the Cosine Measure
    Madylova, Ainura
    Oguducu, Sule Guenduez
    [J]. 2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 129 - 134
  • [25] RoCS: Knowledge Graph Embedding Based on Joint Cosine Similarity
    Wang, Lifeng
    Luo, Juan
    Deng, Shiqiao
    Guo, Xiuyuan
    [J]. ELECTRONICS, 2024, 13 (01)
  • [26] Cosine similarity and the Borda rule
    Kawada, Yoko
    [J]. SOCIAL CHOICE AND WELFARE, 2018, 51 (01) : 1 - 11
  • [27] Directional evidence conflict measurement based on improved cosine similarity
    Mao Y.-F.
    Zhang D.-L.
    Wang L.
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2016, 38 (11): : 2567 - 2571
  • [28] Design and implementation of mosaic jigsaw software based on cosine similarity
    Liu, Dong
    Wang, Lu
    Liu, Xiao-Yu
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2014, 35 : 114 - 117
  • [29] Vision-Language Navigation Algorithm Based on Cosine Similarity
    Jin Jie
    Liu Kaiyan
    Zha Shunkao
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (16)
  • [30] Study on the Establishment Process of Muscle Synergy Based on Cosine Similarity
    Hu, Lin T.
    Xu, Chong
    Chen, Lin
    Wu, Xiao Y.
    Hou, Wen S.
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 6590 - 6593