Vector Based Privacy-Preserving Document Similarity with LSA

被引:0
|
作者
Yu, Xiaojie [1 ,2 ]
Chen, Xiaojun [1 ]
Shi, Jinqiao [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
关键词
privacy-preserving; document similarity; latent semantic analysis;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Document similarity is the foundation of many intelligent data processing systems, such as information retrieval, text classification and clustering. However, traditional document similarity algorithms are challenged by the privacy-preserving problem. Recently, privacy-preserving document similarity approaches are provided to solve this problem and there are two kinds of approaches which are vector based protocols and set based protocols. Existing vector based protocols mainly use vector space model for document similarity computation. But vector space model has deficiencies to compute document similarity effectively and efficiently. In this paper, we focus on vector based document similarity and present a novel protocol with latent semantic analysis. Experimental evaluation shows that our protocol has better accuracy and performance than existing protocols.
引用
收藏
页码:1383 / 1387
页数:5
相关论文
共 50 条
  • [1] Privacy-Preserving Similarity-Based Text Retrieval
    Pang, Hweehwa
    Shen, Jialie
    Krishnan, Ramayya
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2010, 10 (01)
  • [2] Privacy-Preserving Outsourced Similarity Search
    Kozak, Stepan
    Novak, David
    Zezula, Pavel
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2014, 25 (03) : 48 - 71
  • [3] STRIP: Privacy-Preserving Vector-Based Routing
    Henecka, Wilko
    Roughan, Matthew
    [J]. 2013 21ST IEEE INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2013,
  • [4] A Practical Privacy-Preserving Algorithm for Document Data
    Mimoto, Tomoaki
    Kiyomoto, Shinsaku
    Kitamura, Koji
    Miyaji, Atsuko
    [J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1377 - 1384
  • [5] Efficient privacy-preserving similar document detection
    Murugesan, Mummoorthy
    Jiang, Wei
    Clifton, Chris
    Si, Luo
    Vaidya, Jaideep
    [J]. VLDB JOURNAL, 2010, 19 (04): : 457 - 475
  • [6] Privacy-preserving similarity coefficients for binary data
    Wong, Kok-Seng
    Kim, Myung Ho
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2013, 65 (09) : 1280 - 1290
  • [7] Efficient privacy-preserving similar document detection
    Mummoorthy Murugesan
    Wei Jiang
    Chris Clifton
    Luo Si
    Jaideep Vaidya
    [J]. The VLDB Journal, 2010, 19 : 457 - 475
  • [8] Privacy-Preserving Classification Scheme Based on Support Vector Machine
    Mao, Qinyu
    Chen, Yange
    Duan, Pu
    Zhang, Benyu
    Hong, Zhiyong
    Wang, Baocang
    [J]. IEEE SYSTEMS JOURNAL, 2022, 16 (04): : 5906 - 5916
  • [9] Privacy preserving scheme for document similarity detection
    Abdulsada, Ayad, I
    Al-Darraji, Salah
    Honi, Dhafer G.
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (03) : 609 - 628
  • [10] Privacy-Preserving Collaborative Filtering Protocol based on Similarity between Items
    Tada, Minako
    Kikuchi, Hiroaki
    Puntheeranurak, Sutheera
    [J]. 2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 573 - 578