Privacy-Preserving Similarity-Based Text Retrieval

被引:38
|
作者
Pang, Hweehwa [1 ]
Shen, Jialie [1 ]
Krishnan, Ramayya [2 ]
机构
[1] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
[2] Carnegie Mellon Univ, Sch Informat Syst & Management, Pittsburgh, PA 15213 USA
关键词
Security; Privacy of search queries; security in text retrieval; singular value decomposition; COMPRESSION;
D O I
10.1145/1667067.1667071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Users of online services are increasingly wary that their activities could disclose confidential information on their business or personal activities. It would be desirable for an online document service to perform text retrieval for users, while protecting the privacy of their activities. In this article, we introduce a privacy-preserving, similarity-based text retrieval scheme that (a) prevents the server from accurately reconstructing the term composition of queries and documents, and (b) anonymizes the search results from unauthorized observers. At the same time, our scheme preserves the relevance-ranking of the search server, and enables accounting of the number of documents that each user opens. The effectiveness of the scheme is verified empirically with two real text corpora.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] Verifiable Privacy-Preserving Multi-Keyword Text Search in the Cloud Supporting Similarity-Based Ranking
    Sun, Wenhai
    Wang, Bing
    Cao, Ning
    Li, Ming
    Lou, Wenjing
    Hou, Y. Thomas
    Li, Hui
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (11) : 3025 - 3035
  • [2] Privacy-preserving personalized similarity retrieval of large CT image sequences
    Jiang, Nan
    Zhuang, Yi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (13) : 20407 - 20429
  • [3] Quantitative Similarity-based Evaluation of Text Retrieval Algorithms
    Didari, Parastoo
    Babai, Behrad
    Shakery, Azadeh
    [J]. 2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 264 - 269
  • [4] Privacy-preserving personalized similarity retrieval of large CT image sequences
    Nan Jiang
    Yi Zhuang
    [J]. Multimedia Tools and Applications, 2023, 82 : 20407 - 20429
  • [5] Vector Based Privacy-Preserving Document Similarity with LSA
    Yu, Xiaojie
    Chen, Xiaojun
    Shi, Jinqiao
    [J]. 2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1383 - 1387
  • [6] Privacy-Preserving Outsourced Similarity Search
    Kozak, Stepan
    Novak, David
    Zezula, Pavel
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2014, 25 (03) : 48 - 71
  • [7] Privacy-Preserving Text Similarity via Non-Prefix-Free Codes
    Kulekci, M. Oguzhan
    Habib, Ismail
    Aghabaiglou, Amir
    [J]. SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 94 - 102
  • [8] Proactive Privacy-preserving Learning for Retrieval
    Zhang, Peng-Fei
    Huang, Zi
    Xu, Xin-Shun
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3369 - 3376
  • [9] Privacy-Preserving Text Mining as a Service
    Costantino, Gianpiero
    La Marra, Antonio
    Martinelli, Fabio
    Saracino, Andrea
    Sheikhalishahi, Mina
    [J]. 2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 890 - 897
  • [10] Privacy-preserving Neural Representations of Text
    Coavoux, Maximin
    Narayan, Shashi
    Cohen, Shay B.
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1 - 10