Chinese-keyword Fuzzy Search and Extraction over Encrypted Patent Documents

被引:0
|
作者
Ding, Wei [1 ]
Liu, Yongji [1 ]
Zhang, Jianfeng [2 ]
机构
[1] China Def Sci & Technol Informat Ctr, Beijing 100036, Peoples R China
[2] Natl Univ Def Technol, Changsha 410073, Hunan, Peoples R China
基金
中国博士后科学基金;
关键词
Chinese Keywords; Fuzzy Search; Extraction; Encrypted Documents;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud storage for information sharing is likely indispensable to the future national defence library in China e.g., for searching national defence patent documents, while security risks need to be maximally avoided using data encryption. Patent keywords are the high-level summary of the patent document, and it is significant in practice to efficiently extract and search the key words in the patent documents. Due to the particularity of Chinese keywords, most existing algorithms in English language environment become ineffective in Chinese scenarios. For extracting the keywords from patent documents, the manual keyword extraction is inappropriate when the amount of files is large. An improved method based on the term frequency-inverse document frequency (TF-IDF) is proposed to auto-extract the keywords in the patent literature. The extracted keyword sets also help to accelerate the keyword search by linking finite keywords with a large amount of documents. Fuzzy keyword search is introduced to further increase the search efficiency in the cloud computing scenarios compared to exact keyword search methods. Based on the Chinese Pinyin similarity, a Pinyin-Gram-based algorithm is proposed for fuzzy search in encrypted Chinese environment, and a keyword trapdoor search index structure based on the n-ary tree is designed. Both the search efficiency and accuracy of the proposed scheme are verified through computer experiments.
引用
收藏
页码:168 / 176
页数:9
相关论文
共 50 条
  • [31] Privacy-Preserving Multi-Keyword Fuzzy Search over Encrypted Data in the Cloud
    Wang, Bing
    Yu, Shucheng
    Lou, Wenjing
    Hou, Y. Thomas
    2014 PROCEEDINGS IEEE INFOCOM, 2014, : 2112 - 2120
  • [32] An improved multi-keyword fuzzy search scheme based on BloomFilter over encrypted text
    Wu X.
    Yu N.-H.
    Zhang W.-M.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (01): : 97 - 104
  • [33] A shareable keyword search over encrypted data in cloud computing
    Xu, Li
    Weng, Chi-Yao
    Yuan, Lun-Pin
    Wu, Mu-En
    Tso, Raylin
    Sun, Hung-Min
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (03): : 1001 - 1023
  • [34] Similarity of Private Keyword Search over Encrypted Document Collection
    Elmehdwi, Yousef
    Jiang, Wei
    Hurson, Ali
    ADVANCES IN COMPUTERS, VOL 94, 2014, 94 : 71 - 102
  • [35] Concisely Indexed Multi-Keyword Rank Search on Encrypted Cloud Documents
    Chin, Tai-Lin
    Shih, Wan-Ni
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [36] Efficient and Expressive Keyword Search Over Encrypted Data in Cloud
    Cui, Hui
    Wan, Zhiguo
    Deng, Robert H.
    Wang, Guilin
    Li, Yingjiu
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2018, 15 (03) : 409 - 422
  • [37] A shareable keyword search over encrypted data in cloud computing
    Li Xu
    Chi-Yao Weng
    Lun-Pin Yuan
    Mu-En Wu
    Raylin Tso
    Hung-Min Sun
    The Journal of Supercomputing, 2018, 74 : 1001 - 1023
  • [38] Secure Ranked Keyword Search over Encrypted Cloud Data
    Wang, Cong
    Cao, Ning
    Li, Jin
    Ren, Kui
    Lou, Wenjing
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,
  • [39] A SURVEY OF MULTI KEYWORD SEARCH OVER THE ENCRYPTED DATA IN CLOUD
    Sathya, S.
    Gayathri, J.
    Radhika, D.
    IIOAB JOURNAL, 2016, 7 (09) : 600 - 607
  • [40] Preferred Keyword Search over Encrypted Data in Cloud Computing
    Shen, Zhirong
    Shu, Jiwu
    Xue, Wei
    2013 IEEE/ACM 21ST INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2013, : 207 - 212