Chinese-keyword Fuzzy Search and Extraction over Encrypted Patent Documents

被引:0
|
作者
Ding, Wei [1 ]
Liu, Yongji [1 ]
Zhang, Jianfeng [2 ]
机构
[1] China Def Sci & Technol Informat Ctr, Beijing 100036, Peoples R China
[2] Natl Univ Def Technol, Changsha 410073, Hunan, Peoples R China
基金
中国博士后科学基金;
关键词
Chinese Keywords; Fuzzy Search; Extraction; Encrypted Documents;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud storage for information sharing is likely indispensable to the future national defence library in China e.g., for searching national defence patent documents, while security risks need to be maximally avoided using data encryption. Patent keywords are the high-level summary of the patent document, and it is significant in practice to efficiently extract and search the key words in the patent documents. Due to the particularity of Chinese keywords, most existing algorithms in English language environment become ineffective in Chinese scenarios. For extracting the keywords from patent documents, the manual keyword extraction is inappropriate when the amount of files is large. An improved method based on the term frequency-inverse document frequency (TF-IDF) is proposed to auto-extract the keywords in the patent literature. The extracted keyword sets also help to accelerate the keyword search by linking finite keywords with a large amount of documents. Fuzzy keyword search is introduced to further increase the search efficiency in the cloud computing scenarios compared to exact keyword search methods. Based on the Chinese Pinyin similarity, a Pinyin-Gram-based algorithm is proposed for fuzzy search in encrypted Chinese environment, and a keyword trapdoor search index structure based on the n-ary tree is designed. Both the search efficiency and accuracy of the proposed scheme are verified through computer experiments.
引用
收藏
页码:168 / 176
页数:9
相关论文
共 50 条
  • [21] Efficient Keyword Search over Encrypted Cloud Data
    Meharwade, Anuradha
    Patil, G. A.
    1ST INTERNATIONAL CONFERENCE ON INFORMATION SECURITY & PRIVACY 2015, 2016, 78 : 139 - 145
  • [22] Approach to keyword search over encrypted data in cloud
    Zhang, Peng
    Li, Yan
    Lin, Hai-Lun
    Yang, Rong
    Liu, Qing-Yun
    Tongxin Xuebao/Journal on Communications, 2014, 35 : 147 - 153
  • [23] Secure conjunctive keyword search over encrypted data
    Golle, P
    Staddon, J
    Waters, B
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, PROCEEDINGS, 2004, 3089 : 31 - 45
  • [24] Chinese Multi-Keyword Fuzzy Rank Search over Encrypted Cloud Data Based on Locality-Sensitive Hashing
    Yang, Yang
    Zhang, Yu-Chao
    Liu, Jia
    Liu, Xi-Meng
    Yuan, Feng
    Zhong, Shang-Ping
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (01) : 137 - 158
  • [25] Computation and Search over Encrypted XML Documents
    Poon, Hoi Ting
    Miri, Ali
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 631 - 634
  • [26] Survey on Keyword Search over XML Documents
    Thuy Ngoc Le
    Ling, Tok Wang
    SIGMOD RECORD, 2016, 45 (03) : 17 - 28
  • [27] Secure and Efficient Multi-keyword Fuzzy Search Over Encrypted Data on Alliance Chain
    Song, Jimeng
    Shen, Ziqi
    Yu, Han
    Lai, Rongxin
    Li, Yuancheng
    Wang, Qingle
    Li, Jianbin
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2024, 17 (07) : 652 - 665
  • [28] Encrypted Keyword Search in Cloud Computing using Fuzzy Logic
    Yadav, Manish Kumar
    Gugal, Drishti
    Matkar, Shivani
    Waghmare, Sanket
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [29] A Novel Verifiable and Dynamic Fuzzy Keyword Search Scheme over Encrypted Data in Cloud Computing
    Zhu, Xiaoyu
    Liu, Qin
    Wang, Guojun
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 845 - 851
  • [30] BSMFS: Blockchain assisted Secure Multi-keyword Fuzzy Search over Encrypted Data
    Chakraborty, Partha Sarathi
    Chandrawanshi, Mangesh Shivaji
    Kumar, Puspesh
    Tripathy, Somanath
    2022 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN (BLOCKCHAIN 2022), 2022, : 216 - 221