Encoding Document Semantic into Binary Codes Space

被引:0
|
作者
Yu, Zheng [1 ]
Zhao, Xiang [2 ]
Wang, Liping [1 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Natl Univ Def Technol, Changsha,, Hunan, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, WAIM 2014 | 2014年 / 8485卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a deep neural network model to encode document semantic into compact binary codes with the elegant property that semantically similar documents have similar embedding codes. The deep learning model is constructed with three stacked auto-encoders. The input of the lowest auto-encoder is the representation of word-count vector of a document, while the learned hidden features of the deepest auto-encoder are thresholded to be binary codes to represent the document semantic. Retrieving similar document is very efficient by simply returning the documents whose codes have small Hamming distances to that of the query document. We illustrate the effectiveness of our model on two public real datasets - 20NewsGroup and Wikipedia, and the experiments demonstrate that the compact binary codes sufficiently embed the semantic of documents and bring improvement in retrieval accuracy.
引用
收藏
页码:535 / 539
页数:5
相关论文
共 50 条
  • [1] Learning Semantic Binary Codes by Encoding Attributes for Image Retrieval
    Luo, Jianwei
    Jiang, Zhiguo
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 279 - 284
  • [2] Semantic Binary Codes
    Bondugula, Sravanthi
    Davis, Larry S.
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 371 - 375
  • [3] Optimal encoding of binary cyclic codes
    Chen, Houshou
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2006, E89B (12) : 3280 - 3287
  • [4] Watermarking in binary document images using fractal codes
    Daraee, Fatemeh
    Mozaffari, Saeed
    PATTERN RECOGNITION LETTERS, 2014, 35 : 120 - 129
  • [5] Convolutional Encoding of Some Binary Quadratic Residue Codes
    Lee, Chong-Dao
    Truong, Trieu-Kien
    Chen, Yan-Haw
    IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 319 - +
  • [6] SEMANTIC SPACE AND ENCODING SPACE IN SHORT-TERM-MEMORY
    WEEKS, DG
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1976, 8 (05) : 356 - 358
  • [7] New identifying codes in the binary Hamming space
    Charon, Irene
    Cohen, Gerard
    Hudry, Olivier
    Lobstein, Antoine
    EUROPEAN JOURNAL OF COMBINATORICS, 2010, 31 (02) : 491 - 501
  • [8] Efficient Systematic Encoding of Non-binary VT Codes
    Abroshan, Mahed
    Venkataramanan, Ramji
    Guillen i Fabregas, Albert
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 91 - 95
  • [9] Encoding changing country codes for the Semantic Web with ISO 3166 and SKOS
    Voss, Jakob
    METADATA AND SEMANTICS, 2009, : 211 - 221
  • [10] Modeling Semantic Encoding in a Common Neural Representational Space
    Van Uden, Cara E.
    Nastase, Samuel A.
    Connolly, Andrew C.
    Ma Feilong
    Hansen, Isabella
    Gobbini, M. Ida
    Haxby, James V.
    FRONTIERS IN NEUROSCIENCE, 2018, 12