Convolutional Neural Networks for Text Hashing

被引:0
|
作者
Xu, Jiaming [1 ]
Wang, Peng [1 ]
Tian, Guanhua [1 ]
Xu, Bo [1 ]
Zhao, Jun [2 ]
Wang, Fangyuan [1 ]
Hao, Hongwei [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] NLPR, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing, as a popular approximate nearest neighbor search, has been widely used for large-scale similarity search. Recently, a spectrum of machine learning methods are utilized to learn similarity-preserving binary codes. However, most of them directly encode the explicit features, keywords, which fail to preserve the accurate semantic similarities in binary code beyond keyword matching, especially on short texts. Here we propose a novel text hashing framework with convolutional neural networks. In particular, we first embed the keyword features into compact binary code with a locality preserving constraint. Meanwhile word features and position features are together fed into a convolutional network to learn the implicit features which are further incorporated with the explicit features to fit the pre-trained binary code. Such base method can be successfully accomplished without any external tags/labels, and other three model variations are designed to integrate tags/labels. Experimental results show the superiority of our proposed approach over several state-of-the-art hashing methods when tested on one short text dataset as well as one normal text dataset.
引用
收藏
页码:1369 / 1375
页数:7
相关论文
共 50 条
  • [1] Text normalization with convolutional neural networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 589 - 600
  • [2] Text detection with convolutional neural networks
    Delakis, Manolis
    Garcia, Christophe
    [J]. VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 290 - 294
  • [3] An Image Perceptual Hashing Algorithm Based on Convolutional Neural Networks
    Yang, Meihong
    Qi, Baolin
    Xian, Yongjin
    Li, Jian
    [J]. DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 95 - 108
  • [4] Reading Text in the Wild with Convolutional Neural Networks
    Max Jaderberg
    Karen Simonyan
    Andrea Vedaldi
    Andrew Zisserman
    [J]. International Journal of Computer Vision, 2016, 116 : 1 - 20
  • [5] On the Interpretation of Convolutional Neural Networks for Text Classification
    Xu, Jincheng
    Du, Qingfeng
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2252 - 2259
  • [6] Reading Text in the Wild with Convolutional Neural Networks
    Jaderberg, Max
    Simonyan, Karen
    Vedaldi, Andrea
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) : 1 - 20
  • [7] Convolutional Neural Networks for Financial Text Regression
    Dereli, Nesat
    Saraclar, Murat
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 331 - 337
  • [8] Convolutional Recurrent Neural Networks for Text Classification
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    Wang, Lei
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Recurrent Convolutional Neural Networks for Text Classification
    Lai, Siwei
    Xu, Liheng
    Liu, Kang
    Zhao, Jun
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2267 - 2273
  • [10] Convolutional Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 65 - 82