Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring

被引:9
|
作者
Han, Sifei [1 ]
Shi, Lingyun [1 ]
Richie, Russell [1 ]
Tsui, Fuchiang R. Rich [1 ,2 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, 2716 South St, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, 3400 Spruce St,Suite 680 Dulles, Philadelphia, PA USA
基金
美国国家科学基金会;
关键词
Attention neural network; Deep learning; Machine learning; Natural language processing; Information retrieval; Text similarity;
D O I
10.1016/j.ins.2022.10.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically measuring document similarity is imperative in natural language process-ing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural net-works, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnet-work of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are con-catenated to form a single document representation. Using the Quora Question Pairs data -set, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions.(c) 2022 Published by Elsevier Inc.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [31] Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks
    Wang, Jianrong
    Zhao, Yaxin
    Liu, Li
    Xu, Tianyi
    Li, Qi
    Li, Sen
    INTERSPEECH 2023, 2023, : 2 - 6
  • [32] SIMILARITY METRIC BASED ON SIAMESE NEURAL NETWORKS FOR VOICE CASTING
    Gresse, Adrien
    Quillot, Mathias
    Dufour, Richard
    Labatut, Vincent
    Bonastre, Jean-Francois
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6585 - 6589
  • [33] Deep Image Matching Based on Siamese Convolutional Neural Networks
    Dou, J.
    Tu, Z.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2025, 35 (01) : 1 - 15
  • [34] Detection of Image Manipulations Using Siamese Convolutional Neural Networks
    Mazumdar, Aniruddha
    Singh, Jaya
    Tomar, Yosha Singh
    Bora, P. K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 226 - 233
  • [35] Siamese Convolutional Neural Networks for Remote Sensing Scene Classification
    Liu, Xuning
    Zhou, Yong
    Zhao, Jiaqi
    Yao, Rui
    Liu, Bing
    Zheng, Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (08) : 1200 - 1204
  • [36] SINGING STYLE INVESTIGATION BY RESIDUAL SIAMESE CONVOLUTIONAL NEURAL NETWORKS
    Wang, Cheng-i
    Tzanetakis, George
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 116 - 120
  • [37] Face Verification Using Convolutional Neural Networks with Siamese Architecture
    Bukovcikova, Zuzana
    Sopiak, Dominik
    Oravec, Milos
    Pavlovicova, Jarmila
    PROCEEDINGS OF 2017 INTERNATIONAL SYMPOSIUM ELMAR, 2017, : 205 - 208
  • [38] Detecting Object Defects with Fusioning Convolutional Siamese Neural Networks
    Nagy, Amr M.
    Czuni, Laszlo
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 157 - 163
  • [39] Automated diagnosis of bone metastasis based on multi-view bone scans using attention-augmented deep neural networks
    Pi, Yong
    Zhao, Zhen
    Xiang, Yongzhao
    Li, Yuhao
    Cai, Huawei
    Yi, Zhang
    MEDICAL IMAGE ANALYSIS, 2020, 65
  • [40] Extended Siamese Convolutional Neural Networks for Discriminative Feature Learning
    Lee, Sangyun
    Hong, Sungjun
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (04) : 339 - 349