Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring

被引:9
|
作者
Han, Sifei [1 ]
Shi, Lingyun [1 ]
Richie, Russell [1 ]
Tsui, Fuchiang R. Rich [1 ,2 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, 2716 South St, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, 3400 Spruce St,Suite 680 Dulles, Philadelphia, PA USA
基金
美国国家科学基金会;
关键词
Attention neural network; Deep learning; Machine learning; Natural language processing; Information retrieval; Text similarity;
D O I
10.1016/j.ins.2022.10.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically measuring document similarity is imperative in natural language process-ing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural net-works, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnet-work of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are con-catenated to form a single document representation. Using the Quora Question Pairs data -set, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions.(c) 2022 Published by Elsevier Inc.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [21] Enhancing resolution uniformity of spectral line confocal 3D sensors through integration of vision transformer with perpendicular attention-augmented parallel convolutional neural networks
    Wang, Shuai
    Zheng, Zechen
    Diao, Kuan
    Liu, Xiaojun
    MEASUREMENT, 2025, 249
  • [22] Automated fundus ultrasound image classification based on siamese convolutional neural networks with multi-attention
    Jiachen Tan
    Yongquan Dong
    Junchi Li
    BMC Medical Imaging, 23
  • [23] DIRECTION FINDING USING CONVOLUTIONAL NEURAL NETWORKS and CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Uckun, Fehmi Ayberk
    Ozer, Hakan
    Nurbas, Ekin
    Onat, Emrah
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [24] Automated fundus ultrasound image classification based on siamese convolutional neural networks with multi-attention
    Tan, Jiachen
    Dong, Yongquan
    Li, Junchi
    BMC MEDICAL IMAGING, 2023, 23 (01)
  • [25] Convolutional Neural Networks with Recurrent Neural Filters
    Yang, Yi
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 912 - 917
  • [26] Enhanced fault diagnosis of rolling bearings using attention-augmented separable residual networks
    Liang, Chuang
    Mu, Xuelin
    Zhang, Xiaoguang
    Lu, Fanfan
    Wang, Chengcheng
    Shao, Yubo
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2025, 61
  • [27] An Attention Module for Convolutional Neural Networks
    Zhu, Baozhou
    Hofstee, Peter
    Lee, Jinho
    Al-Ars, Zaid
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 167 - 178
  • [28] Reparameterized attention for convolutional neural networks
    Wu, Yiming
    Li, Ruixiang
    Yu, Yunlong
    Li, Xi
    PATTERN RECOGNITION LETTERS, 2022, 164 : 89 - 95
  • [29] Semi-Supervised Attention-Augmented Convolutional Autoencoder for Radar-Based Human Activity Recognition
    Campbell, Christopher
    Ahmad, Fauzia
    RADAR SENSOR TECHNOLOGY XXVI, 2022, 12108
  • [30] Siamese Neural Networks Method for Semantic Requirements Similarity Detection
    Alnajem, Nojoom A.
    Binkhonain, Manal
    Hossain, M. Shamim
    IEEE ACCESS, 2024, 12 : 140932 - 140947