Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring

被引:9
|
作者
Han, Sifei [1 ]
Shi, Lingyun [1 ]
Richie, Russell [1 ]
Tsui, Fuchiang R. Rich [1 ,2 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, 2716 South St, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, 3400 Spruce St,Suite 680 Dulles, Philadelphia, PA USA
基金
美国国家科学基金会;
关键词
Attention neural network; Deep learning; Machine learning; Natural language processing; Information retrieval; Text similarity;
D O I
10.1016/j.ins.2022.10.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically measuring document similarity is imperative in natural language process-ing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural net-works, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnet-work of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are con-catenated to form a single document representation. Using the Quora Question Pairs data -set, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions.(c) 2022 Published by Elsevier Inc.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [1] Remaining Useful Life Estimation of Aircraft Engines Using Siamese Attention-Augmented Quantum Convolutional Neural Networks
    Ali, Al-Moayed Zaid Abdulrazaq Ali
    Abdulaziz, Al-Qubati Mohammed Ahmed
    Ahmed, Al-Jonaid Amjad Mohammed
    Wang, Cheng Long
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1366 - 1371
  • [2] Audio classification using attention-augmented convolutional neural network
    Wu, Yu
    Mao, Hua
    Yi, Zhang
    KNOWLEDGE-BASED SYSTEMS, 2018, 161 : 90 - 100
  • [3] An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement
    Xu, Zezheng
    Jiang, Ting
    Li, Chao
    Yu, Jiacheng
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [4] A Trimodel SAR Semisupervised Recognition Method Based on Attention-Augmented Convolutional Networks
    Yan, Sifan
    Zhang, Yaotian
    Gao, Fei
    Sun, Jinping
    Hussain, Amir
    Zhou, Huiyu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9566 - 9583
  • [5] CSI-Fingerprinting Indoor Localization via Attention-Augmented Residual Convolutional Neural Network
    Zhang, Bowen
    Sifaou, Houssem
    Li, Geoffrey Ye
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) : 5583 - 5597
  • [6] Enhancing Plant Disease Detection Using Attention-Augmented Residual Networks and Faster Region-Convolutional Networks
    Sathya, K.
    Balakrishnan, Arunkumar
    Baskaran, P.
    Ramamoorthy, Arun Kumar
    IEEE ACCESS, 2025, 13 : 48625 - 48642
  • [7] Siamese Convolutional Neural Networks to Quantify Crack Pattern Similarity in Masonry Facades
    Rozsas, Arpad
    Slobbe, Arthur
    Huizinga, Wyke
    Kruithof, Maarten
    Pillai, Krishna Ajithkumar
    Kleijn, Kelvin
    Giardina, Giorgia
    INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2023, 17 (01) : 147 - 169
  • [8] An Attention-Augmented Convolutional Neural Network With Focal Loss for Mixed-Type Wafer Defect Classification
    Batool, Uzma
    Shapiai, Mohd Ibrahim
    Mostafa, Salama A.
    Ibrahim, Mohd Zamri
    IEEE ACCESS, 2023, 11 : 108891 - 108905
  • [9] Attention Augmented Convolutional Networks
    Bello, Irwan
    Zoph, Barret
    Vaswani, Ashish
    Shlens, Jonathon
    Le, Quoc V.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3285 - 3294
  • [10] Applying Siamese Hierarchical Attention Neural Networks for multi-document summarization
    Angel Gonzalez, Jose
    Delonca, Julien
    Sanchis, Emilio
    Garcia-Granada, Fernando
    Segarra, Encarna
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 111 - 118