A Fully Attention-Based Information Retriever

被引:0
|
作者
Correia, Alvaro H. C. [1 ]
Silva, Jorge L. M. [1 ]
Martins, Thiago de C. [1 ]
Cozman, Fabio G. [1 ]
机构
[1] Univ Sao Paulo, Escola Politecn, Sao Paulo, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks are now the state-of-the-art in natural language processing because they can build rich contextual representations and process texts of arbitrary length. However, recent developments on attention mechanisms have equipped feedforward networks with similar capabilities, hence enabling faster computations due to the increase in the number of operations that can be parallelized. We explore this new type of architecture in the domain of question-answering and propose a novel approach that we call Fully Attention Based Information Retriever (FABIR). We show that FABIR achieves competitive results in the Stanford Question Answering Dataset (SQuAD) while having fewer parameters and being faster at both learning and inference than rival methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Attention-based texture segregation
    Thomas V. Papathomas
    Andrei Gorea
    Akos Feher
    Tiffany E. Conway
    [J]. Perception & Psychophysics, 1999, 61 : 1399 - 1410
  • [32] Attention-based visual processes
    Cavanagh, P
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 3363 - 3363
  • [33] ATTENTION-BASED MOTION PERCEPTION
    CAVANAGH, P
    [J]. SCIENCE, 1992, 257 (5076) : 1563 - 1565
  • [34] Attention-based quantum tomography
    Cha, Peter
    Ginsparg, Paul
    Wu, Felix
    Carrasquilla, Juan
    McMahon, Peter L.
    Kim, Eun-Ah
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
  • [35] A Comparative Study of Information Extraction Strategies Using an Attention-Based Neural Network
    Tarride, Solene
    Lemaitre, Aurelie
    Couasnon, Bertrand
    Tardivel, Sophie
    [J]. DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 644 - 658
  • [36] DAWE: A Double Attention-Based Word Embedding Model with Sememe Structure Information
    Li, Shengwen
    Chen, Renyao
    Wan, Bo
    Gong, Junfang
    Yang, Lin
    Yao, Hong
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [37] Attention-based information composition for multicontext-aware recommendation in ubiquitous computing
    Kim, Sungrim
    Kwon, Joonhee
    [J]. SMART SENSING AND CONTEXT, PROCEEDINGS, 2006, 4272 : 230 - 233
  • [38] Structured Information Extraction of Pathology Reports with Attention-based Graph Convolutional Network
    Wu, Jialun
    Tang, Kaiwen
    Zhang, Haichuan
    Wang, Chunbao
    Li, Chen
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2395 - 2402
  • [39] An Attention-Based Model for Travel Energy Consumption of Electric Vehicle with Traffic Information
    Li, Shen
    Zhang, Hailong
    Tan, Huachun
    Zhong, Zhiyu
    Jiang, Zhuxi
    [J]. ADVANCES IN CIVIL ENGINEERING, 2021, 2021
  • [40] GNSS jamming detection using attention-based mutual information feature selection
    Ali Reda
    Tamer Mekkawy
    [J]. Discover Applied Sciences, 6