End-to-End Transformer-Based Models in Textual-Based NLP

被引：27

作者：

Rahali, Abir ^{[1
]}

Akhloufi, Moulay A. ^{[1
]}

机构：

[1] Univ Moncton, Dept Comp Sci, Percept Robot & Intelligent Machines Res Grp PRIME, Moncton, NB E1A 3E9, Canada

来源：

AI | 2023年 / 4卷 / 01期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Transformers; deep learning; natural language processing; transfer learning; PRE-TRAINED BERT; PREDICTION; SYSTEMS;

D O I：

10.3390/ai4010004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transformer architectures are highly expressive because they use self-attention mechanisms to encode long-range dependencies in the input sequences. In this paper, we present a literature review on Transformer-based (TB) models, providing a detailed overview of each model in comparison to the Transformer's standard architecture. This survey focuses on TB models used in the field of Natural Language Processing (NLP) for textual-based tasks. We begin with an overview of the fundamental concepts at the heart of the success of these models. Then, we classify them based on their architecture and training mode. We compare the advantages and disadvantages of popular techniques in terms of architectural design and experimental value. Finally, we discuss open research, directions, and potential future work to help solve current TB application challenges in NLP.

引用

页码：54 / 110

页数：57

共 50 条

[1] Transformer-based end-to-end scene text recognition
Zhu, Xinghao
Zhang, Zhi
PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1691 - 1695
[2] Transformer-Based End-to-End Anatomical and Functional Image Fusion
Zhang, Jing
Liu, Aiping
Wang, Dan
Liu, Yu
Wang, Z. Jane
Chen, Xun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[3] Transformer-based End-to-End Object Detection in Aerial Images
Vo, Nguyen D.
Le, Nguyen
Ngo, Giang
Doan, Du
Le, Do
Nguyen, Khang
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 1072 - 1079
[4] A Transformer-Based End-to-End Automatic Speech Recognition Algorithm
Dong, Fang
Qian, Yiyang
Wang, Tianlei
Liu, Peng
Cao, Jiuwen
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1592 - 1596
[5] SymFormer: End-to-End Symbolic Regression Using Transformer-Based Architecture
Vastl, Martin
Kulhanek, Jonas
Kubalik, Jiri
Derner, Erik
Babuska, Robert
IEEE ACCESS, 2024, 12 : 37840 - 37849
[6] An End-to-End Transformer-Based Automatic Speech Recognition for Qur?an Reciters
Hadwan, Mohammed
Alsayadi, Hamzah A.
AL-Hagree, Salah
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3471 - 3487
[7] Transformer-Based End-to-End Speech Translation With Rotary Position Embedding
Li, Xueqing
Li, Shengqiang
Zhang, Xiao-Lei
Rahardja, Susanto
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 371 - 375
[8] End-to-end information fusion method for transformer-based stereo matching
Xu, Zhenghui
Wang, Jingxue
Guo, Jun
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (06)
[9] Transformer-based Long-context End-to-end Speech Recognition
Hori, Takaaki
Moritz, Niko
Hori, Chiori
Le Roux, Jonathan
INTERSPEECH 2020, 2020, : 5011 - 5015
[10] On-device Streaming Transformer-based End-to-End Speech Recognition
Oh, Yoo Rhee
Park, Kiyoung
INTERSPEECH 2021, 2021, : 967 - 968

← 1 2 3 4 5 →