Automatic Question Generation using RNN-based and Pre-trained Transformer-based Models in Low Resource Indonesian Language

被引：2

作者：

Vincentio, Karissa ^{[1
]}

Suhartono, Derwin ^{[1
]}

机构：

[1] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Jakarta 11530, Indonesia

来源：

INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS | 2022年 / 46卷 / 07期

关键词：

natural language processing; natural language generation; automatic question generation; recurrent neural network; long-short term memory; gated recurrent unit; transformer; fine-tuning;

D O I：

10.31449/inf.v46i7.4236

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Although Indonesian is the fourth most frequently used language on the internet, the development of NLP in Indonesian has not been studied intensively. One form of NLP application classified as an NLG task is the Automatic Question Generation task. Generally, the task has proven well, us-ing rule-based and cloze tests, but these approaches depend heavily on the defined rules. While this approach is suitable for automated question generation systems on a small scale, it can be-come less efficient as the scale of the system grows. Many NLG model architectures have recently proven to have significantly improved performance compared to previous architectures, such as generative pre-trained transformers, text-to-text transfer transformers, bidirectional auto -regressive transformers, and many more. Previous studies on AQG in Indonesian were built on RNN-based architecture such as GRU, LSTM, and Transformer. The performance of models in previous studies is compared with state-of-the-art models, such as multilingual models mBART and mT5, and monolingual models such as IndoBART and IndoGPT. As a result, the fine-tuned IndoBART performed significantly higher than either BiGRU and BiLSTM on the SQuAD dataset. Fine-tuned IndoBART on most of the metrics also performed better on the TyDiQA dataset only, which has fewer population than the SQuAD dataset.

引用

页码：103 / 118

页数：16

共 50 条

[1] Pre-trained transformer-based language models for Sundanese
Wilson Wongso
Henry Lucky
Derwin Suhartono
[J]. Journal of Big Data, 9
[2] Pre-trained transformer-based language models for Sundanese
Wongso, Wilson
Lucky, Henry
Suhartono, Derwin
[J]. JOURNAL OF BIG DATA, 2022, 9 (01)
[3] A Survey of Controllable Text Generation Using Transformer-based Pre-trained Language Models
Zhang, Hanqing
Song, Haolin
Li, Shaoyu
Zhou, Ming
Song, Dawei
[J]. ACM COMPUTING SURVEYS, 2024, 56 (03)
[4] Pre-Trained Transformer-Based Models for Text Classification Using Low-Resourced Ewe Language
Agbesi, Victor Kwaku
Chen, Wenyu
Yussif, Sophyani Banaamwini
Hossin, Md Altab
Ukwuoma, Chiagoziem C.
Kuadey, Noble A.
Agbesi, Colin Collinson
Samee, Nagwan Abdel
Jamjoom, Mona M.
Al-antari, Mugahed A.
[J]. SYSTEMS, 2024, 12 (01):
[5] A survey of transformer-based multimodal pre-trained modals
Han, Xue
Wang, Yi-Tong
Feng, Jun-Lan
Deng, Chao
Chen, Zhan-Heng
Huang, Yu-An
Su, Hui
Hu, Lun
Hu, Peng-Wei
[J]. NEUROCOMPUTING, 2023, 515 : 89 - 106
[6] A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning
Kotei, Evans
Thirunavukarasu, Ramkumar
[J]. INFORMATION, 2023, 14 (03)
[7] Multi-task Active Learning for Pre-trained Transformer-based Models
Rotman, Guy
Reichart, Roi
[J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1209 - 1228
[8] Automatic Title Generation for Text with Pre-trained Transformer Language Model
Mishra, Prakhar
Diwan, Chaitali
Srinivasa, Srinath
Srinivasaraghavan, G.
[J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 17 - 24
[9] A Transformer Based Approach To Detect Suicidal Ideation Using Pre-Trained Language Models
Haque, Farsheed
Nur, Ragib Un
Al Jahan, Shaeekh
Mahmud, Zarar
Shah, Faisal Muhammad
[J]. 2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
[10] Arabic abstractive text summarization using RNN-based and transformer-based architectures
Bani-Almarjeh, Mohammad
Kurdy, Mohamad-Bassam
[J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)

← 1 2 3 4 5 →