Using of Transformers Models for Text Classification to Mobile Educational Applications

被引:2
|
作者
Garrido, Anabel Pilicita [1 ]
Arias, Enrique Barra [1 ]
机构
[1] Univ Politecn Madrid, Madrid, Spain
关键词
Bit error rate; Transformers; Internet; Training; Text categorization; Recurrent neural networks; IEEE transactions; Natural Language Processing; Multiclass Text Classification; Bidirectional Encoder Representations from Transformers;
D O I
10.1109/TLA.2023.10172138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Q2 2022, educational apps were the second most popular category on the Google Play Store, accounting for 10.47% of the apps available worldwide. This work explores the application of five BERT-based pre-trained models with the Transformers architecture to classify mobile educational applications. These five models are according to the knowledge field: bert-base-cased, bert-base-uncased, roberta-base, albert-base-v2 and distilbert-base-uncased. This study uses a dataset with educational apps of Google Play, this dataset was enriched with description and category because it lacked this information. In all models, a tokenizer and fine-tuning works were applied for training in the classification task. After training the data, the testing phase was performed in which the models had to go through four training epochs to obtain better results: roberta-base with 81% accuracy, bert-base-uncased with 79% accuracy, bert-base-cased obtained 80% accuracy, albert-base-v2 obtained 78% accuracy and distilbert-base-uncased obtained 76% accuracy.
引用
下载
收藏
页码:730 / 736
页数:7
相关论文
共 50 条
  • [21] Temporal Language Modeling for Short Text Document Classification with Transformers
    Pokrywka, Jakub
    Gralinski, Filip
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 121 - 128
  • [22] Efficient WSI classification with sequence reduction and transformers pretrained on text
    Juan I. Pisula
    Katarzyna Bozek
    Scientific Reports, 15 (1)
  • [23] Using Multilingual Bidirectional Encoder Representations from Transformers on Medical Corpus for Kurdish Text Classification
    Badawi, Soran S.
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2023, 11 (01): : 10 - 15
  • [24] State classification of transformers using nonlinear dynamic analysis and Hidden Markov models
    Hong, Kaixing
    Lin, Guanxi
    MEASUREMENT, 2019, 147
  • [25] Arbitrary Shape Text Detection using Transformers
    Raisi, Zobeir
    Younes, Georges
    Zelek, John
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3238 - 3245
  • [26] Layout Cross-Browser Failure Classification for Mobile Responsive Design Web Applications: Combining Classification Models Using Feature Selection
    Watanabe, Willian Massami
    Dos Santos, Danilo Alves
    De Oliveira, Claiton
    ACM TRANSACTIONS ON THE WEB, 2023, 17 (04)
  • [27] An Open Source Framework for Educational Applications Using Cozmo Mobile Robot
    Pires Kusumota, Victor Luis
    Aroca, Rafael Vidal
    Martins, Felipe Nascimento
    15TH LATIN AMERICAN ROBOTICS SYMPOSIUM 6TH BRAZILIAN ROBOTICS SYMPOSIUM 9TH WORKSHOP ON ROBOTICS IN EDUCATION (LARS/SBR/WRE 2018), 2018, : 569 - 576
  • [28] Intelligent Spam Classification for Mobile Text Message
    Mathew, Kuruvilla
    Issac, Biju
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 101 - 105
  • [29] Integrating Text Classification into Topic Discovery Using Semantic Embedding Models
    Lezama-Sanchez, Ana Laura
    Vidal, Mireya Tovar
    Reyes-Ortiz, Jose A.
    APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [30] On the Classification of Mobile Broadband Applications
    Hsieh, I-Ching
    Tung, Li-Ping
    Lin, Bao-Shuh Paul
    2016 IEEE 21ST INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELLING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2016, : 128 - 134