Research on automatic pilot repetition generation method based on deep reinforcement learning

被引:2
|
作者
Pan, Weijun [1 ]
Jiang, Peiyuan [1 ]
Li, Yukun [1 ]
Wang, Zhuang [1 ]
Huang, Junxiang [2 ]
机构
[1] Civil Aviat Flight Univ China, Coll Air Traff Management, Air Traff Control Automation Lab, Deyang, Peoples R China
[2] East China Air Traff Management Bur, Dept Safety Management, Xiamen Air Traff Management Stn, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
controller training; transfer learning; text generation; reinforcement learning; generalization; RECOGNITION; EXTRACTION; AGENT;
D O I
10.3389/fnbot.2023.1285831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using computers to replace pilot seats in air traffic control (ATC) simulators is an effective way to improve controller training efficiency and reduce training costs. To achieve this, we propose a deep reinforcement learning model, RoBERTa-RL (RoBERTa with Reinforcement Learning), for generating pilot repetitions. RoBERTa-RL is based on the pre-trained language model RoBERTa and is optimized through transfer learning and reinforcement learning. Transfer learning is used to address the issue of scarce data in the ATC domain, while reinforcement learning algorithms are employed to optimize the RoBERTa model and overcome the limitations in model generalization caused by transfer learning. We selected a real-world area control dataset as the target task training and testing dataset, and a tower control dataset generated based on civil aviation radio land-air communication rules as the test dataset for evaluating model generalization. In terms of the ROUGE evaluation metrics, RoBERTa-RL achieved significant results on the area control dataset with ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.9962, 0.992, and 0.996, respectively. On the tower control dataset, the scores were 0.982, 0.954, and 0.982, respectively. To overcome the limitations of ROUGE in this field, we conducted a detailed evaluation of the proposed model architecture using keyword-based evaluation criteria for the generated repetition instructions. This evaluation criterion calculates various keyword-based metrics based on the segmented results of the repetition instruction text. In the keyword-based evaluation criteria, the constructed model achieved an overall accuracy of 98.8% on the area control dataset and 81.8% on the tower control dataset. In terms of generalization, RoBERTa-RL improved accuracy by 56% compared to the model before improvement and achieved a 47.5% improvement compared to various comparative models. These results indicate that employing reinforcement learning strategies to enhance deep learning algorithms can effectively mitigate the issue of poor generalization in text generation tasks, and this approach holds promise for future application in other related domains.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Automatic View Generation with Deep Learning and Reinforcement Learning
    Yuan, Haitao
    Li, Guoliang
    Feng, Ling
    Sun, Ji
    Han, Yue
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1501 - 1512
  • [2] Virtual Generation Alliance Automatic Generation Control Based on Deep Reinforcement Learning
    Li, Jiawen
    Yu, Tao
    [J]. IEEE ACCESS, 2020, 8 : 182204 - 182217
  • [3] Deep Reinforcement Learning for Automatic Thumbnail Generation
    Li, Zhuopeng
    Zhang, Xiaoyan
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 41 - 53
  • [4] Automatic Cell Rotation Method Based on Deep Reinforcement Learning
    Gong, Huiying
    Zhang, Yujie
    Liu, Yaowei
    Zhao, Qili
    Zhao, Xin
    Sun, Mingzhu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5452 - 5458
  • [5] DEEP REINFORCEMENT LEARNING-BASED AUTOMATIC TEST PATTERN GENERATION
    Li, Wenxing
    Lyu, Hongqin
    Liang, Shengwen
    Liu, Zizhen
    Lin, Ning
    Wang, Zhongrui
    Tian, Pengyu
    Wang, Tiancheng
    Li, Huawei
    [J]. CONFERENCE OF SCIENCE & TECHNOLOGY FOR INTEGRATED CIRCUITS, 2024 CSTIC, 2024,
  • [6] Research on Automatic Dance Generation System Based on Deep Learning
    Lan, Jia
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [7] Deep Reinforcement Learning for Automatic Generation Control of Wind Farms
    Vijayshankar, Sanjana
    Stanfel, Paul
    King, Jennifer
    Spyrou, Evangelia
    Johnson, Kathryn
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1796 - 1802
  • [8] Automatic depth matching method of well log based on deep reinforcement learning
    Xiong, Wenjun
    Xiao, Lizhi
    Yuan, Jiangru
    Yue, Wenzheng
    [J]. PETROLEUM EXPLORATION AND DEVELOPMENT, 2024, 51 (03) : 634 - 646
  • [9] Research on Robot Intelligent Control Method Based on Deep Reinforcement Learning
    Rao, Shu
    [J]. 2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 221 - 225
  • [10] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
    Jarosik, Piotr
    Lewandowski, Marcin
    [J]. 2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478