Leveraging Knowledge and Reinforcement Learning for Enhanced Reliability of Language Models

被引:1
|
作者
Tyagi, Nancy [1 ]
Sarkar, Surjodeep [1 ]
Gaur, Manas [1 ]
机构
[1] Univ Maryland Baltimore Cty, Baltimore, MD 21250 USA
关键词
Natural Language Processing; Language Models; Ensemble; Reinforcement Learning; Knowledge Infusion; Reliability;
D O I
10.1145/3583780.3615273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Natural Language Processing (NLP) community has been using crowd-sourcing techniques to create benchmark datasets such as General Language Understanding and Evaluation (GLUE) for training modern Language Models (LMs) such as BERT. GLUE tasks measure the reliability scores using inter-annotator metrics - Cohen's Kappa (kappa). However, the reliability aspect of LMs has often been overlooked. To counter this problem, we explore a knowledge-guided LM ensembling approach that leverages reinforcement learning to integrate knowledge from ConceptNet and Wikipedia as knowledge graph embeddings. This approach mimics human annotators resorting to external knowledge to compensate for information deficits in the datasets. Across nine GLUE datasets, our research shows that ensembling strengthens reliability and accuracy scores, outperforming state-of-the-art.
引用
收藏
页码:4320 / 4324
页数:5
相关论文
共 50 条
  • [41] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
    Zhang, Danyang
    Chen, Lu
    Zhang, Situo
    Xu, Hongshen
    Zhao, Zihan
    Yu, Kai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [42] Reinforcement Learning With Large Language Models (LLMs) Interaction For Network Services
    Du, Hongyang
    Zhang, Ruichen
    Niyato, Dusit
    Kang, Jiawen
    Xiong, Zehui
    Kim, Dong In
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 799 - 803
  • [43] Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
    Carta, Thomas
    Romac, Clement
    Wolf, Thomas
    Lamprier, Sylvain
    Sigaud, Olivier
    Oudeyer, Pierre-Yves
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [44] Lexical Error Guard: Leveraging Large Language Models for Enhanced ASR Error Correction
    Si, Mei
    Cobas, Omar
    Fababeir, Michael
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (04): : 2435 - 2446
  • [45] Leveraging Large Language Models for Enhanced Classification and Analysis: Fire Incidents Case Study
    Alkhammash, Eman H.
    FIRE-SWITZERLAND, 2025, 8 (01):
  • [46] Leveraging Procedural Generation to Benchmark Reinforcement Learning
    Cobbe, Karl
    Hesse, Christopher
    Hilton, Jacob
    Schulman, John
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [47] Leveraging Procedural Generation to Benchmark Reinforcement Learning
    Cobbe, Karl
    Hesse, Christopher
    Hilton, Jacob
    Schulman, John
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [48] Leveraging More of Biology in Evolutionary Reinforcement Learning
    Gasperov, Bruno
    Durasevic, Marko
    Jakobovic, Domagoj
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2024, PT II, 2024, 14635 : 91 - 114
  • [49] Reinforcement Learning for Reliability Optimisation
    Saka, Prasuna
    Banerjee, Ansuman
    THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 25 - 32
  • [50] Knowledge Transfer between Multi-granularity Models for Reinforcement Learning
    Wang, Lan
    Tang, Kaiqiang
    Xin, Bo
    Chen, Chunlin
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2881 - 2886