Natural Language Instruction-following with Task-related Language Development and Translation

被引:0
|
作者
Pang, Jing-Cheng [1 ]
Yang, Xinyu [1 ]
Yang, Si-Hang [1 ]
Chen, Xiong-Hui [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Polixir Technol, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language-conditioned reinforcement learning (RL) enables agents to follow human instructions. Previous approaches generally implemented language-conditioned RL by providing the policy with human instructions in natural language (NL) and training the policy to follow instructions. In this is outside-in approach, the policy must comprehend the NL and manage the task simultaneously. However, the unbounded NL examples often bring much extra complexity for solving concrete RL tasks, which can distract policy learning from completing the task. To ease the learning burden of the policy, we investigate an inside-out scheme for natural language-conditioned RL by developing a task language (TL) that is task-related and easily understood by the policy, thus reducing the policy learning burden. Besides, we employ a translator to translate natural language into the TL, which is used in RL to achieve efficient policy training. We implement this scheme as TALAR (TAsk Language with predicAte Representation) that learns multiple predicates to model object relationships as the TL. Experiments indicate that TALAR not only better comprehends NL instructions but also leads to a better instruction-following policy that significantly improves the success rate over baselines and adapts to unseen expressions of NL instruction. Besides, the TL is also an effective sub-task abstraction compatible with hierarchical RL.
引用
收藏
页数:31
相关论文
共 50 条
  • [31] ON-LINE TRANSLATION OF NATURAL LANGUAGE QUESTIONS INTO ARTIFICIAL LANGUAGE QUERIES
    KELLOGG, CH
    INFORMATION STORAGE AND RETRIEVAL, 1968, 4 (03): : 287 - &
  • [32] Task-related modulation of early cortical responses during language production: An event-related synthetic aperture Magnetometry study
    Herdman, Anthony T.
    Pang, Elizabeth W.
    Ressel, Volker
    Gaetz, William
    Cheyne, Douglas
    CEREBRAL CORTEX, 2007, 17 (11) : 2536 - 2543
  • [33] Combining Functional Neuroimaging with Off-line Brain Stimulation: Modulation of Task-related Activity in Language Areas
    Andoh, Jamila
    Paus, Tomas
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2011, 23 (02) : 349 - 361
  • [34] Effects of early child-care on cognition, language, and task-related behaviours at 18 months: An English study
    Sylva, Kathy
    Stein, Alan
    Leach, Penelope
    Barnes, Jacqueline
    Malmberg, Lars-Erik
    BRITISH JOURNAL OF DEVELOPMENTAL PSYCHOLOGY, 2011, 29 (01) : 18 - 45
  • [35] Natural Language Ontology of Action: A Gap with Huge Consequences for Natural Language Understanding and Machine Translation
    Moneglia, Massimo
    HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 379 - 395
  • [36] Prior language knowledge and intercomprehension at the first encounter of Italian as an additional language: A translation task
    Smidfelt, Linda
    Van De Weijer, Joost
    MODERNA SPRAK, 2019, 113 (01): : 1 - 24
  • [37] Language and translation as management problems: A new task for education
    Lambert, J
    TEACHING TRANSLATION AND INTERPRETING 3: NEW HORIZONS, 1996, 16 : 271 - 293
  • [38] Multi-Task Learning for Multiple Language Translation
    Dong, Daxiang
    Wu, Hua
    He, Wei
    Yu, Dianhai
    Wang, Haifeng
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1723 - 1732
  • [39] An Evolutionary Method for Natural Language to SQL Translation
    Afonso, Alexandre
    Brito, Leonardo
    Vale, Oto
    SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2008, 5361 : 432 - +
  • [40] Handbook of Natural Language Processing and Machine Translation
    Rossi, Kimmo
    MACHINE TRANSLATION, 2013, 27 (01) : 73 - 76