Self-Regulated Interactive Sequence-to-Sequence Learning

被引:0
|
作者
Kreutzer, Julia [1 ]
Riezler, Stefan [1 ,2 ]
机构
[1] Heidelberg Univ, Computat Linguist, Heidelberg, Germany
[2] Heidelberg Univ, IWR, Heidelberg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Not all types of supervision signals are created equal: Different types of feedback have different costs and effects on learning. We show how self-regulation strategies that decide when to ask for which kind of feedback from a teacher (or from oneself) can be cast as a learning-to-learn problem leading to improved cost-aware sequence-to-sequence learning. In experiments on interactive neural machine translation, we find that the self-regulator discovers an epsilon-greedy strategy for the optimal cost-quality trade-off by mixing different feedback types including corrections, error markups, and self-supervision. Furthermore, we demonstrate its robustness under domain shift and identify it as a promising alternative to active learning.
引用
收藏
页码:303 / 315
页数:13
相关论文
共 50 条
  • [21] Sequence-to-Sequence Learning for Human Pose Correction in Videos
    Swetha, Sirnam
    Balasubramanian, Vineeth N.
    Jawahar, C. V.
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 298 - 303
  • [22] Compositional generalization through meta sequence-to-sequence learning
    Lake, Brenden M.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [23] Bandit Structured Prediction for Neural Sequence-to-Sequence Learning
    Kreutzer, Julia
    Sokolov, Artem
    Riezler, Stefan
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1503 - 1513
  • [24] Exploring Sequence-to-Sequence Learning in Aspect Term Extraction
    Ma, Dehong
    Li, Sujian
    Wu, Fangzhao
    Xie, Xing
    Wang, Houfeng
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3538 - 3547
  • [25] Sequence-to-Sequence Learning via Shared Latent Representation
    Shen, Xu
    Tian, Xinmei
    Xing, Jun
    Rui, Yong
    Tao, Dacheng
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2395 - 2402
  • [26] An Interactive Layers Model of Self-Regulated Learning and Cognitive Load
    Wirth, Joachim
    Stebner, Ferdinand
    Trypke, Melanie
    Schuster, Corinna
    Leutner, Detlev
    [J]. EDUCATIONAL PSYCHOLOGY REVIEW, 2020, 32 (04) : 1127 - 1149
  • [27] An Interactive Layers Model of Self-Regulated Learning and Cognitive Load
    Joachim Wirth
    Ferdinand Stebner
    Melanie Trypke
    Corinna Schuster
    Detlev Leutner
    [J]. Educational Psychology Review, 2020, 32 : 1127 - 1149
  • [28] ON SEQUENCE-TO-SEQUENCE TRANSFORMATIONS
    UPRETI, R
    [J]. INDIAN JOURNAL OF PURE & APPLIED MATHEMATICS, 1982, 13 (04): : 454 - 457
  • [29] A Character-Level Sequence-to-Sequence Method for Subtitle learning
    Zhang, Haijun
    Li, Jingxuan
    Ji, Yuzhu
    Yue, Heng
    [J]. 2016 IEEE 14TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2016, : 780 - 783
  • [30] Agreement on Target-Bidirectional LSTMs for Sequence-to-Sequence Learning
    Liu, Lemao
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2630 - 2637