A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning

被引:0
|
作者
Takanobu, Ryuichi [1 ]
Huang, Minlie [1 ]
Zhao, Zhongzhou [2 ]
Li, Fenglin [2 ]
Chen, Haiqing [2 ]
Zhu, Xiaoyan [1 ]
Nie, Liqiang [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Conversat AI Grp, AI Lab,Beijing Natl Res Ctr Informat Sci & Techno, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Shandong Univ, Jinan, Peoples R China
基金
美国国家科学基金会;
关键词
TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic structure analysis plays a pivotal role in dialogue understanding. We propose a reinforcement learning (RL) method for topic segmentation and labeling in goal-oriented dialogues, which aims to detect topic boundaries among dialogue utterances and assign topic labels to the utterances. We address three common issues in the goal-oriented customer service dialogues: informality, local topic continuity, and global topic structure. We explore the task in a weakly supervised setting and formulate it as a sequential decision problem. The proposed method consists of a state representation network to address the informality issue, and a policy network with rewards to model local topic continuity and global topic structure. To train the two networks and offer a warm-start to the policy, we firstly use some keywords to annotate the data automatically. We then pre-train the networks on noisy data. Henceforth, the method continues to refine the data labels using the current policy to learn better state representations on the refined data for obtaining a better policy. Results demonstrate that this weakly supervised method obtains substantial improvements over state-of-the-art baselines.
引用
收藏
页码:4403 / 4410
页数:8
相关论文
共 50 条
  • [1] A Goal-Oriented Specification Language for Reinforcement Learning
    Schwan, Simon
    Kloes, Verena
    Glesner, Sabine
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2023, 2023, 13890 : 169 - 180
  • [2] GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
    Liu, Jianfeng
    Pan, Feiyang
    Luo, Ling
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1793 - 1796
  • [3] Goal-Oriented Reinforcement Learning in THz-Enabled UAV-Aided Network Using Supervised Learning
    Termehchi, Atefeh
    Bao, Tingnan
    Syed, Aisha
    Kennedy, William Sean
    Erol-Kantarci, Melike
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5027 - 5036
  • [4] A goal-oriented reinforcement learning for optimal drug dosage control
    Zhang, Qian
    Li, Tianhao
    Li, Dengfeng
    Lu, Wei
    ANNALS OF OPERATIONS RESEARCH, 2024, 338 (2-3) : 1403 - 1423
  • [5] Incorporating Joint Embeddings into Goal-Oriented Dialogues with Multi-task Learning
    Kassawat, Firas
    Chaudhuri, Debanjan
    Lehmann, Jens
    SEMANTIC WEB, ESWC 2019, 2019, 11503 : 225 - 239
  • [6] Goal-oriented Knowledge Reuse via Curriculum Evolution for Reinforcement Learning-based Adaptation
    Li, Jialong
    Zhang, Mingyue
    Mao, Zhenyu
    Zhao, Haiyan
    Jin, Zhi
    Honiden, Shinichi
    Tei, Kenji
    2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 189 - 198
  • [7] Graph Enhanced Hierarchical Reinforcement Learning for Goal-oriented Learning Path Recommendation
    Li, Qingyao
    Xia, Wei
    Yin, Li'ang
    Shen, Jian
    Rui, Renting
    Zhang, Weinan
    Chen, Xianyu
    Tang, Ruiming
    Yu, Yong
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1318 - 1327
  • [8] Weakly Supervised Disentangled Representation for Goal-Conditioned Reinforcement Learning
    Qian, Zhifeng
    You, Mingyu
    Zhou, Hongjun
    He, Bin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 2202 - 2209
  • [9] Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
    Jung, Youngsoo
    Lee, Jongmin
    Kim, Kee-Eung
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7994 - 8001
  • [10] Lesion region segmentation via weakly supervised learning
    Ran Yi
    Rui Zeng
    Yang Weng
    Minjing Yu
    YuKun Lai
    YongJin Liu
    Quantitative Biology, 2022, 10 (03) : 239 - 252