A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning

被引:0
|
作者
Takanobu, Ryuichi [1 ]
Huang, Minlie [1 ]
Zhao, Zhongzhou [2 ]
Li, Fenglin [2 ]
Chen, Haiqing [2 ]
Zhu, Xiaoyan [1 ]
Nie, Liqiang [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Conversat AI Grp, AI Lab,Beijing Natl Res Ctr Informat Sci & Techno, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Shandong Univ, Jinan, Peoples R China
基金
美国国家科学基金会;
关键词
TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic structure analysis plays a pivotal role in dialogue understanding. We propose a reinforcement learning (RL) method for topic segmentation and labeling in goal-oriented dialogues, which aims to detect topic boundaries among dialogue utterances and assign topic labels to the utterances. We address three common issues in the goal-oriented customer service dialogues: informality, local topic continuity, and global topic structure. We explore the task in a weakly supervised setting and formulate it as a sequential decision problem. The proposed method consists of a state representation network to address the informality issue, and a policy network with rewards to model local topic continuity and global topic structure. To train the two networks and offer a warm-start to the policy, we firstly use some keywords to annotate the data automatically. We then pre-train the networks on noisy data. Henceforth, the method continues to refine the data labels using the current policy to learn better state representations on the refined data for obtaining a better policy. Results demonstrate that this weakly supervised method obtains substantial improvements over state-of-the-art baselines.
引用
收藏
页码:4403 / 4410
页数:8
相关论文
共 50 条
  • [21] Forced convection heat transfer control for cylinder via closed-loop continuous goal-oriented reinforcement learning
    Liu, Yangwei
    Wang, Feitong
    Zhao, Shihang
    Tang, Yumeng
    PHYSICS OF FLUIDS, 2024, 36 (11)
  • [22] Supervised Learning of Internal Models for Autonomous Goal-Oriented Robot Navigation using Reservoir Computing
    Antonelo, Eric A.
    Schrauwen, Benjamin
    2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2959 - 2964
  • [23] HiTKG: Towards Goal-Oriented Conversations via Multi-Hierarchy Learning
    Ni, Jinjie
    Pandelea, Vlad
    Young, Tom
    Zhou, Haicang
    Cambria, Erik
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11112 - 11120
  • [24] Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor
    Kweon, Hyeokjun
    Yoon, Sung-Hoon
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11329 - 11339
  • [25] Goal-Oriented Navigation with Avoiding Obstacle based on Deep Reinforcement Learning in Continuous Action Space
    Hien, Pham Xuan
    Kim, Gon-Woo
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 8 - 11
  • [26] Weakly Supervised Brain Lesion Segmentation via Attentional Representation Learning
    Wu, Kai
    Du, Bowen
    Luo, Man
    Wen, Hongkai
    Shen, Yiran
    Feng, Jianfeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 : 211 - 219
  • [27] Multi-objective reinforcement learning in process control: A goal-oriented approach with adaptive thresholds
    Li, Dazi
    Gu, Wentao
    Song, Tianheng
    JOURNAL OF PROCESS CONTROL, 2023, 129
  • [28] Deep Reinforcement Learning for Weakly-Supervised Lymph Node Segmentation in CT Images
    Li, Zhe
    Xia, Yong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (03) : 774 - 783
  • [29] An optic disk semantic segmentation method based on weakly supervised learning
    Pan, Feng
    Lu, Zheng
    Chen, Dali
    Xue, Dingyu
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4791 - 4794
  • [30] An Efficient Weakly-Supervised Learning Method for Optic Disc Segmentation
    Wen, Yang
    Chen, Leiting
    Qiao, Lifeng
    Zhou, Chuan
    Xi, Shuo
    Guo, Rui
    Deng, Yu
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 835 - 842