A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning

被引:0
|
作者
Takanobu, Ryuichi [1 ]
Huang, Minlie [1 ]
Zhao, Zhongzhou [2 ]
Li, Fenglin [2 ]
Chen, Haiqing [2 ]
Zhu, Xiaoyan [1 ]
Nie, Liqiang [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Conversat AI Grp, AI Lab,Beijing Natl Res Ctr Informat Sci & Techno, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Shandong Univ, Jinan, Peoples R China
基金
美国国家科学基金会;
关键词
TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic structure analysis plays a pivotal role in dialogue understanding. We propose a reinforcement learning (RL) method for topic segmentation and labeling in goal-oriented dialogues, which aims to detect topic boundaries among dialogue utterances and assign topic labels to the utterances. We address three common issues in the goal-oriented customer service dialogues: informality, local topic continuity, and global topic structure. We explore the task in a weakly supervised setting and formulate it as a sequential decision problem. The proposed method consists of a state representation network to address the informality issue, and a policy network with rewards to model local topic continuity and global topic structure. To train the two networks and offer a warm-start to the policy, we firstly use some keywords to annotate the data automatically. We then pre-train the networks on noisy data. Henceforth, the method continues to refine the data labels using the current policy to learn better state representations on the refined data for obtaining a better policy. Results demonstrate that this weakly supervised method obtains substantial improvements over state-of-the-art baselines.
引用
收藏
页码:4403 / 4410
页数:8
相关论文
共 50 条
  • [31] Goal-Oriented Communications in Federated Learning via Feedback on Risk-Averse Participation
    Pandey, Shashi Raj
    Van Phuc Bui
    Popovski, Petar
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [32] Optimizing passengers' experience: A goal-oriented reinforcement learning speed control approach for urban railway trains
    Liu, Wangyang
    Feng, Qingsheng
    Li, Hong
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2024, 238 (10) : 1283 - 1295
  • [33] Structural Optimization for Asymmetrical Inline Topology Filter With Transmission Zeros Using Goal-Oriented Reinforcement Learning
    Leong, Kiet Yew
    Soeung, Socheatra
    Cheab, Sovuthy
    Lu, Cheng-Kai
    IEEE ACCESS, 2024, 12 : 111386 - 111399
  • [34] Weakly Supervised Few-Shot Segmentation via Meta-Learning
    Gama, Pedro H. T.
    Oliveira, Hugo
    Marcato Jr, Jose
    dos Santos, Jefersson A.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1784 - 1797
  • [35] Boundary-RL: Reinforcement Learning for Weakly-Supervised Prostate Segmentation in TRUS Images
    Yi, Weixi
    Stavrinides, Vasilis
    Baum, Zachary M. C.
    Yang, Qianye
    Barratt, Dean C.
    Clarkson, Matthew J.
    Hu, Yipeng
    Saeed, Shaheer U.
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 277 - 288
  • [36] Multi-Task Learning of System Dialogue Act Selection for Supervised Pretraining of Goal-Oriented Dialogue Policies
    McLeod, Sarah
    Kruijff-Korbayova, Ivana
    Kiefer, Bernd
    20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 411 - 417
  • [37] Fast interactive medical image segmentation with weakly supervised deep learning method
    Girum, Kibrom Berihu
    Crehange, Gilles
    Hussain, Raabid
    Lalande, Alain
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (09) : 1437 - 1444
  • [38] Fast interactive medical image segmentation with weakly supervised deep learning method
    Kibrom Berihu Girum
    Gilles Créhange
    Raabid Hussain
    Alain Lalande
    International Journal of Computer Assisted Radiology and Surgery, 2020, 15 : 1437 - 1444
  • [39] Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages
    Kuutti, Sampo
    Bowden, Richard
    Fallah, Saber
    SENSORS, 2021, 21 (06) : 1 - 16
  • [40] A pseudo-labeling based weakly supervised segmentation method for few-shot texture images
    Han, Yuexing
    Li, Ruiqi
    Wang, Bing
    Ruan, Liheng
    Chen, Qiaochuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238