Large Scale Multi-Actor Generative Dialog Modeling

被引:0
|
作者
Boyd, Alex [1 ]
Puri, Raul [2 ]
Shoeybi, Mohammad [2 ]
Patwary, Mostofa [2 ]
Catanzaro, Bryan [2 ]
机构
[1] Univ Calif Irvine, Dept Stat, Irvine, CA 92697 USA
[2] NVIDIA, Santa Clara, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-goal oriented dialog agents (i.e. chat-bots) aim to produce varying and engaging conversations with a user; however, they typically exhibit either inconsistent personality across conversations or the average personality of all users. This paper addresses these issues by controlling an agent's persona upon generation via conditioning on prior conversations of a target actor. In doing so, we are able to utilize more abstract patterns within a person's speech and better emulate them in generated responses. This work introduces the GENERATIVE CONVERSATION CONTROL model, an augmented and fine-tuned GPT-2 language model that conditions on past reference conversations to probabilistically model multi-turn conversations in the actor's persona. We introduce an accompanying data collection procedure to obtain 10.3M conversations from 6 months worth of Reddit comments. We demonstrate that scaling model sizes from 117M to 8.3B parameters yields an improvement from 23.14 to 13.14 perplexity on 1.7M held out Reddit conversations. Increasing model scale yielded similar improvements in human evaluations that measure preference of model samples to the held out target distribution in terms of realism (31% increased to 37% preference), style matching (37% to 42%), grammar and content quality (29% to 42%), and conversation coherency (32% to 40%). We find that conditionally modeling past conversations improves perplexity by 0.47 in automatic evaluations. Through human trials we identify positive trends between conditional modeling and style matching and outline steps to further improve persona control.
引用
收藏
页码:66 / 84
页数:19
相关论文
共 50 条
  • [31] Couples' adjustment to retirement: A multi-actor panel study
    van Solinge, H
    Henkens, K
    [J]. JOURNALS OF GERONTOLOGY SERIES B-PSYCHOLOGICAL SCIENCES AND SOCIAL SCIENCES, 2005, 60 (01): : S11 - S20
  • [32] QUALITATIVE REASONING WITH BLUFF AND BELIEFS IN A MULTI-ACTOR ENVIRONMENT
    LELOUCHE, R
    DOUBLAIT, S
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1992, 36 (02): : 149 - 165
  • [33] Simulating the value of collaboration in multi-actor conservation planning
    Gordon, Ascelin
    Bastin, Lucy
    Langford, William T.
    Lechner, Alex M.
    Bekessy, Sarah A.
    [J]. ECOLOGICAL MODELLING, 2013, 249 : 19 - 25
  • [34] Simulating the value of collaboration in multi-actor conservation planning
    Gordon, A.
    Langford, W. T.
    Bastin, L.
    Lechner, A. M.
    Bekessy, S. A.
    [J]. 19TH INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2011), 2011, : 2233 - 2239
  • [35] Capabilities supporting digital servitization: A multi-actor perspective
    Marcon, Erico
    Marcon, Arthur
    Ayala, Nestor F.
    Frank, Alejandro G.
    Story, Vicky
    Burton, Jamie
    Raddats, Chris
    Zolkiewski, Judy
    [J]. INDUSTRIAL MARKETING MANAGEMENT, 2022, 103 : 97 - 116
  • [36] A Framework for Managing Data in Multi-actor Fabrication Processes
    Skoury, Lior
    Amtsberg, Felix
    Yang, Xiliu
    Wagner, Hans Jakob
    Menges, Achim
    Wortmann, Thomas
    [J]. TOWARDS RADICAL REGENERATION, 2023, : 601 - 615
  • [37] Modelling of multi-actor logistic chains with resources mutualization
    Hiohi, Laurenţiu
    Costescu, Dorinela
    Olteanu, Sergiu
    [J]. UPB Scientific Bulletin, Series D: Mechanical Engineering, 2016, 78 (03): : 31 - 42
  • [38] Multi-actor activity detection by modeling object relationships in extended videos based on deep learning
    Zhang, Binyu
    Wan, Junfeng
    Zhao, Yanyun
    Tong, Zhihang
    Du, Yunhao
    [J]. Engineering Applications of Artificial Intelligence, 2022, 114
  • [39] Multi-actor activity detection by modeling object relationships in extended videos based on deep learning
    Zhang, Binyu
    Wan, Junfeng
    Zhao, Yanyun
    Tong, Zhihang
    Du, Yunhao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [40] A multi-actor multi-criteria analysis of the performance of global cities
    Kourtit, Katima
    Macharis, Cathy
    Nijkamp, Peter
    [J]. APPLIED GEOGRAPHY, 2014, 49 : 24 - 36