Large-Scale Multimodal Movie Dialogue Corpus

被引:3
|
作者
Yasuhara, Ryu [1 ]
Inoue, Masashi [1 ]
Suga, Ikuya [1 ]
Kosaka, Tetsuo [1 ]
机构
[1] Yamagata Univ, 3-16,4 Jyonan, Yonezawa, Yamagata, Japan
关键词
Dialogue; Multimodal; Corpus; Movie; Film; VAD; DNN;
D O I
10.1145/2993148.2998523
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an outline of our newly created multimodal dialogue corpus that is constructed from public domain movies. Dialogues in movies are useful sources for analyzing human communication patterns. In addition, they can be used to train machine-learning-based dialogue processing systems. However, the movie files are processing intensive and they contain large portions of non-dialogue segments. Therefore, we created a corpus that contains only dialogue segments from movies. The corpus contains 165, 368 dialogue segments taken from 1, 722 movies. These dialogues are automatically segmented by using deep neural network-based voice activity detection with filtering rules. Our corpus can reduce the human workload and machine-processing effort required to analyze human dialogue behavior by using movies.
引用
收藏
页码:414 / 415
页数:2
相关论文
共 50 条
  • [1] Improving Voice Activity Detection for Multimodal Movie Dialogue Corpus
    Kosaka, Tetsuo
    Suga, Ikumi
    Inoue, Masashi
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 481 - 484
  • [2] A Large-scale Depth-based Multimodal Audio-Visual Corpus in Mandarin
    Wang, Jianrong
    Wang, Liyuan
    Zhang, Ju
    Yu, Mei
    Yu, Ruiguo
    Wei, Jianguo
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 881 - 885
  • [3] A Corpus for Large-Scale Phonetic Typology
    Salesky, Elizabeth
    Chodroff, Eleanor
    Pimentel, Tiago
    Wiesner, Matthew
    Cotterell, Ryan
    Black, Alan W.
    Eisner, Jason
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4526 - 4546
  • [4] A Large-Scale Corpus for Conversation Disentanglement
    Kummerfeld, Jonathan K.
    Athreya, Vignesh
    Patel, Siva Sankalp
    Gouravajhala, Sai R.
    Gunasekara, Chulaka
    Polymenakos, Lazaros
    Peper, Joseph J.
    Ganhotra, Jatin
    Lasecki, Walter S.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3846 - 3856
  • [5] Towards robust spoken dialogue systems using large-scale in-car speech corpus
    Yamaguchi, Yukiko
    Hayashi, Keita
    Ono, Takahiro
    Kato, Shingo
    Irie, Yuki
    Ohno, Tomohiro
    Murao, Hiroya
    Matsubara, Shigeki
    Kawaguchi, Nobuo
    Takeda, Kazuya
    ADVANCES FOR IN-VEHICLE AND MOBILE SYSTEMS: CHALLENGES FOR INTERNATIONAL STANDARDS, 2007, : 211 - 222
  • [6] A Multimodal Corpus of Rapid Dialogue Games
    Paetzel, Maike
    Racca, David Nicolas
    DeVault, David
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4189 - 4195
  • [7] MedDialog: Large-scale Medical Dialogue Datasets
    Zeng, Guangtao
    Yang, Wenmian
    Ju, Zeqian
    Yang, Yue
    Wang, Sicheng
    Zhang, Ruisi
    Zhou, Meng
    Zeng, Jiaqi
    Dong, Xiangyu
    Zhang, Ruoyu
    Fang, Hongchao
    Zhu, Penghui
    Chen, Shu
    Xie, Pengtao
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9241 - 9250
  • [8] EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems
    Feng, Shutong
    Lubis, Nurul
    Geishauser, Christian
    Lin, Hsien-Chin
    Heck, Michael
    van Niekerk, Carel
    Gasic, Milica
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4096 - 4113
  • [9] Vocal development in a large-scale crosslinguistic corpus
    Cychosz, Margaret
    Cristia, Alejandrina
    Bergelson, Elika
    Casillas, Marisa
    Baudet, Gladys
    Warlaumont, Anne S.
    Scaff, Camila
    Yankowitz, Lisa
    Seidl, Amanda
    DEVELOPMENTAL SCIENCE, 2021, 24 (05)
  • [10] A Phrase Topic Model for Large-scale Corpus
    Li, Baoji
    Xu, Wenhua
    Tian, Yuhui
    Chen, Juan
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 634 - 639