Multi-speaker experimental designs: Methodological considerations

被引:5
|
作者
Offrede, Tom [1 ]
Fuchs, Susanne [2 ]
Mooshammer, Christine [1 ]
机构
[1] Humboldt Univ, Inst Deutsch Sprache & Linguist, Dorotheenstr 24, D-10117 Berlin, Germany
[2] Leibniz Ctr Gen Linguist ZAS, Berlin, Germany
来源
LANGUAGE AND LINGUISTICS COMPASS | 2021年 / 15卷 / 12期
基金
欧盟地平线“2020”;
关键词
VENTILATION; GUIDELINES; TUTORIAL; VOICE;
D O I
10.1111/lnc3.12443
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Research on language use has become increasingly interested in the multimodal and interactional aspects of language - theoretical models of dialogue, such as the Communication Accommodation Theory and the Interactive Alignment Model are examples of this. In addition, researchers have started to give more consideration to the relationship between physiological processes and language use. This article aims to contribute to the advancement in studies of physiological and/or multimodal language use in naturalistic settings. It does so by providing methodological recommendations for such multi-speaker experimental designs. It covers the topics of (a) speaker preparation and logistics, (b) experimental tasks and (c) data synchronisation and post-processing. The types of data that will be considered in further detail include audio and video, electroencephalography, respiratory data and electromagnetic articulography. This overview with recommendations is based on the answers to a questionnaire that was sent amongst the members of the Horizon 2020 research network 'Conversational Brains, several researchers in the field and interviews with three additional experts.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Improving Multi-Speaker Tacotron with Speaker Gating Mechanisms
    Zhao, Wei
    Xu, Li
    He, Ting
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7498 - 7503
  • [2] Multi-array multi-speaker tracking
    Potamitis, I
    Tremoulis, G
    Fakotakis, N
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 206 - 213
  • [3] A hybrid approach to speaker recognition in multi-speaker environment
    Trivedi, J
    Maitra, A
    Mitra, SK
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 272 - 275
  • [4] Automatic speaker clustering from multi-speaker utterances
    McLaughlin, J
    Reynolds, D
    Singer, E
    O'Leary, GC
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 817 - 820
  • [5] Automatic speaker clustering from multi-speaker utterances
    MIT Lincoln Lab, Lexington, United States
    [J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (817-820):
  • [6] Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech
    Das, Rohan Kumar
    Yang, Jichen
    Li, Haizhou
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1630 - 1635
  • [7] Multi-Speaker Meeting Audio Segmentation
    Nwe, Tin Lay
    Dong, Minghui
    Khine, Swe Zin Kalayar
    Li, Haizhou
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2522 - 2525
  • [8] JOINTLY RECOGNIZING MULTI-SPEAKER CONVERSATIONS
    Ji, Gang
    Bilmes, Jeff
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5110 - 5113
  • [9] Speaker conditioned acoustic modeling for multi-speaker conversational ASR
    Chetupalli, Srikanth Raj
    Ganapathy, Sriram
    [J]. INTERSPEECH 2022, 2022, : 3834 - 3838
  • [10] Multi-Speaker Dialogue for Vehicular Navigation and Assistance
    Hsien-Chang Wang
    Jhing-Fa Wang
    [J]. International Journal of Speech Technology, 2004, 7 (2-3) : 231 - 244