Multi-speaker experimental designs: Methodological considerations

被引：5

作者：

Offrede, Tom ^{[1
]}

Fuchs, Susanne ^{[2
]}

Mooshammer, Christine ^{[1
]}

机构：

[1] Humboldt Univ, Inst Deutsch Sprache & Linguist, Dorotheenstr 24, D-10117 Berlin, Germany

[2] Leibniz Ctr Gen Linguist ZAS, Berlin, Germany

来源：

LANGUAGE AND LINGUISTICS COMPASS | 2021年 / 15卷 / 12期

基金：

欧盟地平线“2020”;

关键词：

VENTILATION; GUIDELINES; TUTORIAL; VOICE;

D O I：

10.1111/lnc3.12443

中图分类号：

H [语言、文字];

学科分类号：

05 ;

摘要：

Research on language use has become increasingly interested in the multimodal and interactional aspects of language - theoretical models of dialogue, such as the Communication Accommodation Theory and the Interactive Alignment Model are examples of this. In addition, researchers have started to give more consideration to the relationship between physiological processes and language use. This article aims to contribute to the advancement in studies of physiological and/or multimodal language use in naturalistic settings. It does so by providing methodological recommendations for such multi-speaker experimental designs. It covers the topics of (a) speaker preparation and logistics, (b) experimental tasks and (c) data synchronisation and post-processing. The types of data that will be considered in further detail include audio and video, electroencephalography, respiratory data and electromagnetic articulography. This overview with recommendations is based on the answers to a questionnaire that was sent amongst the members of the Horizon 2020 research network 'Conversational Brains, several researchers in the field and interviews with three additional experts.

引用

页数：24

共 50 条

[1] Improving Multi-Speaker Tacotron with Speaker Gating Mechanisms
Zhao, Wei
Xu, Li
He, Ting
[J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7498 - 7503
[2] Multi-array multi-speaker tracking
Potamitis, I
Tremoulis, G
Fakotakis, N
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 206 - 213
[3] A hybrid approach to speaker recognition in multi-speaker environment
Trivedi, J
Maitra, A
Mitra, SK
[J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 272 - 275
[4] Automatic speaker clustering from multi-speaker utterances
McLaughlin, J
Reynolds, D
Singer, E
O'Leary, GC
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 817 - 820
[5] Automatic speaker clustering from multi-speaker utterances
MIT Lincoln Lab, Lexington, United States
[J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (817-820):
[6] Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech
Das, Rohan Kumar
Yang, Jichen
Li, Haizhou
[J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1630 - 1635
[7] Multi-Speaker Meeting Audio Segmentation
Nwe, Tin Lay
Dong, Minghui
Khine, Swe Zin Kalayar
Li, Haizhou
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2522 - 2525
[8] JOINTLY RECOGNIZING MULTI-SPEAKER CONVERSATIONS
Ji, Gang
Bilmes, Jeff
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5110 - 5113
[9] Speaker conditioned acoustic modeling for multi-speaker conversational ASR
Chetupalli, Srikanth Raj
Ganapathy, Sriram
[J]. INTERSPEECH 2022, 2022, : 3834 - 3838
[10] Multi-Speaker Dialogue for Vehicular Navigation and Assistance
Hsien-Chang Wang
Jhing-Fa Wang
[J]. International Journal of Speech Technology, 2004, 7 (2-3) : 231 - 244

← 1 2 3 4 5 →