Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task

被引：0

作者：

Akira, Hayakawa ^{[1
]}

Vogel, Carl ^{[1
]}

Luz, Saturnino ^{[2
]}

Campbell, Nick ^{[1
]}

机构：

[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland

[2] Univ Edinburgh, Usher Inst Populat Hlth Sci & Informat, Edinburgh, Midlothian, Scotland

来源：

PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年

基金：

爱尔兰科学基金会;

关键词：

speech rate; utterance duration comparison; task oriented dialogues;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The motivation for this paper is to present a way to verify if an utterance within a corpus is pronounced at a fast or slow pace. An alternative method to the well-known Word-Per-Minute (wpm) method for cases where this approach is not applicable. For long segmentations, such as the full introduction section of a speech or presentation, the measurement of wpm is a viable option. For short comparisons of the same single word or multiple syllables, Syllables-Per-Second (sps) is also a viable option. However, when there are multiple short utterances that are frequent in task oriented dialogues or natural free flowing conversation, such as those of the direct Human-to-Human dialogues of the HCRC Map Task corpus or the computer mediated inter-lingual dialogues of the ILMT-s2s corpus, it becomes difficult to obtain a meaningful value for the utterance speech rate. In this paper we explain the method used to provide a alternative speech rate value to the utterance of the ILMT-s2s corpus and the HCRC Map Task corpus.

引用

页码：3176 / 3183

页数：8

共 40 条

[1] A Speech-to-Speech, Machine Translation Mediated Map Task: An Exploratory Study
Cerrato, Loredana
Akira, Hayakawa
Campbell, Nick
Luz, Saturnino
[J]. FUTURE AND EMERGENT TRENDS IN LANGUAGE TECHNOLOGY, FETLT 2015, 2016, 9577 : 53 - 64
[2] Speech Rate Comparison when Talking to a System and Talking to a Human: A study from a Speech-to-Speech, Machine Translation mediated Map Task
Akira, Hayakawa
Vogel, Carl
Luz, Saturnino
Campbell, Nick
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3286 - 3290
[3] Talking to a System and Talking to a Human: A study from a Speech-to-Speech, Machine Translation mediated Map Task
Akira, Hayakawa
Luz, Saturnino
Campbell, Nick
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1422 - 1426
[4] Perception Changes With and Without a Video Channel: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
Akira, Hayakawa
Vogel, Carl
Campbell, Nick
Luz, Saturnino
[J]. 2017 8TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2017, : 401 - 406
[5] Towards Machine Speech-to-speech Translation
Satoshi, Nakamura
Sudoh, Katsuhito
Sakti, Sakriani
[J]. TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2019, (17): : 81 - 87
[6] Impacts of machine translation and speech synthesis on speech-to-speech translation
Hashimoto, Kei
Yamagishi, Junichi
Byrne, William
King, Simon
Tokuda, Keiichi
[J]. SPEECH COMMUNICATION, 2012, 54 (07) : 857 - 866
[7] AN ANALYSIS OF MACHINE TRANSLATION AND SPEECH SYNTHESIS IN SPEECH-TO-SPEECH TRANSLATION SYSTEM
Hashimoto, Kei
Yamagishi, Junichi
Byrne, William
King, Simon
Tokuda, Keiichi
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5108 - 5111
[8] Semantic transfer in speech-to-speech machine translation
Abb, B
Buschbeck-Wolf, B
Tschernitschek, C
[J]. NATURAL LANGUAGE PROCESSING AND SPEECH TECHNOLOGY: RESULTS OF THE 3RD KONVENS CONFERENCE, 1996, : 123 - 136
[9] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
Anumanchipalli, Gopala Krishna
Oliveira, Luis C.
Black, Alan W.
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
[10] Enriching machine-mediated speech-to-speech translation using contextual information
Sridhar, Vivek Kumar Rangarajan
Bangalore, Srinivas
Narayanan, Shrikanth
[J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 492 - 508

← 1 2 3 4 →