A Speech-to-Speech, Machine Translation Mediated Map Task: An Exploratory Study

被引:1
|
作者
Cerrato, Loredana [1 ]
Akira, Hayakawa [1 ]
Campbell, Nick [1 ]
Luz, Saturnino [2 ]
机构
[1] Trinity Coll Dublin, Sch Comp Sci & Stat, ADAPT Ctr, Dublin, Ireland
[2] Univ Edinburgh, Usher Inst Populat Hlth Sci & Informat, Edinburgh, Midlothian, Scotland
关键词
Interlingual speech-to-speech translation; Repair strategies; Speaker alignment; Adaptation; User evaluation;
D O I
10.1007/978-3-319-33500-1_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this study is to investigate how the language technologies of Automatic Speech Recognition (ASR), Machine Translation (MT), and Text To Speech (TTS) synthesis affect users during an interlingual interaction. In this paper, we describe the prototype system used for the data collection, we give details of the collected data and report the results of a usability test run to assess how the users of the interlingual system evaluate the interactions in a collaborative map task. We use widely adopted usability evaluation measures: ease of use, effectiveness and users satisfaction, and look at both qualitative and quantitative measures. Results indicate that both users taking part in the dialogues (instructions giver and follower) found the system similarly satisfactory in terms of ease of learning, ease of use, and pleasantness, even if they were less satisfied with its effectiveness in supporting the task. Users employed different strategies in order to adapt to the shortcomings of the technology, such as hyper-articulation, and rewording of utterances in relation to error of the ASR. We also report the results of a comparison of the map task in two different settings-one that includes a constant video stream ("video-on") and one that does not ("no-video.") Surprisingly, users rated the no-video setting consistently better.
引用
收藏
页码:53 / 64
页数:12
相关论文
共 50 条
  • [1] Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
    Akira, Hayakawa
    Vogel, Carl
    Luz, Saturnino
    Campbell, Nick
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3176 - 3183
  • [2] Talking to a System and Talking to a Human: A study from a Speech-to-Speech, Machine Translation mediated Map Task
    Akira, Hayakawa
    Luz, Saturnino
    Campbell, Nick
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1422 - 1426
  • [3] Perception Changes With and Without a Video Channel: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
    Akira, Hayakawa
    Vogel, Carl
    Campbell, Nick
    Luz, Saturnino
    [J]. 2017 8TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2017, : 401 - 406
  • [4] Speech Rate Comparison when Talking to a System and Talking to a Human: A study from a Speech-to-Speech, Machine Translation mediated Map Task
    Akira, Hayakawa
    Vogel, Carl
    Luz, Saturnino
    Campbell, Nick
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3286 - 3290
  • [5] Towards Machine Speech-to-speech Translation
    Satoshi, Nakamura
    Sudoh, Katsuhito
    Sakti, Sakriani
    [J]. TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2019, (17): : 81 - 87
  • [6] Impacts of machine translation and speech synthesis on speech-to-speech translation
    Hashimoto, Kei
    Yamagishi, Junichi
    Byrne, William
    King, Simon
    Tokuda, Keiichi
    [J]. SPEECH COMMUNICATION, 2012, 54 (07) : 857 - 866
  • [7] AN ANALYSIS OF MACHINE TRANSLATION AND SPEECH SYNTHESIS IN SPEECH-TO-SPEECH TRANSLATION SYSTEM
    Hashimoto, Kei
    Yamagishi, Junichi
    Byrne, William
    King, Simon
    Tokuda, Keiichi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5108 - 5111
  • [8] Semantic transfer in speech-to-speech machine translation
    Abb, B
    Buschbeck-Wolf, B
    Tschernitschek, C
    [J]. NATURAL LANGUAGE PROCESSING AND SPEECH TECHNOLOGY: RESULTS OF THE 3RD KONVENS CONFERENCE, 1996, : 123 - 136
  • [9] INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
    Anumanchipalli, Gopala Krishna
    Oliveira, Luis C.
    Black, Alan W.
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 153 - 158
  • [10] Enriching machine-mediated speech-to-speech translation using contextual information
    Sridhar, Vivek Kumar Rangarajan
    Bangalore, Srinivas
    Narayanan, Shrikanth
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 492 - 508