Evaluating the Performance of ASR Systems for TV Interactions in Several Domestic Noise Scenarios

被引:0
|
作者
Beca, Pedro [1 ]
Abreu, Jorge [1 ]
Santos, Rita [1 ]
Rodrigues, Ana [1 ]
机构
[1] Univ Aveiro, Dept Commun & Arts, Digimedia, Aveiro, Portugal
关键词
Natural language interaction; ASR evaluation; TV interaction; Automatic speech recognition; AUTOMATIC SPEECH RECOGNITION;
D O I
10.1007/978-3-030-23862-9_12
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Voice interaction with the television is becoming a reality on domestic environments. However, one of the factors that influences the correct operation of these systems is the background noise that obstructs the performance of the automatic speech recognition (ASR) component. In order to further understand this issue, the paper presents an analysis of the performance of three ASR systems (Bing Speech API, Google API, and Nuance ASR) in several domestic noise scenarios resembling the interaction with the TV on a domestic context. A group of 36 users was asked to utter sentences based on TV requests, where the sentences' corpus comprised typical phrases used when interacting with the TV. To better know the behavior, performance and robustness of each ASR to noise, the tests were carried out with three recording devices placed at different distances from the user. Google ASR proved to be the most robust to noise with a higher recognition precision, followed by Bing Speech and Nuance. The results obtained showed that ASR systems performance is globally quite robust but tends to deteriorate with domestic background noise. Future replications of the evaluation setup will allow the evaluation of ASR solutions in other scenarios.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 5 条
  • [1] Objective evaluation of noise reduction performance in TV-systems
    Puttenstein, JG
    Heynderickx, I
    de Haan, G
    [J]. 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 69 - 72
  • [2] Analyzing the performance of ASR systems The effects of noise, distance to the device, age and gender
    Rodrigues, Ana
    Santos, Rita
    Abreu, Jorge
    Beca, Pedro
    Almeida, Pedro
    Fernandes, Silvia
    [J]. PROCEEDINGS OF THE XX INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION (INTERACCION'2019), 2019,
  • [3] Transport modeling and multivariate adaptive regression splines for evaluating performance of ASR systems in freshwater aquifers
    Forghani, Ali
    Peralta, Richard C.
    [J]. JOURNAL OF HYDROLOGY, 2017, 553 : 540 - 548
  • [4] Evaluating the impact of consumer behaviour on the performance of domestic solar water heating systems in South Africa
    Ijumba, Pamela
    Sebitosi, Adoniya Ben
    [J]. JOURNAL OF ENERGY IN SOUTHERN AFRICA, 2010, 21 (01) : 25 - 34
  • [5] Evaluating the Performance of State-of-the-Art ASR Systems on Non-Native English using Corpora with Extensive Language Background Variation
    Hollands, Samuel
    Blackburn, Daniel
    Christensen, Heidi
    [J]. INTERSPEECH 2022, 2022, : 3958 - 3962