GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION

被引：0

作者：

Xiao, Xiaoqiang ^{[1
]}

Chen, Hong ^{[1
]}

Zylak, Mark ^{[1
]}

Sosa, Daniela ^{[1
]}

Desu, Suma ^{[1
]}

Krishnamoorthy, Mahesh ^{[1
]}

Liu, Daben ^{[1
]}

Paulik, Matthias ^{[1
]}

Zhang, Yuchen ^{[1
]}

机构：

[1] Apple Inc, Cupertino, CA 95014 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

speech recognition; language model; Geo-LM; class LM; Combine Statistical Area; MOBILE; VOICE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose improving automatic speech recognition (ASR) accuracy for local points of interest (POI) by leveraging a geo-specific language model (Geo-LM). Geographic regions are defined according to U.S. Census Bureau Combined Statistical Areas. Depending on the user's associated geographic region, for each user a class based Geo-LM is constructed dynamically within a difference-LM based weighted finite state transducer (WFST) system. The benefits of this approach include: improved accuracy for local POI name recognition, flexibility in training, and efficient LM construction at runtime. Our experiments show that the proposed Geo-LM achieves an average of over 18% relative word error rate (WER) reduction on the tasks of local POI search, with no degradation to the general accuracy and very limited latency increase, compared to the baseline nationwide general LM. In addition to accuracy improvement, we also discuss optimization of runtime efficiency.

引用

页码：6124 / 6128

页数：5

共 50 条

[21] CONVERTING NEURAL NETWORK LANGUAGE MODELS INTO BACK-OFF LANGUAGE MODELS FOR EFFICIENT DECODING IN AUTOMATIC SPEECH RECOGNITION
Arisoy, Ebru
Chen, Stanley F.
Ramabhadran, Bhuvana
Sethy, Abhinav
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8242 - 8246
[22] Creating Language and Acoustic Models using Kaldi to Build An Automatic Speech Recognition System for Kannada Language
Yadava, Thimmaraja G.
Jayanna, H. S.
[J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 161 - 165
[23] gram Approximation of Latent Words Language Models for Domain Robust Automatic Speech Recognition
Masumura, Ryo
Asami, Taichi
Oba, Takanobu
Masataki, Hirokazu
Sakauchi, Sumitaka
Takahashi, Satoshi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2462 - 2470
[24] Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
Chen, Xie
Liu, Xunying
Wang, Yongqiang
Gales, Mark J. F.
Woodland, Philip C.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2146 - 2157
[25] End-To-End deep neural models for Automatic Speech Recognition for Polish Language
Pondel-Sycz, Karolina
Pietrzak, Agnieszka Paula
Szymla, Julia
[J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (02) : 315 - 321
[26] Application of Morphosyntactic and Class-Based Language Models in Automatic Speech Recognition of Polish
Smywinski-Pohl, Alexsander
Ziolko, Bartosz
[J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2016, 25 (02)
[27] IMPROVED TEXT NORMALIZATION AND LANGUAGE MODELS FOR SPEED'S AUTOMATIC SPEECH RECOGNITION SYSTEM
Manolache, Cristian
Georgescu, Alexandru-Lucian
Cucu, Horia
Mititelu, Verginica Barbu
Burileanu, Corneliu
[J]. PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE LINGUISTIC RESOURCES AND TOOLS FOR NATURAL LANGUAGE PROCESSING, 2020, : 115 - 128
[28] A study of neural network Russian language models for automatic continuous speech recognition systems
I. S. Kipyatkova
A. A. Karpov
[J]. Automation and Remote Control, 2017, 78 : 858 - 867
[29] A study of neural network Russian language models for automatic continuous speech recognition systems
Kipyatkova, I. S.
Karpov, A. A.
[J]. AUTOMATION AND REMOTE CONTROL, 2017, 78 (05) : 858 - 867
[30] Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition
Masumura, Ryo
Asami, Taichi
Oba, Takanobu
Masataki, Hirokazu
Sakauchi, Sumitaka
Ito, Akinori
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (06): : 1581 - 1590

← 1 2 3 4 5 →