IMPROVING LANGUAGE IDENTIFICATION FOR MULTILINGUAL SPEAKERS

被引：0

作者：

Titus, Andrew ^{[1
]}

Silovsky, Jan ^{[1
]}

Chen, Nanxin ^{[1
,2
]}

Hsiao, Roger ^{[1
]}

Young, Mary ^{[1
]}

Ghoshal, Arnab ^{[1
]}

机构：

[1] Apple, Cupertino, CA 95014 USA

[2] Johns Hopkins Univ, Baltimore, MD 21218 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

关键词：

Language identification; multilingual; RECOGNITION;

D O I：

10.1109/icassp40776.2020.9053057

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected, however, is discrimination of languages for multilingual speakers, despite being a primary target audience of many systems that utilize LID technologies. As we show in this work, LID systems can have a high average accuracy for most combinations of languages while greatly underperforming for others when accented speech is present. We address this by using coarser-grained targets for the acoustic LID model and integrating its outputs with interaction context signals in a context-aware model to tailor the system to each user. This combined system achieves an average 97% accuracy across all language combinations while improving worst-case accuracy by over 60% relative to our baseline.

引用

页码：8284 / 8288

页数：5

共 50 条

[1] Multilingual speakers and language choice in the legal sphere
Angermeyer, Philipp Sebastian
APPLIED LINGUISTICS REVIEW, 2013, 4 (01) : 105 - 126
[2] MULTILINGUAL SPEAKERS PROBLEMS IN DECODING IN A SECOND LANGUAGE
KALDOR, S
SNELL, R
LINGUISTICS, 1972, (87) : 54 - 70
[3] Multilingual native language identification
Malmasi, Shervin
Dras, Mark
NATURAL LANGUAGE ENGINEERING, 2017, 23 (02) : 163 - 215
[4] Linguini: Language identification for multilingual documents
IBM Thomas J. Watson Research Center, United States
不详
不详
J Manage Inf Syst, 3 (71-101):
[5] Linguini: Language identification for multilingual documents
Prager, JM
JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 1999, 16 (03) : 71 - 101
[6] Acquisition of prosody in an additional language: the accentuation in French by multilingual adult speakers of Turkish as initial language
Marchand, Aline
Gac, David Le
7E CONGRES MONDIAL DE LINGUISTIQUE FRANCAISE, 2020, 78
[7] Improving Non-Native Speakers' Participation with an Automatic Agent in Multilingual Groups
Li X.
Yamashita N.
Duan W.
Shirai Y.
Fussell S.R.
Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (GROUP)
[8] The view from within: Gendered language ideologies of multilingual speakers in contemporary BerlinSchlusselworter
Truan, Naomi
Oldani, Martina
JOURNAL OF SOCIOLINGUISTICS, 2021, 25 (03) : 374 - 397
[9] Language choices as audience design strategies in Chinese multilingual speakers' Wechat posts
Liu, Kaiwen
GLOBAL MEDIA AND CHINA, 2021, 6 (04) : 391 - 415
[10] LanideNN: Multilingual Language Identification on Character Window
Kocmi, Tom
Bojar, Ondrej
15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 927 - 936

← 1 2 3 4 5 →