CROSS-LINGUAL TOPIC PREDICTION FOR SPEECH USING TRANSLATIONS

被引:0
|
作者
Bansal, Sameer [1 ]
Kamper, Herman [2 ]
Lopez, Adam [1 ]
Goldwater, Sharon [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Stellenbosch Univ, Dept E&E Engn, Stellenbosch, South Africa
关键词
speech translation; low-resource speech processing; speech classification; unwritten languages; SPOKEN CONTENT; RECOGNITION; RETRIEVAL; LANGUAGE;
D O I
10.1109/icassp40776.2020.9054169
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Given a large amount of unannotated speech in a low-resource language, can we classify the speech utterances by topic? We consider this question in the setting where a small amount of speech in the low-resource language is paired with text translations in a high-resource language. We develop an effective cross-lingual topic classifier by training on just 20 hours of translated speech, using a recent model for direct speech-to-text translation. While the translations are poor, they are still good enough to correctly classify the topic of 1-minute speech segments over 70% of the time-a 20% improvement over a majority-class baseline. Such a system could be useful for humanitarian applications like crisis response, where incoming speech in a foreign low-resource language must be quickly assessed for further action.
引用
收藏
页码:8164 / 8168
页数:5
相关论文
共 50 条
  • [1] Cross-lingual Link Prediction Using Multimodal Relational Topic Models
    Sakata, Yosuke
    Eguchi, Koji
    [J]. 2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 951 - 958
  • [2] Cross-Lingual Latent Topic Extraction
    Zhang, Duo
    Mei, Qiaozhu
    Zhai, ChengXiang
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1128 - 1137
  • [3] Cross-lingual embeddings with auxiliary topic models
    Zhou, Dong
    Peng, Xiaoya
    Li, Lin
    Han, Jun-mei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [4] Investigation of Cross-lingual Depression Prediction Possibilities Based on Speech Processing
    Kiss, Gabor
    Vicsi, Klara
    [J]. 2017 8TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2017, : 97 - 101
  • [5] Effectiveness of Automatic Translations for Cross-Lingual Ontology Mapping
    Abu Helou, Mamoun
    Palmonari, Matteo
    Jarrar, Mustafa
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 55 : 165 - 208
  • [6] Cross-lingual Dialog Model for Speech to Speech Translation
    Ettelaie, Emil
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1173 - 1176
  • [7] Cross-Lingual Automatic Speech Recognition Using Tandem Features
    Lal, Partha
    King, Simon
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (12): : 2506 - 2515
  • [8] Improving hate speech detection using Cross-Lingual Learning
    Firmino, Anderson Almeida
    Baptista, Claudio de Souza
    de Paiva, Anselmo Cardoso
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [9] Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech
    Wester, Mirjam
    Liang, Hui
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2492 - 2495
  • [10] Cross-lingual Lexical Sememe Prediction
    Qi, Fanchao
    Lin, Yankai
    Sun, Maosong
    Zhu, Hao
    Xie, Ruobing
    Liu, Zhiyuan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 358 - 368