CROSS-LINGUAL TOPIC PREDICTION FOR SPEECH USING TRANSLATIONS

被引:0
|
作者
Bansal, Sameer [1 ]
Kamper, Herman [2 ]
Lopez, Adam [1 ]
Goldwater, Sharon [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Stellenbosch Univ, Dept E&E Engn, Stellenbosch, South Africa
关键词
speech translation; low-resource speech processing; speech classification; unwritten languages; SPOKEN CONTENT; RECOGNITION; RETRIEVAL; LANGUAGE;
D O I
10.1109/icassp40776.2020.9054169
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Given a large amount of unannotated speech in a low-resource language, can we classify the speech utterances by topic? We consider this question in the setting where a small amount of speech in the low-resource language is paired with text translations in a high-resource language. We develop an effective cross-lingual topic classifier by training on just 20 hours of translated speech, using a recent model for direct speech-to-text translation. While the translations are poor, they are still good enough to correctly classify the topic of 1-minute speech segments over 70% of the time-a 20% improvement over a majority-class baseline. Such a system could be useful for humanitarian applications like crisis response, where incoming speech in a foreign low-resource language must be quickly assessed for further action.
引用
收藏
页码:8164 / 8168
页数:5
相关论文
共 50 条
  • [31] Neural topic-enhanced cross-lingual word embeddings for CLIR
    Zhou, Dong
    Qu, Wei
    Li, Lin
    Tang, Mingdong
    Yang, Aimin
    [J]. INFORMATION SCIENCES, 2022, 608 : 809 - 824
  • [32] Cross-lingual Contextualized Topic Models with Zero-shot Learning
    Bianchi, Federico
    Terragni, Silvia
    Hovy, Dirk
    Nozza, Debora
    Fersini, Elisabetta
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1676 - 1683
  • [33] A word embedding-based approach to cross-lingual topic modeling
    Chang, Chia-Hsuan
    Hwang, San-Yih
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (06) : 1529 - 1555
  • [34] CROSS-LINGUAL AND MULTILINGUAL SPEECH EMOTION RECOGNITION ON ENGLISH AND FRENCH
    Neumann, Michael
    Ngoc Thang Vu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5769 - 5773
  • [35] Cross-lingual Speech Emotion Recognition through Factor Analysis
    Desplanques, Brecht
    Demuynck, Kris
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3648 - 3652
  • [36] Identifying Agreement/Disagreement in Conversational Speech: A Cross-lingual Study
    Wang, Wen
    Precoda, Kristin
    Richey, Colleen
    Raymond, Geoffrey
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3100 - +
  • [37] Cross-Lingual Sentiment Relation Capturing for Cross-Lingual Sentiment Analysis
    Chen, Qiang
    Li, Wenjie
    Lei, Yu
    Liu, Xule
    Luo, Chuwei
    He, Yanxiang
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 54 - 67
  • [38] Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer
    Secujski, Milan
    Gerazov, Branislav
    Csapo, Tamas Gabor
    Delic, Vlado
    Garner, Philip N.
    Gjoreski, Aleksandar
    Guennec, David
    Ivanovski, Zoran
    Melov, Aleksandar
    Nemeth, Geza
    Stojkovic, Ana
    Szaszak, Gyoergy
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 199 - 206
  • [39] Cross-lingual Detection of Dysphonic Speech for Dutch and Hungarian Datasets
    Sztaho, David
    Tulics, Miklos Gabriel
    Qi, Jinzi
    Van Hamme, Hugo
    Vicsi, Klara
    [J]. BIOSIGNALS: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 4: BIOSIGNALS, 2022, : 215 - 220
  • [40] Semi-supervised cross-lingual speech emotion recognition
    Agarla, Mirko
    Bianco, Simone
    Celona, Luigi
    Napoletano, Paolo
    Petrovsky, Alexey
    Piccoli, Flavio
    Schettini, Raimondo
    Shanin, Ivan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237