Investigation of Cross-lingual Depression Prediction Possibilities Based on Speech Processing

被引:0
|
作者
Kiss, Gabor [1 ]
Vicsi, Klara [1 ]
机构
[1] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
关键词
depression; classification; regression; cross-lingual;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study a cross-lingual investigation is presented about prediction possibilities of depression on the base of speech processing. Our examination was performed on three European languages: German, Hungarian and Italian. Those acoustic features were selected, as input vector of the predictor, which correlate with the severity of depression in quasi language-independent way. Several mono and cross-lingual experiments were conducted. The method is even capable of predicting the severity of depression in the case of a language not used during the training of the model. The experiments clearly show that multilingual depression recognition can be achieved, and it should be possible to construct an automated diagnostic tool for detecting depression, or for patient monitoring, in a cross-lingual way.
引用
收藏
页码:97 / 101
页数:5
相关论文
共 50 条
  • [1] CROSS-LINGUAL TOPIC PREDICTION FOR SPEECH USING TRANSLATIONS
    Bansal, Sameer
    Kamper, Herman
    Lopez, Adam
    Goldwater, Sharon
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8164 - 8168
  • [2] CROSS-LINGUAL TRANSFER FOR SPEECH PROCESSING USING ACOUSTIC LANGUAGE SIMILARITY
    Wu, Peter
    Shi, Jiatong
    Zhong, Yifan
    Watanabe, Shinji
    Black, Alan W.
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1050 - 1057
  • [3] Investigation of the Accuracy of Depression Prediction Based on Speech Processing
    Kiss, Gabor
    Jenei, Attila Zoltan
    [J]. 2020 43RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2020, : 129 - 132
  • [4] Cross-lingual Dialog Model for Speech to Speech Translation
    Ettelaie, Emil
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1173 - 1176
  • [5] Mono- and multi-lingual depression prediction based on speech processing
    Kiss G.
    Vicsi K.
    [J]. International Journal of Speech Technology, 2017, 20 (04) : 919 - 935
  • [6] Cross-lingual Lexical Sememe Prediction
    Qi, Fanchao
    Lin, Yankai
    Sun, Maosong
    Zhu, Hao
    Xie, Ruobing
    Liu, Zhiyuan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 358 - 368
  • [7] Cross-Lingual Speech-to-Text Summarization
    Pontes, Elvys Linhares
    Gonzalez-Gallardo, Carlos-Emiliano
    Torres-Moreno, Juan-Manuel
    Huet, Stephane
    [J]. MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 385 - 395
  • [8] Speech Emotion Recognition with Cross-lingual Databases
    Chiou, Bo-Chang
    Chen, Chia-Ping
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 558 - 561
  • [9] IMPROVING LUXEMBOURGISH SPEECH RECOGNITION WITH CROSS-LINGUAL SPEECH REPRESENTATIONS
    Le Minh Nguyen
    Nayak, Shekhar
    Coler, Matt
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 792 - 797
  • [10] CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS
    Wu, Yi-Jian
    King, Simon
    Tokuda, Keiichi
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 9 - 12