Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results

被引:2
|
作者
Baird, Alice [1 ]
Amiriparian, Shahin [1 ]
Berschneider, Miriam [1 ]
Schmitt, Maximilian [1 ]
机构
[1] Augsburg Univ, Augsburg, Germany
来源
2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019) | 2019年
关键词
RECOGNITION; RESPONSES;
D O I
10.1109/mmsp.2019.8901758
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, diagnosis and awareness of mental health conditions, e.g., chronic stress, have been increasing globally. Biological signals can be an effective way to monitor such conditions, yet acquisition can be cumbersome and invasive. Alternatively, acoustic features offer non-invasive and efficient monitoring of an array of health and wellbeing characteristics. This study presents the BioSpeech Database (BIOS-DB), a novel database of audio and biological signals - blood volume pulse (BVP) and skin conductance (SC) - from 55 individuals speaking aloud in front of others, whilst having their emotional state annotated in real time. Through a variation of conventional and state-of-the-art approaches, initial experiments have shown for the first time that acoustic features can be applied for the task of BVP prediction. Notably, using deep representations of audio and a sequence-to-sequence auto-encoders with a GRU-RNN as a time-dependent regressor achieved at best 0.075 and 0.123 RMSE for [0; 1] normalised BVP and SC, respectively.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A novel multimodal EEG-image fusion approach for emotion recognition: introducing a multimodal KMED dataset
    Bahar Hatipoglu Yilmaz
    Cemal Kose
    Cagatay Murat Yilmaz
    Neural Computing and Applications, 2025, 37 (6) : 5187 - 5202
  • [2] Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals
    Pesan, Jan
    Jurik, Vojtech
    Ruzickova, Alexandra
    Svoboda, Vojtech
    Janousek, Oto
    Nemcova, Andrea
    Bojanovska, Hana
    Aldabaghova, Jasmina
    Kyslik, Filip
    Vodickova, Katerina
    Sodomova, Adela
    Bartys, Patrik
    Chudy, Peter
    Cernocky, Jan
    SCIENTIFIC DATA, 2024, 11 (01)
  • [3] Introducing DeReKoGram: A Novel Frequency Dataset with Lemma and Part-of-Speech Information for German
    Wolfer, Sascha
    Koplenig, Alexander
    Kupietz, Marc
    Mueller-Spitzer, Carolin
    DATA, 2023, 8 (11)
  • [4] Unlocking Human-Robot Dynamics: Introducing SenseCobot, a Novel Multimodal Dataset on Industry 4.0
    Borghi, Simone
    Zucchi, Federica
    Prati, Elisa
    Ruo, Andrea
    Villani, Valeria
    Sabattini, Lorenzo
    Peruzzini, Margherita
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 880 - 884
  • [5] Speech-Based Classification of Defensive Communication: A Novel Dataset and Results
    Amiriparian, Shahin
    Christ, Lukas
    Kushtanova, Regina
    Gerczuk, Maurice
    Teynor, Alexandra
    Schuller, Bjoern W.
    INTERSPEECH 2023, 2023, : 2703 - 2707
  • [6] A Novel Method for Epoch Extraction from Speech Signals
    Kaushik, Lakshmish
    O'Shaughnessy, Douglas
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2847 - 2850
  • [7] Voice pathology detection and classification from speech signals and EGG signals based on a multimodal fusion method
    Geng, Lei
    Shan, Hongfeng
    Xiao, Zhitao
    Wang, Wei
    Wei, Mei
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2021, 66 (06): : 613 - 625
  • [8] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
    Syed, Zafi Sherhan
    Memon, Sajjad Ali
    Shah, Muhammad Shehram
    Syed, Abbas Shah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
  • [9] A Novel Hierarchical Framework for Measuring the Complexity and Irregularity of Multimodal Speech Signals and Its Application in the Assessment of Speech Impairment in Amyotrophic Lateral Sclerosis
    Rong, Panying
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (08): : 2996 - 3014
  • [10] A Novel System for Recognizing Recording Devices from Recorded Speech Signals
    Bao, Yongqiang
    Shao, Qi
    Zhang, Xuxu
    Jiang, Jiahui
    Xie, Yue
    Liu, Tingting
    Xu, Weiye
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 65 (03): : 2557 - 2570