THE IN-THE-WILD SPEECH MEDICAL CORPUS

被引:4
|
作者
Correia, Joana [1 ,2 ]
Teixeira, Francisco [2 ]
Botelho, Catarina [2 ]
Trancoso, Isabel [2 ]
Raj, Bhiksha [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Lisbon, INESC ID, Lisbon, Portugal
关键词
Speech affecting diseases; pathological speech; in-the-wild; i-vectors; x-vectors; PARKINSONS-DISEASE;
D O I
10.1109/ICASSP39728.2021.9414230
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic detection of speech affecting (SA) diseases has received significant attention, particularly in clinical scenarios. However, the same task in in-the-wild conditions is often neglected, in part, due to the lack of appropriate datasets. In this work, we present the in-the-Wild Speech Medical (WSM) Corpus, a collection of in-the-wild videos, featuring subjects potentially affected by a SA disease - specifically, depression or Parkinson's disease. The WSM Corpus contains a total 928 videos, and over 131 hours of speech. Each video is accompanied by a crowdsourced annotation for perceived age/gender, and self-reported health status of the speaker. The WSM Corpus is balanced over all the labels. In this work we present a detailed description of the collection, and annotation processes of the WSM corpus. Furthermore, we present present several baseline systems for the detection of SA diseases using speech alone, thus motivating the use of this type of in-the-wild data in paralinguistic audiovisual tasks.
引用
收藏
页码:6973 / 6977
页数:5
相关论文
共 50 条
  • [21] Exploring HTTP Header Manipulation In-The-Wild
    Tyson, Gareth
    Huang, Shan
    Cuadrado, Felix
    Castro, Ignacio
    Perta, Vasile C.
    Sathiaseelan, Arjuna
    Uhlig, Steve
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 451 - 458
  • [22] Estimating Correspondences of Deformable Objects "In-the-wild"
    Zhou, Yuxiang
    Antonakos, Epameinondas
    Alabort-i-Medina, Joan
    Roussos, Anastasios
    Zafeiriou, Stefanos
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5791 - 5801
  • [23] WildLight: In-the-wild Inverse Rendering with a Flashlight
    Cheng, Ziang
    Li, Junxuan
    Li, Hongdong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4305 - 4314
  • [24] Facial Expression Recognition for In-the-wild Videos
    Liu, Hanyu
    Zeng, Jiabei
    Shan, Shiguang
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 615 - 618
  • [25] Cascaded Video Generation for Videos In-the-Wild
    Castrejon, Lluis
    Ballas, Nicolas
    Courville, Aaron
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2385 - 2392
  • [26] Studying MarathonLive: Consent for In-The-Wild Research
    Anstead, Edward
    Flintham, Martin
    Benford, Steve
    PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING (UBICOMP'14 ADJUNCT), 2014, : 665 - 670
  • [27] Fast In-the-Wild Hair Segmentation and Color Classification
    Ileni, Tudor Alexandru
    Borza, Diana Laura
    Darabant, Adrian Sergiu
    VISAPP: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4, 2019, : 59 - 66
  • [28] In-the-wild Drowsiness Detection from Facial Expressions
    Joshi, Ajjen
    Kyal, Survi
    Banerjee, Sandipan
    Mishra, Taniya
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 207 - 212
  • [29] Active Orientation Models for Face Alignment In-the-Wild
    Tzimiropoulos, Georgios
    Alabort-i-Medina, Joan
    Zafeiriou, Stefanos P.
    Pantic, Maja
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (12) : 2024 - 2034
  • [30] Audio Visual Recognition of Spontaneous Emotions In-the-Wild
    Xia, Xiaohan
    Guo, Liyong
    Jiang, Dongmei
    Pei, Ercheng
    Yang, Le
    Sahli, Hichem
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 692 - 706