THE IN-THE-WILD SPEECH MEDICAL CORPUS

被引:4
|
作者
Correia, Joana [1 ,2 ]
Teixeira, Francisco [2 ]
Botelho, Catarina [2 ]
Trancoso, Isabel [2 ]
Raj, Bhiksha [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Lisbon, INESC ID, Lisbon, Portugal
关键词
Speech affecting diseases; pathological speech; in-the-wild; i-vectors; x-vectors; PARKINSONS-DISEASE;
D O I
10.1109/ICASSP39728.2021.9414230
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic detection of speech affecting (SA) diseases has received significant attention, particularly in clinical scenarios. However, the same task in in-the-wild conditions is often neglected, in part, due to the lack of appropriate datasets. In this work, we present the in-the-Wild Speech Medical (WSM) Corpus, a collection of in-the-wild videos, featuring subjects potentially affected by a SA disease - specifically, depression or Parkinson's disease. The WSM Corpus contains a total 928 videos, and over 131 hours of speech. Each video is accompanied by a crowdsourced annotation for perceived age/gender, and self-reported health status of the speaker. The WSM Corpus is balanced over all the labels. In this work we present a detailed description of the collection, and annotation processes of the WSM corpus. Furthermore, we present present several baseline systems for the detection of SA diseases using speech alone, thus motivating the use of this type of in-the-wild data in paralinguistic audiovisual tasks.
引用
收藏
页码:6973 / 6977
页数:5
相关论文
共 50 条
  • [41] Optimization problems for fast AAM fitting in-the-wild
    Tzimiropoulos, Georgios
    Pantic, Maja
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 593 - 600
  • [42] Deformable Models of Ears in-the-wild for Alignment and Recognition
    Zhou, Yuxiang
    Zaferiou, Stefanos
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 626 - 633
  • [43] Examining Participant Adherence with Wearables in an In-the-Wild Setting
    Nolasco, Hannah R.
    Vargo, Andrew
    Bohley, Niklas
    Brinkhaus, Christian
    Kise, Koichi
    SENSORS, 2023, 23 (14)
  • [44] Reconstructing Objects in-the-wild for Realistic Sensor Simulation
    Yang, Ze
    Manivasagam, Sivabalan
    Chen, Yun
    Wang, Jingkang
    Hu, Rui
    Urtasun, Raquel
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11661 - 11668
  • [45] Is Gender 'In-the-Wild' Inference Really a Solved Problem?
    Roxo T.
    Proenca H.
    IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (04): : 573 - 582
  • [46] Investigation Into Deep Learning Methods for In-the-Wild Applications
    Land, Haley B.
    Price, Stanton R.
    Carley, Samantha S.
    Butler, Samantha J.
    Price, Steven R.
    AUTONOMOUS SYSTEMS:SENSORS, PROCESSING, AND SECURITY FOR GROUND, AIR, SEA, AND SPACE VEHICLES AND INFRASTRUCTURE 2024, 2024, 13052
  • [47] Shapes and Context: In-the-Wild Image Synthesis & Manipulation
    Banssal, Aayush
    Sheikh, Yaser
    Ramanan, Deva
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2312 - 2321
  • [48] Generating Realistic Images from In-the-wild Sounds
    Lee, Taegyeong
    Kang, Jeonghun
    Kim, Hyeonyu
    Kim, Taehwan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7126 - 7136
  • [49] Unaligned Supervision for Automatic Music Transcription in-the-Wild
    Maman, Ben
    Bermano, Amit H.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [50] Valence and Arousal Estimation In-The-Wild with Tensor Methods
    Mitenkova, Anna
    Kossaifi, Jean
    Panagakis, Yannis
    Pantic, Maja
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 48 - 54