Semi-Structural Interview-Based Chinese Multimodal Depression Corpus Towards Automatic Preliminary Screening of Depressive Disorders

被引:11
|
作者
Zou, Bochao [1 ]
Han, Jiali [2 ]
Wang, Yingxue [3 ]
Liu, Rui [2 ]
Zhao, Shenghui [4 ]
Feng, Lei [2 ]
Lyu, Xiangwen [3 ]
Ma, Huimin [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Capital Med Univ, Natl Clin Res Ctr Mental Disorders, Beijing Anding Hosp, Beijing 100088, Peoples R China
[3] Natl Engn Lab Risk Percept & Prevent, Beijing 100041, Peoples R China
[4] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Depression; Interviews; Feature extraction; Behavioral sciences; Visualization; Hospitals; Acoustics; Affective computing; depressive disorder; multimodal corpus; semi-structural interview; DETECTING DEPRESSION; RISK-ASSESSMENT; SEVERITY; DIAGNOSIS; VALIDITY; SPEECH;
D O I
10.1109/TAFFC.2022.3181210
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is a common psychiatric disorder worldwide. However, in China, a considerable number of patients with depression are not diagnosed, and most of them are not aware of their depression. Despite increasing efforts, the goal of automatic depression screening from behavioral indicators has not been achieved. A major limitation is the lack of available multimodal depression corpus in Chinese since linguistic knowledge is crucial in clinical practice. Therefore, we first carried out a comprehensive survey with psychiatrists from a renowned psychiatric hospital to identify key interview topics which are highly related to the diagnosis of depression. Then, a semi-structural interview study was conducted over a year with subjects who have undergone clinical diagnosis and professional assessment. After that, Visual, acoustic, and textual features were extracted and analyzed between the two groups, statistically significant differences were observed in all three modalities. Benchmark evaluations of both single modal and multimodal fusion methods of depression assessment were also performed. A multimodal transformer-based fusion approach achieved the best performance. Finally, the proposed Chinese Multimodal Depression Corpus (CMDC) was made publicly available after de-identification and annotation. Hopefully, the release of this corpus would promote the research progress and practical applications of automatic depression screening.
引用
收藏
页码:2823 / 2838
页数:16
相关论文
empty
未找到相关数据