Abstract
Depression is a mental health problem that affects human mood and the ability to function properly. Currently, the assessment is mainly based on subjective questionnaires and clinical opinions. This work aims to infer the depression level through analysing multimodal data, as a tool to support medical experts and patients in depression screening and diagnosis. We introduce a fusion method to estimate the Beck Depression Inventory II scores using multiple inputs: facial images, video-based blood volume pulse signals, and speech data. Each modality has its own regression model, based on the ResNet-50 architecture. Our approach leverages the synchrony between regression scores of these models to produce the fusion values. Specifically, we calculate the Pearson correlation coefficient and the dynamic time warping distance between sliding windows of the score sequences to find the optimal segments for fusion. We evaluate our method on the dataset of the fourth Audio-Visual Emotion Recognition Challenge (AVEC 2014). We achieve a Mean Absolute Error of 6.08 and a Root Mean Squared Error of 8.60, which are lower than those of each single-modality model.
Original language | English |
---|---|
Title of host publication | UbiComp/ISWC 2022 Adjunct - Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2022 ACM International Symposium on Wearable Computers |
Publisher | Association for Computing Machinery ACM |
Pages | 198-201 |
Number of pages | 4 |
ISBN (Electronic) | 9781450394239 |
DOIs | |
Publication status | Published - 11 Sept 2022 |
MoE publication type | A4 Article in a conference publication |
Event | 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers, UbiComp/ISWC 2022 - Cambridge, United Kingdom Duration: 11 Sept 2022 → 15 Sept 2022 |
Conference
Conference | 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers, UbiComp/ISWC 2022 |
---|---|
Country/Territory | United Kingdom |
City | Cambridge |
Period | 11/09/22 → 15/09/22 |
Funding
This research has been supported by the Academy of Finland 6G Flagship program under Grant 346208 and PROFI5 HiDyn under Grant 32629, and the InSecTT project, which is funded under the European ECSEL Joint Undertaking (JU) program under grant agreement No 876038.
Keywords
- ensemble methods
- multimodal data
- wellbeing