[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2910674.2935856acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
short-paper

Fusing active orientation models and mid-term audio features for automatic depression estimation

Published: 29 June 2016 Publication History

Abstract

In this paper, we predict a human's depression level in the BDI-II scale, using facial and voice features. Active orientation models (AOM) and several voice features were extracted from the video and audio modalities. Long-term and mid-term features were computed and a fusion is performed in the feature space. Videos from the Depression Recognition Sub-Challenge of the 2014 Audio-Visual Emotion Challenge and Workshop (AVEC 2014) were used and support vector regression models were trained to predict the depression level. We demonstrated that the fusion of AOMs with audio features leads to better performance compared to individual modalities. The obtained regression results indicate the robustness of the proposed technique, under different settings, as well as an RMSE improvement compared to the AVEC 2014 video baseline.

References

[1]
World Health Organization. Depression - a hidden burden. www.who.int/mediacentre/factsheets/fs369. Last accessed: 05.01.2016.
[2]
Almaev, T. R., and Valstar, M. F. Local gabor binary patterns from three orthogonal planes for automatic facial expression recognition. In ACII (2013).
[3]
Beck, A. T., Steer, R. A., Ball, R., and Ranieri, W. Comparison of Beck Depression Inventories -IA and -II in psychiatric outpatients. Journal of Personality Assessment (1996).
[4]
Belhumeur, P. N., Jacobs, D. W., Kriegman, D. J., and Kumar, N. Localizing parts of faces using a consensus of exemplars. TPAMI (2013).
[5]
Cohn, J. F., Kruez, T. S., Matthews, I., Yang, Y., Nguyen, M. H., Padilla, M. T., Zhou, F., and La Torre, F. D. Detecting depression from facial actions and vocal prosody. In ACII Workshops (2009).
[6]
Gupta, R., Malandrakis, N., Xiao, B., Guha, T., Van Segbroeck, M., Black, M., Potamianos, A., and Narayanan, S. Multimodal prediction of affective dimensions and depression in human-computer interactions. In ACM International AVEC Workshop (2014).
[7]
Jain, V., Crowley, J. L., Dey, A. K., and Lux, A. Depression estimation using audiovisual features and fisher vector encoding. In ACM International AVEC Workshop (2014).
[8]
Jan, A., Meng, H., Gaus, Y. F. A., Zhang, F., and Turabzadeh, S. Automatic depression scale prediction using facial expression dynamics and regression. In ACM International AVEC Workshop (2014).
[9]
Kaya, H., Eyben, F., Salah, A. A., and Schuller, B. CCA based feature selection with application to continuous depression recognition from acoustic speech features. In ICASSP (2014).
[10]
Pérez Espinosa, H., Escalante, H. J., Villaseñor Pineda, L., Montes-y Gómez, M., Pinto-Avedaño, D., and Reyez-Meza, V. Fusing affective dimensions and audio-visual features from segmented video for depression recognition: INAOE-BUAP's. In ACM International AVEC Workshop (2014).
[11]
Senoussaoui, M., Sarria-Paja, M., Santos, J. a. F., and Falk, T. H. Model fusion for multimodal depression classification and level detection. In ACM International AVEC Workshop (2014).
[12]
Tzimiropoulos, G., Alabort-i Medina, J., Zafeiriou, S., and Pantic, M. Generic active appearance models revisited. In ACCV. 2013.
[13]
Valstar, M., Schuller, B., Smith, K., Almaev, T., Eyben, F., Krajewski, J., Cowie, R., and Pantic, M. AVEC 2014: 3D dimensional affect and depression recognition challenge. In ACM International AVEC Workshop (2014).
[14]
Williamson, J. R., Quatieri, T. F., Helfer, B. S., Ciccarelli, G., and Mehta, D. D. Vocal and facial biomarkers of depression based on motor incoordination and timing. In ACM International AVEC Workshop (2014).

Cited By

View all
  • (2019)Automatic Assessment of Depression Based on Visual Cues: A Systematic ReviewIEEE Transactions on Affective Computing10.1109/TAFFC.2017.272403510:4(445-470)Online publication date: 1-Oct-2019
  • (2017)Towards More Robust Automatic Facial Expression Recognition in Smart EnvironmentsProceedings of the 10th International Conference on PErvasive Technologies Related to Assistive Environments10.1145/3056540.3056546(37-44)Online publication date: 21-Jun-2017
  1. Fusing active orientation models and mid-term audio features for automatic depression estimation

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    PETRA '16: Proceedings of the 9th ACM International Conference on PErvasive Technologies Related to Assistive Environments
    June 2016
    455 pages
    ISBN:9781450343374
    DOI:10.1145/2910674
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 June 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Depression estimation
    2. active orientation models
    3. audio-visual fusion

    Qualifiers

    • Short-paper
    • Research
    • Refereed limited

    Conference

    PETRA '16

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Automatic Assessment of Depression Based on Visual Cues: A Systematic ReviewIEEE Transactions on Affective Computing10.1109/TAFFC.2017.272403510:4(445-470)Online publication date: 1-Oct-2019
    • (2017)Towards More Robust Automatic Facial Expression Recognition in Smart EnvironmentsProceedings of the 10th International Conference on PErvasive Technologies Related to Assistive Environments10.1145/3056540.3056546(37-44)Online publication date: 21-Jun-2017

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media