Detecting depression: A comparison between spontaneous and read speech

Sharifa Alghowinem, Roland Goecke, Michael Wagner, Julien Epps, Michael Breakspear, Gordon Parker

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    99 Citations (Scopus)

    Abstract

    Major depressive disorders are mental disorders of high prevalence, leading to a high impact on individuals, their families, society and the economy. In order to assist clinicians to better diagnose depression, we investigate an objective diagnostic aid using affective sensing technology with a focus on acoustic features. In this paper, we hypothesise that (1) classifying the general characteristics of clinical depression using spontaneous speech will give better results than using read speech, (2) that there are some acoustic features that are robust and would give good classification results in both spontaneous and read, and (3) that a 'thin-slicing' approach using smaller parts of the speech data will perform similarly if not better than using the whole speech data. By examining and comparing recognition results for acoustic features on a real-world clinical dataset of 30 depressed and 30 control subjects using SVM for classification and a leave-one-out cross-validation scheme, we found that spontaneous speech has more variability, which increases the recognition rate of depression. We also found that jitter, shimmer, energy and loudness feature groups are robust in characterising both read and spontaneous depressive speech. Remarkably, thin-slicing the read speech, using either the beginning of each sentence or the first few sentences performs better than using all reading task data.

    Original languageEnglish
    Title of host publication2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
    Pages7547-7551
    Number of pages5
    DOIs
    Publication statusPublished - 18 Oct 2013
    Event2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Vancouver, BC, Canada
    Duration: 26 May 201331 May 2013

    Publication series

    NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    ISSN (Print)1520-6149

    Conference

    Conference2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
    Country/TerritoryCanada
    CityVancouver, BC
    Period26/05/1331/05/13

    Fingerprint

    Dive into the research topics of 'Detecting depression: A comparison between spontaneous and read speech'. Together they form a unique fingerprint.

    Cite this