Exploring sub-band cepstral distances for more robust speaker classification

Takashi Osanai, Yuko Kinoshita, Frantz Clermont

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    This paper presents the first of two-part exploration into the potential of parametric cepstral distance (PCD) as a forensic voice comparison feature, based on Japanese vowel data collected from 306 male native speakers under microphone and mobile transmission conditions. The behaviours of PCDs were closely examined by altering sub-band settings, and we found the behaviour of PCDs to correspond well to what is known about formants, which suggests that PCDs are relatable to articulatory gestures. Comparison between sub-band and full-band PCD revealed that limiting the band range to a specific frequency region makes the feature more robust against channel mismatch, encouraging further examination of this potential feature.
    Original languageEnglish
    Title of host publicationProceedings of the 17th Australasian International Conference on Speech Science and Technology
    EditorsJ Epps, J Wolfe, J Smith & C Jones
    Place of PublicationAustralia
    PublisherThe Australasian Speech Science and Technology Association, Inc.
    Pages41-44
    EditionPeer reviewed
    ISBN (Print)2207-1296
    Publication statusPublished - 2018
    Event17th Australasian International Conference on Speech Science and Technology - Sydney, Australia, Australia
    Duration: 1 Jan 2018 → …

    Conference

    Conference17th Australasian International Conference on Speech Science and Technology
    Country/TerritoryAustralia
    Period1/01/18 → …
    OtherDecember 4-7 2018

    Fingerprint

    Dive into the research topics of 'Exploring sub-band cepstral distances for more robust speaker classification'. Together they form a unique fingerprint.

    Cite this