I-vector speaker verification for speech degraded by narrowband and wideband channels

Laura Fernández Gallardo, Michael Wagner, Sebastian Möller

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    17 Citations (Scopus)

    Abstract

    Voice biometrics are frequently exposed to channel degradations of transmitted speech and to channel mismatch between enrollment and test utterances, which cause speaker recognition systems to perform poorly. In this paper, the influence of channel bandwidth and speech coding on speaker verification is assessed employing the state-of-the-art i-vector technique. Our focus is on the possible benefits of enhanced wideband over narrowband and on the effects of codec mismatch and bandwidth mismatch. Our results on subsets of the NIST SRE (Speaker Recognition Evaluation) 2010 and of the TIMIT corpus show that the performance with wideband data is significantly better than that employing narrowband signals for matched and codec-mismatched conditions. In the presence of bandwidth mismatch, a relative improvement of 40-70% can be obtained by downsampling the wideband signal to 8 kHz.

    Original languageEnglish
    Title of host publicationProceedings of 11th ITG Symposium on Speech Communication
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    ISBN (Electronic)9783800736409
    Publication statusPublished - 2014
    Event11th ITG Symposium on Speech Communication - Erlangen, Germany
    Duration: 24 Sept 201426 Sept 2014

    Publication series

    NameProceedings of 11th ITG Symposium on Speech Communication

    Conference

    Conference11th ITG Symposium on Speech Communication
    Country/TerritoryGermany
    CityErlangen
    Period24/09/1426/09/14

    Fingerprint

    Dive into the research topics of 'I-vector speaker verification for speech degraded by narrowband and wideband channels'. Together they form a unique fingerprint.

    Cite this