I-vector speaker verification based on phonetic information under transmission channel effects

Laura Fernández Gallardo*, Michael Wagner, Sebastian Möller

*Corresponding author for this work

    Research output: Contribution to journalConference articlepeer-review

    7 Citations (Scopus)

    Abstract

    Past studies have shown evidence of important speakerspecific content in the higher frequencies of the spectrum, which are filtered out by narrowband channels. Besides, wideband transmissions, which are gaining ground over narrowband communications, offer an extended range of frequencies which account not only for better speech quality and intelligibility, but also for an improved speaker recognition performance. In this work, different phoneme classes (fricatives, nasals, and vowels) were removed from speech of different bandwidths, and a series of i-vector based speaker verification experiments were conducted. Our results show that the performance enhancement with clean wideband speech with respect to clean narrowband speech is principally due to the presence of unvoiced fricative consonants. The effects of codec schemes of different bandwidths on the aforementioned speech are discussed.

    Original languageEnglish
    Pages (from-to)696-700
    Number of pages5
    JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
    Publication statusPublished - 2014
    Event15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 - Singapore, Singapore
    Duration: 14 Sept 201418 Sept 2014

    Fingerprint

    Dive into the research topics of 'I-vector speaker verification based on phonetic information under transmission channel effects'. Together they form a unique fingerprint.

    Cite this