TY - GEN
T1 - I-vector speaker verification for speech degraded by narrowband and wideband channels
AU - Gallardo, Laura Fernández
AU - Wagner, Michael
AU - Möller, Sebastian
N1 - Publisher Copyright:
© VDE VERLAG GMBH Berlin Offenbach.
PY - 2014
Y1 - 2014
N2 - Voice biometrics are frequently exposed to channel degradations of transmitted speech and to channel mismatch between enrollment and test utterances, which cause speaker recognition systems to perform poorly. In this paper, the influence of channel bandwidth and speech coding on speaker verification is assessed employing the state-of-the-art i-vector technique. Our focus is on the possible benefits of enhanced wideband over narrowband and on the effects of codec mismatch and bandwidth mismatch. Our results on subsets of the NIST SRE (Speaker Recognition Evaluation) 2010 and of the TIMIT corpus show that the performance with wideband data is significantly better than that employing narrowband signals for matched and codec-mismatched conditions. In the presence of bandwidth mismatch, a relative improvement of 40-70% can be obtained by downsampling the wideband signal to 8 kHz.
AB - Voice biometrics are frequently exposed to channel degradations of transmitted speech and to channel mismatch between enrollment and test utterances, which cause speaker recognition systems to perform poorly. In this paper, the influence of channel bandwidth and speech coding on speaker verification is assessed employing the state-of-the-art i-vector technique. Our focus is on the possible benefits of enhanced wideband over narrowband and on the effects of codec mismatch and bandwidth mismatch. Our results on subsets of the NIST SRE (Speaker Recognition Evaluation) 2010 and of the TIMIT corpus show that the performance with wideband data is significantly better than that employing narrowband signals for matched and codec-mismatched conditions. In the presence of bandwidth mismatch, a relative improvement of 40-70% can be obtained by downsampling the wideband signal to 8 kHz.
UR - http://www.scopus.com/inward/record.url?scp=84939494888&partnerID=8YFLogxK
M3 - Conference contribution
T3 - Proceedings of 11th ITG Symposium on Speech Communication
BT - Proceedings of 11th ITG Symposium on Speech Communication
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th ITG Symposium on Speech Communication
Y2 - 24 September 2014 through 26 September 2014
ER -