TY - JOUR
T1 - Advantages of wideband over narrowband channels for speaker verification employing MFCCs and LFCCs
AU - Gallardo, Laura Fernández
AU - Wagner, Michael
AU - Möller, Sebastian
N1 - Publisher Copyright:
Copyright © 2014 ISCA.
PY - 2014
Y1 - 2014
N2 - Wideband communications permit the transmission of an extended frequency range compared to the traditional narrowband. While benefits for automatic speaker recognition can be expected, the extent of the contribution of the additional bandwidth in wideband is still unclear. This work compares the i-vector speaker verification performances employing speech signals of 0-4 kHz, 4-8 kHz, and 0-8 kHz and different sets of cepstral features extracted using linearlyand a mel-spaced filterbanks. Analyses of clean speech and of speech transmitted through commonly employed codecs are conducted separately for male and for female speech. Our evaluation on two different datasets shows the improved speaker verification performance with the extended bandwidth, and also that the linear scale can lead to better results for narrowband signals. The advantages of linear- over mel-scaled features for wideband depend on the speakers' gender and on the channel distortion.
AB - Wideband communications permit the transmission of an extended frequency range compared to the traditional narrowband. While benefits for automatic speaker recognition can be expected, the extent of the contribution of the additional bandwidth in wideband is still unclear. This work compares the i-vector speaker verification performances employing speech signals of 0-4 kHz, 4-8 kHz, and 0-8 kHz and different sets of cepstral features extracted using linearlyand a mel-spaced filterbanks. Analyses of clean speech and of speech transmitted through commonly employed codecs are conducted separately for male and for female speech. Our evaluation on two different datasets shows the improved speaker verification performance with the extended bandwidth, and also that the linear scale can lead to better results for narrowband signals. The advantages of linear- over mel-scaled features for wideband depend on the speakers' gender and on the channel distortion.
KW - Channel degradation
KW - LFCC
KW - MFCC
KW - Speaker verification
UR - http://www.scopus.com/inward/record.url?scp=84910092167&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84910092167
SN - 2308-457X
SP - 1115
EP - 1119
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014
Y2 - 14 September 2014 through 18 September 2014
ER -