TY - JOUR
T1 - A Likelihood ratio-based forensic voice comparison in microphone vs. mobile mismatched conditions using Japanese /ai/
AU - Carne, Michael J.
N1 - Publisher Copyright:
Copyright © 2015 ISCA.
PY - 2015
Y1 - 2015
N2 - This paper describes a likelihood ratio-based forensic voice comparison experiment in microphone versus mobile channel mismatched conditions using parametric representations of formant trajectories. Cubic polynomial coefficients of /ai/ from non-contemporaneous recordings of 30 Japanese male speakers are used to derive multivariate likelihood ratios. The results are evaluated separately for a matched and mismatched group to determine the effect of the mismatch on system performance. A calibrated cross-validated log-likelihood ratio cost (Cllr) of 0.93 is achieved for the F-pattern of /ai/ representing an 18% reduction in system validity relative to the matched group. Separate testing involving only F1 and F2 features evinces a smaller (10%) reduction; suggesting F3 may be more impacted by channel differences. Spectral analysis of F3 indicates this stems from formant tracking errors due to weak signal energy in transmission. As such, F3 in /ai/ should be excluded from analysis where it is poorly preserved. Given the relatively small percentage reductions in validity, it is concluded that /ai/ may be reasonably robust to the mismatch. However, poor performance in optimal conditions (Cllr = 0.77) suggests it may not be a particularly useful parameter in the first place. Limitations to the current study are also discussed.
AB - This paper describes a likelihood ratio-based forensic voice comparison experiment in microphone versus mobile channel mismatched conditions using parametric representations of formant trajectories. Cubic polynomial coefficients of /ai/ from non-contemporaneous recordings of 30 Japanese male speakers are used to derive multivariate likelihood ratios. The results are evaluated separately for a matched and mismatched group to determine the effect of the mismatch on system performance. A calibrated cross-validated log-likelihood ratio cost (Cllr) of 0.93 is achieved for the F-pattern of /ai/ representing an 18% reduction in system validity relative to the matched group. Separate testing involving only F1 and F2 features evinces a smaller (10%) reduction; suggesting F3 may be more impacted by channel differences. Spectral analysis of F3 indicates this stems from formant tracking errors due to weak signal energy in transmission. As such, F3 in /ai/ should be excluded from analysis where it is poorly preserved. Given the relatively small percentage reductions in validity, it is concluded that /ai/ may be reasonably robust to the mismatch. However, poor performance in optimal conditions (Cllr = 0.77) suggests it may not be a particularly useful parameter in the first place. Limitations to the current study are also discussed.
KW - Channel mismatch
KW - Forensic voice comparison
KW - Formants
KW - Likelihood ratio
KW - Mobile phones
UR - http://www.scopus.com/inward/record.url?scp=84959110301&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84959110301
SN - 2308-457X
VL - 2015-January
SP - 3471
EP - 3475
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015
Y2 - 6 September 2015 through 10 September 2015
ER -