Abstract
This study sets out to find the most reliable method for loglikelihood-ratio (LLR) calculation under severe data scarcity, which is typical of forensic voice comparison casework. We compared the performances of three types of speaker modelling, namely a single Gaussian model, Gaussian Mixture Models (GMM) of different complexity, and a Multivariate Kernel Density Model (MVKD), using two and threedimensional formant frequency feature vectors extracted from /iː/ vowels. We varied the number of tokens used in the offender dataset from 2 to 6. We find that calibration of the systems was critical for dependable evaluation with all the systems tested and that the MVKD model outperformed Gaussian models in most cases.
Original language | English |
---|---|
Title of host publication | Interpeech 2014 |
Place of Publication | Singapore |
Publisher | International Speech Communication Association |
Pages | 16-19 |
Edition | Peer Reviewed |
Publication status | Published - 2014 |
Event | Annual Conference of the International Speech Communication Association INTERSPEECH 2014 - Singapore, Singapore Duration: 1 Jan 2014 → … http://www.isca-speech.org/archive/interspeech_2014 |
Conference
Conference | Annual Conference of the International Speech Communication Association INTERSPEECH 2014 |
---|---|
Country/Territory | Singapore |
Period | 1/01/14 → … |
Other | September 14-18 2014 |
Internet address |