Abstract
The performances of two different procedures for calculating the likelihood ratio (LR) for forensic text comparison (FTC) are empirically compared. One is the multivariate kernel density (MVKD) procedure with so-called lexical features. The MVKD procedure has been successfully applied to various types of evidence, including texts. The other is the procedure based on character N-grams. N-gram is a widely-used, robust probabilistic language model. The effectiveness of character N-grams has been reported in authorship analysis, however, to the best of my knowledge, it has not yet been applied to LR-based FTC. In this study, the log-likelihood-ratio-cost (Cllr), which is an appropriate assessment metric for LR-based systems, is used to assess the performance of the two procedures. It will be reported that the MVKD procedure outperforms the character N-gram procedure. Through the comparison of the two procedures, this study also demonstrates how the weight of evidence (= LR) can be estimated from text messages.
Original language | English |
---|---|
Title of host publication | The 45th Australian Linguistic Society Conference Proceedings - 2014 |
Editors | Mark Harvey and Alexis Antonia |
Place of Publication | Newcastle, Australia |
Publisher | University of Newcastle |
Pages | 131-152 |
Edition | peer reviewed |
ISBN (Print) | 9780994150707 |
Publication status | Published - 2014 |
Event | Australian Linguistic Society Conference Proceedings - ALS 2014 - University of Newcastle Duration: 1 Jan 2014 → … http://nova.newcastle.edu.au/vital/access/manager/Repository/uon:21580?queryType=vitalDismax&query=Australian+Linguistic+Society |
Conference
Conference | Australian Linguistic Society Conference Proceedings - ALS 2014 |
---|---|
Period | 1/01/14 → … |
Other | December 10-12 2014 |
Internet address |