A Likelihood Ratio-Based Forensic Text Comparison in SMS Messages: A Fused System with Lexical Features and N-Grams

    Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

    Abstract

    This chapter is built on two studies: Ishihara (2011) "A Forensic Authorship Classification in SMS Messages: A Likelihood Ratio-Based Approach Using N-Grams" and Ishihara (2012) "A Forensic Text Comparison in SMS Messages: A Likelihood Ratio Approach with Lexical Features.” They are two of the first Likelihood Ratio (LR)-based forensic text comparison studies in forensic authorship analysis. The author attribution was modelled using N-grams in the former, whereas it was modelled using so-called lexical features in the latter. In the current study, the LRs obtained from these separate experiments are fused using a logistic regression fusion technique, and the author reports how much improvement in performance the fusion brings to the LR-based forensic text comparison system. The performance of the fused system is assessed based on the magnitude of the fused LRs using the log-likelihood-ratio cost (Cllr). The strength of the fused LRs is graphically presented in Tippett plots and compared with those of the original LRs. The chapter demonstrates that the fused system outperforms the original systems.
    Original languageEnglish
    Title of host publicationAnalyzing Security, Trust, and Crime in the Digital World
    EditorsHamid R. Nemati
    Place of PublicationHershey PA: USA
    PublisherIGI Global Books
    Pages208-224
    Volume1
    ISBN (Print)9781466648579
    Publication statusPublished - 2014

    Fingerprint

    Dive into the research topics of 'A Likelihood Ratio-Based Forensic Text Comparison in SMS Messages: A Fused System with Lexical Features and N-Grams'. Together they form a unique fingerprint.

    Cite this