A Forensic Authorship Classification in SMS Messages: A Likelihood Ratio Based Approach Using N-gram

Shunichi Ishihara*

*Corresponding author for this work

    Research output: Contribution to conferencePaperpeer-review

    23 Citations (Scopus)

    Abstract

    Due to its convenience and low-cost, short message service (SMS) has been a very popular medium for communication for quite some time. Unfortunately, however, SMS messages are sometimes used in illicit acts, such as communication between drug dealers and buyers, extortion, fraud, scam, hoax, false reports of terrorist threats, and many more. This study is a forensic study on the authorship classification of SMS messages in the Likelihood Ration (LR) framework with the N-gram modelling technique. The aims of this study are to investigate 1) how accurately it is possible to classify the authors of SMS messages; 2) what degree of strength of evidence (LR) can be obtained from SMS messages and 3) how the classification performance and the LRs are affected by the sample size for modelling. The resultant LRs are calibrated by means of the logistic regress calibration technique. The results of the classification tests will be rigorously assessed from different angles, using the techniques proposed for automatic speaker recognition and forensic voice comparison.

    Original languageEnglish
    Pages47-56
    Number of pages10
    Publication statusPublished - 2011
    Event9th Australasian Language Technology Association Workshop, ALTA 2011 - Canberra, Australia
    Duration: 1 Dec 20112 Dec 2011

    Conference

    Conference9th Australasian Language Technology Association Workshop, ALTA 2011
    Country/TerritoryAustralia
    CityCanberra
    Period1/12/112/12/11

    Fingerprint

    Dive into the research topics of 'A Forensic Authorship Classification in SMS Messages: A Likelihood Ratio Based Approach Using N-gram'. Together they form a unique fingerprint.

    Cite this