Constructive Visual Analytics for Text Similarity Detection

A. Abdul-Rahman, G. Roe, M. Olsen, C. Gladstone, R. Whaling, N. Cronk, R. Morrissey, M. Chen

    Research output: Contribution to journalArticlepeer-review

    23 Citations (Scopus)

    Abstract

    Detecting similarity between texts is a frequently encountered text mining task. Because the measurement of similarity is typically composed of a number of metrics, and some measures are sensitive to subjective interpretation, a generic detector obtained using machine learning often has difficulties balancing the roles of different metrics according to the semantic context exhibited in a specific collection of texts. In order to facilitate human interaction in a visual analytics process for text similarity detection, we first map the problem of pairwise sequence comparison to that of image processing, allowing patterns of similarity to be visualized as a 2D pixelmap. We then devise a visual interface to enable users to construct and experiment with different detectors using primitive metrics, in a way similar to constructing an image processing pipeline. We deployed this new approach for the identification of commonplaces in 18th-century literary and print culture. Domain experts were then able to make use of the prototype system to derive new scholarly discoveries and generate new hypotheses.

    Original languageEnglish
    Pages (from-to)237-248
    Number of pages12
    JournalComputer Graphics Forum
    Volume36
    Issue number1
    DOIs
    Publication statusPublished - 1 Jan 2017

    Fingerprint

    Dive into the research topics of 'Constructive Visual Analytics for Text Similarity Detection'. Together they form a unique fingerprint.

    Cite this