Query-biased summaries for tabular data

Vincent Au, Paul Thomas, Gaya K. Jayasinghe

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    Government, research, and academic data portals publish a large amount of public data, but present tools make discovery difficult. In particular, search results do not support a user's decision whether or not to commit to a download of what might be a large data set. We describe a method for producing query-biased summaries of tabular data, which aims to support a user's download decision-or even to answer the question on the spot, with no further interaction. The method infers simple types in the data and query; automatically refines queries, where that makes sense; extracts relevant subsets of the complete table; and generates both graphical and tabular summaries of what remains. A small-scale user study suggests this both helps users identify useful results (fewer false negatives), and reduces wasted downloads (fewer false positives).

    Original languageEnglish
    Title of host publicationADCS 2016 - Proceedings of the 21st Australasian Document Computing Symposium
    EditorsSarvnaz Karimi, Mark Carman
    PublisherAssociation for Computing Machinery (ACM)
    Pages69-72
    Number of pages4
    ISBN (Electronic)9781450348652
    DOIs
    Publication statusPublished - 5 Dec 2016
    Event21st Australasian Document Computing Symposium, ADCS 2016 - Caulfield, Australia
    Duration: 6 Dec 20167 Dec 2016

    Publication series

    NameACM International Conference Proceeding Series

    Conference

    Conference21st Australasian Document Computing Symposium, ADCS 2016
    Country/TerritoryAustralia
    CityCaulfield
    Period6/12/167/12/16

    Fingerprint

    Dive into the research topics of 'Query-biased summaries for tabular data'. Together they form a unique fingerprint.

    Cite this