Towards searching amongst tables

Paul Thomas, Rollin Omari, Tom Rowlands

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    An increasing number of data sets are being published online, in institutional or government repositories as well as by individual researchers, journalists, and others. These data are often represented as tables of various kinds: however, repositories have poor search over and inside tables. It is difficult for a user to tell from a repository's portal whether a useful dataset is available, and this problem is only likely to get worse. We describe this problem, and demonstrate that the naïve approach of full-text search is not appropriate. We describe an alternative, based on inferring types of data and indexing columns as a unit, and demonstrate some improvements in early success especially when long captions are not available.

    Original languageEnglish
    Title of host publicationADCS 2015 - Proceedings of the 20th Australasian Document Computing Symposium
    EditorsSarvnaz Karimi, Laurence A. F. Park
    PublisherAssociation for Computing Machinery
    ISBN (Electronic)9781450340403
    DOIs
    Publication statusPublished - 8 Dec 2015
    Event20th Australasian Document Computing Symposium, ADCS 2015 - Parramatta, Australia
    Duration: 8 Dec 20159 Dec 2015

    Publication series

    NameACM International Conference Proceeding Series
    Volume08-09-Dec-2015

    Conference

    Conference20th Australasian Document Computing Symposium, ADCS 2015
    Country/TerritoryAustralia
    CityParramatta
    Period8/12/159/12/15

    Fingerprint

    Dive into the research topics of 'Towards searching amongst tables'. Together they form a unique fingerprint.

    Cite this