A toolbox approach to flexible and efficient data mining

Ole M. Nielsen*, Peter Christen, Markus Hegland, Tatiana Semenova, Timothy Hancock

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    2 Citations (Scopus)

    Abstract

    This paper describes a flexible and efficient toolbox based on the scripting language Python, capable of handling common tasks in data mining. Using either a relational database or flat files the toolbox gives the user a uniform view of a data collection. Two core features of the toolbox are caching of database queries and parallelism within a collection of independent queries. Our toolbox provides a number of routines for basic data mining tasks on top of which the user can add more functions - mainly domain and data collection dependent - for complex and time consuming data mining tasks.

    Original languageEnglish
    Title of host publicationAdvances in Knowledge Discovery and Data Mining - 5th Pacific-Asia Conference, PAKDD 2001, Proceedings
    EditorsDavid Cheung, Graham J. Williams, Qing Li
    PublisherSpringer Verlag
    Pages124-135
    Number of pages12
    ISBN (Print)3540419101, 9783540419105
    DOIs
    Publication statusPublished - 2001
    Event5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2001 - Kowloon, Hong Kong
    Duration: 16 Apr 200118 Apr 2001

    Publication series

    NameLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
    Volume2035
    ISSN (Print)0302-9743

    Conference

    Conference5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2001
    Country/TerritoryHong Kong
    CityKowloon
    Period16/04/0118/04/01

    Fingerprint

    Dive into the research topics of 'A toolbox approach to flexible and efficient data mining'. Together they form a unique fingerprint.

    Cite this