An efficient alternative to SVM based recursive feature elimination with applications in natural language processing and bioinformatics

Justin Bedo, Conrad Sanderson, Adam Kowalczyk

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    30 Citations (Scopus)

    Abstract

    The SVM based Recursive Feature Elimination (RFE-SVM) algorithm is a popular technique for feature selection, used in natural language processing and bioinformatics. Recently it was demonstrated that a small regularisation constant C can considerably improve the performance of RFE-SVM on microarray datasets. In this paper we show that further improvements are possible if the explicitly computable limit C → 0 is used. We prove that in this limit most forms of SVM and ridge regression classifiers scaled by the factor 1 C converge to a centroid classifier. As this classifier can be used directly for feature ranking, in the limit we can avoid the computationally demanding recursion and convex optimisation in RFE-SVM. Comparisons on two text based author verification tasks and on three genomic microarray classification tasks indicate that this straightforward method can surprisingly obtain comparable (at times superior) performance and is about an order of magnitude faster.

    Original languageEnglish
    Title of host publicationAI 2006
    Subtitle of host publicationAdvances in Artificial Intelligence - 19th Australian Joint Conference on Artificial Intelligence, Proceedings
    PublisherSpringer Verlag
    Pages170-180
    Number of pages11
    ISBN (Print)9783540497875
    DOIs
    Publication statusPublished - 2006
    Event19th Australian Joint Conference onArtificial Intelligence, AI 2006 - Hobart, TAS, Australia
    Duration: 4 Dec 20068 Dec 2006

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume4304 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference19th Australian Joint Conference onArtificial Intelligence, AI 2006
    Country/TerritoryAustralia
    CityHobart, TAS
    Period4/12/068/12/06

    Fingerprint

    Dive into the research topics of 'An efficient alternative to SVM based recursive feature elimination with applications in natural language processing and bioinformatics'. Together they form a unique fingerprint.

    Cite this