BenchMake: turn any scientific data set into a reproducible benchmark

A. S. Barnard*

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    1 Citation (Scopus)

    Abstract

    Benchmark data sets are a cornerstone of machine learning development and applications, ensuring new methods are robust, reliable and competitive. The relative rarity of benchmark sets in computational science, due to the uniqueness of the problems and the pace of change in the associated domains, makes evaluating new innovations difficult for computational scientists. In this paper a new tool is developed and tested to potentially turn any of the increasing numbers of scientific data sets made openly available into a benchmark accessible to the community. BenchMake uses non-negative matrix factorization to deterministically identify and isolate challenging edge cases on the convex hull (the smallest convex set that contains all existing data instances) and partitions a required fraction of matched data instances into a testing set that maximizes divergence and statistical significance, across tabular, graph, image, signal and textual modalities. BenchMake splits are compared to establish splits and random splits using ten publicly available benchmark sets from different areas of science, with different sizes, shapes, distributions.

    Original languageEnglish
    Article number030502
    Pages (from-to)1-11
    Number of pages11
    JournalMachine Learning: Science and Technology
    Volume6
    Issue number3
    DOIs
    Publication statusPublished - 30 Sept 2025

    Fingerprint

    Dive into the research topics of 'BenchMake: turn any scientific data set into a reproducible benchmark'. Together they form a unique fingerprint.

    Cite this