A comparison of machine learning algorithms and human listeners in the identification of phonemic contrasts

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    To elucidate the processes by which automatic speech recognition (ASR) architectures reach transcription decisions, our study compared human and ASR responses to stimuli with manipulated cues for stop manner (burst, silence, and vocalic onset) and voicing (voice onset time, aspiration amplitude, and vocalic onset). Fourteen participants and two ASR systems completed a forced-response identification task. Results indicated that the cues were of perceptual significance for human participants, and though weighted differently, significant predictors of ASR output. This demonstrated that ASR systems may be relying on the same key acoustic information as do human listeners for phonemic classification.
    Original languageEnglish
    Title of host publicationProceedings of the Eighteenth Australasian International Conference on Speech Science and Technology
    Place of PublicationCanberra, Australia
    PublisherThe Australasian Speech Science and Technology Association, Inc.
    Pages41-45
    Publication statusPublished - 2022
    Event18th Australasian International Conference on Speech Science and Technology - Canberra, Australia
    Duration: 13 Dec 202216 Dec 2022
    https://sst2022.com/proceedings/

    Conference

    Conference18th Australasian International Conference on Speech Science and Technology
    Abbreviated titleSST2022
    Country/TerritoryAustralia
    CityCanberra
    Period13/12/2216/12/22
    OtherThe Australasian Speech Science and Technology Association and the Australian National University are pleased to host the 18th Australasian International Conference on Speech Science and Technology (SST2022).

    SST is an international interdisciplinary conference designed to foster collaboration among speech scientists, engineers, psycholinguists, audiologists, linguists, speech/language pathologists and industrial partners, and welcomes submissions from all areas of speech science and technology.
    Internet address

    Fingerprint

    Dive into the research topics of 'A comparison of machine learning algorithms and human listeners in the identification of phonemic contrasts'. Together they form a unique fingerprint.

    Cite this