Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition

Solène Inceoglu, Wen Hsin Chen, Hyojung Lim

    Research output: Contribution to journalArticlepeer-review

    13 Citations (Scopus)

    Abstract

    An increasing number of studies are exploring the benefits of automatic speech recognition (ASR)-based dictation programs for second language (L2) pronunciation learning (e.g. Chen, Inceoglu & Lim, 2020; Liakin, Cardoso & Liakina, 2015; McCrocklin, 2019), but how ASR recognizes accented speech and the nature of the feedback it provides to language learners is still largely under-researched. The current study explores whether the intelligibility of L2 speakers differs when assessed by native (L1) listeners versus ASR technology, and reports on the types of intelligibility issues encountered by the two groups. Twelve L1 listeners of English transcribed 48 isolated words targeting the -i/ and /æ-ϵ/ contrasts and 24 short sentences that four Taiwanese intermediate learners of English had produced using Google's ASR dictation system. Overall, the results revealed lower intelligibility scores for the word task (ASR: 40.81%, L1 listeners: 38.62%) than the sentence task (ASR: 75.52%, L1 listeners: 83.88%), and highlighted strong similarities in the error types - and their proportions - identified by ASR and the L1 listeners. However, despite similar recognition scores, correlations indicated that the ASR recognition of the L2 speakers' oral productions mirrored the L1 listeners' judgments of intelligibility in the word and sentence tasks for only one speaker, with significant positive correlations for one additional speaker in each task. This suggests that the extent to which ASR approaches L1 listeners at recognizing accented speech may depend on individual speakers and the type of oral speech.

    Original languageEnglish
    Pages (from-to)89-104
    Number of pages16
    JournalReCALL
    Volume35
    Issue number1
    DOIs
    Publication statusPublished - 18 Jan 2023

    Fingerprint

    Dive into the research topics of 'Assessment of L2 intelligibility: Comparing L1 listeners and automatic speech recognition'. Together they form a unique fingerprint.

    Cite this