Abstract
Speech visualisations are known to help language learners to acquire correct pronunciation and promote a better study experience. We present a two-step approach based on two established techniques to display tongue tip movements of an acoustic speech signal on a vowel space plot. First, we use Energy Entropy Ratio to extract vowels; and then, we apply the Linear Predictive Coding root method to estimate Formant 1 and Formant 2. We invited and collected acoustic data from one Modern Standard Arabic (MSA) lecturer and four MSA students. Our proof of concept was able to reflect differences between the tongue tip movements in a native MSA speaker to those of a MSA language learner at a vocabulary level. This paper addresses principle methods for generating features that reflect bio-physiological features of speech and thus, facilitates an approach that can be generally adapted to languages other than MSA.
| Original language | English |
|---|---|
| Pages (from-to) | 53-63 |
| Journal | Proceedings of the Australasian Language Technology Workshop |
| Volume | 18 |
| Publication status | Published - 2020 |
| Event | 18th Annual Workshop of the Australasian Language Technology Association, ALTA 2020 - Virtual, Online Duration: 14 Jan 2021 → 15 Jan 2021 |
Fingerprint
Dive into the research topics of 'An Automatic Vowel Space Generator for Language Learners’ Pronunciation Acquisition and Correction'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver