Abstract
Phonetic information is well-known to be unevenly encoded throughout vowel spectra, implying the existence of sub-band regions sensitive to that information. This work exploits bandlimited cepstral coefficients (BLCCs) to locate such regions and quantify their sensitivity through vowel classification. BLCCs are acoustic parameters representing sub-band spectra; their extraction involves a linear transformation of full-band CCs with flexible sub-band selection. Here, 18 sub-bands spanning the full band [0-4 kHz] and their respective BLCCs are used to classify Japanese vowels from 306 native male speakers. Classification accuracy is high in sub-bands where phonetic differences between vowels are the most significant. Such subbands are mainly in the low frequency range as expected, but do not exclusively align with formant regions. These findings suggest that BLCCs are potentially very useful for gaining detailed phonetic insights with flexible sub-band focus and efficient computation.
Original language | English |
---|---|
Pages | 1535-1539 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 2024 |
Event | Interspeech 2024 - Kos, Greece Duration: 1 Sept 2024 → 5 Sept 2024 |
Conference
Conference | Interspeech 2024 |
---|---|
Country/Territory | Greece |
City | Kos |
Period | 1/09/24 → 5/09/24 |