The sub-band cepstrum as a tool for locating local spectral regions of phonetic sensitivity: A first attempt with multi-speaker vowel data

Research output: Contribution to conferencePaperpeer-review

Abstract

Phonetic information is well-known to be unevenly encoded throughout vowel spectra, implying the existence of sub-band regions sensitive to that information. This work exploits bandlimited cepstral coefficients (BLCCs) to locate such regions and quantify their sensitivity through vowel classification. BLCCs are acoustic parameters representing sub-band spectra; their extraction involves a linear transformation of full-band CCs with flexible sub-band selection. Here, 18 sub-bands spanning the full band [0-4 kHz] and their respective BLCCs are used to classify Japanese vowels from 306 native male speakers. Classification accuracy is high in sub-bands where phonetic differences between vowels are the most significant. Such subbands are mainly in the low frequency range as expected, but do not exclusively align with formant regions. These findings suggest that BLCCs are potentially very useful for gaining detailed phonetic insights with flexible sub-band focus and efficient computation.
Original languageEnglish
Pages1535-1539
Number of pages5
DOIs
Publication statusPublished - 2024
EventInterspeech 2024 - Kos, Greece
Duration: 1 Sept 20245 Sept 2024

Conference

ConferenceInterspeech 2024
Country/TerritoryGreece
City Kos
Period1/09/245/09/24

Fingerprint

Dive into the research topics of 'The sub-band cepstrum as a tool for locating local spectral regions of phonetic sensitivity: A first attempt with multi-speaker vowel data'. Together they form a unique fingerprint.

Cite this