Spatial feature learning for robust binaural sound source localization using a composite feature vector

Xiang Wu, Dumidu S. Talagala, Wen Zhang, Thushara D. Abhayapala

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    7 Citations (Scopus)

    Abstract

    The performance of binaural speech source localization systems can be significantly impacted by an imperfect selection of spatial localization cues, due to the limited bandwidth of speech, and the effects of noise. In order to mitigate these impacts, this paper presents a novel method that combines a deterministic localization approach with a spatial feature learning process. Here, we (i) obtain a composite feature vector derived from analysing the mutual information between different spatial cues and (ii) estimate the optimum feature combination that minimizes the angular localization error in three dimensional space. The performance of the proposed mutual information based feature learning approach is evaluated for a range of speech inputs and noise conditions. We also demonstrate that the proposed approach improves the localization accuracy and its robustness, with respect to traditional localization algorithms, especially in the relatively low signal-to-noise ratio localization scenarios.

    Original languageEnglish
    Title of host publication2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Proceedings
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages6320-6324
    Number of pages5
    ISBN (Electronic)9781479999880
    DOIs
    Publication statusPublished - 18 May 2016
    Event41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 - Shanghai, China
    Duration: 20 Mar 201625 Mar 2016

    Publication series

    NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    Volume2016-May
    ISSN (Print)1520-6149

    Conference

    Conference41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016
    Country/TerritoryChina
    CityShanghai
    Period20/03/1625/03/16

    Fingerprint

    Dive into the research topics of 'Spatial feature learning for robust binaural sound source localization using a composite feature vector'. Together they form a unique fingerprint.

    Cite this