Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/81365
Title: Time-shifting based primary-ambient extraction for spatial audio reproduction
Authors: He, Jianjun
Gan, Woon-Seng
Tan, Ee-Leng
Keywords: Primary-ambient extraction (PAE)
Principal component analysis (PCA)
Spatial audio
Spatial cues
Issue Date: 2015
Source: He, J., Gan, W.-S., & Tan, E.-L. (2015). Time-Shifting Based Primary-Ambient Extraction for Spatial Audio Reproduction. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 23(10), 1576-1588.
Series/Report no.: IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Abstract: One of the key issues in spatial audio analysis and reproduction is to decompose a signal into primary and ambient components based on their directional and diffuse spatial features, respectively. Existing approaches employed in primary-ambient extraction (PAE), such as principal component analysis (PCA), are mainly based on a basic stereo signal model. The performance of these PAE approaches has not been well studied for the input signals that do not satisfy all the assumptions of the stereo signal model. In practice, one such case commonly encountered is that the primary components of the stereo signal are partially correlated at zero lag, referred to as the primary-complex case. In this paper, we take PCA as a representative of existing PAE approaches and investigate the performance degradation of PAE with respect to the correlation of the primary components in the primary-complex case. A time-shifting technique is proposed in PAE to alleviate the performance degradation due to the low correlation of the primary components in such stereo signals. This technique involves time-shifting the input signal according to the estimated inter-channel time difference of the primary component prior to the signal decomposition using conventional PAE approaches. To avoid the switching artifacts caused by the varied time-shifting in successive time frames, overlapped output mapping is suggested. Based on the results from our experiments, PAE approaches with the proposed time-shifting technique are found to be superior to the conventional PAE approaches in terms of extraction accuracy and spatial accuracy.
URI: https://hdl.handle.net/10356/81365
http://hdl.handle.net/10220/39538
ISSN: 2329-9290
DOI: http://dx.doi.org/10.1109/TASLP.2015.2439577
Rights: © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TASLP.2015.2439577].
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Journal Articles

Files in This Item:
File Description SizeFormat 
Time-shifting based primary-ambient extraction for spatial audio reproduction.pdf1.09 MBAdobe PDFThumbnail
View/Open

Google ScholarTM

Check

Altmetric

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.