mirage

A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system.

DSpace/Manakin Repository

 

Search DR-NTU


Advanced Search Subject Search

Browse

My Account

A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system.

Show simple item record

dc.contributor.author Shao, Yu.
dc.contributor.author Chang, Chip Hong.
dc.date.accessioned 2009-08-05T01:01:02Z
dc.date.available 2009-08-05T01:01:02Z
dc.date.copyright 2007
dc.date.issued 2009-08-05T01:01:02Z
dc.identifier.citation Shao, Y., & Chang, C. H. (2007). A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, 37(4), 877-889.
dc.identifier.issn 1083-4419
dc.identifier.uri http://hdl.handle.net/10220/6027
dc.description.abstract We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized time–frequency subtraction algorithm, which advantageously exploits the wavelet multirate signal representation to preserve the critical transient information. Simultaneous masking and temporal masking of the human auditory system are modeled by the perceptual wavelet packet transform via the frequency and temporal localization of speech components. The wavelet coefficients are used to calculate the Bark spreading energy and temporal spreading energy, from which a time–frequency masking threshold is deduced to adaptively adjust the subtraction parameters of the proposed method. An unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is shown that the proposed speech enhancement system is capable of reducing noise with little speech degradation in adverse noise environments and the overall performance is superior to several competitive methods.
dc.format.extent 13 p.
dc.language.iso en
dc.relation.ispartofseries IEEE transactions on systems, man, and cybernetics-part B : cybernetics
dc.rights IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics © copyright 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. http://www.ieee.org/portal/site.
dc.subject DRNTU::Engineering::Electrical and electronic engineering.
dc.title A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system.
dc.type Journal Article
dc.contributor.school School of Electrical and Electronic Engineering
dc.identifier.doi http://dx.doi.org/10.1109/TSMCB.2007.895365
dc.description.version Published version

Files in this item

Files Size Format View Description
equency subtrac ... human auditory system.pdf 1.286Mb PDF View/Open Published version

This item appears in the following Collection(s)

Show simple item record

Statistics

Total views

All Items Views
A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system. 278

Total downloads

All Bitstreams Views
equency subtraction method for robust speech enhancement based on wavelet filter bank modeling of human auditory system.pdf 351

Top country downloads

Country Code Views
United States of America 111
India 46
China 39
Taiwan 27
Vietnam 24

Top city downloads

city Views
Mountain View 57
Hanoi 20
Singapore 8
Taipei 8
Bellevue 7