Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/158000
Title: Probably pleasant? A neural-probabilistic approach to automatic masker selection for urban soundscape augmentation
Authors: Ooi, Kenenth
Watcharasupat, Karn N.
Lam, Bhan
Ong, Zhen-Ting
Gan, Woon-Seng 
Keywords: Engineering::Electrical and electronic engineering
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Science::Physics::Acoustics
Social sciences::Psychology::Applied psychology
Issue Date: 2022
Source: Ooi, K., Watcharasupat, K. N., Lam, B., Ong, Z. & Gan, W. (2022). Probably pleasant? A neural-probabilistic approach to automatic masker selection for urban soundscape augmentation. 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022), 8887-8891. https://dx.doi.org/10.1109/ICASSP43922.2022.9746897
Project: CoT-V4-2020-1
GCP205231017
Abstract: Soundscape augmentation, which involves the addition of sounds known as “maskers” to a given soundscape, is a human-centric urban noise mitigation measure aimed at improving the overall soundscape quality. However, the choice of maskers is often predicated on laborious processes and is inflexible to the time-varying nature of real-world soundscapes. Owing to the perceptual uniqueness of each soundscape and the inherent subjectiveness of human perception, we propose a probabilistic perceptual attribute predictor (PPAP) that predicts parameters of random distributions as outputs instead of a single deterministic value. Using the PPAP, we developed a novel automatic masker selection system (AMSS), which selects optimal masker candidates based on the predicted distribution of the ISO 12913-3 Pleasantness score for a given soundscape. Via a large-scale listening test with 300 participants, we collected 12600 subjective responses, each to a unique augmented soundscape, to train the PPAP models in a 5-fold cross-validation scheme. Using a convolutional recurrent neural network backbone and experimenting with several variants of the attention mechanism for the PPAP, we evaluated the proposed system on a blind test set with 48 unseen augmented soundscapes to assess the effectiveness of the probabilistic output scheme over traditional deterministic systems.
URI: https://hdl.handle.net/10356/158000
ISBN: 978-1-6654-0540-9
ISSN: 2379-190X
DOI: 10.1109/ICASSP43922.2022.9746897
DOI (Related Dataset): 10.21979/N9/YSJQKD
Rights: © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/ICASSP43922.2022.9746897.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:EEE Conference Papers

Files in This Item:
File Description SizeFormat 
_ICASSP22__SiNAPS_Masker_Selection (2).pdfFinal Accepted Version489.33 kBAdobe PDFView/Open

SCOPUSTM   
Citations 50

1
Updated on Nov 24, 2022

Page view(s)

105
Updated on Dec 1, 2022

Download(s) 50

54
Updated on Dec 1, 2022

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.