Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/179928
Title: USN: a robust imitation learning method against diverse action noise
Authors: Yu, Xingrui
Han, Bo
Tsang, Ivor Wai-Hung
Keywords: Computer and Information Science
Issue Date: 2024
Source: Yu, X., Han, B. & Tsang, I. W. (2024). USN: a robust imitation learning method against diverse action noise. Journal of Artificial Intelligence Research, 79, 1237-1280. https://dx.doi.org/10.1613/jair.1.15819
Project: SMI-2022-MTP-06 
Journal: Journal of Artificial Intelligence Research 
Abstract: Learning from imperfect demonstrations is a crucial challenge in imitation learning (IL). Unlike existing works that still rely on the enormous effort of expert demonstrators, we consider a more cost-effective option for obtaining a large number of demonstrations. That is, hire annotators to label actions for existing image records in realistic scenarios. However, action noise can occur when annotators are not domain experts or encounter confusing states. In this work, we introduce two particular forms of action noise, i.e., state-independent and state-dependent action noise. Previous IL methods fail to achieve expert-level performance when the demonstrations contain action noise, especially the state-dependent action noise. To mitigate the harmful effects of action noises, we propose a robust learning paradigm called USN (Uncertainty-aware Sample-selection with Negative learning). The model first estimates the predictive uncertainty for all demonstration data and then selects samples with high loss based on the uncertainty measures. Finally, it updates the model parameters with additional negative learning on the selected samples. Empirical results in Box2D tasks and Atari games show that USN consistently improves the final rewards of behavioral cloning, online imitation learning, and offline imitation learning methods under various action noises. The ratio of significant improvements is up to 94.44%. Moreover, our method scales to conditional imitation learning with real-world noisy commands in urban driving.
URI: https://hdl.handle.net/10356/179928
ISSN: 1076-9757
DOI: 10.1613/jair.1.15819
Schools: School of Computer Science and Engineering 
Organisations: Centre for Frontier AI Research, A*STAR 
Singapore Institute of High-Performance Computing, A*ATAR 
Rights: © 2024 The Authors. Published by AI Access Foundation under Creative Commons Attribution License CC BY 4.0.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Journal Articles

Files in This Item:
File Description SizeFormat 
15819wPg#s.pdf1.56 MBAdobe PDFThumbnail
View/Open

Page view(s)

60
Updated on Mar 16, 2025

Download(s)

9
Updated on Mar 16, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.