Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/170579
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPateria, Shubhamen_US
dc.contributor.authorSubagdja, Budhitamaen_US
dc.contributor.authorTan, Ah-Hweeen_US
dc.contributor.authorQuek, Chaien_US
dc.date.accessioned2023-09-19T08:52:55Z-
dc.date.available2023-09-19T08:52:55Z-
dc.date.issued2023-
dc.identifier.citationPateria, S., Subagdja, B., Tan, A. & Quek, C. (2023). Value-based subgoal discovery and path planning for reaching long-horizon goals. IEEE Transactions On Neural Networks and Learning Systems. https://dx.doi.org/10.1109/TNNLS.2023.3240004en_US
dc.identifier.issn2162-237Xen_US
dc.identifier.urihttps://hdl.handle.net/10356/170579-
dc.description.abstractLearning to reach long-horizon goals in spatial traversal tasks is a significant challenge for autonomous agents. Recent subgoal graph-based planning methods address this challenge by decomposing a goal into a sequence of shorter-horizon subgoals. These methods, however, use arbitrary heuristics for sampling or discovering subgoals, which may not conform to the cumulative reward distribution. Moreover, they are prone to learning erroneous connections (edges) between subgoals, especially those lying across obstacles. To address these issues, this article proposes a novel subgoal graph-based planning method called learning subgoal graph using value-based subgoal discovery and automatic pruning (LSGVP). The proposed method uses a subgoal discovery heuristic that is based on a cumulative reward (value) measure and yields sparse subgoals, including those lying on the higher cumulative reward paths. Moreover, LSGVP guides the agent to automatically prune the learned subgoal graph to remove the erroneous edges. The combination of these novel features helps the LSGVP agent to achieve higher cumulative positive rewards than other subgoal sampling or discovery heuristics, as well as higher goal-reaching success rates than other state-of-the-art subgoal graph-based planning methods.en_US
dc.language.isoenen_US
dc.relation.ispartofIEEE Transactions on Neural Networks and Learning Systemsen_US
dc.rights© 2023 IEEE. All rights reserved.en_US
dc.subjectEngineering::Computer science and engineeringen_US
dc.titleValue-based subgoal discovery and path planning for reaching long-horizon goalsen_US
dc.typeJournal Articleen
dc.contributor.schoolSchool of Computer Science and Engineeringen_US
dc.identifier.doi10.1109/TNNLS.2023.3240004-
dc.identifier.pmid37022814-
dc.identifier.scopus2-s2.0-85148420396-
dc.subject.keywordsMotion Planningen_US
dc.subject.keywordsPath Planningen_US
item.fulltextNo Fulltext-
item.grantfulltextnone-
Appears in Collections:SCSE Journal Articles

Page view(s)

35
Updated on Nov 29, 2023

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.