Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/179825
Title: Metric-semantic map representation for natural-language-based navigation
Authors: Liu, Ruimeng
Keywords: Computer and Information Science
Engineering
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Liu, R. (2024). Metric-semantic map representation for natural-language-based navigation. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/179825
Abstract: The widespread of Large Language Models has sparked a rapid development of general embodied Artificial Intelligence (AI). Many methods to integrate LLMs with robot agent for downstream tasks like manipulation or navigation have been proposed. LLMs have demonstrated exceptionally strong capabilities in understanding, planning, and reasoning to work as the brain of a robot agent. However, grounding the model and deploying it in real world remain challeng- ing. Motivated by the need for more adaptable and intelligent robotic navigation systems, we present a novel and efficient system designed to complete naviga- tion tasks under natural language instructions. We introduce a GPU-accelerating volumetric semantic map for scene representation and environment perception. We use prompts that combine world perception and robot capabilities, enabling LLMs to generate executable plans directly as programs, and build a close- loop robot control framework that enhances task planning and error manage- ment through an integrated feedback structure at both high and low levels. Our framework demonstrates state of the art success rates in zero-shot goal naviga- tion tasks in real-world experiments.
URI: https://hdl.handle.net/10356/179825
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
LiuRuimeng_MSc_Dissertation_Modified.pdf
  Restricted Access
14.77 MBAdobe PDFView/Open

Page view(s)

118
Updated on Mar 21, 2025

Download(s)

2
Updated on Mar 21, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.