Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175084
Title: Embodied object hunt
Authors: Kam, Rainer I-Wen
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Kam, R. I. (2024). Embodied object hunt. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175084
Project: SCSE23-0037 
Abstract: This study investigates the use of multimodal encoders in the Embodied Object Hunt task. The motivation behind this approach is recent developments in joint multimodal encoders such as CLIP that are able to extract common features between images and text. This ability is ideal for tasks combining imagery and text, such as the Embodied Object Hunt using visual observations and textual input prompts. This study also explores using intrinsic curiosity rewards to supplement agent learning, encouraging agents to explore their environment and facilitate learning. This study compares agents trained using CLIP embeddings and intrinsic curiosity and those without, and analyzes the key differences between their training results. The results of this study can be used to understand the effectiveness and feasibility of using different approaches to train embodied agents, serving as an exploratory study that future improvements can be based upon.
URI: https://hdl.handle.net/10356/175084
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Kam Rainer I-Wen_Final Year Report.pdf
  Restricted Access
5.57 MBAdobe PDFView/Open

Page view(s)

80
Updated on Mar 14, 2025

Download(s)

2
Updated on Mar 14, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.