Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184080
Title: Real-time visual SLAM for embedded systems
Authors: Tan, Edward Beng Wai
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Tan, E. B. W. (2025). Real-time visual SLAM for embedded systems. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184080
Project: CCDS24-0791
Abstract: vSLAM (Visual Simultaneous Localization And Mapping) is a key step that enables many autonomous robots to self-navigate and map a given space. Recently, there has been a shift to incorporate Deep Learning (DL) based methods into the vSLAM pipeline due to their high performance. An important part of vSLAM, known as feature extraction, is responsible for consistently extracting salient regions in the camera images over time, in order to feed downstream matching and tracking algorithms. DL based feature extractors face two main challenges: robustness of the features and latency. This thesis makes the following contributions. The first contribution focuses on quantization and power efficiency optimization of SuperPoint, a DL based feature extractor. It is shown that SuperPoint can be quantized to half precision and executed on a neural accelerator with up to 40% reduction in power consumption with almost no effect on SLAM tracking performance. The second contribution propose hardware-aware inference time optimizations of ViT-based feature extractors. The proposed methods achieve approximately 12.5% prune rate at negligible performance degradation. The pruning methods can be further coupled with hardware-aware latency/accuracy trade- off search algorithms, to exploit non-linearity characteristics that leads to 7% faster inference for a single pruned token. Next, the proposed image-adaptive learned pruning mechanism demonstrates up to 20% speedup by selectively forwarding less important tokens. It is also demonstrated that lightweight ViT backbones like EfficientFormer can achieve high performance on the feature extraction task.
URI: https://hdl.handle.net/10356/184080
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
UTF-8FYP Report (Examiner's Copy) 170325.pdf
  Restricted Access
5.21 MBAdobe PDFView/Open

Page view(s)

58
Updated on May 7, 2025

Download(s)

4
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.