Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184048
Title: Enhancing stereo vision estimation with custom Berhu-Gradient loss
Authors: Goh, Glendon Xian Hao
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Goh, G. X. H. (2025). Enhancing stereo vision estimation with custom Berhu-Gradient loss. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184048
Project: CCDS24-0229
Abstract: Stereo vision and disparity estimation play a critical role in allowing machines to perceive depth, with its applications spanning from various real-time systems such as autonomous driving, robotics to medical imaging. The practicality of these algorithms in real-time systems are hence tied closely to its efficiency and accuracy. As such, exploration have been done to employ deep learning for stereo vision, aimed at improving the performance in terms of accuracy and efficiency. Deep learning models generally outperform traditional stereo vision techniques in handling challenging regions like occlusions and texture-less regions. Despite these advancements, optimising disparity estimation models for higher accuracy and efficiency is still an ongoing challenge. Thus, we would like to explore a novel loss function and evaluate its effects on the accuracy of disparity estimation. This research aims to analyse the accuracy of the resulting model from our novel loss function. To achieve this, we train a baseline model with Huber loss function and compare it against our model which is trained on Berhu-Gradient loss function. The methodology involves modifying the loss function from a smooth L1 function (Huber Loss) to a hybrid between Berhu Loss and Gradient Loss. We will compare the results of the baseline model against our model and discuss the implications of the new loss function. The evaluation metric used includes end-point error (EPE) and disparity accuracy thresholds to assess improvements in depth estimation performance. To further improve on the accuracy, an ablation study is done to fine tune hyperparameters within the new hybrid function.
URI: https://hdl.handle.net/10356/184048
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Fypreport2.pdf
  Restricted Access
11.8 MBAdobe PDFView/Open

Page view(s)

28
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.