Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/50632
Title: Vergence control for a biologically inspired binocular active vision system
Authors: Zhang, Xuejie
Keywords: DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Issue Date: 2010
Source: Zhang, X. (2010). Vergence control for a biologically inspired binocular active vision system. Doctoral thesis, Nanyang Technological University, Singapore.
Abstract: The human naturally possesses a robust and effective binocular vision system that utilizes saccade and vergence eye movements to explore the visual environment. This thesis studies how low-level visual functions such as vergence and saccade can provide a basis to build an active vision system that is able to robustly support scene exploration. Several available vision techniques were examined for suitability of application into human-like binocular active vision systems. A biologically inspired binocular active vision proof-of-concept platform was designed and built. This thesis focuses on the development of biologically inspired vergence control models. Vergence is the capability of human vision to gaze two eyes on a single spot in space. Through the study on the human vision system, several vergence control models were developed for the binocular vision platform. An initial vergence control model based on edge features in log polar space was first developed to achieve vergence in a simplified environment. Through further research and experimentation, investigations progressed onto a pyramidal vergence control model using disparity energy neurons. This second model utilizes fixed tuning disparity energy neurons and a pyramidal image structure to estimate disparity, resulting in near real time vergence control. Subsequently a pyramidal area correlation model was developed for simple vergence capabilities. As the search for a better solution prevailed, the model evolved into a disparity estimation model using a pyramidal fusion of log polar images. This final model superseded the previous models and exhibited robustness, providing a fundamental basis for developing real world applications for binocular foveation without object registration techniques or very accurate camera lens calibrations. Besides the developed vergence control models, the breadth of supporting technologies included the building of mechanisms that simulated saccadic eye movements through the use of image segmentation and visual attention approaches. Through the study and development of the biologically inspired components, the developed binocular vision system was able to select fixation points, reconstruct depth of the attended positions through vergence and build a simple fixation based spatial understanding of the environment. This thesis is presented with a balance of research rigour in the vergence domain and a broad-based understanding of human visual concepts dealing with viable computational solutions to saccade and vergence.
URI: https://hdl.handle.net/10356/50632
DOI: 10.32657/10356/50632
Schools: School of Computer Engineering 
Research Centres: Emerging Research Lab 
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Theses

Files in This Item:
File Description SizeFormat 
ZhangXuejie_Thesis.pdf8.17 MBAdobe PDFThumbnail
View/Open

Page view(s) 50

517
Updated on Sep 6, 2024

Download(s) 20

253
Updated on Sep 6, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.