Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/153862
Title: Key frame extraction from a big dataset
Authors: Agarwal, Mehal
Keywords: Engineering::Electrical and electronic engineering
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Agarwal, M. (2021). Key frame extraction from a big dataset. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/153862
Project: B3365-202
Abstract: An autonomous vehicle is an automobile platform capable of sensing and reacting to its immediate environment in an attempt to eliminate the need for human drivers. In autonomous driving, taking decisions like overtaking a vehicle or defining a route requires environmental perception, localization, and planning. In particular, object detection is one of the core modules in an autonomous vehicle as perception plays a central role in many tasks ranging from localization to obstacle avoidance and general motion planning. To enable multi-modal perception, autonomous vehicles use several on-board sensors such as cameras, radars, and lidars which provide data which are utilized by deep-learning based object detectors. A large, diverse and accurately labeled dataset is essential for perception tasks like object detection. However, the problem encountered is that the costs pertaining to human annotation of a large dataset is very expensive even for large companies and results in diminishing returns. This project aims to solve this problem by designing a keyframe filter package to extract keyframes from a big dataset and the extracted useful images are sent to annotators for manual labeling. The approach to perform keyframe extraction explored in this project uses heuristics to determine keyframes in a dataset by performing 2D multi-label tagging of images. The multi-label tagging of images is implemented using one of the state-of-the-art object detection frameworks, Faster Region-based Convolutional Neural Network (Faster-RCNN). This project proposes a novel addition to improve the Faster-RCNN model by including the visibility detection feature. The keyframe filter package permits the use of a subset of the raw data for annotation while optimizing the model performance and reducing the costs incurred.
URI: https://hdl.handle.net/10356/153862
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP Report.pdf
  Restricted Access
Final FYP Report2.65 MBAdobe PDFView/Open

Page view(s)

13
Updated on Jan 23, 2022

Download(s)

6
Updated on Jan 23, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.