Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/55020
Title: Visual event recognition
Authors: Gong, Li.
Keywords: DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Issue Date: 2013
Abstract: This report summarizes the work that has been done in the final year project of recognizing visual events in videos. It starts with image recognition, in which im- ages are represented in spatial pyramids. Such representations are then input into SVM and KNN for recognition. In video recognition, bag of words and special- ized Gaussian Mixture Models are employed to represent videos, and respective distance calculation is used to measure video-to-video distance. These distance matrices are then input into SVM for recognition using different kernel types. Also, four domain adaptation methods are implemented to recognize Kodak con- sumer videos using Youtube videos. Adaptive multiple kernel learning achieves the best and improves the mean average precision from 44.33% to 61.40%. Last but not least, a web-based demo system is implemented in two modes to visually demonstrate the underlying recognition system.
URI: http://hdl.handle.net/10356/55020
Schools: School of Computer Engineering 
Research Centres: Centre for Multimedia and Network Technology 
Rights: Nanyang Technological University
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FinalReportV3.pdf
  Restricted Access
6.23 MBAdobe PDFView/Open

Page view(s) 50

491
Updated on May 7, 2025

Download(s)

20
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.