Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/55020
Title: | Visual event recognition | Authors: | Gong, Li. | Keywords: | DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition | Issue Date: | 2013 | Abstract: | This report summarizes the work that has been done in the final year project of recognizing visual events in videos. It starts with image recognition, in which im- ages are represented in spatial pyramids. Such representations are then input into SVM and KNN for recognition. In video recognition, bag of words and special- ized Gaussian Mixture Models are employed to represent videos, and respective distance calculation is used to measure video-to-video distance. These distance matrices are then input into SVM for recognition using different kernel types. Also, four domain adaptation methods are implemented to recognize Kodak con- sumer videos using Youtube videos. Adaptive multiple kernel learning achieves the best and improves the mean average precision from 44.33% to 61.40%. Last but not least, a web-based demo system is implemented in two modes to visually demonstrate the underlying recognition system. | URI: | http://hdl.handle.net/10356/55020 | Schools: | School of Computer Engineering | Research Centres: | Centre for Multimedia and Network Technology | Rights: | Nanyang Technological University | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | SCSE Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FinalReportV3.pdf Restricted Access | 6.23 MB | Adobe PDF | View/Open |
Page view(s) 50
491
Updated on May 7, 2025
Download(s)
20
Updated on May 7, 2025
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.