Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/156866
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | U S Vaitesswar | en_US |
dc.date.accessioned | 2022-04-26T06:25:10Z | - |
dc.date.available | 2022-04-26T06:25:10Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | U S Vaitesswar (2022). Skeleton-based human action recognition with graph neural networks. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156866 | en_US |
dc.identifier.uri | https://hdl.handle.net/10356/156866 | - |
dc.description.abstract | Skeleton-based action recognition is a long-standing task in computer vision which aims to distinguish different human actions by identifying their unique characteristic patterns in the input data. Most of the existing GCN-based models developed for this task primarily model the skeleton graph as either directed or undirected. Furthermore, these models also restrict the receptive field in the temporal domain to a fixed range which significantly inhibits their expressibility. Therefore, a mixed graph network comprising both directed and undirected graph networks with a multi-range temporal module called MMGCN is proposed. In this way, the model can benefit from the different interpretations of the same action by the different graphs. Adding on, the multi-range temporal module enhances the model’s expressibility as it can choose the appropriate receptive field for each layer, thus allowing the model to dynamically adapt to the input data. With this lightweight MMGCN model, it is shown that deep learning models can learn the underlying patterns in the data and model large receptive fields without additional semantics or high model complexity. Finally, this model achieved state-of-the-art results on benchmark datasets: NTU-RGB+D, NTU-RGB+D 120, Skeleton-Kinetics and Northwestern-UCLA despite its low model complexity thus proving its effectiveness. An additional study was conducted to weigh the importance of model complexity (i.e. more nuanced architecture) against ensemble model learning (i.e. multiple input streams). The insights derived from this study will be useful for future models developed for skeleton-based action recognition task. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nanyang Technological University | en_US |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). | en_US |
dc.subject | Engineering::Computer science and engineering | en_US |
dc.title | Skeleton-based human action recognition with graph neural networks | en_US |
dc.type | Thesis-Master by Research | en_US |
dc.contributor.supervisor | Yeo Chai Kiat | en_US |
dc.contributor.school | School of Computer Science and Engineering | en_US |
dc.description.degree | Master of Engineering | en_US |
dc.identifier.doi | 10.32657/10356/156866 | - |
dc.contributor.supervisoremail | ASCKYEO@ntu.edu.sg | en_US |
item.fulltext | With Fulltext | - |
item.grantfulltext | open | - |
Appears in Collections: | SCSE Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MEng Thesis - Revised.pdf | 1.84 MB | Adobe PDF | View/Open |
Page view(s)
334
Updated on Mar 28, 2024
Download(s) 50
148
Updated on Mar 28, 2024
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.