Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/172664
Title: | Entry-flipped transformer for inference and prediction of participant behavior | Authors: | Hu, Bo Cham, Tat-Jen |
Keywords: | Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision | Issue Date: | 2022 | Source: | Hu, B. & Cham, T. (2022). Entry-flipped transformer for inference and prediction of participant behavior. 17th European Conference on Computer Vision (ECCV 2022), 439-456. https://dx.doi.org/10.1007/978-3-031-19772-7_26 | Conference: | 17th European Conference on Computer Vision (ECCV 2022) | Abstract: | Some group activities, such as team sports and choreographed dances, involve closely coupled interaction between participants. Here we investigate the tasks of inferring and predicting participant behavior, in terms of motion paths and actions, under such conditions. We narrow the problem to that of estimating how a set target participants react to the behavior of other observed participants. Our key idea is to model the spatio-temporal relations among participants in a manner that is robust to error accumulation during frame-wise inference and prediction. We propose a novel Entry-Flipped Transformer (EF-Transformer), which models the relations of participants by attention mechanisms on both spatial and temporal domains. Unlike typical transformers, we tackle the problem of error accumulation by flipping the order of query, key, and value entries, to increase the importance and fidelity of observed features in the current frame. Comparative experiments show that our EF-Transformer achieves the best performance on a newly-collected tennis doubles dataset, a Ceilidh dance dataset, and two pedestrian datasets. Furthermore, it is also demonstrated that our EF-Transformer is better at limiting accumulated errors and recovering from wrong estimations. | URI: | https://hdl.handle.net/10356/172664 | ISBN: | 9783031197710 | DOI: | 10.1007/978-3-031-19772-7_26 | Schools: | School of Computer Science and Engineering | Rights: | © 2022 Association for Computing Machinery. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Conference Papers |
SCOPUSTM
Citations
50
3
Updated on Sep 10, 2024
Page view(s)
96
Updated on Sep 15, 2024
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.