Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/172187
Title: | Temporal sentence grounding in videos: a survey and future directions | Authors: | Zhang, Hao Sun, Aixin Jing, Wei Zhou, Joey Tianyi |
Keywords: | Engineering::Computer science and engineering | Issue Date: | 2023 | Source: | Zhang, H., Sun, A., Jing, W. & Zhou, J. T. (2023). Temporal sentence grounding in videos: a survey and future directions. IEEE Transactions On Pattern Analysis and Machine Intelligence, 45(8), 10443-10465. https://dx.doi.org/10.1109/TPAMI.2023.3258628 | Project: | A18A1b0045 | Journal: | IEEE Transactions on Pattern Analysis and Machine Intelligence | Abstract: | Temporal sentence grounding in videos (TSGV), a.k.a., natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that semantically corresponds to a language query from an untrimmed video. Connecting computer vision and natural language, TSGV has drawn significant attention from researchers in both communities. This survey attempts to provide a summary of fundamental concepts in TSGV and current research status, as well as future research directions. As the background, we present a common structure of functional components in TSGV, in a tutorial style: from feature extraction from raw video and language query, to answer prediction of the target moment. Then we review the techniques for multimodal understanding and interaction, which is the key focus of TSGV for effective alignment between the two modalities. We construct a taxonomy of TSGV techniques and elaborate the methods in different categories with their strengths and weaknesses. Lastly, we discuss issues with the current TSGV research and share our insights about promising research directions. | URI: | https://hdl.handle.net/10356/172187 | ISSN: | 0162-8828 | DOI: | 10.1109/TPAMI.2023.3258628 | Schools: | School of Computer Science and Engineering | Research Centres: | S-Lab | Rights: | © 2023 IEEE. All rights reserved. | Fulltext Permission: | none | Fulltext Availability: | No Fulltext |
Appears in Collections: | SCSE Journal Articles |
SCOPUSTM
Citations
20
30
Updated on Mar 17, 2025
Page view(s)
171
Updated on Mar 17, 2025
Google ScholarTM
Check
Altmetric
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.