Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/178457
Title: Exploring the effectiveness of video perceptual representation in blind video quality assessment
Authors: Liao, Liang
Xu, Kangmin
Wu, Haoning
Chen, Chaofeng
Sun, Wenxiu
Yan, Qiong
Lin, Weisi
Keywords: Computer and Information Science
Issue Date: 2022
Source: Liao, L., Xu, K., Wu, H., Chen, C., Sun, W., Yan, Q. & Lin, W. (2022). Exploring the effectiveness of video perceptual representation in blind video quality assessment. 30th ACM International Conference on Multimedia (MM '22), 837-846. https://dx.doi.org/10.1145/3503161.3547849
Conference: 30th ACM International Conference on Multimedia (MM '22)
Abstract: With the rapid growth of in-The-wild videos taken by non-specialists, blind video quality assessment (VQA) has become a challenging and demanding problem. Although lots of efforts have been made to solve this problem, it remains unclear how the human visual system (HVS) relates to the temporal quality of videos. Meanwhile, recent work has found that the frames of natural video transformed into the perceptual domain of the HVS tend to form a straight trajectory of the representations. With the obtained insight that distortion impairs the perceived video quality and results in a curved trajectory of the perceptual representation, we propose a temporal perceptual quality index (TPQI) to measure the temporal distortion by describing the graphic morphology of the representation. Specifically, we first extract the video perceptual representations from the lateral geniculate nucleus (LGN) and primary visual area (V1) of the HVS, and then measure the straightness and compactness of their trajectories to quantify the degradation in naturalness and content continuity of video. Experiments show that the perceptual representation in the HVS is an effective way of predicting subjective temporal quality, and thus TPQI can, for the first time, achieve comparable performance to the spatial quality metric and be even more effective in assessing videos with large temporal variations. We further demonstrate that by combining with NIQE, a spatial quality metric, TPQI can achieve top performance over popular in-The-wild video datasets. More importantly, TPQI does not require any additional information beyond the video being evaluated and thus can be applied to any datasets without parameter tuning. Source code is available at https://github.com/UoLMM/TPQI-VQA.
URI: https://hdl.handle.net/10356/178457
ISBN: 9781450392037
DOI: 10.1145/3503161.3547849
Schools: College of Computing and Data Science 
School of Computer Science and Engineering 
Research Centres: S-Lab
Rights: © 2022 Association for Computing Machinery. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:CCDS Conference Papers

SCOPUSTM   
Citations 20

15
Updated on Sep 8, 2024

Page view(s)

54
Updated on Sep 8, 2024

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.