Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/140153
Title: Optimal region selection for stereoscopic video subtitle insertion
Authors: Yue, Guanghui
Hou, Chunping
Lei, Jianjun
Fang, Yuming
Lin, Weisi
Keywords: Engineering::Computer science and engineering
Issue Date: 2017
Source: Yue, G., Hou, C., Lei, J., Fang, Y., & Lin, W. (2018). Optimal region selection for stereoscopic video subtitle insertion. IEEE Transactions on Circuits and Systems for Video Technology, 28(11), 3141-3153. doi:10.1109/TCSVT.2017.2739756
Journal: IEEE Transactions on Circuits and Systems for Video Technology
Abstract: Stereoscopic subtitle insertion is a fundamental and essential element in stereoscopic film and TV industry. However, little work has been dedicated to the optimal region selection for stereoscopic subtitle insertion. In addition, there is no public database reported for the performance evaluation of it. In this paper, we build the first large-scale video database (TJU3D) for stereoscopic video subtitle insertion, which includes 50 video sequences with rich screen scenes. Compared with 2D subtitle region selection, there are several problems we have to consider in stereoscopic subtitle region selection: 1) the subtitle should avoid depth cue collision and occlusion from objects in stereoscopic video sequences; 2) the disparity value of the subtitle must be minimized to reduce visual discomfort; and 3) the temporal coherence constraint must be considered during region selection for subtitles in video sequences. By considering these constraints, we propose an optimal region selection algorithm for stereoscopic subtitle insertion. First, we compute the disparity map of each video frame in video sequences. For each frame, the optimal position and disparity value of the subtitle are determined by a subtitle region selection algorithm, which contains two parts (i.e., the coarse selection and fine selection). After that, by considering the temporal consistency between adjacent frames, the position and disparity value of each frame are further classified and processed in order to avoid the subtitle jitter. We evaluate the proposed method on TJU3D video database through two visual discomfort prediction metrics and one subjective experiment. To further verify the effectiveness of the proposed method, we also validate the performance of the proposed method on video comfort assessment database, i.e., IEEE-SA Stereo Database. Experimental results demonstrate that the visual discomfort is greatly reduced when using the proposed method compared with the basic method.
URI: https://hdl.handle.net/10356/140153
ISSN: 1051-8215
DOI: 10.1109/TCSVT.2017.2739756
Rights: © 2017 IEEE. All rights reserved.
Fulltext Permission: none
Fulltext Availability: No Fulltext
Appears in Collections:SCSE Journal Articles

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.