An Efficient and Lightweight Structure for Spatial-Temporal Feature Extraction in Video Super Resolution

Xiaonan He, Yukun Xia, Yuansong Qiao, Brian Lee, Yuhang Ye

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Video Super Resolution (VSR) model based on deep convolutional neural network (CNN) uses multiple Low-Resolution (LR) frames as input and has a strong ability to recover High-Resolution (HR) frames and maintain video temporal information. However, to realize the above advantages, VSR must consider both spatial and temporal information to improve the perceived quality of the output video, leading to expensive operations such as cross-frame convolution. Therefore, how to balance the output video quality and computational cost is a worthy issue to be studied. To address the above problem, we propose an efficient and lightweight multi-scale 3D video super-resolution scheme that arranges 3D convolution features extraction blocks using a U-Net structure to achieve multi-scale feature extraction in both spatial and temporal dimensions. Quantitative and qualitative evaluation results on public video datasets show that compared to other simple cascaded spatial-temporal feature extraction structures, an U-Net structure achieves comparable texture details and temporal consistency while with a significant reduction in computation costs and latency.

Original languageEnglish
Title of host publicationAdvances in Computer Graphics - 40th Computer Graphics International Conference, CGI 2023, Proceedings
EditorsBin Sheng, Lei Bi, Jinman Kim, Nadia Magnenat-Thalmann, Daniel Thalmann
PublisherSpringer Science and Business Media Deutschland GmbH
Pages362-374
Number of pages13
ISBN (Print)9783031500688
DOIs
Publication statusPublished - 2024
Event40th Computer Graphics International Conference, CGI 2023 - Shanghai, China
Duration: 28 Aug 20231 Sep 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14495
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference40th Computer Graphics International Conference, CGI 2023
Country/TerritoryChina
CityShanghai
Period28/08/231/09/23

Keywords

  • 3D convolution
  • Efficiency
  • U-Net
  • Video Super Resolution

Fingerprint

Dive into the research topics of 'An Efficient and Lightweight Structure for Spatial-Temporal Feature Extraction in Video Super Resolution'. Together they form a unique fingerprint.

Cite this