Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Video Frame Extrapolation Scheme Using Deep Learning-Based Uni-Directional Flow Estimation and Pixel Warpingopen access

Authors
Ban, Tae-Won
Issue Date
Sep-2023
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Bidirectional control; deep learning; Estimation; Extrapolation; flow estimation; flow network; Generative adversarial networks; Real-time systems; Streaming media; Training; Video frame extrapolation; video frame prediction
Citation
IEEE Access, v.11, pp 1 - 1
Pages
1
Indexed
SCIE
SCOPUS
Journal Title
IEEE Access
Volume
11
Start Page
1
End Page
1
URI
https://scholarworks.gnu.ac.kr/handle/sw.gnu/68225
DOI
10.1109/ACCESS.2023.3319660
ISSN
2169-3536
Abstract
This paper investigates video frame extrapolation, which can predict future frames from current and past frames. Although there have been many studies on video frame extrapolation in recent years, most of them suffer from the unsatisfactory image quality of the predicted frames such as severe blurring because it is difficult to predict the movement of future pixels for multi-modal video frames, especially with fast changing frames. An additional process such as frame alignment or recurrent prediction can improve the quality of the predicted frames, but it hinders real-time extrapolation. Motivated by the significant progress in video frame interpolation using deep learning-based flow estimation, a simplified video frame extrapolation scheme using deep learning-based uni-directional flow estimation is proposed to reduce the processing time compared to conventional video frame extrapolation schemes without compromising the image quality of the predicted frames. In the proposed scheme, the uni-directional flow is first estimated from the current and past frames through a flow network consisting of four flow blocks and the current frame is forward-warped through the estimated flow to predict a future frame. The proposed flow network is trained and evaluated using the Vimeo-90K triplet dataset. The performance of the proposed scheme is analyzed using the trained flow network in terms of prediction time as well as the similarity between predicted and ground truth frames such as the structural similarity index measure and mean absolute error of pixels, and compared to that of the state-of-the-art schemes such as Iterative and cycleGAN schemes. Extensive experiments show that the proposed scheme improves prediction quality by 2.1% and reduces prediction time by 99.7% compared to the state-of-the-art scheme. Author
Files in This Item
There are no files associated with this item.
Appears in
Collections
해양과학대학 > 지능형통신공학과 > Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ban, Tae Won photo

Ban, Tae Won
IT공과대학 (AI정보공학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE