A Video Frame Extrapolation Scheme Using Deep Learning-Based Uni-Directional Flow Estimation and Pixel Warpingopen access
- Authors
- Ban, Tae-Won
- Issue Date
- Sep-2023
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Bidirectional control; deep learning; Estimation; Extrapolation; flow estimation; flow network; Generative adversarial networks; Real-time systems; Streaming media; Training; Video frame extrapolation; video frame prediction
- Citation
- IEEE Access, v.11, pp 1 - 1
- Pages
- 1
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 11
- Start Page
- 1
- End Page
- 1
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/68225
- DOI
- 10.1109/ACCESS.2023.3319660
- ISSN
- 2169-3536
- Abstract
- This paper investigates video frame extrapolation, which can predict future frames from current and past frames. Although there have been many studies on video frame extrapolation in recent years, most of them suffer from the unsatisfactory image quality of the predicted frames such as severe blurring because it is difficult to predict the movement of future pixels for multi-modal video frames, especially with fast changing frames. An additional process such as frame alignment or recurrent prediction can improve the quality of the predicted frames, but it hinders real-time extrapolation. Motivated by the significant progress in video frame interpolation using deep learning-based flow estimation, a simplified video frame extrapolation scheme using deep learning-based uni-directional flow estimation is proposed to reduce the processing time compared to conventional video frame extrapolation schemes without compromising the image quality of the predicted frames. In the proposed scheme, the uni-directional flow is first estimated from the current and past frames through a flow network consisting of four flow blocks and the current frame is forward-warped through the estimated flow to predict a future frame. The proposed flow network is trained and evaluated using the Vimeo-90K triplet dataset. The performance of the proposed scheme is analyzed using the trained flow network in terms of prediction time as well as the similarity between predicted and ground truth frames such as the structural similarity index measure and mean absolute error of pixels, and compared to that of the state-of-the-art schemes such as Iterative and cycleGAN schemes. Extensive experiments show that the proposed scheme improves prediction quality by 2.1% and reduces prediction time by 99.7% compared to the state-of-the-art scheme. Author
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 해양과학대학 > 지능형통신공학과 > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.