Paper
22 July 2024 DVT: transformer-based style transfer from reality to art
Qinghao Zhang, Shangjiaqi Hao, Jiaming Zhou
Author Affiliations +
Proceedings Volume 13222, International Conference on Signal Processing and Communication Security (ICSPCS 2024); 1322206 (2024) https://doi.org/10.1117/12.3038752
Event: Third International Conference on Signal Processing and Communication Security (ICSPCS 2024), 2024, Chengdu, China
Abstract
In advancing Neural Style Transfer (NST) for video applications, this study introduces a novel Transformer-based framework that enhances temporal consistency and computational speed. The approach utilizes a Vision Transformer (ViT) architecture, adapted to exploit the temporal dynamics inherent in video sequences. To achieve seamless style consistency across frames while ensuring real-time performance, we incorporate a hybrid Transformer-Residual learning module coupled with a bi-directional frame recurrence strategy. Our DVT: Dual-frame Video Transfer also integrates advanced model acceleration techniques, including network pruning and knowledge distillation, to foster an efficient processing environment capable of real-time stylization. Through rigorous experimental validation on diverse video datasets, our model outperforms current NST techniques, delivering superior stylization quality, improved temporal stability, and heightened computational efficiency. The findings highlight the pivotal components of our architecture.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Qinghao Zhang, Shangjiaqi Hao, and Jiaming Zhou "DVT: transformer-based style transfer from reality to art", Proc. SPIE 13222, International Conference on Signal Processing and Communication Security (ICSPCS 2024), 1322206 (22 July 2024); https://doi.org/10.1117/12.3038752
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Education and training

Optical flow

RGB color model

Video processing

Visualization

Design

Back to Top