1 June 2022 Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network
Yaser Rahimi, Mehdi Rezaei, Pouria Jafari
Author Affiliations +
Abstract

In almost all video applications, a video rate control algorithm (RCA) is used by the encoder. The RCA tunes the quantization parameter (QP) to match the encoded bit rate to the available capacity of the communication channel or storage media. Conventional RCAs usually utilize a rate-quantization (R-Q) or a rate-distortion (R-D) model for rate control. A content-based R-Q model for intra coding tree units (CTUs) of the high-efficiency video coding standard is proposed. The model is a convolutional neural network that observes pixels of a CTU and its intraprediction reference pixels and it estimates required bit counts for intracoding the CTU for all QP values simultaneously. The proposed model can be easily used by any video RCA. A given RCA just selects a proper QP for which the estimated bit counts are closer to the allocated bit budget. The evaluation results show a high accuracy for the model. According to simulation results, the mean absolute normalized bit error at CTU level is 19.66% and it decreases to 6.85% at the frame level. Compared with similar networks, the proposed structure has a very low computational complexity.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00 © 2022 SPIE and IS&T
Yaser Rahimi, Mehdi Rezaei, and Pouria Jafari "Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network," Journal of Electronic Imaging 31(3), 033026 (1 June 2022). https://doi.org/10.1117/1.JEI.31.3.033026
Received: 2 February 2022; Accepted: 6 May 2022; Published: 1 June 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Video coding

Copper

Computer programming

Neurons

Video compression

Convolutional neural networks

RELATED CONTENT

Pseudomodeling of MPEG-based variable-bit-rate video
Proceedings of SPIE (December 27 1999)
Traffic models and admission control for VBR video
Proceedings of SPIE (November 04 1996)

Back to Top