Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network

Yaser Rahimi; Mehdi Rezaei; Pouria Jafari

doi:10.1117/1.JEI.31.3.033026

1 June 2022 Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network

Yaser Rahimi, Mehdi Rezaei, Pouria Jafari

Author Affiliations +

Journal of Electronic Imaging, Vol. 31, Issue 3, 033026 (June 2022). https://doi.org/10.1117/1.JEI.31.3.033026

Abstract

In almost all video applications, a video rate control algorithm (RCA) is used by the encoder. The RCA tunes the quantization parameter (QP) to match the encoded bit rate to the available capacity of the communication channel or storage media. Conventional RCAs usually utilize a rate-quantization (R-Q) or a rate-distortion (R-D) model for rate control. A content-based R-Q model for intra coding tree units (CTUs) of the high-efficiency video coding standard is proposed. The model is a convolutional neural network that observes pixels of a CTU and its intraprediction reference pixels and it estimates required bit counts for intracoding the CTU for all QP values simultaneously. The proposed model can be easily used by any video RCA. A given RCA just selects a proper QP for which the estimated bit counts are closer to the allocated bit budget. The evaluation results show a high accuracy for the model. According to simulation results, the mean absolute normalized bit error at CTU level is 19.66% and it decreases to 6.85% at the frame level. Compared with similar networks, the proposed structure has a very low computational complexity.

Citation Download Citation

Yaser Rahimi, Mehdi Rezaei, and Pouria Jafari "Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network," Journal of Electronic Imaging 31(3), 033026 (1 June 2022). https://doi.org/10.1117/1.JEI.31.3.033026

Received: 2 February 2022; Accepted: 6 May 2022; Published: 1 June 2022

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
19 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Video

Video coding

Copper

Computer programming

Neurons

Video compression

Convolutional neural networks

Show All Keywords

Keywords/Phrases

Search In:

Publication Years