Fundamentals 17 min read

H.265 Video Encoding Principles and Technical Framework

The article explains H.265 (HEVC) encoding fundamentals, describing its two‑layer VCL/NAL architecture, quad‑tree CTU hierarchy, intra‑ and inter‑frame prediction, DCT/DST transform coding, quantization, CABAC entropy coding, and loop‑filtering techniques that together achieve roughly 50 % bitrate reduction while preserving quality.

Baidu Geek Talk
Baidu Geek Talk
Baidu Geek Talk
H.265 Video Encoding Principles and Technical Framework

This article provides a comprehensive overview of H.265 video encoding principles and technical framework. H.265 (also known as HEVC) was developed to provide better compression efficiency than its predecessor H.264, achieving approximately 50% bitrate reduction while maintaining the same video quality.

The article begins by explaining the fundamental purpose of video encoding - to compress raw video by removing redundant information from spatial, temporal, coding, and visual perspectives. It then introduces the H.265 encoding framework, which consists of two main layers: the Video Coding Layer (VCL) and the Network Abstraction Layer (NAL). The VCL handles video compression and syntax definition, while the NAL manages data encapsulation and network transmission.

The article details the block-based structure of H.265, explaining how video frames are divided into Coding Tree Units (CTUs), which can be further subdivided into Coding Units (CUs), Prediction Units (PUs), and Transform Units (TUs) using a quad-tree structure. This hierarchical approach allows for more efficient encoding of different video content types.

Prediction techniques are thoroughly covered, including intra-frame prediction (using neighboring pixels within the same frame) and inter-frame prediction (using motion estimation and compensation across frames). The article explains different prediction modes, block sizes, and the use of I-frames, P-frames, and B-frames in video compression.

Transform coding is discussed, focusing on how spatial domain signals are converted to frequency domain using Discrete Cosine Transform (DCT) and Discrete Sine Transform (DST) for specific block sizes. The article explains how this transformation helps concentrate residual data and reduce redundancy.

Quantization is covered as a lossy compression technique that reduces data by dividing transform coefficients by quantization step sizes. The article explains scalar quantization and the relationship between quantization parameters (QP) and step sizes.

Entropy coding is explained through Huffman coding and Arithmetic coding (specifically CABAC used in H.265). The article provides detailed examples of how these variable-length coding schemes assign shorter codes to more frequent symbols.

Finally, the article discusses loop filtering techniques including Deblocking Filter (DBF) and Sample Adaptive Offset (SAO) that help reduce artifacts like blocking effects and ringing effects in reconstructed video.

The comprehensive technical explanation provides a solid foundation for understanding H.265 video encoding principles and their practical applications in modern video compression.

video encodingH.265video compressionHEVCentropy codingloop filteringmotion estimationquad-tree structuretransform coding
Baidu Geek Talk
Written by

Baidu Geek Talk

Follow us to discover more Baidu tech insights.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.