Basics of MPEG

April 04, 2012

Basics of MPEG
Picture sizes: up to 4095 x 4095
Most algorithms are for the CCIR 601 format for video frames
Y-Cb-Cr color space
NTSC: 525 lines per frame at 60 fps, 720 x 480 pixel luminance frame, 360 x 480 pixel chrominance frame
PAL: 625 lines per frame at 50 fps, 720 x 576 pixel luminance frame, 360 x 576 pixel chrominance frame
SIF (source input format) for digital TV
Luminance resolution: 360 x 240 pixels at 30 fps or 360 x 288 pixels at 25 fps
Chrominance resolution: half the luminance resolution in both dimensions

Detour: Motion Vectors with Subpixel Accuracy

Find motion vector (u,v) with integer pixel accuracy
Let the MAE be m0
Compute the MAE at its 4-neighbor pixels (m1 .. m4)
Horizontal pixels
Model with the function p(i)=a|i-b|+c
If 2(m3 – m0) < (m4 – m0), the i coordinate is to the left of the center
If (m3 – m0) > 2(m4 – m0), the i coordinate is to the right of the center
Otherwise it is along the center line
Similarly for the vertical direction

Basics of MPEG

•Types of pictures

–I (intra) frame

•compressed using only intraframe coding

•Moderate compression but faster random access

–P (predicted) frame

•Coded with motion compression using past I frames or P frames

•Can be used as reference pictures for additional motion compensation

–B (bidirectional) frame

•Coded by motion compensation by either past or future I or P frames

–D (DC) frame

•Limited use: encodes only DC components of intraframe coding

MPEG: Video Encoding

•The MPEG standards

–do not define an encoding process

–define syntax of the coded stream

– define a decoding process

MPEG: Video Encoding

Some highlights:
Interframe predictive coding (P-pictures)
For each macroblock the motion estimator produces the best matching macroblock
The two macroblocks are subtracted and the difference is DCT coded
Interframe interpolative coding (B-pictures)
The motion vector estimation is performed twice
The encoder forms a prediction error macroblock from either or from their average
The prediction error is encoded using a block-based DCT
The encoder needs to reorder pictures because B-frames always arrive late

MPEG: Structure of the Coded Bit-Stream

Sequence layer: picture dimensions, pixel aspect ratio, picture rate, minimum buffer size, DCT quantization matrices
GOP layer: will have one I picture, start with I or B picture, end with I or P picture, has closed GOP flag, timing info, user data
Picture layer: temporal ref number, picture type, synchronization info, resolution, range of motion vectors
Slices: position of slice in picture, quantization scale factor
Macroblock: position, H and V motion vectors, which blocks are coded and transmitted

Search This Blog

world at a GlanCE

Basics of MPEG

Comments

Post a Comment

Popular posts from this blog

Handling of Skew

Fragment-and-Replicate Join

USER INTERFACE DESIGN FOR ANNA UNIVERSITY SYLLABUS