Specifications

A Digital Video Primer

Most codecs compress video using intraframe compression. With intraframe, or spatial, com-

pression, each frame of video is compressed separately. Many video compression schemes start

by discarding color detail in the picture. As long as this type of compression is not too severe, it

is generally acceptable.

A number of codecs also use interframe, or temporal, compression. is type of compression

takes advantage of the fact that any given frame of video is oen very similar to the frames before

and aer it. Instead of storing all complete frames, interframe compression saves just the image

data that is dierent, by generating three types of frames:

• I frames (which serve as the keyframes) contain a full representation of a frame of video and

use intraframe compression. I frames preserve more information than P or B frames and are,

therefore, the largest, in terms of the amount of data needed to describe them.

• P frames are predicted frames, computed from previous frames, and each may require less than

a tenth of the data needed for an I frame.

• B frames, or bidirectional frames, are interpolated from previous frames and those that follow.

B frames can be even smaller than P frames.

A typical sequence might look something like this:

I B B P B B P B B P B B P P B

How each frame is compressed depends on the type of content. If the content is fairly static (for

example, a talking head shot against a plain, still background with not much changing from

frame to frame), then few I frames will be needed, and the video can be compressed into a rela-

tively small amount of data. But if the content is action-oriented (for example, a soccer game, in

which either the action or the background moves or changes rapidly or dramatically from frame

to frame), then more I frames are required, a greater amount of data is needed to maintain good

quality, and the video cannot be compressed as much.

B and P frames contain only those portions of the adjacent I frames that change, therefore reducing the

amount of data required for a video. If more than half of a frame changes, an I frame is automatically

generated.

DV25 compression

DV25 is the compression format used for the standard DV format employed by most consumer

and prosumer camcorders. DV25 is compressed at a xed rate of 5:1 and delivers video data at 25

megabits per second (Mbps). Audio and control information is also included in the data stream,

so the total data rate in bytes is about 3.6 million bytes (megabytes or MB) per second. is

means that one hour of DV25-compressed footage will require about 13 billion bytes (gigabytes

or GB) of storage. It is impressive to realize that each 60-minute mini-DV cassette is actually

13GB of oine storage. DV25 compression uses 4:1:1 color sampling. e audio is uncompressed,

and there are two stereo audio pairs. e audio can be digitized at either 12 bits with a sampling

rate of 32kHz or 16 bits with a sampling rate of 48kHz. You should generally use the highest

quality setting (16 bit, 48kHz).