Zoom out Search Issue

ManualsBrandsContents Manualsaudio & home theatreZoom in

111

112

113

114

115

116

117

118

119

120

IEEE SIGNAL PROCESSING MAGAZINE [109] MARCH 2015

as critical as the use of nonindividualized HRTFs [13]. Thus,

equalization using individual HPTFs is strongly recommended.

Another difficulty in carrying out accurate headphone equaliza-

tion is the variability of the HPTFs with repositioning. The effect of

repositioning of headphones is lower at low frequencies but dis-

plays high standard deviations up to 10 dB at high frequencies

[37]. Kulkarni et al. [37] observed that equalization based on a sin-

gle measurement may become worse than no equalization at all.

The positional dependency has no specific solution and its effect

can only be reduced by taking the average of a number of trials as

a representative HPTF [37]. Thus, to create a convincing immer-

sive sound environment, use of individualized HRTFs and individ-

ualized equalization is entailed, which may not be viable all the

time. To reduce the dependency on individualized equalization,

Sunder et al. [33] designed a Type-2 equalization technique for the

playback through frontal projection headphone, which is indepen-

dent of the headphone-ear coupling. Unlike the conventional

equalization technique, Type-2 equalization compensates only for

the distortion due to the emitter, thereby preserving the individual

pinna cues due to frontal projection.

The other type of equalization is the “decoupled” equalization

technique, and it is the most commonly used method of equaliza-

tion for rendering music. In this technique, the binaural record-

ing [(BIR) or HRTFs] as well as the headphone are equalized

using a reference sound field (REF) (e.g., FF, DF, etc.) [36]. If the

REF of the recording environment is well known and reproduced

reliably, this method of equalization can result in a very natural

perception of sound similar to the nondecoupled equalization

technique. This method of equalization is mainly carried out to

make the binaural recordings compatible with stereophonic (con-

ventional microphone) recordings in terms of timbral quality.

If the recording is binaural, then a reference field equalized

binaural recording (BIR/REF) achieves a sound quality equivalent

to a conventional microphone recording. When the equalized

recording is played from a reference field equalized headphone

(HPTF/REF), the perceived timbre of the spatial sound would be as

natural as the original binaural recording. Individualized binaural

recordings are thus necessary to experience the true immersive-

ness of sound without any timbral coloration and spatial degrada-

tion. Note that for rendering conventional stereo recorded music,

it is sufficient to carry out just the headphone equalization using

an appropriate reference field. Some of the commonly used refer-

ence fields are:

■ Free-field (FF) equalization: With the aim to replicate the ear

signals produced by frontal loudspeakers, the target response of

FF equalization is the HRTF of frontal incidence. Hammershoi

et al. proposed an FF equalization curve, which has additional

high frequency energy above 3 kHz to approximate listening to

stereo loudspeakers in the FF [4]. A FF equalized headphone

can reproduce a frontal sound with natural sound quality but

colors the sound that originates from other directions. More-

over, it is important to note that there are large interindividual

variations in the FF equalization filters [38].

■ Diffuse-field (DF) equalization: In this case, the target

response for equalization is the DF response, i.e., the average

of the HRTFs of all measured directions in horizontal plane.

The interindividual variations are reduced drastically due to

the averaging effect [38]. Thus, the DF target response can be

achieved universally over a great number of individuals.

Møller [35] identified certain headphones which are already

DF equalized and recommended such type of headphones for

stereo listening.

■ Other target responses: A typical listening room is not

completely diffuse but it can be considered somewhere

between a FF and a DF. Møller [38] illustrated other alterna-

tive target responses which are partially diffuse by applying

unequal weighting to different directions within ± 45° azi-

muth and elevation. Other researchers also modified the DF

equalization filters with the help of certain parametric fil-

ters and found that the subjects generally preferred the tar-

get response with a 3 kHz peak lower in amplitude than in

the DF response for both music and speech [4]. Recent

[TABLE 4] EQUALIZATION TECHNIQUES FOR DIFFERENT PLAYBACK MODES (BINAURAL, STEREOPHONY).

MODE OF

EQUALIZATION AIM

TYPES OF

EQUALIZATION AND

TARGET RESPONSE CHARACTERISTICS

NONDECOUPLED

(BINAURAL)

SPECTRUM AT

EARDRUM IS THE

INDIVIDUAL HRTF

FEATURES

CONVENTIONAL

EQUALIZATION (FLAT

TARGET RESPONSE)

■ FOR CONVENTIONAL HEADPHONES. THE SPECTRUM AT

THE EARDRUM HAS INDIVIDUAL FEATURES (IF INDIVIDUALIZED HRTF IS USED)

■ DEPENDENT ON THE INDIVIDUAL’S UNIQUE PINNA FEATURES

TYPE-2 EQUALIZATION

[33]

■ FOR FRONTAL PROJECTION HEADPHONES. THE SPECTRUM AT EARDRUM

AUTOMATICALLY MODELS THE INDIVIDUAL PINNA SPECTRAL CUES

■ REMOVES ONLY THE DISTORTION DUE TO THE HEADPHONE EMITTER

■ INDEPENDENT OF THE IDIOSYNCRATIC FEATURES OF THE EAR

DECOUPLED

(BINAURAL,

STEREOPHONY)

EMULATE

THE MOST NATURAL

REPRODUCTION

CLOSER TO THE

PERCEPTION IN A

REFERENCE FIELD

FF EQUALIZATION [38] ■ TARGET RESPONSE IS THE FF RESPONSE CORRESPONDING TO

THE FRONTAL INCIDENCE

DF EQUALIZATION [38] ■ TARGET RESPONSE IS THE DF RESPONSE

■ LESSER INTERINDIVIDUAL VARIABILITY

DF TARGET RESPONSE

BASED ON MØLLER [38]

■ TARGET RESPONSE BASED ON AVERAGE OF HRTFS BETWEEN ± 45 DEGREES

AZIMUTH AND ELEVATION WITH UNEQUAL WEIGHTING

DF TARGET RESPONSE

BASED ON LORHO [4]

■ REDUCED A 3-KHZ PEAK FROM ABOUT 12 DB TO 3 DB OF DF RESPONSE

RR_G AND RR1_G [4] ■ RR_G: BASED ON THE IMPULSE RESPONSE OF HARMAN

REFERENCE LISTENING ROOM

■ RR1_G HAS LESSER BASS AND TREBLE

THE WORLD’S NEWSSTAND

THE WORLD’S NEWSSTAND