Equivalent Rectangular Bandwidth

It also turns out that a first-order conformal map (bilinear
transform) can provide a good match to the ERB scale
[269] as well. Moore and Glasberg
[177] have revised Zwicker's loudness model to
better explain (1) how equal-loudness contours change as a function of
level, (2) why loudness remains constant as the bandwidth of a
fixed-intensity sound increases up to the critical bandwidth, and (3)
the loudness of partially masked sounds. The modification that is
relevant here is the replacement of the Bark scale by the
*equivalent rectangular bandwidth* (ERB) scale. The ERB of the
auditory filter is assumed to be closely related to the critical
bandwidth, but it is measured using the *notched-noise* method
[205,206,251,181,87]
rather than on classical masking experiments involving a narrow-band
masker and probe tone
[306,307,304]. As a result,
the ERB is said not to be affected by the detection of beats or
intermodulation products between the signal and masker. Since this
scale is defined analytically, it is also more smoothly behaved than
the Bark scale data.

At moderate sound levels, the ERB in Hz is defined by [177]

where is center-frequency in Hz, normally in the range 100 Hz to 10 kHz. The ERB is generally narrower than the classical critical bandwidth (CB), being about % of center frequency at high frequencies, and leveling off to about Hz at low frequencies. The classical CB, on the other hand, is approximately % of center frequency, leveling off to Hz below Hz. An overlay of ERB and CB bandwidths is shown in Fig.E.10. Also shown is the approximate classical CB bandwidth, as well as a more accurate analytical expression for Bark bandwidth vs. Hz [3]. Finally, note that the frequency interval [ Hz, kHz] corresponds to good agreement between the psychophysical ERB and the directly physical audio filter bandwidths defined in terms of

The *ERB scale* is defined as the number of ERBs below
each frequency

for in Hz [177]. An overlay of the normalized Bark and ERB frequency warpings is shown in Fig.E.11. The ERB warping is determined by scaling the inverse of (E.5), evaluated along a uniform frequency grid from zero to the number of ERBs at half the sampling rate, so that dc maps to zero and half the sampling rate maps to .

Proceeding in the same manner as for the Bark-scale case, allpass coefficients giving a best approximation to the ERB-scale warping were computed for sampling rates near twice the Bark band edge frequencies (chosen to facilitate comparison between the ERB and Bark cases). The resulting optimal map coefficients are shown in Fig.E.12. The allpass parameter increases with increasing sampling rate, as in the Bark-scale case, but it covers a significantly narrower range, as a comparison with Fig.E.3 shows. Also, the Chebyshev solution is now systematically larger than the least-squares solutions, and the least-squares and weighted equation-error cases are no longer essentially identical. The fact that the arctangent formula is optimized for the Chebyshev case is much more evident in the error plot of Fig.E.12b than it was in Fig.E.3b for the Bark warping parameter.

The peak and rms mapping errors are plotted versus sampling rate in Fig.E.13. Compare these results for the ERB scale with those for the Bark scale in Fig.E.4. The ERB map errors are plotted in Barks to facilitate comparison. The rms error of the conformal map fit to the ERB scale increases nearly linearly with log-sampling-rate. The ERB-scale error increases very smoothly with frequency while the Bark-scale error is non-monotonic (see Fig.E.4). The smoother behavior of the ERB errors appears due in part to the fact that the ERB scale is defined analytically while the Bark scale is defined more directly in terms of experimental data: The Bark-scale fit is so good as to be within experimental deviation, while the ERB-scale fit has a much larger systematic error component. The peak error in Fig.E.13 also grows close to linearly on a log-frequency scale and is similarly two to three times the Bark-scale errors of Fig.E.4.

The frequency mapping errors are plotted versus frequency in Fig.E.14 for a sampling rate of kHz. Unlike the Bark-scale case in Fig.E.5, there is now a visible difference between the weighted equation-error and optimal least-squares mappings for the ERB scale. The figure shows also that the peak error when warping to an ERB scale is about three times larger than the peak error when warping to the Bark scale, growing from 0.64 Barks to 1.9 Barks. The locations of the peak errors are also at lower frequencies (moving from 1.3 and 8.8 kHz in the Bark-scale case to 0.7 and 8.2 kHz in the ERB-scale case).

[How to cite this work] [Order a printed hardcopy] [Comment on this page via email]

Copyright ©

Center for Computer Research in Music and Acoustics (CCRMA), Stanford University