Moore and Glasberg [19] have revised Zwicker's
loudness model to better explain (1) how equal-loudness contours
change as a function of level, (2) why loudness remains constant as
the bandwidth of a fixed-intensity sound increases up to the critical
bandwidth, and (3) the loudness of partially masked sounds. The
modification that is relevant here is the replacement of the Bark
scale by the *equivalent rectangular bandwidth* (ERB) scale. The
ERB of the auditory filter is assumed to be closely related to the
critical bandwidth, but it is measured using the *notched-noise*
method
[27,28,31,22,5]
rather than on classical masking experiments involving a narrowband
masker and probe tone
[41,42,39]. As a
result, the ERB is said not to be affected by the detection of beats
or intermodulation products between the signal and masker. Since this
scale is defined analytically, it is also more smoothly behaved than
the Bark scale data.

At moderate sound levels, the ERB in Hz is defined by
[19]

The *ERB scale* is defined as the number of ERBs below
each frequency [19]:

Proceeding in the same manner as for the Bark-scale case, allpass coefficients giving a best approximation to the ERB-scale warping were computed for sampling rates near twice the Bark band edge frequencies (chosen to facilitate comparison between the ERB and Bark cases). The resulting optimal map coefficients are shown in Fig.13. The allpass parameter increases with increasing sampling rate, as in the Bark-scale case, but it covers a significantly narrower range, as a comparison with Fig.4 shows. Also, the Chebyshev solution is now systematically larger than the least-squares solutions, and the least-squares and weighted equation-error cases are no longer essentially identical. The fact that the arctangent formula is optimized for the Chebyshev case is much more evident in the error plot of Fig.13b than it was in Fig.4b for the Bark warping parameter.

The peak and rms mapping errors are plotted versus sampling rate in Fig.14. Compare these results for the ERB scale with those for the Bark scale in Fig.5. The ERB map errors are plotted in Barks to facilitate comparison. The rms error of the conformal map fit to the ERB scale increases nearly linearly with log-sampling-rate. The ERB-scale error increases very smoothly with frequency while the Bark-scale error is non-monotonic (see Fig.5). The smoother behavior of the ERB errors appears due in part to the fact that the ERB scale is defined analytically while the Bark scale is defined more directly in terms of experimental data: The Bark-scale fit is so good as to be within experimental deviation, while the ERB-scale fit has a much larger systematic error component.

The peak error in Fig.14 also grows close to linearly on a log-frequency scale and is similarly two to three times the Bark-scale errors of Fig.5.

The frequency mapping errors are plotted versus frequency in Fig.15 for a sampling rate of 31 kHz. Unlike the Bark-scale case in Fig.6, there is now a visible difference between the weighted equation-error and optimal least-squares mappings for the ERB scale. The figure shows also that the peak error when warping to an ERB scale is about three times larger than the peak error when warping to the Bark scale, growing from 0.64 Barks to 1.9 Barks. The locations of the peak errors are also at lower frequencies (moving from 1.3 and 8.8 kHz in the Bark-scale case to 0.7 and 8.2 kHz in the ERB-scale case).

Download bbt.pdf

[Comment on this page via email]

Copyright ©

Center for Computer Research in Music and Acoustics (CCRMA), Stanford University