Next  |  Prev  |  Up  |  Top  |  Index  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search

``Allpass'' Frequency Warping

In 1971, Oppenheim, Johnson, and Steiglitz proposed forming an FFT filter bank with non-uniformly spaced bins by taking the FFT of the outputs of a cascade chain of first-order allpass filters [174]. The allpass filters serve to implement ``frequency-dependent unit delays,'' thus effectively altering the signal sampling-rate as a function of frequency.

In 1980, ``warped linear prediction'' was proposed by Strube [243] for obtaining better formant models of speech: The frequency axis ``seen'' by LPC is made to approximate a Bark scaleF.2using the first-order allpass transformation. It was noted in [243] that setting the allpass coefficient to $ 0.47$ gave a ``very good approximation to the subjective Bark scale based on the critical bands of the ear'' at a 10 kHz sampling rate. It was concluded that low-order LPC was helped significantly by the frequency warping, because the first and second formants of speech become well separated on a Bark scale and therefore better resolved by a low-order predictor. However, higher order LPC fits could actually be made worse, e.g., due to splitting of the first formant as a result of four poles being used in the LPC fit instead of two.

In 1983, the Bark bilinear transformation was also developed independently for audio digital filter design [225]. In that work, the frequency response fit was carried out over an approximate Bark scale provided by the allpass transformation. The allpass coefficient $ \rho $ was optimized as a function of sampling rate using the method of bisection under a least-squares norm on the error between the allpass and Bark frequency warpings. The root mean square errors were found to range from $ 0.0034f_s$ at $ f_s=6$  kHz to $ 0.0068f_s$ at $ f_s=27$  kHz, where $ f_s$ denotes the sampling rate. The frequency warp dictated by the optimal allpass transformation $ {\cal A}_{\rho }$ determined an interpolated resampling of the desired filter frequency response $ H(e^{j\omega})$ which converted its support to an approximate Bark scale $ H(e^{j\omega })=H[{\cal A}_{\rho }(e^{ja(\omega )})]$ . Any filter design method could then be carried out to give an optimal match $ H^*[e^{ja(\omega )}]$ over the warped, sampled frequency response. Many filter design methods were compared and evaluated with respect to their audio quality. Finally, the optimal warped filter $ H^*(\zeta )$ was unwarped by applying the inverse allpass transformation $ {\cal A}_{-\rho }$ to the warped filter transfer function using polynomial manipulations to obtain $ H^*[{\cal A}_{-\rho }(z)]$ .

The first-order allpass transformation has been used traditionally in digital filter design to scale the cut-off frequency of digital lowpass and highpass filters, preserving optimality in the Chebyshev sense [179,38]. Higher order allpass transformations have been used to convert lowpass or highpass prototype filters into multiple bandpass/bandstop filters [164]. Allpass transformations of order greater than one appear not to have been used in frequency warping applications, since allpass transformations of order $ N$ map the unit circle to $ N$ traversals of the unit circle, and a one-to-one mapping of the unit circle to itself is desired.F.3

In 1994 [115], an allpass coefficient of $ 0.62$ was used to generate a frequency warping closely approximating the Bark scale for a sampling rate of 22 kHz. Experiments comparing the performance of warped LPC and ``normal'' LPC for speech coding and speech recognition applications showed that warped LPC required less than half the predictor model order for comparable performance.

More recently, the first-order allpass transformation was used to implement audio-warped filters directly in the warped domain [101,102]. In this application, a digital filter is designed over the warped frequency axis, and in its implementation, each delay element is replaced by a first-order allpass filter which implements the unwarping on the fly. Advantages of this scheme include (a) reducing the necessary filter order by a factor of 5 to 10 (more than compensating for the increased cost of implementing a delay element as a first-order allpass filter), (b) avoiding numerical failures which can occur (even in double-precision floating point) when attempting to unwarp very high-order filters (e.g., much larger than 30), and (c) providing a dynamic warping modulation control which tends to act as a frequency-scaling parameter associated with ``acoustic size'' and is therefore musically useful.

The critical feature of the first-order conformal map in the $ z$ plane is that it does not increase filter order; it is the most general order-preserving frequency-warping transformation for rational digital filters. In view of this constraint, it is remarkable indeed that a ``free'' filter transformation such as this can so closely match the Bark frequency scale.


Next  |  Prev  |  Up  |  Top  |  Index  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search

[How to cite this work] [Order a printed hardcopy]

``Spectral Audio Signal Processing'', by Julius O. Smith III, (August 2008 Draft).
Copyright © 2008-08-13 by Julius O. Smith III
Center for Computer Research in Music and Acoustics (CCRMA),   Stanford University
CCRMA  [Automatic-links disclaimer]