The STFT can be viewed as a function of either frame-time or bin-frequency . We will develop both points of view in this book.

At each frame time
, the STFT can be regarded as producing a
*Fourier transform* centered around that time. As
advances, a sequence of spectral transforms is obtained. This is
depicted graphically in Fig.9.1, and it forms the basis of the
*overlap-add method* for Fourier analysis, modification, and
resynthesis [9]. It is also the basis for
*transform coders* [16,284].

In an exact Fourier duality, each bin
of the STFT can
be regarded as a sample of the complex signal at the output of a
lowpass filter whose input is
. As discussed
in §9.1.2, this signal is obtained from
by
*frequency-shifting* it so that frequency
is translated
down to **0**
Hz. For each value of
, the time-domain signal
, for
, is the output of
the
th ``filter bank channel,'' for
. In this
``filter bank'' interpretation, the hop size
can be interpreted as
the *downsampling factor* applied to each bin-filter output, and
the analysis window
is seen as the *impulse
response of the anti-aliasing filter* used prior to downsampling. The
window transform
is also the frequency response of each
channel filter (translated to dc). This point of view is depicted
graphically in Fig.9.2 and elaborated further in Chapter 9.

[How to cite this work] [Order a printed hardcopy] [Comment on this page via email]

Copyright ©

Center for Computer Research in Music and Acoustics (CCRMA), Stanford University