Next  |  Prev  |  Up  |  Top  |  Index  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search


Convolving with Long Signals

We saw that we can perform efficient convolution of two finite-length sequences using the Fast Fourier Transform. There are some situations, however, in which it is impractical to use a single FFT for each convolution operand:

Direct convolution does not have these problems. For example, given a causal finite-impulse resonse (FIR) $ h$ of length $ L$ , we need only store the past $ L-1$ samples of the input signal $ x$ to calculate the next output sample, since

\begin{eqnarray*}
y(n) &=& (h\ast x)(n) = \sum_{m=0}^n h(m)x(n-m)\\
&=& h(0)x(n) + h(1)x(n-1)
+\cdots+ h(L-1) x(n-L+1)
\end{eqnarray*}

Thus, at every time $ n$ , the output $ y(n)$ can be computed as a linear combination of the current input sample $ x(n)$ and the current filter state $ \{x(n-1),\ldots,x(n-L+1)\}$ .

To obtain the benefit of high-speed FFT convolution when the input signal is very long, we simply chop up the input signal $ x$ into blocks, and perform convolution on each block separately. The output is then the sum of the separately filtered blocks. The blocks overlap because of the ``ringing'' of the filter. For a zero-phase filter, each block overlaps with both of its neighboring blocks. For causal filters, each block overlaps only with its neighbor to the right (the next block in time). The fact that signal blocks overlap and must be added together (instead of simply abutted) is the source of the name overlap-add method for FFT convolution of long sequences [8,10].

The idea of processing input blocks separately can be extended also to both operands of a convolution (both $ x$ and $ h$ in $ x\ast h$ ). The details are a straightforward extension of the single-block-signal case discussed below.

When simple FFT convolution is being performed between a signal $ x$ and FIR filter $ h$ , there is no reason to use a window function on each input block. In these cases, we may consider that the input signal is processed under a rectangular window. If the window length is $ M$ , it may advance $ M$ samples for each successive frame (hop size = $ M$ samples). In this case, the input blocks do not overlap. On the other hand, when spectral modifications are to be performed, or if the filter is changing each frame ( $ h\leftarrow h_m$ where $ m$ is the frame number), then there are good reasons to use a non-rectangular window function and a smaller hop size, as we will develop below.



Subsections
Next  |  Prev  |  Up  |  Top  |  Index  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search

[How to cite this work] [Order a printed hardcopy]

``Spectral Audio Signal Processing'', by Julius O. Smith III, (August 2008 Draft).
Copyright © 2008-08-13 by Julius O. Smith III
Center for Computer Research in Music and Acoustics (CCRMA),   Stanford University
CCRMA  [Automatic-links disclaimer]