Computing Vocoder Parameters

Computing Vocoder Parameters

To compute the amplitude at the output of the th subband, we can apply an envelope follower. Classically, such as in the original vocoder, this can be done by full-wave rectification and subsequent low pass filtering, as shown in Fig.G.10. This produces an approximation of the average power in each subband.

$\begin{psfrags} % latex2html id marker 43360\psfrag{x} []{ \LARGE$ x_k(t)$\ }\psfrag{xkt} []{ \LARGE$ \tilde{x}_k(t)$\ }\psfrag{xk} []{ \LARGE$ x_k(t)$\ }\psfrag{output} []{ \LARGE$ y_k=h*\tilde{x}_k $\ }\begin{figure}[htbp] \includegraphics[width=\textwidth ]{eps/envelope} \caption{Classic method for amplitude envelope extraction in continuous-time analog circuits.} \end{figure} \end{psfrags}$

In digital signal processing, we can do much better than the classical amplitude-envelope follower: We can measure instead the instantaneous amplitude of the (assumed quasi sinusoidal) signal in each filter band using so-called analytic signal processing (introduced in §4.6). For this, we generalize (G.3) to the real-part of the corresponding analytic signal:

$\displaystyle x_k(t)\eqsp a_k(t)\cos[ \omega_kt + \phi_k(t) ] \eqsp$ $\displaystyle \mbox{re\ensuremath{\left\{a_k(t)e^{j\phi_k(t)} e^{j\omega_k t}\right\}}}$ $\displaystyle \isdefs$ $\displaystyle \mbox{re\ensuremath{\left\{x^a_k(t)\right\}}}$ $\displaystyle \protect$

(G.5)

In general, when both amplitude and phase are needed, we must compute two real signals for each vocoder channel:

$\displaystyle a_k(t)$	$\displaystyle =$	$\displaystyle \vert x_k^a(t) \vert$ (instantaneous amplitude)
$\displaystyle \phi_k(t)$	$\displaystyle =$	$\displaystyle \angle x_k^a(t) - \omega_kt$ (instantaneous phase)
	$\displaystyle =$	$\displaystyle \tan^{-1} \left[ \frac{\mbox{im\ensuremath{\left\{x_k^a(t)\right\}}}} {\mbox{re\ensuremath{\left\{x_k^a(t)\right\}}}} \right] - \omega_kt \protect$	(G.6)

We call the instantaneous amplitude at time for both and . The function $a_k(\cdot)$ as a whole is called the amplitude envelope of the th channel output. The instantaneous phase at time is $\phi_k(t)$ , and its time-derivative is instantaneous frequency.

In order to determine these signals, we need to compute the analytic signal from its real part . Ideally, the imaginary part of the analytic signal is obtained from its real part using the Hilbert transform (§4.6), as shown in Fig.G.11.

$\begin{psfrags} % latex2html id marker 43396\psfrag{x} []{ \LARGE$ x_k(t)$\ }\psfrag{Rex} []{ \LARGE$ \mbox{re\ensuremath{\left\{x_k^a(t)\right\}}}$\ }\psfrag{Imx} []{ \LARGE$ \mbox{im\ensuremath{\left\{x_k^a(t)\right\}}}$\ }\begin{figure}[htbp] \includegraphics[width=0.8\twidth]{eps/hilbert} \caption{Creating an analytic signal from its real part using the Hilbert transform (\textit{cf.}{} \sref {hilbert}).} \end{figure} \end{psfrags}$

Using the Hilbert-transform filter, we obtain the analytic signal in ``rectangular'' (Cartesian) form:

$\displaystyle x_k^a(t) =$ $\displaystyle \mbox{re\ensuremath{\left\{x_k^a(t)\right\}}}$ $\displaystyle + j\,$ $\displaystyle \mbox{im\ensuremath{\left\{x_k^a(t)\right\}}}$

(G.7)

To obtain the instantaneous amplitude and phase, we simply convert each complex value of to polar form

$\displaystyle x_k^a(t) \eqsp a_k(t)\,e^{j[ \omega_kt +\phi_k(t)] }$

(G.8)

as given by (G.6).

Subsections

[How to cite this work] [Order a printed hardcopy] [Comment on this page via email]

``Spectral Audio Signal Processing'', by Julius O. Smith III, W3K Publishing, 2011, ISBN 978-0-9745607-3-1.
Copyright © 2022-02-28 by Julius O. Smith III
Center for Computer Research in Music and Acoustics (CCRMA), Stanford University

Computing Vocoder Parameters

``Spectral Audio Signal Processing'', by Julius O. Smith III, W3K Publishing, 2011, ISBN 978-0-9745607-3-1. Copyright © 2022-02-28 by Julius O. Smith III Center for Computer Research in Music and Acoustics (CCRMA), Stanford University

``Spectral Audio Signal Processing'', by Julius O. Smith III, W3K Publishing, 2011, ISBN 978-0-9745607-3-1.
Copyright © 2022-02-28 by Julius O. Smith III
Center for Computer Research in Music and Acoustics (CCRMA), Stanford University