The spectral envelope obtained by cepstral windowing is defined as
(11.2) |
(11.3) |
The log-magnitude spectrum of is thus lowpass filtered (the real cepstrum of is ``liftered'') to obtain a smooth spectral envelope. For periodic signals, should be set below the period in samples.
Cepstral coefficients are typically used in speech recognition to characterize spectral envelopes, capturing primarily the formants (spectral resonances) of speech [227]. In audio applications, a warped frequency axis, such as the ERB scale (Appendix E), Bark scale, or mel frequency scale is typically preferred. Mel Frequency Cepstral Coefficients (MFCC) appear to remain quite standard in speech-recognition front ends, and they are often used to characterize steady-state spectral timbre in Music Information Retrieval (MIR) applications.