Next  |  Prev  |  Top  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search


Results and Conclusions

The coder has been tested and developed mainly with about 10 second sampels from music CD's, most of which are in the table below. After finishing the development, the MPEG-2 testfile music.wav, containing many hard instruments, was tested. Some of the instruments were treated well, and some, like for example the castanets sounded rather bad. The castanetes of course require some kind of pre-echo detection to sound good. Some examples take from that music.wav are shown below.

All of the following samples can be found in wav format on the web, at http://ccrma.stanford.edu/~bosse/. Note that no rate controlling module is developed, and thus bitrates vary a lot with the type of signal. From the values below, one can deduce that a fair perceptual lossless coding can be achieved at about 128 kbit/second for stereo data and 75 kbit/second for mono.



Audio Stream Bits/s M/S Apparent artifacts
Mono audio      
mixed.wav 67 kb/s M  
jacob.wav 71 kb/s M  
cardigans.wav 73 kb/s M  
strings.wav 58 kb/s M  
Stereo audio      
music.wav 106 kb/s S Especially triangle and castanets
tpd.wav 118 kb/s S  
jacob.wav 124 kb/s S  
castanets.wav 103 kb/s S Very audible preecho
instruments.wav 108 kb/s S  
oasis.wav 118 kb/s S The ``s'' in ``sunday'
Low bitrate      
oasis.wav 89 kb/s S Easy to detect
jacob.wav 83 kb/s S Easy to detect


M/S means mono/stereo. The low bitrate signals are coded with a masking threshold multiplied by a factor $\frac{1}{0.5}$ and $\frac{1}{0.3}$ respectively.

The encoder described in this report is apparently rather undeveloped. To improve the coder, I would like to add some kind of transient coding, for example using wavelets. Transform-wavelet hybrid coders (see e.g [11]) has become more popular and show good results. Also, an adaptive prediction in either the time- or transform domain would decrease bitrate in stationary signals (although some of this is exploited by the KLT).

This project did not result in a coder with many new features, but in some experience and knowledege for me in the field of high fidelity perceptual audio coders.


Next  |  Prev  |  Top  |  JOS Index  |  JOS Pubs  |  JOS Home  |  Search

Download bosse.pdf

``An Experimental High Fidelity Perceptual Audio Coder'', by Bosse Lincoln<bosse@ccrma.stanford.edu>, (Final Project, Music 420, Winter '97-'98).
Copyright © 2006-01-03 by Bosse Lincoln<bosse@ccrma.stanford.edu>
Center for Computer Research in Music and Acoustics (CCRMA),   Stanford University
CCRMA  [Automatic-links disclaimer]