Music 422/EE 367C: Perceptual Audio Coding

Prof. Marina Bosi

Topics and Tentative Schedule, Winter 2007-2008

Below is a tentative schedule, subject to update. Required readings from the course textbook (M. Bosi and R. E. Goldberg, "Introduction to Digital Audio Coding and Standards", Kluwer Academic Publishers, Dec 2002) are referenced for each week. As a general rule, readings should be always done according to topics and prior to class.  Unless otherwise specified the class will meet in the CCRMA classroom at the Knoll from 2:15pm until 4:05pm.
Date Topic
Reading Due
1/11  Course Overview and Audio Signal Representation
Chapters 1 and 3

1/18  Quantization
Chapters 2 HW1
1/22
Time to Frequency Mapping I
(8:45am - 10:00am)


Chapter 4
This class is optional


1/25  Time to Frequency Mapping II
Chapter 5 HW2
2/1  Introduction to Psychoacoustics
Chapters 6 and 7 HW3
2/8  Bit Allocation and Basic Building Blocks of an Audio Codec
Chapters 8 and 9 HW4
2/14 Audio Codec Evaluation *
Chapter 10 HW5

(*) This class will be held at Dolby Laboratories in San Francisco (2:30 pm - 5:30 pm) on Thursday instead of usual Friday classtime.
2/22  Overview of MPEG and MPEG-1 Audio Coding
Chapter 11 HW6 (Project Proposal)
2/29  Overview of MPEG-2 and MPEG-4 Audio Coding
Chapters 12, 13 and 15 HW7
3/7  Overview of Other Coding Standards (AC-3, etc.)
Chapter 14 Project due 5pm*
3/14  Project Presentations





Topic outline:

  1. Overview
    1. Course Introduction
    2. Administrative Information
    3. What is an Audio Coder?
    4. Coding Goals
    5. The Simplest Coder: PCM
    6. Types of PCM Coding Errors
    7. A More Complex Coder: Transform Coding
    8. Why do we need more complex coders?
  2. Representation of Audio Signals
    1. Notation and Basics
    2. Time and Frequency Domain Representation of Audio Signals
    3. Sampling of Audio Signals
    4. Choice of Representation for Band-Limited Signals of Finite Duration
  3. Quantization of Audio Signals
    1. Review Binary Numbers
    2. Quantization
    3. Quantization Error
    4. Block Floating Point Quantization
    5. Entropy Coding
  4. Introduction to Psychoacoustics
    1. Sound Pressure Levels
    2. Processing in the Auditory System
    3. Threshold in Quiet
    4. Critical Bands (Bark Scale)
    5. Masking
    6. Models and Modeling
  5. Time to Frequency Mapping of Audio Signals
    1. Discrete Fourier Transform
    2. Time-Frequency Analysis (STFT)
    3. Fast Algorithms (FFT/IFFT)
    4. Filterbanks Based on Time-Domain Aliasing Cancellation
    5. Quadrature Mirror Filterbanks
    6. Frequency vs. Time Resolution Considerations
  6. Bit Allocation and the Basic Building Blocks of Perceptual Audio Coders
    1. Data Rates
    2. Bit Allocation
    3. Perceptual Coding Building Blocks
  7. Quality Evaluation of Perceptual Audio Coding
    1. Measurement of Impairments in Perceptual Audio Coding
    2. Measurement Scale
    3. How to Carry out Listening Tests
    4. Perform Listening Test
  8. State-of-the-Art Coders
    1. Applications of Perceptual Audio Coding: Part I, Single and Dual Channel Audio Coding - MPEG-1, MPEG-2 Low Sampling Frequencies (LSF)
  9. State-of-the-Art Coders (Cont.)
    1. Applications of Perceptual Audio Coding: Part II, Multichannel Audio Coding - MPEG-2, AC-3
    2. Applications of Perceptual Audio Coding: Part III, Current and Future Developments - MPEG-2 AAC, MPEG-4
  10. Presentation of final Projects