Mike Mandel (Audience) - Reverberation is all around us, and yet little is understood about its effect

Date:

Fri, 05/11/2012 - 1:15pm - 2:30pm

Location:

CCRMA Seminar Room

Event Type:

Hearing Seminar

Echo, echo, echo, echo, and it's all reverberation. It adds a lot of richness to our lives.. **and** it makes it really hard for machines to understand speech.

I'm really happy to introduce Mike Mandel, who will be talking about his work to understand reverberation and its effect on source separation algorithms. Reverberation is a tough problem, one that humans with normal hearing pretty much solve without thinking about it. The same can't be said for the elderly or our machines. Mike studied this problem at Columbia, and he's now at Audience (who make the sound chips for the new iPhones).

    Who:    Mike Mandel
    Why:    Reverberation is all around us, and yet little is understood about its effect
    What:    Evaluating Reverberant Source Separation
    When:    Friday May 11 at 1:15PM
    Where:    CCRMA Seminar Room, Top Floor of the Knoll at Stanford

Not much reverberation in the Hearing Seminar, but we'll talk about it this week. Should be a good introduction to a hard problem.

- Malcolm

"Evaluating reverberant source separation"

Abstract:

While a number of recent algorithms can separate sources from reverberant binaural mixtures, the evaluation of these algorithms is still based on ideas from anechoic source separation. In this talk I will discuss some of the issues that arise when evaluating source separation algorithms with reverberant mixtures. These include a measure of the attenuation of Direct path, Early echoes, and Reverberation of Target and Masker speech (DERTM) as well as the definition of oracle masks based on a similar decomposition of impulse responses. I will also describe a new source separation algorithm, Model-based Expectation Maximization Source Separation and Localization (MESSL), that forms the core of a general probabilistic framework for source separation. An experiment evaluated using DERTM shows that MESSL and another state of the art source separator (Sawada et al., 2007) successfully suppress an interfering speaker's direct-path speech, but are much less successful in suppressing its reverberation. Such suppression slightly improves automatic speech recognition rates, but fails to improve intelligibility for human listeners.

Bio:

Michael I Mandel uses signal processing and machine learning to model sound perception and understanding. He is currently an Algorithm Developer at Audience, Inc, applying machine learning to the problem of noise suppression in mobile phones. In 2009-10 he was a postdoctoral researcher in the Machine Learning laboratory at the Université de Montréal with Yoshua Bengio and Douglas Eck. He earned his PhD in Electrical Engineering from Columbia University in 2010 in Daniel P W Ellis' Laboratory for the Recognition and Organization of Speech and Audio (LabROSA) and his BSc in Computer Science from the Massachusetts Institute of Technology in 2004.

FREE

Open to the Public

Search this site:

Spring Quarter 2024

Music 101 Introduction to Creating Electronic Sounds
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 155/255 (ARTSTUDI 239) Intermedia Workshop
Music 220C Research Seminar in Computer-Generated Music
Music 222A Quantum Computer Music
Music 228 SVOrk (Stanford Virtual Reality Orchestra)
Music 250A Physical Interaction Design for Music
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 319 Research Seminar on Computational Models of Sound Perception
Music 320C Audio DSP Projects in Faust and C++
Music 423 Graduate Research in Music Technology

Main menu

Secondary menu

Mike Mandel (Audience) - Reverberation is all around us, and yet little is understood about its effect

Search this site:

Spring Quarter 2024