Matt Hoffman on a Learned Source-Filter Model of Speech

Date:

Fri, 10/17/2014 - 11:00am - 12:30pm

Location:

CCRMA Seminar Rooom

Event Type:

Hearing Seminar

We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain. PoF makes similar assumptions to those used in the classic homomorphic filtering approach to signal processing, but replaces hand-designed decompositions built of basic signal processing operations with a learned decomposition based on statistical inference. When applied to speech, PoF discovers a source-filter representation of speech, despite its lack of any explicit prior knowledge about the mechanisms of vocalization. The PoF model can be used as a prior in more complicated models, permitting applications to problems such as dereverberation and bandwidth expansion.

Bio:
Matt Hoffman is a research scientist in the Creative Technologies Laboratory in Adobe Research. Before that, he was a postdoc working with Prof. Andrew Gelman in the Statistics Department at Columbia University. He did his Ph.D. at Princeton University in Computer Science working in the Sound Lab with Prof. Perry Cook and Prof. David Blei. His research interests include developing efficient Bayesian (and pseudo-Bayesian) inference algorithms; hierarchical probabilistic modeling of audio, text, and marketing data; audio feature extraction, music information retrieval, and the application of music information retrieval and modeling techniques to musical synthesis.

FREE

Open to the Public

Search this site:

Spring Quarter 2024

Music 101 Introduction to Creating Electronic Sounds
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 155/255 (ARTSTUDI 239) Intermedia Workshop
Music 220C Research Seminar in Computer-Generated Music
Music 222A Quantum Computer Music
Music 228 SVOrk (Stanford Virtual Reality Orchestra)
Music 250A Physical Interaction Design for Music
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 319 Research Seminar on Computational Models of Sound Perception
Music 320C Audio DSP Projects in Faust and C++
Music 423 Graduate Research in Music Technology

Main menu

Secondary menu

Matt Hoffman on a Learned Source-Filter Model of Speech

Search this site:

Spring Quarter 2024