Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

Submitted by gautham on Sat, 10/02/2010 - 11:03am

Title	Non-negative Hidden Markov Modeling of Audio with Application to Source Separation
Publication Type	Conference Paper
Year of Publication	2010
Authors	Mysore, G. J., P. Smaragdis, and B. Raj
Conference Name	International Conference on Latent Variable Analysis and Signal Separation (LVA / ICA)
Date Published	09/2010
Conference Location	St. Malo, France
Abstract	In recent years, there has been a great deal of work in modeling audio using non-negative matrix factorization and its probabilistic counterparts as they yield rich models that are very useful for source separation and automatic music transcription. Given a sound source, these algorithms learn a dictionary of spectral vectors to best explain it. This dictionary is however learned in a manner that disregards a very important aspect of sound, its temporal structure. We propose a novel algorithm, the non-negative hidden Markov model (N-HMM), that extends the aforementioned models by jointly learning several small spectral dictionaries as well as a Markov chain that describes the structure of changes between these dictionaries. We also extend this algorithm to the non-negative factorial hidden Markov model (N-FHMM) to model sound mixtures, and demonstrate that it yields superior performance in single channel source separation tasks.
URL	https://ccrma.stanford.edu/~gautham/Site/NFHMM.html

Search this site:

Spring Quarter 2024

Music 101 Introduction to Creating Electronic Sounds
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 155/255 (ARTSTUDI 239) Intermedia Workshop
Music 220C Research Seminar in Computer-Generated Music
Music 222A Quantum Computer Music
Music 228 SVOrk (Stanford Virtual Reality Orchestra)
Music 250A Physical Interaction Design for Music
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 319 Research Seminar on Computational Models of Sound Perception
Music 320C Audio DSP Projects in Faust and C++
Music 423 Graduate Research in Music Technology

Main menu

Secondary menu

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

Search this site:

Spring Quarter 2024