Speech Denoising with Deep Feature Losses

Submitted by francois on Sat, 07/07/2018 - 11:02am

Title	Speech Denoising with Deep Feature Losses
Publication Type	Journal Article
Year of Publication	2018
Authors	Germain, F. G., Q. Chen, and V. Koltun
Journal	arXiv:1806.10522
Date Published	06/2018
Type of Article	arXiv eprint
Abstract	We present an end-to-end deep learning approach to denoising speech signals by processing the raw waveform directly. Given input audio containing speech corrupted by an additive background signal, the system aims to produce a processed signal that contains only the speech content. Recent approaches have shown promising results using various deep network architectures. In this paper, we propose to train a fully-convolutional context aggregation network using a deep feature loss. That loss is based on comparing the internal feature activations in a different network, trained for acoustic environment detection and domestic audio tagging. Our approach outperforms the state-of-the-art in objective speech quality metrics and in large-scale perceptual experiments with human listeners. It also outperforms an identical network trained using traditional regression losses. The advantage of the new approach is particularly pronounced for the hardest data with the most intrusive background noise, for which denoising is most needed and most challenging. Code Audio examples
URL	https://arxiv.org/abs/1806.10522
Refereed Designation	Non-Refereed
Full Text	https://arxiv.org/pdf/1806.10522

Search this site:

Spring Quarter 2024

Music 101 Introduction to Creating Electronic Sounds
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 155/255 (ARTSTUDI 239) Intermedia Workshop
Music 220C Research Seminar in Computer-Generated Music
Music 222A Quantum Computer Music
Music 228 SVOrk (Stanford Virtual Reality Orchestra)
Music 250A Physical Interaction Design for Music
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 319 Research Seminar on Computational Models of Sound Perception
Music 320C Audio DSP Projects in Faust and C++
Music 423 Graduate Research in Music Technology

Main menu

Secondary menu

Speech Denoising with Deep Feature Losses

Search this site:

Spring Quarter 2024