Jump to Navigation

Main menu

  • Login
Home

Secondary menu

  • [Room Booking]
  • [Wiki]
  • [Webmail]

DDSP: Differentiable Digital Signal Processing

Date: 
Tue, 02/18/2020 - 5:30pm - 7:00pm
Location: 
CCRMA Classroom [Knoll 217]
Event Type: 
DSP Seminar
Abstract: Classical DSP techniques have recently been overshadowed by deep learning for applications such as image recognition and audio generation. End-2-end learning has been key to this shift, enabling the optimization of high-dimensional nonlinear functions. However DSP techniques provide interpretability, modularity, and efficiency lacking from black-box deep networks. In this talk, I'll review the Differentiable Digital Signal Processing (DDSP) library, which enables direct integration of classic signal processing elements with deep learning methods. Focusing on audio synthesis, we achieve high-fidelity generation without the need for large autoregressive models or adversarial losses, demonstrating that DDSP enables utilizing strong inductive biases without losing the expressive power of neural networks. Further, we show that combining interpretable modules permits manipulation of each separate model component, with applications such as independent control of pitch and loudness, realistic extrapolation to pitches not seen during training, blind dereverberation of room acoustics, transfer of extracted room acoustics to new environments, and transformation of timbre between disparate sources. In short, DDSP enables an interpretable and modular approach to generative modeling, without sacrificing the benefits of deep learning.

Bio: Jesse Engel is lead research scientist on Magenta, a research team within Google Brain exploring the role of machine learning in creative applications. He did his Bachelors, and Ph.D., at UC Berkeley, studying the martian atmosphere and quantum dot nanoelectronics respectively, and a joint postdoc at Berkeley and Stanford on neuromorphic computing. Afterward, he worked with Andrew Ng to help found the Baidu Silicon Valley AI Lab and was a key contributor to DeepSpeech 2, a speech recognition system named one of the ‘Top 10 Breakthroughs of 2016’ by MIT Technology Review. He joined Google Brain in 2016, where he his research on Magenta includes creating new generative models for audio (DDSP, NSynth), symbolic music (MusicVAE, GrooVAE), adapting to user preferences (Latent Constraints, MIDI-Me), and work to close the gap between research and musical applications (NSynth Super, Magenta Studio). Outside of work, he is also a professional-level jazz guitarist, and likes to include in his bio that he once played an opening set for the Dalai Lama at the Greek Theatre.
FREE
Open to the Public
  • Calendar
  • Home
  • News and Events
    • All Events
      • CCRMA Concerts
      • Colloquium Series
      • DSP Seminars
      • Hearing Seminars
      • Guest Lectures
    • Event Calendar
    • Events Mailing List
    • Recent News
  • Academics
    • Courses
    • Current Year Course Schedule
    • Undergraduate
    • Masters
    • PhD Program
    • Visiting Scholar
    • Visiting Student Researcher
    • Workshops 2022
  • Research
    • Publications
      • Authors
      • Keywords
      • STAN-M
      • Max Mathews Portrait
    • Research Groups
    • Software
  • People
    • Faculty and Staff
    • Students
    • Alumni
    • All Users
  • User Guides
    • New Documentation
    • Booking Events
    • Common Areas
    • Rooms
    • System
  • Resources
    • Planet CCRMA
    • MARL
  • Blogs
  • Opportunities
    • CFPs
  • About
    • The Knoll
      • Renovation
    • Directions
    • Contact

Search this site:

Spring Quarter 2022

Music 101 Introduction to Creating Electronic Sounds
Music 123F Wild Sound Explorers
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 220C Research Seminar in Computer-Generated Music
Music 251 Psychophysics and Music Cognition
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 264 Musical Engagement
Music 285 Intermedia Lab
Music 320C Audio DSP Projects in Faust and C++

 

 

 

   

CCRMA
Department of Music
Stanford University
Stanford, CA 94305-8180 USA
tel: (650) 723-4971
fax: (650) 723-8468
info@ccrma.stanford.edu

 
Web Issues: webteam@ccrma

site copyright © 2009 
Stanford University

site design: 
Linnea A. Williams