Jump to Navigation

Main menu

  • Login
Home

Secondary menu

  • [Room Booking]
  • [Wiki]
  • [Webmail]

Deep Waveform Synthesis

Date: 
Thu, 05/30/2019 - 5:30pm - 7:00pm
Location: 
CCRMA Class Room [Knoll 217]
Event Type: 
DSP Seminar
Abstract: Conventional audio synthesis (TTS, voice conversion, enhancement, etc) often relies on acoustic feature representations (spectrogram, MFCC, F0, etc.) and a signal processing procedure that infers the waveform from these features. However, such procedures often introduce artifacts caused by insufficient information in the feature representation (e.g. iSTFT without the correct phase info) and/or an oversimplified synthesis process (e.g. a source-filter model). Recent advancements battle this problem using deep learning: WaveNet, for example, generates the waveform sample-by-sample based on acoustic features and previously generated samples using a dilated convolutional net. This new way of synthesis opens a gate to end-to-end and high-quality audio synthesis that sounds almost real. In this talk, I will introduce some of the most notable deep waveform synthesis methods from the past three years and discuss the intuition behind them as well as future directions.

Bio: Zeyu is a research scientist at Adobe Research in San Francisco. His research interests are in speech and music synthesis, deep learning, and human-computer interaction. He received a Ph.D. degree in computer science from Princeton University advised by Adam Finkelstein and M.S degree in music technology from Carnegie Mellon University. Between 2015 and 2017, he interned at Adobe three times and presented his branding research project – VoCo – at Adobe MAX Sneaks (link to video) in 2016.

FREE
Open to the Public
  • Calendar
  • Home
  • News and Events
    • All Events
      • CCRMA Concerts
      • Colloquium Series
      • DSP Seminars
      • Hearing Seminars
      • Guest Lectures
    • Event Calendar
    • Events Mailing List
    • Recent News
  • Academics
    • Courses
    • Current Year Course Schedule
    • Undergraduate
    • Masters
    • PhD Program
    • Visiting Scholar
    • Visiting Student Researcher
    • Workshops 2022
  • Research
    • Publications
      • Authors
      • Keywords
      • STAN-M
      • Max Mathews Portrait
    • Research Groups
    • Software
  • People
    • Faculty and Staff
    • Students
    • Alumni
    • All Users
  • User Guides
    • New Documentation
    • Booking Events
    • Common Areas
    • Rooms
    • System
  • Resources
    • Planet CCRMA
    • MARL
  • Blogs
  • Opportunities
    • CFPs
  • About
    • The Knoll
      • Renovation
    • Directions
    • Contact

Search this site:

Spring Quarter 2022

Music 101 Introduction to Creating Electronic Sounds
Music 123F Wild Sound Explorers
Music 128 Stanford Laptop Orchestra (SLOrk)
Music 220C Research Seminar in Computer-Generated Music
Music 251 Psychophysics and Music Cognition
Music 254 Computational Music Analysis
Music 257 Neuroplasticity and Musical Gaming
Music 264 Musical Engagement
Music 285 Intermedia Lab
Music 320C Audio DSP Projects in Faust and C++

 

 

 

   

CCRMA
Department of Music
Stanford University
Stanford, CA 94305-8180 USA
tel: (650) 723-4971
fax: (650) 723-8468
info@ccrma.stanford.edu

 
Web Issues: webteam@ccrma

site copyright © 2009 
Stanford University

site design: 
Linnea A. Williams