Jump to Navigation

Main menu

  • Login
Home

Secondary menu

  • [Room Booking]
  • [Wiki]
  • [Webmail]

Sound localization using a deep graph signal-processing model for acoustic imaging

Date: 
Wed, 08/23/2023 - 3:30pm - 4:30pm
Event Type: 
Guest Lecture
Adrian S. Roman (USC) joins us to discuss his ongoing project.

ABSTRACT:
Our research explores ways to leverage the architecture of DeepWave, originally used as an acoustic camera, to enable precise localization of sound sources. While DeepWave inherently generates spherical maps in the form of sound intensity fields, it has not been utilized for determining precise localization coordinates of sound sources.

We adopted two approaches to unfold DeepWave’s potential as a localization model. The first approach shows how DeepWave’s generated sound intensity fields can be processed for localization. We performed k-means clustering on DeepWave's output fields to identify centroids of relevant sound intensity areas. As a result of a fine-tuned k-means algorithm, DeepWave can compete with the baseline SELDNet model from the DCASE challenge using the LOCATA dataset. The second approach shows how DeepWave can also be integrated into larger networks for sound localization. We employ transfer learning using DeepWave as a feature extractor by taking its sound intensity fields as inputs to a Gated
Recurrent Unit (GRU) network. The results demonstrate how stacking DeepWave with a GRU network, like the one in the SELDNet model, outperforms the SELDNet model itself when trained and evaluated with LOCATA.

The proposed architecture that combines DeepWave with a GRU network presented in this project achieves robust localization accuracy compared to existing models. In conclusion, DeepWave can easily be used as a core building block in sound localization networks. The full implementation of DeepWave written in PyTorch used on this research can be found at
https://github.com/adrianSRoman/DeepWaveTorch.git.

NOTE: this event is part of the DL4MIR workshop series (ccrma-mir.github.io); guest speaker talks are open to the broader CCRMA community.
FREE
Open to the Public
  • Calendar
  • Home
  • News and Events
    • All Events
      • CCRMA Concerts
      • Colloquium Series
      • DSP Seminars
      • Hearing Seminars
      • Guest Lectures
    • Event Calendar
    • Events Mailing List
    • Recent News
  • Academics
    • Courses
    • Current Year Course Schedule
    • Undergraduate
    • Masters
    • PhD Program
    • Visiting Scholar
    • Visiting Student Researcher
    • Workshops 2023
  • Research
    • Publications
      • Authors
      • Keywords
      • STAN-M
      • Max Mathews Portrait
    • Research Groups
    • Software
  • People
    • Faculty and Staff
    • Students
    • Alumni
    • All Users
  • User Guides
    • New Documentation
    • Booking Events
    • Common Areas
    • Rooms
    • System
  • Resources
    • Planet CCRMA
    • MARL
  • Blogs
  • Opportunities
    • CFPs
  • About
    • The Knoll
      • Renovation
    • Directions
    • Contact

Search this site:

Fall Courses at CCRMA

Music 101 Introduction to Creating Electronic Sounds
Music 192A Foundations in Sound Recording Technology
Music 201 CCRMA Colloquium
Music 220A Foundations of Computer-Generated Sound
Music 223A Composing Electronic Sound Poetry
Music 256A Music, Computing, and Design I: Software Paradigms for Computer Music
Music 319 Research Seminar on Computational Models of Sound Perception
Music 320 Introduction to Audio Signal Processing
Music 351A Research Seminar in Music Perception and Cognition I
Music 423 Graduate Research in Music Technology
Music 451A Auditory EEG Research I

 

 

 

   

CCRMA
Department of Music
Stanford University
Stanford, CA 94305-8180 USA
tel: (650) 723-4971
fax: (650) 723-8468
info@ccrma.stanford.edu

 
Stanford Digital Accessibility
Web Issues: webteam@ccrma
site copyright © 2009-2023
Stanford University

site design: 
Linnea A. Williams