Total variation in popular rap vocals from 2009-2023
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/iran/screen_shot_2024-06-13_at_9.30.33_am.png)
Dataset distillation for Audio-visual tasks
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/iran/screen_shot_2024-06-13_at_9.26.50_am.png)
Intermedia Workshop Final Projects
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/azaday/intermedia_workshop_poster_1.jpg)
Doors open at 6:00pm.
Alex Han - Master's Capstone Concert
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/alexhan/studio_portrait_1.jpg)
Josh Mitchell: Master's Recital
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/cobasica/poster.png)
FREE and Open to the Public | In Person + Livestream
NeuralNote: An Audio-to-MIDI Plugin Using Machine Learning
![](https://ccrma.stanford.edu/sites/default/files/imagecache/thumbnail/user/jos/photo_damien_2021_2.jpg)
Abstract: NeuralNote is an open-source audio-to-MIDI VST/AU plugin that uses machine learning for accurate audio-to-MIDI transcription. This talk will begin with an in-depth look at BasicPitch, the machine learning model from Spotify that powers NeuralNote. We will explore its internal workings and how it processes audio to generate MIDI data. Next, we will cover the integration of BasicPitch into the NeuralNote plugin, implemented in C++ using the JUCE framework. We will discuss the challenges of incorporating neural network inference in audio plugins, focusing on real-time processing, thread safety, and performance. A comparison of the ONNXRuntime and RTNeural libraries will highlight the options for neural network integration in this domain.