Orchisama Das-Research

Artificial Reverberation and Room Acoustics

Feedback Delay Networks

The Feedback Delay Network (FDN) is an efficient structure for generating real-time artificial reverberation. We study the modal behaviour of FDNs, and mode coupling when the mixing among the delay lines is increased. The effect of the mixing matrix on the echo density profile is investigated, and an empirical method for determining the mixing matrix for a desired mixing time is proposed.

PDF - On the Behavior of Delay Network Reverberator Modes - O. Das, E. K. Canfield and J.S Abel; Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019), New Paltz, New York; October 2019.

In an extended work, we proposed the Grouped Feedback Delay Network (GFDN) , that has different attenuation filters in different delay line groups. We used the GFDN for modeling reverberation in coupled rooms. The presentation accompanying the DAFx paper is available here. We have also explored the design of frequency-dependent, unlossless coupling matrices in the GFDN to model wave effects such as diffraction.

Plugin code Matlab code

PDF - Delay Network Architectures for Room and Coupled Space Modeling - O. Das, J. S. Abel and E. K. Canfield; Proceedings of 23rd International Conference on Digital Audio Effiects (DAFx 20), Vienna, Austria; September 2020.

PDF - Grouped Feedback Delay Networks for Modeling of Coupled Spaces - O. Das, J.S Abel; Journal of the Audio Engineering Society (JAES), Volume - 69, no. 7/8, pp 486-496, July 2021.

PDF - Grouped Feedback Delay Networks with Frequency-Dependent Coupling - O.Das, S.J. Schlecht and E. De Sena; IEEE/ACM Transactions on Audio, Speech and Language Processing, May 2023.

I presented my research on GFDNs at Jean Le Rond d'Alembert Institute at Sorbonne University in December 2021.

Scattering Delay Networks

Similar to the FDN, the Scattering Delay Network (SDN) is a delay-network reverberator. Unlike the FDN, the SDN has parameters based on the room geometry and source-listener positions - thereby rendering early reflections accurately and higher order reflections with courser approximation. The standard SDN only renders the first-order reflections exactly. In the SCReAM project, we have extended SDNs to render higher-order reflections correctly by proposing various topologies and directional scattering matrices. The higher-order SDNs were rater higher in naturalness and texture than the standard SDN and the image-method.

PDF - Higher-order Scattering Delay Networks for Artificial Reverberation - M. Scerbo, O.Das, E. De Sena; Proceedings of 25th International Conference on Digital Audio Effects (DAFx 2022), Vienna, Austria; September 2022.

Virtual Acoustics

At Facebook Reality Labs, my research on Room Impulse Response Interpolation for augmented reality applications was published in ICASSP 2021. We detect and interpolate low-frequency room modes from sparse microphone measurements to a continuous spatial mapping by solving the homogenous Helmholtz equation with non-linear optimization. With offline estimation of the model parameters, real-time interpolation of the room response can be performed very fast with parallel biquad filters as the subject moves around the room.

PDF - Room Impulse Response Interpolation from a Sparse Set of Measurements Using a Modal Architecture - O. Das, P. Calamia, S.V.A Gari; Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASPP 2021), Virtual; June 2021

The wave-effects of diffraction and occlusion are key to reproducing audio realism in VR. Whilst many mathematical models of sound diffraction exist, our study is the first to compare them perceptually. Developed during a final year technical project, the paper associated with the study won the best paper award at AES Conference on Audio for Virtual and Augmented Reality, 2022.

PDF - Perceptual Evaluation of Low Complexity Diffraction Models from a Single Edge - J. Mannall, O. Das, P. Calamia, E. De Sena; Proceedings of AES Conference on Audio for Virtual and Augmented Reality (AVAR 2022), Redmond, USA; August 2022

The Image Method is widely popular for rendering the acoustics of shoebox rooms. However, it cannot model wave scattering and in highly symmetric rooms, it leads to the phenomenon of "sweeping echoes" due to the time alignment of multiple image sources. We address this problem by replacing the plane wave reflection coefficient used in IM with the spherical wave reflection function, which takes into account directional scattering. The Complex Image Method leads to a significant reduction in sweeping echoes in cuboid rooms.

Github Repository

PDF - The Complex Image Method for Simulating Wave Scattering in Room Acoustics - O. Das, E. De Sena; Proceedings of 2nd International Conference on Immersive and 3D Audio, (I3DA 2023), Bologna, Italy; September 2023

In May 2023, I gave an overview of modeling room acoustics in AR/VR applications in a talk organised by the UK Acoustics Network (UKAN). The video is available to watch below.

Microphone Bleed Cancellation

My dissertation research was on microphone "bleed" (cross-talk) cancellation in ensemble recordings. While recording an ensemble of musicians, it is often desired to isolate the instruments to avoid interference from other sources. Close-miking and acoustic isolation booths are some techniques for mitigating microphone bleed. I proposed statistical signal processing methods for reducing bleed in the post-processing stage with Maximum Likelihood and Maximum Aposteriori Probability estimation. The proposed methods showed impressive results against the state-of-the-art Multichannel Wiener Filter on simulated and real recordings.

The public part of my dissertation defense (with all of its technical glitches) is available to watch online. I am a victim of pandemic-affected online vivas, but most of my friends, academic and otherwise could attend the Zoom viva.

Github Repository

PDF - Microphone Cross-talk Cancellation in Ensemble Recordings with Maximum Likelihood Estimation - O. Das, J.O. Smith and J.S Abel; Proceedings of 150th Audio Engineering Society Convention, May 2021.

PDF - Close-microphone Cross-talk Cancelation in Ensemble Recordings with Statistical Estimation - O. Das; PhD dissertation, Stanford University, September 2021.

Spectral Modeling

Sounds emanating from resonant objects such as rooms, plates and string instruments are composed of modes (standing waves) vibrating at different frequencies, each with its unique decay rate. Modal synthesis aims to reconstruct sounds by estimating these mode parameters and efficiently synthesizing modes using parallel biquad filters.

We have measured and modeled carillon bells at Stanford's Hoover Tower using modal synthesis. Our 'computer carillon' can ring at different dynamic levels using a parameterized clapper-bell interaction function. Sound examples are available here.

PDF - Improved Carillon Synthesis - M. Rau, O. Das, and E. K. Canfield; Proceedings of 22nd International Conference on Digital Audio Effects (DAFx 2019), Birmingham, UK; September 2019.

Modal parameters can be estimated on a warped frequency axis to resolve beating partials. The proposed method, Frequency warped ESPRIT, is used to model coupled piano strings that exhibit two-stage decay and beating modes in doublets and triplets, and room impulse responses. An additional optimization step is used to fine-tune the mode estimates. Sound examples are available here.

Github Repository

PDF - Modal Estimation on a Warped Frequency Axis with Application to Coupled Piano String Modeling - O. Das and J. S. Abel; arXiv Preprint, Feb 2022.

We have proposed a more efficient MUSIC (MUltiple Signal Classification) algorithm - FAST MUSIC, which is numerically more stable and suited for detecting close-frequency beating partials in approximately periodic signals. Some possible applications of these techniques in music research include modeling instruments such as pianos and bells where close frequency beating is often observed.

Github Repository

PDF - FAST MUSIC - An Efficient Implementation of the MUSIC algorithm for Frequency Estimation of Approximately Periodic Signals - O. Das, J.S Abel and J.O Smith; Proceedings of 21st International Conference on Digital Audio Effects (DAFx 2018), Aveiro, Portugal; September 2018.

Pitch Tracking

The Extended Kalman Filter is used to track fundamental frequency, amplitude and instantaneous phase of monophonic audio signals. It has certain advantages, such as a unique pitch value for each sample of data, unlike most block-based methods like the CREPE or YIN estimator, and is robust to the presence of a large amount of observation noise. However, it has certain drawbacks such as poor transient performance and slow detection of rapid pitch changes. These drawbacks have been addressed in an extended journal paper. Performance on vocal singing excerpts can be found here.

Github Repository

PDF - Real-time Pitch Tracking in Audio Signals with the Extended Complex Kalman Filter - O. Das, J.O Smith and C. Chafe; Proceedings of 20th International Conference on Digital Audio Effects (DAFx 2017), Edinburgh; September 2017. - 4th best paper award

PDF - Improved Real-time Monophonic Pitch Tracking with the Extended Complex Kalman Filter - O. Das, J.O Smith and C. Chafe; Journal of the Audio Engineering Society, Volume - 68, Pages - 78-86, January 2020.

Ranchlands' Hum

The Ranchlands' Hum is a low frequency noise around 40Hz that has been plaguing the residents of Calgary, Canada for years. As an intern in the department of Electrical and Computer Engineering at University of Calgary, I assisted Dr. Mike Smith in developing an Android application that could capture, store and analyze low frequency noise. I added features that integrated the existing application with an SQLite database, calculated and plotted signal metrics. The project received some media attention.

Github Repository

PDF - Development of an Android Application to Capture and Analyze the Community Noise Problem, the Ranchlands' Hum - O. Das, August 2015

Poster, One page paper - Investigating an Urban Noise Nuisance - The Ranchlands' Hum - O. Das, A. Gaspard and M.Smith; 38th International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC, 2016 - Orlando, Florida; August 2016.

News Article - O.Das and M.Smith - Mobile Phone App to Assist in Solving Ranchlands' Hum Noise Nuisance, CTV Calgary, Calgary Herald, Quebec Science .

Speech Enhancement

The Kalman Filter is an MMSE estimator that can be used to remove background noise from speech. The filter equations are formulated based on the linear Autoregressive model of speech production. We implement a novel algorithm that tunes the Kalman Filter by accurately determining its parameters - measurement and process noise covariance. We also study the effect of changing AR model order on speech corrupted with various types of noise of various SNRs and summarize the results in an undergraduate thesis.

Github Repository

PDF - Undergraduate Thesis - Kalman Filter in Speech Enhancement - O.Das, April 2016

PDF - Application of the Tuned Kalman Filter in Speech Enhancement - O. Das, B. Goswami and R. Ghosh; Proceedings of IEEE CMI (Control, Measurement and Instrumentation), Kolkata, India; January 2016.

Rhythm analysis of Tabla solos in Indian Classical Music

Tabla is is a membranophone percussion instrument (similar to bongos) which is often used in Hindustani classical music. The instrument consists of a pair of hand drums of contrasting sizes and timbres. The rhythmic pattern of any composition in Indian music is described by the term tala, which is composed of cycles of matra-s. Tala roughly correlates with the metres in Western music. Our aim is to determine the number of beats that constitute tala-s in different tabla solos. We develop a heuristic algorithm that extracts peaks from the tabla signal, corresponding to single or composite strokes and devise statistical methods to ensure that spurious noisy peaks are removed,and missed peaks are accounted for. We obtain excellent results for solo tabla recordings played by human artist.

PDF - Rhythm Analysis of Tablã Signal by Detecting the Cyclic Pattern. - S. Bhaduri, O.Das, S. Saha and C. Mazumdar; Innovations in Systems and Software Engineering, a NASA journal publised by Springer, September 2015.