I enjoy making things. Here are a selection of projects that I have worked on over the years.
Developed an SSM architecture to predict masked channel audio signal at a desired location in a simulated room given source and microphone configuration. This model learns the spatial audio representation via self-supervised learning technique
Developed a Deep SSM for real-time audio denoising for edge AI applications.
This is a basic HMM-based ASR that uses MFCC and delta features to identify six phrases: “Odessa” (a keyword phrase), “Play music,” “Start music,” “Turn off the lights,” “Turn on the lights,” and “What time is it.”