As I prepare to apply for PhD programs in Fall 2026, I’m on the hunt for open positions in audio ML, specifically where music, speech, and multimodal learning intersect. With a background in speech processing and a first-author publication on a speech dataset at ACL, I’m now passionate about transitioning into music and audio-focused research.
My hands-on experience in building tools for lyrics alignment, note detection for guitar, and guitar pedal settings identification has sparked my interest in music generation, source separation, and audio-visual learning. However, I’m wondering if my speech-focused publication record might make me less competitive for music/audio PhD roles.
If you’re a professor or lab hiring in these areas, I’d love to hear from you. And if you have any advice on how to make my application stand out, I’m all ears!