Pursuing a PhD in Audio ML: From Speech to Music | Ranjan Kumar

As I prepare to apply for PhD programs in Fall 2026, I’m on the hunt for open positions in audio ML, specifically where music, speech, and multimodal learning intersect. With a background in speech processing and a first-author publication on a speech dataset at ACL, I’m now passionate about transitioning into music and audio-focused research.

My hands-on experience in building tools for lyrics alignment, note detection for guitar, and guitar pedal settings identification has sparked my interest in music generation, source separation, and audio-visual learning. However, I’m wondering if my speech-focused publication record might make me less competitive for music/audio PhD roles.

If you’re a professor or lab hiring in these areas, I’d love to hear from you. And if you have any advice on how to make my application stand out, I’m all ears!

Leave a Comment Cancel Reply