Machine Learning Researcher (Music & Audio)
New York City, New York, United States
Background
At Udio, our mission is to enable the next generation of music creators with state-of-the-art AI tools. Our founding team is composed of world-leading AI researchers, formerly of Google DeepMind. Between the founders, they have 4 PhDs, degrees from Harvard, Oxford, Mila, and Edinburgh, and industry experience at DeepMind, Google, Nvidia, Dropbox, Yext, and more.
Our company isn’t just about world-class AI; our product and design team bring deep expertise and design experience from leading tech companies such as Spotify, Instagram, and Airbnb. They are innovators in their own right, constantly pushing the boundaries of what’s possible in music and technology.
We’re proud to be backed by Andreesen Horowitz and an all-star lineup of visionaries from the tech, music, and creative worlds—including will.i.am, Common, Tay Keith, United Masters, Kevin Wall, Mike Krieger (Instagram co-founder), and Oriol Vinyals (head of Gemini).
Check out this video from our Co Founder, David Ding that gives you much more insights into what we are doing here at Udio.
Role Description
We are a forward-thinking company at the forefront of innovation in AI and audio technology, dedicated to creating next-generation audio experiences. At Udio, you will have the chance to do cutting edge research on generative AI for the audio domain, with hundreds of GPUs at your disposal. You will be working with a talented team of researchers and engineers with extensive experience with large scale deep learning, and one of the most advanced diffusion modeling stacks in existence.
As a Machine Learning Researcher, you will advance the frontiers of generative music. You will join a creative and ambitious team building the future of the creative economy through AI, working directly with world-class machine learning experts who have an exceptional track record in generative AI.
What we're looking for
* PhD or Master's degree in Computer Science, Music Technology, Audio Signal Processing, or equivalent industry experience, ideally with 1+ years of postgraduate research experience
* Strong research track record in music generation and audio ML, including training and advancing foundation models beyond black-box use
* First-author publications at top-tier AI and audio/music conferences (e.g., NeurIPS, ICML, ICLR, ISMIR, ICASSP, Interspeech)
* Deep expertise in modern ML frameworks (e.g., JAX, PyTorch) for music generation and audio processing
* Proven ability to communicate complex technical research and present at leading conferences
* Strong programming background with experience optimizing complex ML systems
* Nice to have:
* Experience implementing custom neural audio processing architectures
* Expertise in low-level ML optimization e.g. profiling, latency, and kernel implementation
Our Culture
We love doing things exceptionally well, and we have fun doing so! Some cultural traits we value are:
* Passion for creating products people love
* Commitment to do what is right, no matter how hard the task
* Willingness to work hard with others who work similarly hard, to achieve revolutionary technology.
* Deep interest in generative AI and pioneering technology
* Highly competitive salary and equity
* Flexible time off (~25 days PTO/year)
* Fantastic office location in Manhattan
* Productivity package, including ChatGPT Plus and Copilot
* Top notch private health, dental, and vision insurance for you and your dependents
* 401(k) plan options with 5% employer matching
* Concierge medical/primary care through One Medical and Rightway
* Mental health support from Spring Health
* Personalized life insurance, travel assistance, and many other perks
Udio's success hinges on hiring great people and creating an environment where we can be happy, feel challenged, and do our best work.
Apply for this job
* indicates a required field
First Name *
Last Name *
Email *
Phone *
Resume/CV *
LinkedIn Profile *
Website
Describe what you feel is your most relevant research experience in machine learning for music or audio *
Describe your experience implementing models from scratch using frameworks like JAX or PyTorch (NOT using pre-trained models) *
Give a brief overview of the research topics you're most excited about in generative music and audio *
#J-18808-Ljbffr