SummaryApple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each otherideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. Itthe diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, youmore than join something youadd something. In the Siri Attention and Invocation team we act as the front door to our usersinteractions with Siri on almost every shipping Apple device. We work hard to make sure that Siri responds only when intended, in an efficient and privacy-preserving manner.DescriptionWe are looking for an intern to explore speech synthesis and audio generation techniques. The ideal candidate will be very familiar with audio generation or text to speech synthesis. Key responsibilities: Develop audio generation and speech synthesis methods Build automated evaluation pipelines to assess quality of the synthetic data Optimize developed models for efficient inferenceMinimum QualificationsBachelordegree in Computer Science or equivalentDemonstrable experience in training deep learning systems on multiple GPUs in PytorchDemonstrable experience in audio, text to speech, speech to text technologiesKnowledge of the state of the art in audio generation, e.g. autoregressive vs non-autoregressive systems, etc.Preferred QualificationsDemonstrable experience with diffusion and\/or autoregressive audio generation modelsPublications in audio generation at well known conferences