Large Machine Learning Model Optimization Engineer, SIML
We’re looking for strong Machine Learning software engineers/leaders to drive the development of the on-device Apple Intelligence LLM and diffusion model developments. This includes defining and leading the execution of model compression, distillation, and integrating to the full Apple Intelligence user experiences. We expect you to have strong, efficient ML model development experiences and a passion for shipping machine learning models on device. We also encourage publishing novel research at top ML conferences.
Minimum Qualifications
Software engineering skills in Python
Experience in developing large computer vision and machine learning models, particularly on the hardware-aware model optimizations
BS and a minimum of 3 years relevant industry experience
Preferred Qualifications
Familiar with model compression algorithms including quantization, pruning, distillations, and experience on optimizing large diffusion models or language models
MS or PhD degree in Computer Science, or equivalent industry research experience
Experience with hardware architecture, software & hardware co-design
Leadership experience in driving large-scale projects in the industry
Strong communication skills; phenomenal work ethic and collaboration
ML compiler
High performance kernel implementation
Distributed inference