AIML - Machine Learning Researcher - Multimodal Agent

As a member of our fast-paced group, you’ll have the unique and rewarding opportunity to shape upcoming products from Apple. We are looking for people with excellent applied machine learning, computer vision, multimodal LLM, and agent training experience and solid engineering skills. This role will have the following responsibilities: - Developing state-of-the-art multimodal foundation models for Apple Intelligence. - Developing various agent capabilities for multimodal LLMs, including computer use agents, visual tool use, thinking with images, and multimodal web search. - Developing, fine-tuning, and evaluating domain specific foundation models for various tasks and applications in Apple’s AI powered products - Conducting applied research to transfer the pioneering research in generative AI to production ready technologies - Understanding product requirements, translate them into modeling tasks and engineering tasks Minimum Qualifications PhD, MS or equivalent experience Experience in machine learning, deep learning, computer vision, or natural language processing Proficiency in one of following languages: Python, Go, Java, C++ Preferred Qualifications Excellent data analytical skills Good interpersonal skills and team player PhD preferred

Similar jobs