Site Reliability Engineering (SRE) Manager, Private Cloud Compute

We're looking for a hardworking and passionate person to join this amazing team. You will be an accomplished builder and leader of teams looking to tackle your next challenge. You know SRE and you know what it will take to run services at Apple scale with a high degree of operational perfection. This role will position you to help shape the future of how we build and run our services on a global scale. You will have the technical skills to go deep and retain the ability to focus on higher-level business and product goals. We hire high quality leaders and engineers with a diverse set of experiences and skill sets for positions on Apple. Our customers count on us to provide extraordinary availability, scalability, and security for services. If you’d like to positively influence millions of customers’ experience of Apple this is the job for you. Minimum Qualifications Experience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformers 3+ Years demonstrable success leading engineering teams - ideally SRE, Production Engineering, or DevOps Knowledge of core operating system principles, networking fundamentals, and systems management Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts Experience with hiring and leading engineers Bachelors or Masters degree in computer science or equivalent field Preferred Qualifications 5+ years professional experience in an engineering leadership position

Similar jobs