Senior Engineering Manager, Platform Engineering - iCloud
iCloud serves billions of customers globally, and the Platform Engineering organization is foundational to how that scale is achieved and maintained. The orchestration layer your teams will own is central to how iCloud provisions, manages, and secures infrastructure by coordinating deployment of platform applications, enforcing policies, and enabling internal engineering teams to build and operate products without managing underlying infrastructure complexity.
Your teams' work creates compounding leverage across iCloud: every improvement to the platform makes every team building on top of it faster, safer, and more efficient. In your first year, you will be expected to stabilize and grow a high-performing team, establish a clear technical roadmap, and measurably improve platform reliability and developer experience for your internal customers.
Minimum Qualifications
Extensive experience in software engineering, with a demonstrated track record of leading engineering teams, including managing both individual contributors and managers
Experience leading teams that build and operate large-scale distributed systems or cloud infrastructure
Strong understanding of control plane architectures, service-oriented design, and API-driven systems
Track record of shipping highly reliable, production systems in high-availability environments
Experience leading incident management, on-call operations, and reliability engineering practices
Ability to operate across multiple teams or domains, driving alignment and execution in complex, matrixed environments
Strong technical depth with the ability to engage meaningfully in architecture and design discussions with senior engineers
Preferred Qualifications
Experience with container orchestration systems such as Kubernetes
Familiarity with identity, authentication, and authorization systems at scale
Experience building internal developer platforms that improve productivity and abstract infrastructure complexity
Knowledge of infrastructure automation, policy as code, and cost optimization practices
Experience designing and maturing observability practices, including SLO/SLI frameworks, distributed tracing, and on-call culture
Demonstrated ability to build psychologically safe, high-trust team cultures that attract and retain strong talent
Experience operating in large, complex organizations with high standards for quality and reliability