Compute SRE
AS AN SRE AT APPLE YOU WILL:
- Build, operate, and scale Apple’s Cloud Platform that powers mission critical services across the globe.
- Accelerate delivery of core services with automation and visibility into release cadences.
- Collaborate with developers to build and release reliable software that manages the lifecycle of customer VMs.
- Drive reliability and excellence of service through CI/CD, production readiness reviews, and incident response.
- Instrument, analyze, and iterate on performance bottlenecks across distributed systems.
- Actively participate in oncall rotations, capacity planning, scale testing, and disaster recovery exercises.
- Ensure uptime SLOs with well-architected systems and rigorous observability.
Minimum Qualifications
Bachelor's Degree in Computer Science, an engineering-related field, or equivalent related experience.
1+ years in a Site Reliability Engineering Infrastructure focused role.
Proficiency in Go, Python, or Java.
Proficiency with Infrastructure as Code (IaC) tools like Puppet, Chef, Ansible, or Terraform.
Experience with cloud infrastructure and experience running businesses.
Experience in architecting, building, and running large-scale distributed systems.
Experience providing 24/7 on-call support and incident management for critical production infrastructure.
Ability to troubleshoot issues across the entire infrastructure stack (Profiling, Tracing, etc).
Preferred Qualifications
Interpersonal and written communication skills, targeted to both technical and non-technical audiences.
Experience operating large-scale multi-tenant Infrastructure as a Managed service.