Sr Site Reliability Engineer, Customer Systems
The Customer Systems Team is looking for an experienced Site Reliability Engineer. In this role you will design, build and deliver highly scalable, reliable, secure cloud infrastructure which powers the applications and services used by Apple’s customers every day. You will work closely with cross functional teams, business leaders and other partners across Apple to implement new solutions. If infrastructure as code, automation and intelligent monitoring excites you then this is the job for you.
In this highly visible position, you will:
Minimum Qualifications
5+ years of experience in designing and building resilient, large-scale, low latency, cloud and on-prem Infrastructure including Compute, Storage, and Network
3+ years of experience with deploying/managing Kubernetes using Helm
Experience with Shell Scripting, Python, or Ansible
Experience in monitoring using Splunk, Grafana, Prometheus, Alertmanager
Deep understanding of networking protocols: DNS, TCP, HTTP/HTTPS
Experience in setting up and managing CI/CD pipelines
Bachelor's or Master's in Computer Science or equivalent experience
Preferred Qualifications
Excellent problem solving, critical thinking, and interpersonal skills
Good communication skills to collaborate with distributed teams
Experience with Cassandra, MongoDB, Couchbase databases, AWS S3 or similar storage technologies
Experience in deploying, monitoring and supporting java applications
Experience with ArgoCD and GitOps model
Experience in defining, monitoring and achieving key operational metrics like MTTR and SLO
Experience with GenAI tools in workflow automation for infrastructure management
Ability to learn new technologies in a short time