Service Reliability Engineer, G&A Solutions Engineering (GSE)
As a Service Reliability Engineer, you'll be at the forefront of maintaining the health, stability, and efficiency of our services, working with a diverse range of technologies and platforms. You will collaborate with Engineers, Data Engineers, DBAs, and network specialists to proactively identify and resolve potential issues, automate repetitive tasks, and drive continuous improvement initiatives. Your expertise will directly impact the reliability of our systems, enabling Apple to deliver innovative products and services to our customers.
Minimum Qualifications
3+ years of experience in a Site Reliability Engineering, DevOps, or related role, supporting large-scale, enterprise-level services.
Strong proficiency in at least one programming language (e.g., Python, Java, Go) and scripting languages (e.g., Bash, PowerShell)
Experience with cloud platforms (e.g., AWS, Azure, GCP) and cloud-native technologies (e.g., Kubernetes, Docker).
Hands-on experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Splunk, Data dog)
Experience in RCA of technical issues
Bachelor's degree in Computer Science or work related experience
Preferred Qualifications
Proven ability to troubleshoot complex issues in distributed systems
Familiarity with CI/CD pipelines and DevOps practices
Experience with database technologies (e.g., MySQL, PostgreSQL, NoSQL databases)
Knowledge of ITIL frameworks and incident management processes
Understanding of Linux/Unix system administration
Experience with configuration management tools (Ansible, Chef, Puppet)