Software Engineering Manager, Batch Compute
In this role, as part of the ASE Compute Organization, you will:
Manage and grow the Dublin team, align objectives and deliverables with teams and partners across EMEA and the US.
Enjoy working on scalability, resilience, security, efficiency, and performance limits of infrastructure services.
Be curious about how systems work and, more importantly, how they fail.
Expect to own and develop significant software features from concept to production, involving cross-functional teams.
Approach complex issues with creativity, ready to take risks and course-correct effectively to impact the success of the platform.
Participate in on-call rotation and partner with the SRE team to triage, investigate, and resolve production issues.
Work with teams across Apple to develop features and functionality to meet the infrastructure and scalability requirements.
Minimum Qualifications
Demonstrated knowledge and a breadth of experience leading or managing teams that tackle and resolve complex engineering problems across multi-disciplinary domains including distributed systems, operating systems, storage, and networking to build stable, performant, and secure virtualized compute infrastructure
Familiarity with all aspects of software development, from architecture to deployment and maintenance
Quick learner and contributor to new code bases
Customer-focused thinking and strong problem-solving skills with attention to detail
Excellent verbal and written communication skills to effectively collaborate with cross-functional partners across different time zones
Enthusiasm about upholding Apple’s standards in product quality, design, and user experience
2+ years of experience as a technical lead manager
5+ years of experience in related software development (or comparable academic experience)
Preferred Qualifications
Experience working with network virtualization stacks (VxLAN, Virtual Private Cloud), across various hardware, operating systems, container runtimes (lxc, Apple Containerization Framework), and distributed storage systems (like Ceph)
Operational knowledge of Kubernetes clusters
Strong knowledge of networking protocols and security best practices
Experience with measuring, analyzing, and optimizing performance and efficiency for large-scale compute platform
Ability to effectively communicate within a team and with project stakeholders