Engineering Lab Infrastructure Lead

At Graphcore, we’re building the future of AI compute.

We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.

As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.

Job Summary

Our Engineering Labs are where new silicon, systems, and platforms are brought to life, tested, and scaled. We’re looking for an experienced Engineering Lab Infrastructure Lead to help us build and operate the technical foundations that support this work.

You will lead the Engineering Lab Support function, ensuring our labs, infrastructure, and services remain reliable, scalable, and effective for the engineers developing Graphcore’s next generation technologies.

Initially, this is a highly hands-on role. You’ll work directly with engineering teams, supporting lab environments, managing infrastructure, automating workflows, and solving complex technical problems. As the function grows, you’ll help build and lead a small team while defining the processes, standards, and service model that will support the organisation long term.

You’ll work closely with silicon, hardware, systems, and software engineering teams, making a direct impact on the speed and effectiveness of product development.

The Team

You’ll be joining a multidisciplinary team with strong technical skills and a very encouraging culture. We work closely together and regularly share knowledge, and your skills will make a direct impact on our business. It’s an exciting and pivotal moment for us right now, with plenty of new projects ahead. If you're looking to solve interesting problems and see your work deliver real-world results, this is the team for you.

Responsibilities and Duties

  • Leading the development of Engineering Lab infrastructure and support services
  • Acting as the technical escalation point for complex infrastructure and lab issues
  • Managing Linux-based servers and engineering environments
  • Supporting hardware bring-up, validation, and testing activities
  • Designing and improving operational processes, tooling, and automation
  • Maintaining and improving network, storage, and compute infrastructure within engineering labs
  • Managing infrastructure through configuration management and Infrastructure-as-Code practices
  • Building strong relationships with engineering teams and understanding their evolving requirements
  • Developing knowledge bases, documentation, and operational standards
  • Recruiting, mentoring, and leading a small team of Lab Infrastructure Engineers as the function grows

Essential

  • Strong Linux systems administration experience across Debian and/or Red Hat environments
  • Experience supporting engineering, research, laboratory, HPC, or data-centre environments
  • Solid networking knowledge including routing, VLANs, VPNs, and troubleshooting complex connectivity issues
  • Experience managing physical infrastructure including servers, rack-mounted equipment, BMCs, firmware, and out-of-band management
  • Experience with configuration management and automation tools such as Ansible, Puppet, or similar
  • Familiarity with authentication and access-management systems such as LDAP, RADIUS, or Active Directory integrations
  • Strong troubleshooting skills with a structured and methodical approach to problem solving
  • Excellent communication skills and a customer-focused mindset

Desirable

  • Container technologies such as Docker, containerd, or Kubernetes
  • Monitoring and observability platforms such as Prometheus, Grafana, Zabbix, OpenTelemetry, or similar
  • Python scripting and automation
  • CI/CD tooling including GitLab or GitHub Actions
  • Experience supporting hardware development, silicon validation, embedded systems, or electronics laboratories
  • Performance analysis and troubleshooting across compute, storage, and network infrastructure
  • Web infrastructure technologies including NGINX, HAProxy, or load balancing platforms

We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.