Lead DevOps Engineer

Project description

The project is a strategic initiative to deliver a new FX Swaps clearing capability, enabling atomic processing of swap packages within a highly regulated, low-latency clearing environment. The platform operates under strict resilience, latency (≤10 seconds SLA), and regulatory (EMIR) constraints. The DevOps function will play a critical role in ensuring high availability, automation, continuous delivery, and operational resilience, while supporting a transition towards cloud-enabled infrastructure. This is a mission-critical system requiring Cat 5 resilience, robust CI/CD pipelines, strong monitoring, and disaster recovery capabilities. The solution will initially be deployed on-prem, with future migration to AWS (multi-AZ, multi-region).

Responsibilities

  • Design, implement, and maintain CI/CD pipelines using Jenkins, GitLab, and related tools Automate infrastructure provisioning and configuration using tools such as Ansible Extend and replicate existing CI/CD pipelines for new services and repositories rather than building from scratch Manage and optimise deployment processes across on-prem Red Hat VM environments and evolving cloud platforms Implement and maintain monitoring, alerting, and observability solutions to ensure system health and SLA adherence Support high-availability and disaster recovery strategies, ensuring compliance with LSEG resilience standards Collaborate with development and QA teams to enable continuous testing, integration, and delivery Ensure secure, compliant deployments, including cyber security and regulatory requirements Support performance, capacity, and non-functional testing environments Drive infrastructure and platform automation, including: secrets management, artifact/package repositories, environment standardisation Support integration environments involving MQ-based messaging (XML-based APIs) Participate in agile ceremonies (sprint planning, retrospectives, stand-ups) as part of a Scrum team Provide production support, release coordination, and go-live support (including weekend/night coverage) Participate in hypercare support (up to 3 days post go-live with on-call escalation model) Contribute to platform readiness for cloud migration and hybrid architecture models Expected to operate in an environment where AI-assisted development is the norm

SKILLS

Must have

  • Strong experience in DevOps engineering (10+ years preferred) Proven expertise with CI/CD tools (Jenkins, GitLab CI/CD) Hands-on experience with configuration management / IaC tools (Ansible or equivalent) Hands-on experience in using AI tools (e.g. GitHub Copilot) for engineering tasks, pipeline automation enhancements, quality and compliance checks Experience managing Linux/Red Hat-based environments Solid understanding of deployment automation, release management, and rollback strategies Experience with monitoring, logging, and alerting frameworks Knowledge of high-availability, disaster recovery, and resilience engineering Familiarity with database environments (Oracle) and integration pipelines Experience working in Agile/Scrum environments within regulated SDLC frameworks Strong understanding of security best practices and compliance in financial systems Strong experience in DevOps automation beyond CI/CD, including: infrastructure automation, secrets management, environment provisioning

Nice to have

Exposure to cloud platforms (AWS, Azure, or GCP) and cloud migration strategies Experience in low-latency, high-throughput systems Familiarity with financial services / clearing / trading platforms Experience implementing observability platforms (e.g., Prometheus, ELK, Datadog) Knowledge of Java/Spring Boot application deployment and tuning Experience with caching technologies (HazelCast) Understanding of risk and regulatory requirements (EMIR, systemic systems) Experience working with messaging systems (MQ), distributed backend systems.