IT Ops Engineer
You will operate and improve the production platform: monitor systems, manage security and incidents, maintain CI/CD pipelines, and coordinate releases across nodes. You will write and maintain system documentation, run tests and recovery exercises, engage with technology partners, and provide on‑call production support when needed.
Responsibilities
- Participate in daily IT operations such as monitoring, security management, change management and incident management
- Design and maintain a highly reliable and secure production environment including logging, monitoring and analytics
- Write and maintain documentation for the platform and supporting systems
- Maintain and enhance the Continuous Integration environment
- Coordinate rollout of new releases across all nodes of the Corda Business Network
- Develop and fix key components of the collateral exchange platform
- Investigate and resolve incidents
- Engage and collaborate with technology partners to improve the technology architecture
- Participate in threat modelling, audits, due diligence and vendor risk assessments
- Coordinate and participate in penetration tests and disaster recovery exercises
- Cover part of an on-call rota to provide production support
Requirements
- Master's degree in computer science, Information Technology, Engineering or related field
- Up-to-date knowledge in networking, relational databases and security
- Solid grasp of modern DevSecOps practices including version control, continuous integration/delivery, GitOps and acceptance testing
- Significant hands-on experience in software development and operations
- Hands-on multiyear experience with Unix/Linux administration
- Experience with containerization including building, optimizing, managing and securing container images
- Experience with container orchestration and tooling such as Helm and Kubernetes; OpenShift and ArgoCD a plus
- Networking knowledge: TCP/IP, firewalls, load balancers, NAT, DNS and proxy configuration
- Security knowledge: certificate and key management, HSMs, PKI, TLS, VPNs, OIDC, vulnerability management and penetration testing
- PostgreSQL database administration experience including backup/restore and HA setups
- Automated management of environments using Ansible and Terraform; Azure experience a plus
- Testing experience: creating test plans and executing acceptance, performance, load and security tests
- Experience configuring and operating message brokers and streaming platforms (RabbitMQ, ActiveMQ, Kafka, RedPanda)
- Monitoring experience with stacks such as Prometheus/Grafana or Datadog
- Scripting skills in Python and shell
- Experience with JVM-based application stacks: building, packaging, deploying, tuning and monitoring
- Experience with git and incident management and incident response
Benefits
- An attractive and competitive benefits package
- Flexible work-from-home arrangements