Senior Infrastructure Engineer

About the Role:

We're looking for a Senior Infrastructure Engineer to build and run Somnia's key backend
services: the L1 and node fleet, RPC and indexing layers, product backends, and
developer-facing services teams depend on. An SRE-minded role: you make reliability
measurable, rollouts safe, and infrastructure repeatable, so every team can move fast without
breaking things.

You treat infrastructure as a product: automate relentlessly, measure everything, and leverage
AI to accelerate development, operations, and incident response.

Key Responsibilities:

Define and maintain SLOs, SLIs, and error budgets, plus the observability—metrics,
logs, traces and alerts—that catches regressions before users do.
Build repeatable, self-service infrastructure through infrastructure-as-code, CI/CD and
golden paths so teams can provision, deploy and recover without reinventing the wheel.
Own rollouts end-to-end—progressive delivery, canaries, safe migrations and clean
rollbacks.
Operate the systems behind Somnia's nodes, validators, RPC and indexing, tuning for
performance and cost across regions.
Lead incident response and on-call, run blameless postmortems, and continuously
harden the platform.
Partner with product and protocol teams to design and operate production-ready
services. You'll rotate between embedding with engineering teams and building the
shared platform, tooling and operational standards that underpin the wider organisation.

Requirements:

Must Have

  • Strong experience operating production infrastructure at scale (cloud and/or bare metal), with deep Linux fundamentals.

  • Experience with infrastructure-as-code such as Terraform or Pulumi, alongside configuration management.

  • Experience running containers and orchestration platforms (Docker, Kubernetes) in production.

  • Strong programming skills, ideally in Go and/or TypeScript, for building automation and internal tooling.

  • Experience with observability stacks (Prometheus, Grafana, OpenTelemetry or equivalents).

  • Experience operating and monitoring distributed systems, including capacity planning and performance tuning.

  • Comfortable operating in high-stakes production environments and responding to incidents.

  • Genuine interest in crypto and on-chain systems.

Nice to Have

  • Experience operating blockchain node infrastructure (validators, RPC, archive nodes) for an L1/L2.

  • Experience with high-performance networking, low-latency systems or load balancing at
    scale.

  • Multi-region and geo-distributed deployments with failover strategies.

  • Security and key management (HSMs, secrets management, hardening).

  • EVM tooling and the wider Web3 infrastructure ecosystem.

What Success Looks Like

  • Engineers can deploy safely and frequently with confidence.

  • Platform reliability is measurable, with well-defined SLOs and continuously improving service health.

  • Infrastructure is automated, repeatable and increasingly self-service.

  • Incidents become less frequent, easier to diagnose and faster to resolve.

  • Product teams spend more time shipping features and less time managing infrastructure.

Why Join Somnia?

Design The Future: Join Somnia to work remotely with a global team, earn competitive compensation with token incentives, and help build the future of Web3 at a company where your impact truly matters.

First of its Kind: Make Somnia famous as the only hyperspeed L1 with native AI inference.

High Stakes: Influence a brand targeting a $1M+ launch budget in the 2026 peak-fragility market.

Incentives: Competitive salary + tokens.

Role Location

The role is remote, but we are looking for someone based in Europe or Asia (preferably).