Senior Infrastructure Engineer

About the Role:

We're looking for a Senior Infrastructure Engineer to build and run Somnia's key backend
services: the L1 and node fleet, RPC and indexing layers, product backends, and
developer-facing services teams depend on. An SRE-minded role: you make reliability
measurable, rollouts safe, and infrastructure repeatable, so every team can move fast without
breaking things.

You treat infrastructure as a product: automate relentlessly, measure everything, and leverage
AI to accelerate development, operations, and incident response.

Key Responsibilities:

Define and maintain SLOs, SLIs, and error budgets, plus the observability—metrics,
logs, traces and alerts—that catches regressions before users do.
Build repeatable, self-service infrastructure through infrastructure-as-code, CI/CD and
golden paths so teams can provision, deploy and recover without reinventing the wheel.
Own rollouts end-to-end—progressive delivery, canaries, safe migrations and clean
rollbacks.
Operate the systems behind Somnia's nodes, validators, RPC and indexing, tuning for
performance and cost across regions.
Lead incident response and on-call, run blameless postmortems, and continuously
harden the platform.
Partner with product and protocol teams to design and operate production-ready
services. You'll rotate between embedding with engineering teams and building the
shared platform, tooling and operational standards that underpin the wider organisation.

Requirements:

Must Have

Strong experience operating production infrastructure at scale (cloud and/or bare metal), with deep Linux fundamentals.
Experience with infrastructure-as-code such as Terraform or Pulumi, alongside configuration management.
Experience running containers and orchestration platforms (Docker, Kubernetes) in production.
Strong programming skills, ideally in Go and/or TypeScript, for building automation and internal tooling.
Experience with observability stacks (Prometheus, Grafana, OpenTelemetry or equivalents).
Experience operating and monitoring distributed systems, including capacity planning and performance tuning.
Comfortable operating in high-stakes production environments and responding to incidents.
Genuine interest in crypto and on-chain systems.

Nice to Have

Experience operating blockchain node infrastructure (validators, RPC, archive nodes) for an L1/L2.
Experience with high-performance networking, low-latency systems or load balancing at
scale.
Multi-region and geo-distributed deployments with failover strategies.
Security and key management (HSMs, secrets management, hardening).
EVM tooling and the wider Web3 infrastructure ecosystem.

What Success Looks Like

Engineers can deploy safely and frequently with confidence.
Platform reliability is measurable, with well-defined SLOs and continuously improving service health.
Infrastructure is automated, repeatable and increasingly self-service.
Incidents become less frequent, easier to diagnose and faster to resolve.
Product teams spend more time shipping features and less time managing infrastructure.

Why Join Somnia?

Design The Future: Join Somnia to work remotely with a global team, earn competitive compensation with token incentives, and help build the future of Web3 at a company where your impact truly matters.

First of its Kind: Make Somnia famous as the only hyperspeed L1 with native AI inference.

High Stakes: Influence a brand targeting a $1M+ launch budget in the 2026 peak-fragility market.

Incentives: Competitive salary + tokens.

Role Location

The role is remote, but we are looking for someone based in Europe or Asia (preferably).