Senior Full-Stack Engineer - Web Platforms for ML Inference
As a Senior Full-Stack Engineer on the Annotation AI Services team, you will design and build web platforms that sit at the front door of ML inference and pipeline systems.
You will create modern, high-quality React and TypeScript applications while also developing cloud-native backend services deployed on AWS and Kubernetes. The role values engineers who appreciate working across the stack and who enjoy collaborating closely with infrastructure, ML systems, and platform teams to deliver cohesive, end-to-end solutions.
Minimum Qualifications
5+ years of professional software engineering experience.
Strong experience building and shipping production web applications with React and TypeScript (including testing and performance).
Working backend experience with Python and/or Node.js building APIs and integrations used by frontend applications.
Experience shipping and debugging services in a cloud environment (AWS preferred) and a containerized runtime (Kubernetes/EKS or equivalent preferred).
Experience designing and consuming APIs for frontend-backend integration (REST and/or gRPC), including auth, pagination, and error handling.
Solid fundamentals in web architecture and security (authentication/authorization and secure API design).
Comfortable working across the stack and collaborating with product/design/backend/infrastructure partners.
BS in Computer Science or equivalent practical experience.
Preferred Qualifications
Experience building internal platforms or developer-facing tools.
Familiarity with ML platforms, inference systems, or data pipelines.
Experience with gRPC and/or Protobuf-based APIs.
Exposure to observability systems (metrics, logging, tracing).
Experience operating services in production Kubernetes environments.
Experience with safe rollout strategies (e.g., canary or blue/green), autoscaling, and Kubernetes resource tuning (requests/limits).
Experience with performance and cost optimization: load/perf testing, caching strategies, capacity planning, and rate limiting/quotas.
Experience designing/operating persistence and messaging patterns (e.g., DB + queues + object storage), including schema migrations and reliability patterns (retries/DLQs).
MS in Computer Science or related field.