Senior Cloud & DevOps Engineer — AI Platform Operations

StratLytics is hiring a Senior Cloud & DevOps Engineer \u2014 AI Platform Operations<\/b> for our Bhubaneswar office<\/b> to support a high\-impact AI underwriting initiative for a North American fintech client.
<\/div>

<\/div>
This role is ideal for a senior AWS cloud, DevOps, SRE, or platform engineering professional who can design, deploy, monitor, and support production\-grade AI and data applications. The selected candidate will own cloud infrastructure, deployment automation, monitoring, incident response, and operational readiness for AI applications built using AWS, Claude/Bedrock, FastAPI, PostgreSQL, S3, and related technologies.
<\/div>

<\/div>
This is not a traditional systems administration role. The role requires hands\-on cloud engineering, DevOps discipline, production support maturity, and the ability to manage junior systems engineers in an 18x7 support model.
<\/div>

<\/div>
Key responsibilities include:<\/b>
<\/div>

<\/div>
\- Design, deploy, and manage AWS infrastructure for AI, data, and application workloads.
<\/div>
\- Set up and manage services such as EC2/ECS/App Runner, RDS PostgreSQL/Aurora PostgreSQL, S3, IAM, VPC, Secrets Manager, KMS, CloudWatch, and related AWS services.
<\/div>
\- Support deployment of FastAPI, Streamlit, background workers, and AI application services.
<\/div>
\- Configure secure access to Amazon Bedrock / Claude and related AI platform services.
<\/div>
\- Build and maintain CI/CD pipelines for reliable application deployment.
<\/div>
\- Implement monitoring, logging, alerting, backup, recovery, and incident\-response processes.
<\/div>
\- Create and maintain runbooks, SOPs, deployment checklists, and support documentation.
<\/div>
\- Lead L1/L2 operational support and coordinate L3 escalation to AI engineers, data engineers, and AI scientists.
<\/div>
\- Manage, guide, and review the work of junior systems engineers.
<\/div>
\- Ensure infrastructure is secure, auditable, cost\-aware, and suitable for financial\-services workloads.
<\/div>
\- Participate in client\-facing technical discussions, status reviews, and incident reviews as required.
<\/div><\/span>

Requirements<\/h3>
Required:<\/b>
<\/div>

<\/div>
\- 7+ years of experience in cloud engineering, DevOps, SRE, platform engineering, or production systems operations.
<\/div>
\- Strong hands\-on experience with AWS.
<\/div>
\- Experience with core AWS services such as EC2, ECS/Fargate, App Runner, RDS PostgreSQL/Aurora PostgreSQL, S3, IAM, VPC, Security Groups, Secrets Manager, KMS, CloudWatch, and CloudTrail.
<\/div>
\- Experience deploying and supporting Python\-based applications, APIs, containers, and web services.
<\/div>
\- Strong understanding of Docker, containerized deployments, CI/CD pipelines, and release management.
<\/div>
\- Experience with Linux administration, shell scripting, networking basics, log analysis, and troubleshooting.
<\/div>
\- Experience supporting production systems with monitoring, alerting, incident management, backup, recovery, and root\-cause analysis.
<\/div>
\- Working knowledge of PostgreSQL operations, connectivity, backup/restore, and performance monitoring.
<\/div>
\- Ability to lead junior engineers and operate in an 18x7 support model.
<\/div>
\- Strong documentation skills for run books, SOPs, incident reports, and deployment notes.
<\/div>
\- Good communication skills and ability to work with engineering, data, AI, and client teams.
<\/div>
\- Willingness to work from the StratLytics Bhubaneswar office.
<\/div>

<\/div>
Preferred:<\/b>
<\/b><\/div>

<\/div>
\- Experience with Amazon Bedrock, Claude, OpenAI, or other enterprise LLM platforms.
<\/div>
\- Experience supporting AI/ML, data science, analytics, or SaaS applications.
<\/div>
\- Experience with FastAPI, Streamlit, Celery, Redis, or Python application stacks.
<\/div>
\- Experience with Terraform, CloudFormation, or infrastructure\-as\-code tools.
<\/div>
\- Experience with GitHub Actions, GitLab CI/CD, Jenkins, or similar tools.
<\/div>
\- Exposure to LangGraph, LangChain, LLMOps, model\-serving, or AI application observability.
<\/div>
\- Experience with CloudWatch dashboards, centralized logging, alerting, and cost monitoring.
<\/div>
\- Experience in fintech, banking, lending, regulated industries, or client\-facing managed services.
<\/div>
\- AWS certifications such as AWS Solutions Architect, AWS SysOps Administrator, AWS DevOps Engineer, or equivalent.
<\/div><\/span>

Benefits<\/h3>
\- Competitive compensation aligned with market standards, based on experience and capability.
<\/div>
\- Opportunity to lead cloud and DevOps operations for a high\-impact AI underwriting programme.
<\/div>
\- Exposure to AWS, Claude/Bedrock, AI platform operations, financial\-services technology, and production AI systems.
<\/div>
\- Opportunity to manage and mentor junior systems engineers.
<\/div>
\- Work on practical, governed AI systems rather than generic proof\-of\-concept demos.
<\/div>
\- Collaborate with experienced AI, data science, data engineering, and platform professionals.
<\/div>
\- Gain experience in 18x7 support operations, client\-facing delivery, and AI platform reliability.
<\/div>
\- Candidates from other cities are welcome to apply if they are open to relocating to Bhubaneswar.
<\/div><\/span>