DevOps Engineer - CI/CD & Monitoring (Remote, China)

BJAK’s automation systems power end-to-end insurance journeys across quote generation, policy issuance, renewals, endorsements, claims, payments and insurer integrations. These systems are business-critical, where deployment stability, monitoring and fast recovery directly impact customers and operations.

We're looking for a DevOps Engineer based in China to strengthen CI/CD systems, monitoring infrastructure and production visibility across BJAK’s AI automation platform, ensuring engineers can ship safely and systems remain highly observable and reliable.

This is a fully remote position where you'll collaborate closely with our Malaysia-based engineering, product and operations teams to improve deployment safety and system observability at scale.

The Mission

Build and maintain reliable CI/CD pipelines and monitoring systems that enable fast, safe and observable deployments across BJAK’s AI automation platform, reducing production risk while improving system visibility and operational confidence.

What You’ll Own

Design and maintain CI/CD pipelines for multiple services across the platform.
Improve deployment automation, release strategies and rollback mechanisms.
Build and enhance monitoring, alerting and observability systems across production services.
Ensure system health visibility through metrics, logs, traces and dashboards.
Work with engineers to reduce deployment risk and improve release confidence.
Implement safe deployment strategies such as canary, blue-green or phased rollouts.
Improve incident detection speed and reduce mean time to recovery (MTTR).
Support infrastructure reliability for business-critical insurance workflows.
Standardize deployment and monitoring practices across engineering teams.
Continuously improve CI/CD performance, stability and maintainability.

What We're Looking For

Experience in DevOps, SRE, platform engineering or infrastructure roles.
Strong understanding of CI/CD pipelines, deployment automation and release engineering.
Experience with monitoring, logging and observability systems in production environments.
Ability to troubleshoot deployment and production issues in a structured and calm manner.
Strong understanding of system reliability, uptime and operational risk.
Experience supporting production systems with high availability requirements.
Hands-on ownership mindset during incidents and deployment failures.
Practical judgment on release safety, performance and system stability.
Strong collaboration with engineering teams in fast-paced environments.
Low ego and disciplined approach to production operations.

Bonus Points

Experience with Jenkins, GitHub Actions, GitLab CI or similar CI/CD tools.
Experience with Kubernetes, Docker or container-based deployments.
Experience with observability stacks (Prometheus, Grafana, ELK, Datadog, etc.).
Experience with infrastructure-as-code tools (Terraform, Ansible, etc.).
Experience with zero-downtime deployments and progressive delivery strategies.
Experience with cloud platforms (AWS, GCP, Azure).
Experience in fintech, insurance or other high-availability industries.
Experience improving deployment velocity and reliability at scale.
Contributions to CI/CD or monitoring system improvements.

The Kind of Builder We Want

Thinks in deployment safety, system visibility and operational reliability.
Hands-on engineer who understands both pipelines and production systems deeply.
Calm and structured when handling deployment failures or production incidents.
Strong focus on observability, automation and release confidence.
Proactive in preventing issues rather than reacting to them.
Careful and deliberate when making production changes.
Builds systems engineers trust to deploy frequently and safely.

This Role Is Not For

Engineers who only react to deployment failures instead of preventing them.
People who are careless with production pipelines or release processes.
Individuals who ignore monitoring, alerting or system visibility.
Engineers who make risky deployment changes without proper safeguards.
Candidates who cannot stay calm during incidents or deployment failures.

Success in This Role

You'll be successful if you can:

Improve deployment safety, speed and reliability across all services.
Strengthen monitoring, alerting and system observability coverage.
Reduce production incidents caused by releases or configuration changes.
Improve MTTR through better visibility and incident tooling.
Enable engineers to ship with confidence and minimal operational risk.

Why Join BJAK

Build Reliable Delivery Systems – Own CI/CD and monitoring for AI automation platforms.
High-Impact Engineering – Solve real-world release engineering and observability challenges.
Global Engineering Team – Work with experienced engineers across multiple countries.
Fully Remote – Work remotely from China while collaborating with our Malaysia-based teams.
International Exposure – Build systems used across Southeast Asia markets.
Learning & Development Budget – Support continuous technical growth and DevOps expertise.
High Ownership Environment – Strong autonomy over deployment and monitoring architecture.
Modern Engineering Culture – Focus on reliability, speed and engineering excellence.
Competitive Compensation – Attractive salary package based on experience and impact.

Interview Process

We assess DevOps depth, CI/CD design thinking and production reliability experience. The process usually includes application review, two interviews and a technical scenario or systems discussion.