Lead SageMaker Platform Engineer

Job Overview:

Our team is undergoing a large data + ML migration onto AWS SageMaker Pipelines. We deploy via Terraform and GitHub Actions across multiple AWS accounts aligned to our SDLC, sync model artifacts to a shared-services account, and validate models in dedicated testing accounts. Data is sourced primarily from Redshift, including trusted identity propagation.

We’re standing these pipelines up for the first time, and we need an expert who can help us debug and ship them to production quickly and reliably.

  • Pair with our data scientists in live debugging sessions to diagnose and fix broken SageMaker pipelines and get them through the SDLC to prod.

  • Rapidly triage failures using AWS logs and telemetry (CloudWatch, CloudTrail, SageMaker pipeline/execution logs, etc.) and pinpoint root causes.

  • Untangle permissions issues across pipeline execution roles, cross-account access, and CI/CD identity (GitHub Actions OIDC, Terraform-managed IAM).

  • Help debug cross-account model artifact syncing (shared services) and the testing-account validation flow.

  • Level up the team’s mental model for how the platform works and where to look when things break.

  • Expert-level AWS operational experience, especially debugging via logs and telemetry (CloudWatch Logs/Metrics, CloudTrail, X-Ray or equivalent) — can move from a vague failure to a root cause fast.

  • Deep IAM / permissions expertise in a multi-account setup: execution roles, assume-role/cross-account access, resource policies, KMS/encryption permissions, and reasoning about “who is allowed to do what, as which principal.”

  • Hands-on SageMaker experience, including SageMaker Studio and SageMaker Pipelines — knows how pipelines are defined, deployed, and executed, and where to look when a step fails. (Operating/debugging, not modeling.)

  • Multi-account AWS experience aligned to an SDLC (dev/test/prod), including cross-account resource sharing and promotion patterns.

  • Comfortable working embedded and hands-on: live pairing, screen-sharing, and debugging under time pressure.

  • Strong communicator who can explain why something broke and how to avoid it next time.

Nice to Haves:

  • Terraform experience, especially managing IAM and SageMaker/data infrastructure as code.

  • GitHub Actions CI/CD experience, particularly OIDC-based authentication to AWS (no long-lived keys) and the IAM trust policies behind it.

  • Experience with Amazon Redshift, and ideally trusted identity propagation / IAM Identity Center integration.

  • Some ML/MLOps background — enough to speak the language of model training, artifacts, and deployment (helpful, not required).

  • AWS certifications (e.g., Solutions Architect Pro, DevOps Engineer Pro, ML Specialty) as a signal, though hands-on evidence matters more.

WHAT WE BELIEVE

At Perficient, we promise to challenge, champion, and celebrate our people. You will experience a unique and collaborative culture that values every voice. Join our team, and you’ll become part of something truly special. We believe in developing a workforce that is as diverse and inclusive as the clients we work with. We’re committed to actively listening, learning, and acting to further advance our organization, our communities, and our future leaders… and we’re not done yet. Perficient, Inc. proudly provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, national origin, age, disability, genetic information, marital status, amnesty, or status as a protected veteran in accordance with applicable federal, state and local laws. Perficient, Inc. complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. Perficient, Inc. expressly prohibits any form of unlawful employee harassment based on race, color, religion, gender, sexual orientation, national origin, age, genetic information, disability, or covered veterans. Improper interference with the ability of Perficient, Inc. employees to perform their expected job duties is absolutely not tolerated. Disability Accommodations: Perficient is committed to providing a barrier-free employment process with reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or accommodation due to a disability, please contact us.

The salary range for this position takes into consideration a variety of factors, including but not limited to skill sets, level of experience, applicable office location, training, licensure and certifications, and other business and organizational needs. The new hire salary range displays the minimum and maximum salary targets for this position across all US locations, and the range has not been adjusted for any specific state differentials. It is not typical for a candidate to be hired at or near the top of the range for their role, and compensation decisions are dependent on the unique facts and circumstances regarding each candidate. A reasonable estimate of the current salary range for this position is $73,008 to $170,640. Please note that the salary range posted reflects the base salary only and does not include benefits or any potential variable compensation programs. Information regarding the benefits available for this position are in our benefits overview.

Disclaimer: The above statements are not intended to be a complete statement of job content, rather to act as a guide to the essential functions performed by the employee assigned to this classification. Management retains the discretion to add or change the duties of the position at any time.

#LI-RS1