Clinical AI Data Specialist

Datavant is the data collaboration platform trusted for healthcare. Guided by our mission to make the world’s health data secure, accessible and actionable, we provide critical data solutions for organizations across the healthcare ecosystem - including providers, health plans, researchers, and life sciences companies. From fulfilling a single patient’s request for their medical records to powering the AI revolution in healthcare, Datavanters are building the future of how data is connected and used to improve health.

By joining Datavant today, you’re stepping onto a driven and highly collaborative team that is passionate about creating transformative change in healthcare.

What We’re Looking For:

The Data Science / Clinical AI function is seeking a Clinical AI Data Specialist to ensure the clinical accuracy of the training data, model output labels, and clinical logic — prompts and coding rules — that shape how our AI-powered risk adjustment products behave. This is a clinical coding domain-expert role first: it requires active coding credentials and the ability to independently read, interpret, and annotate clinical medical record documentation, and that expertise translates directly into measurable model performance. Errors introduced at this layer propagate into training and produce systematic clinical inaccuracies at production scale, so the quality of your judgment is the product. The technical work — annotation at scale, prompt and rule iteration, and label-quality analysis — is carried out using AI-assisted development tools; we will train the right clinical coding expert on the tooling, and a software engineering background is not required.

What You Will Do:

  • Annotate medical records for AI training data
  • Validate annotated data to ensure quality
  • Refine the clinical logic behind AI outputs
  • Provide clinical coding & HIM subject-matter expertise to data science

What a Typical Day Looks Like

In this role, you can expect to:

  • Read and interpret clinical documentation — physician notes, assessment and plan sections, problem lists, medication records — to identify codeable diagnoses, conditions, and other clinical entities (document boundaries, type, author, section), applying ICD-10-CM and risk adjustment coding standards and mapping to clinical ontologies (ICD-10-CM/PCS, CPT, RxNorm) when required by project scope
  • Distinguish conditions that meet documentation standards for coding from those that do not, exercising clinical judgment independently, and flag ambiguous or edge-case documentation with written rationale}
  • Review AI model output labels against clinical documentation to identify false positives, false negatives, and specificity errors; clean and correct label datasets and categorize error patterns for the data science team
  • Apply coding knowledge to evaluate whether model-generated code assignments are clinically and regulatorily supportable, and escalate systematic quality issues that may indicate model behavior problems
  • Translate ICD-10-CM and coding guideline requirements into explicit, testable instructions — LLM prompt language and computable coding rules — using AI-assisted tools testing revisions against curated ground-truth datasets and iterating on observed failures
  • Document the clinical rationale and precision/recall impact of each prompt or rule change for senior review

What You Need to Succeed:

  • Domain expertise with a minimum 5 years of coding and/or CDI experience with demonstrated proficiency in ICD-10-CM code assignment from clinical documentation
  • Active credential in at least one of: CCS, CPC, CRC, CDIP, CCDS, or equivalent AHIMA/AAPC certification
  • Ability to apply clinical coding standards consistently and independently to produce high-quality, reproducible labels across large document sets, catching subtle distinctions that affect code assignment
  • Ability to articulate the clinical rationale behind a labeling decision in writing for QA and audit, and to express coding requirements as explicit, unambiguous instructions — the discipline behind a well-constructed coding query
  • Works independently within established guidelines without case-by-case direction on routine annotation, and escalates systematic issues — repeated error patterns, guideline gaps, documentation quality trends — rather than resolving them in isolation

What Helps You Stand Out:

  • Coding Audit and/or Compliance Experience
  • Clinical annotation or AI/ML data labeling experience in a health-tech or healthcare AI environment
  • Familiarity with HCC reimbursement models
  • Exposure to NLP or ML model outputs in a clinical context — how model-generated codes differ from human-assigned codes

What We Offer:

  • Comprehensive health, dental, and vision insurance
  • Paid time off (PTO) plan, offering X days per year, plus holidays
  • Retirement savings plan
  • Flexible work arrangements
  • Opportunities for career growth and development
  • Employee wellness programs

We are committed to building a diverse team of Datavanters who are all responsible for stewarding a high-performance culture in which all Datavanters belong and thrive. We are proud to be an Equal Employment Opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.

At Datavant our total rewards strategy powers a high-growth, high-performance, health technology company that rewards our employees for transforming health care through creating industry-defining data logistics products and services.

The range posted is for a given job title, which can include multiple levels. Individual rates for the same job title may differ based on their level, responsibilities, skills, and experience for a specific job.

The estimated total cash compensation range for this role is:
$120,000$145,000 USD

To ensure the safety of patients and staff, many of our clients require post-offer health screenings and proof and/or completion of various vaccinations such as the flu shot, Tdap, COVID-19, etc. Any requests to be exempted from these requirements will be reviewed by Datavant Human Resources and determined on a case-by-case basis. Depending on the state in which you will be working, exemptions may be available on the basis of disability, medical contraindications to the vaccine or any of its components, pregnancy or pregnancy-related medical conditions, and/or religion.

This job is not eligible for employment sponsorship.

Datavant is committed to a work environment free from job discrimination. We are proud to be an Equal Employment Opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status. To learn more about our commitment, please review our EEO Commitment Statement here. Know Your Rights, explore the resources available through the EEOC for more information regarding your legal rights and protections. In addition, Datavant does not and will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay.

At the end of this application, you will find a set of voluntary demographic questions. If you choose to respond, your answers will be anonymous and will help us identify areas for improvement in our recruitment process. (We can only see aggregate responses, not individual ones. In fact, we aren’t even able to see whether you’ve responded.) Responding is entirely optional and will not affect your application or hiring process in any way.

Datavant is committed to working with and providing reasonable accommodations to individuals with physical and mental disabilities. If you need an accommodation while seeking employment, please request it here, by selecting the ‘Interview Accommodation Request’ category. You will need your requisition ID when submitting your request, you can find instructions for locating it here. Requests for reasonable accommodations will be reviewed on a case-by-case basis.

For more information about how we collect and use your data, please review our Privacy Policy.