Research Scientist in evals
First to test new models, check them for scheming, review transcripts for unusual behavior, and automate these processes. Required: strong SWE experience (ideally Python), analytics, communication skills, and a deep understanding of AI models. Offers: $135k-270k + equity, relocation, meals, unlimited vacation, and development budget.