Agentic AI Engineer - RL

\u200b<\/span>
<\/div>

ABOUT XENONSTACK<\/b>
<\/h3>
XenonStack is the fastest\-growing <\/span>Data and AI Foundry for Agentic Systems<\/b>, enabling people and organizations to gain <\/span>real\-time and intelligent business insights<\/b>.
<\/p>
We deliver innovation through:
<\/p>
Akira AI<\/span><\/a><\/b> <\/span>\u2013 Building Agentic Systems for AI Agents<\/p><\/li>
XenonStack Vision AI<\/span><\/a><\/b> <\/span>\u2013 Vision AI Platform<\/p><\/li>
NexaStack AI<\/span><\/a><\/b> <\/span>\u2013 Inference AI Infrastructure for Agentic Systems<\/p><\/li><\/ul>
Our mission is to accelerate the world\u2019s transition to <\/span>AI + Human Intelligence<\/b>, combining reasoning, perception, and action to create <\/span>enterprise\-ready AI agents<\/b>.
<\/p>
THE OPPORTUNITY<\/b>
<\/h3>
We are seeking an <\/span>Agentic AI Engineer (Specialized in Reinforcement Learning)<\/b> <\/span>with <\/span>2\u20135 years of experience<\/b> <\/span>in applying RL to enterprise\-grade systems. This role involves designing and deploying <\/span>adaptive AI agents<\/b> <\/span>that continuously learn, optimize decisions, and evolve in dynamic environments.
<\/p>
You\u2019ll work at the intersection of <\/span>RL research, agentic orchestration, and real\-world enterprise workflows<\/b> <\/span>\u2014 building agents that do more than automate, but truly <\/span>reason, adapt, and improve over time<\/b>.
<\/p>
JOB ROLES AND RESPONSIBILITIES<\/b>
<\/h3>
Reinforcement Learning Development<\/b>
<\/p>
Design, implement, and train <\/span>RL algorithms<\/b> <\/span>(PPO, A3C, DQN, SAC) for enterprise decision\-making tasks.
<\/p><\/li>
Develop <\/span>custom simulation environments<\/b> <\/span>to model business processes and operational workflows.
<\/p><\/li>
Experiment with <\/span>reward function design<\/b> <\/span>to balance efficiency, accuracy, and long\-term value creation.
<\/p><\/li><\/ul>
Agentic AI System Design<\/b>
<\/p>
Build <\/span>production\-ready RL\-driven agents<\/b> <\/span>capable of dynamic decision\-making and task orchestration.
<\/p><\/li>
Integrate RL models with <\/span>LLMs, knowledge bases, and external tools<\/b> <\/span>for agentic workflows.
<\/p><\/li>
Implement <\/span>multi\-agent systems<\/b> <\/span>to simulate collaboration, negotiation, and coordination.
<\/p><\/li><\/ul>
Deployment & Optimization<\/b>
<\/p>
Deploy RL agents on <\/span>cloud and hybrid infrastructures<\/b> <\/span>(AWS, GCP, Azure).
<\/p><\/li>
Optimize training and inference pipelines using <\/span>distributed computing frameworks<\/b> <\/span>(Ray RLlib, Horovod).
<\/p><\/li>
Apply <\/span>model optimization techniques<\/b> <\/span>(quantization, ONNX, TensorRT) for scalable deployment.
<\/p><\/li><\/ul>
Evaluation & Monitoring<\/b>
<\/p>
Develop pipelines for <\/span>evaluating agent performance<\/b> <\/span>(robustness, reliability, interpretability).
<\/p><\/li>
Implement <\/span>fail\-safes, guardrails, and observability<\/b> <\/span>for safe enterprise deployment.
<\/p><\/li>
Document processes, experiments, and lessons learned for continuous improvement.
<\/p><\/li><\/ul>
SKILLS REQUIREMENTS<\/b>
<\/h3>
Technical Skills<\/b>
<\/p>
2\u20135 years of hands\-on experience with <\/span>Reinforcement Learning frameworks<\/b> <\/span>(Ray RLlib, Stable Baselines, PyTorch RL, TensorFlow Agents).
<\/p><\/li>
Strong programming skills in <\/span>Python<\/b>; proficiency with <\/span>PyTorch / TensorFlow<\/b>.
<\/p><\/li>
Experience designing and training <\/span>RL algorithms<\/b> <\/span>(PPO, DQN, A3C, Actor\-Critic methods).
<\/p><\/li>
Familiarity with <\/span>simulation environments<\/b> <\/span>(Gymnasium, Isaac Gym, Unity ML\-Agents, custom simulators).
<\/p><\/li>
Experience in <\/span>reward modeling and optimization<\/b> <\/span>for real\-world decision\-making tasks.
<\/p><\/li>
Knowledge of <\/span>multi\-agent systems<\/b> <\/span>and collaborative RL is a strong plus.
<\/p><\/li>
Familiarity with <\/span>LLMs + RLHF (Reinforcement Learning with Human Feedback)<\/b> <\/span>is desirable.
<\/p><\/li>
Exposure to <\/span>cloud platforms (AWS/GCP/Azure)<\/b>, containers (Docker, Kubernetes), and CI/CD for ML.
<\/p><\/li><\/ul>
Professional Attributes<\/b>
<\/p>
Strong analytical and problem\-solving mindset.
<\/p><\/li>
Ability to balance <\/span>research depth<\/b> <\/span>with <\/span>practical engineering<\/b> <\/span>for production\-ready systems.
<\/p><\/li>
Collaborative approach, working across AI, data, and platform teams.
<\/p><\/li>
Commitment to <\/span>Responsible AI<\/b> <\/span>(bias mitigation, fairness, transparency).
<\/p><\/li><\/ul>
XENONSTACK CULTURE \u2013 JOIN US & MAKE AN IMPACT!<\/b>
<\/h3>
At XenonStack, we believe in <\/span>shaping the future of intelligent systems<\/b>. We foster a <\/span>culture of cultivation<\/b> <\/span>built on bold, human\-centric leadership principles, where <\/span>deep work, simplicity, and adoption<\/b> <\/span>define everything we do.
<\/p>
Our Cultural Values<\/b>
<\/p>
Agency<\/b> <\/span>\u2013 Be self\-directed and proactive.
<\/p><\/li>
Taste<\/b> <\/span>\u2013 Sweat the details and build with precision.
<\/p><\/li>
Ownership<\/b> <\/span>\u2013 Take responsibility for outcomes.
<\/p><\/li>
Mastery<\/b> <\/span>\u2013 Commit to continuous learning and growth.
<\/p><\/li>
Impatience<\/b> <\/span>\u2013 Move fast and embrace progress.
<\/p><\/li>
Customer Obsession<\/b> <\/span>\u2013 Always put the customer first.
<\/p><\/li><\/ul>
Our Product Philosophy<\/b>
<\/p>
Obsessed with Adoption<\/b> <\/span>\u2013 Making AI agents accessible and enterprise\-ready.
<\/p><\/li>
Obsessed with Simplicity<\/b> <\/span>\u2013 Turning complex RL + agentic challenges into intuitive, reliable systems.
<\/p><\/li><\/ul>
Be part of our mission to <\/span>reimagine adaptive, enterprise\-grade AI agents<\/b> <\/span>with Reinforcement Learning and accelerate the world\u2019s transition to <\/span>AI + Human Intelligence<\/b>.
<\/p>
WHY SHOULD YOU JOIN US?<\/b>
<\/h3>
<\/p>
1. Agentic AI Product Company<\/b>
<\/div>
Build <\/span>enterprise\-grade AI platforms<\/b> <\/span>powered by Machine Learning, Generative AI, and Agentic Systems. From Vision AI to Inference Infrastructure, you\u2019ll shape products that redefine enterprise AI adoption.
<\/div>
<\/p>
<\/p>
2. A Fast\-Growing Category Leader<\/b>
<\/div>
XenonStack is one of the <\/span>fastest\-growing Data and AI Foundries<\/b>, setting benchmarks in how businesses deploy and scale AI agents with platforms like <\/span>Akira AI, NexaStack, and Vision AI<\/b>.
<\/div>
<\/p>
<\/p>
3. Career Mobility & Growth<\/b>
<\/div>
Move between roles and functions \u2014 from <\/span>AI Engineering to Product Marketing or AgentOps<\/b> <\/span>\u2014 and craft a career that grows with your aspirations.
<\/div>
<\/p>
<\/p>
4. Global Exposure<\/b>
<\/div>
Work with <\/span>Fortune 500 enterprises, BFSI leaders, and global innovators<\/b>, delivering real\-world impact across industries and geographies.
<\/div>
<\/p>
<\/p>
5. Create Real Impact<\/b>
<\/div>
Contribute from day one. Even junior team members work on <\/span>mission\-critical product features<\/b> <\/span>that go into production.
<\/div>
<\/p>
<\/p>
6. Culture of Excellence<\/b>
<\/div>
Our values \u2014 <\/span>Agency, Taste, Ownership, Mastery, Impatience, and Customer Obsession<\/b> <\/span>\u2014 empower you to push boundaries and innovate fearlessly.
<\/div>
<\/p>
<\/p>
7. Responsible AI First<\/b>
<\/div>
Join a company that prioritizes <\/span>trustworthy, explainable, and compliant AI<\/b>. You\u2019ll contribute to <\/span>Responsible AI frameworks<\/b>, ensuring our agentic systems are not just powerful, but also ethical and reliable.\u200b<\/span>
<\/div>
<\/p>
\u200b<\/span>
<\/div><\/span>

Free, open-source IT job aggregator.

CLI API Ask a question GitHub