Agentic AI Engineer - RL

\u200b<\/span>
<\/div>

ABOUT XENONSTACK<\/b>
<\/h3>

XenonStack is the fastest\-growing <\/span>Data and AI Foundry for Agentic Systems<\/b>, enabling people and organizations to gain <\/span>real\-time and intelligent business insights<\/b>.
<\/p>

We deliver innovation through:
<\/p>

  • Akira AI<\/span><\/a><\/b> <\/span>\u2013 Building Agentic Systems for AI Agents<\/p><\/li>

  • XenonStack Vision AI<\/span><\/a><\/b> <\/span>\u2013 Vision AI Platform<\/p><\/li>

  • NexaStack AI<\/span><\/a><\/b> <\/span>\u2013 Inference AI Infrastructure for Agentic Systems<\/p><\/li><\/ul>

    Our mission is to accelerate the world\u2019s transition to <\/span>AI + Human Intelligence<\/b>, combining reasoning, perception, and action to create <\/span>enterprise\-ready AI agents<\/b>.
    <\/p>


    THE OPPORTUNITY<\/b>
    <\/h3>

    We are seeking an <\/span>Agentic AI Engineer (Specialized in Reinforcement Learning)<\/b> <\/span>with <\/span>2\u20135 years of experience<\/b> <\/span>in applying RL to enterprise\-grade systems. This role involves designing and deploying <\/span>adaptive AI agents<\/b> <\/span>that continuously learn, optimize decisions, and evolve in dynamic environments.
    <\/p>

    You\u2019ll work at the intersection of <\/span>RL research, agentic orchestration, and real\-world enterprise workflows<\/b> <\/span>\u2014 building agents that do more than automate, but truly <\/span>reason, adapt, and improve over time<\/b>.
    <\/p>


    JOB ROLES AND RESPONSIBILITIES<\/b>
    <\/h3>

    Reinforcement Learning Development<\/b>
    <\/p>

    • Design, implement, and train <\/span>RL algorithms<\/b> <\/span>(PPO, A3C, DQN, SAC) for enterprise decision\-making tasks.
      <\/p><\/li>

    • Develop <\/span>custom simulation environments<\/b> <\/span>to model business processes and operational workflows.
      <\/p><\/li>

    • Experiment with <\/span>reward function design<\/b> <\/span>to balance efficiency, accuracy, and long\-term value creation.
      <\/p><\/li><\/ul>

      Agentic AI System Design<\/b>
      <\/p>

      • Build <\/span>production\-ready RL\-driven agents<\/b> <\/span>capable of dynamic decision\-making and task orchestration.
        <\/p><\/li>

      • Integrate RL models with <\/span>LLMs, knowledge bases, and external tools<\/b> <\/span>for agentic workflows.
        <\/p><\/li>

      • Implement <\/span>multi\-agent systems<\/b> <\/span>to simulate collaboration, negotiation, and coordination.
        <\/p><\/li><\/ul>

        Deployment & Optimization<\/b>
        <\/p>

        • Deploy RL agents on <\/span>cloud and hybrid infrastructures<\/b> <\/span>(AWS, GCP, Azure).
          <\/p><\/li>

        • Optimize training and inference pipelines using <\/span>distributed computing frameworks<\/b> <\/span>(Ray RLlib, Horovod).
          <\/p><\/li>

        • Apply <\/span>model optimization techniques<\/b> <\/span>(quantization, ONNX, TensorRT) for scalable deployment.
          <\/p><\/li><\/ul>

          Evaluation & Monitoring<\/b>
          <\/p>

          • Develop pipelines for <\/span>evaluating agent performance<\/b> <\/span>(robustness, reliability, interpretability).
            <\/p><\/li>

          • Implement <\/span>fail\-safes, guardrails, and observability<\/b> <\/span>for safe enterprise deployment.
            <\/p><\/li>

          • Document processes, experiments, and lessons learned for continuous improvement.
            <\/p><\/li><\/ul>


            SKILLS REQUIREMENTS<\/b>
            <\/h3>

            Technical Skills<\/b>
            <\/p>

            • 2\u20135 years of hands\-on experience with <\/span>Reinforcement Learning frameworks<\/b> <\/span>(Ray RLlib, Stable Baselines, PyTorch RL, TensorFlow Agents).
              <\/p><\/li>

            • Strong programming skills in <\/span>Python<\/b>; proficiency with <\/span>PyTorch / TensorFlow<\/b>.
              <\/p><\/li>

            • Experience designing and training <\/span>RL algorithms<\/b> <\/span>(PPO, DQN, A3C, Actor\-Critic methods).
              <\/p><\/li>

            • Familiarity with <\/span>simulation environments<\/b> <\/span>(Gymnasium, Isaac Gym, Unity ML\-Agents, custom simulators).
              <\/p><\/li>

            • Experience in <\/span>reward modeling and optimization<\/b> <\/span>for real\-world decision\-making tasks.
              <\/p><\/li>

            • Knowledge of <\/span>multi\-agent systems<\/b> <\/span>and collaborative RL is a strong plus.
              <\/p><\/li>

            • Familiarity with <\/span>LLMs + RLHF (Reinforcement Learning with Human Feedback)<\/b> <\/span>is desirable.
              <\/p><\/li>

            • Exposure to <\/span>cloud platforms (AWS/GCP/Azure)<\/b>, containers (Docker, Kubernetes), and CI/CD for ML.
              <\/p><\/li><\/ul>

              Professional Attributes<\/b>
              <\/p>

              • Strong analytical and problem\-solving mindset.
                <\/p><\/li>

              • Ability to balance <\/span>research depth<\/b> <\/span>with <\/span>practical engineering<\/b> <\/span>for production\-ready systems.
                <\/p><\/li>

              • Collaborative approach, working across AI, data, and platform teams.
                <\/p><\/li>

              • Commitment to <\/span>Responsible AI<\/b> <\/span>(bias mitigation, fairness, transparency).
                <\/p><\/li><\/ul>


                XENONSTACK CULTURE \u2013 JOIN US & MAKE AN IMPACT!<\/b>
                <\/h3>

                At XenonStack, we believe in <\/span>shaping the future of intelligent systems<\/b>. We foster a <\/span>culture of cultivation<\/b> <\/span>built on bold, human\-centric leadership principles, where <\/span>deep work, simplicity, and adoption<\/b> <\/span>define everything we do.
                <\/p>

                Our Cultural Values<\/b>
                <\/p>

                • Agency<\/b> <\/span>\u2013 Be self\-directed and proactive.
                  <\/p><\/li>

                • Taste<\/b> <\/span>\u2013 Sweat the details and build with precision.
                  <\/p><\/li>

                • Ownership<\/b> <\/span>\u2013 Take responsibility for outcomes.
                  <\/p><\/li>

                • Mastery<\/b> <\/span>\u2013 Commit to continuous learning and growth.
                  <\/p><\/li>

                • Impatience<\/b> <\/span>\u2013 Move fast and embrace progress.
                  <\/p><\/li>

                • Customer Obsession<\/b> <\/span>\u2013 Always put the customer first.
                  <\/p><\/li><\/ul>

                  Our Product Philosophy<\/b>
                  <\/p>

                  • Obsessed with Adoption<\/b> <\/span>\u2013 Making AI agents accessible and enterprise\-ready.
                    <\/p><\/li>

                  • Obsessed with Simplicity<\/b> <\/span>\u2013 Turning complex RL + agentic challenges into intuitive, reliable systems.
                    <\/p><\/li><\/ul>

                    Be part of our mission to <\/span>reimagine adaptive, enterprise\-grade AI agents<\/b> <\/span>with Reinforcement Learning and accelerate the world\u2019s transition to <\/span>AI + Human Intelligence<\/b>.
                    <\/p>


                    WHY SHOULD YOU JOIN US?<\/b>
                    <\/h3>

                    <\/p>

                    1. Agentic AI Product Company<\/b>
                    <\/div>
                    Build <\/span>enterprise\-grade AI platforms<\/b> <\/span>powered by Machine Learning, Generative AI, and Agentic Systems. From Vision AI to Inference Infrastructure, you\u2019ll shape products that redefine enterprise AI adoption.
                    <\/div>

                    <\/p>

                    <\/p>

                    2. A Fast\-Growing Category Leader<\/b>
                    <\/div>
                    XenonStack is one of the <\/span>fastest\-growing Data and AI Foundries<\/b>, setting benchmarks in how businesses deploy and scale AI agents with platforms like <\/span>Akira AI, NexaStack, and Vision AI<\/b>.
                    <\/div>

                    <\/p>

                    <\/p>

                    3. Career Mobility & Growth<\/b>
                    <\/div>
                    Move between roles and functions \u2014 from <\/span>AI Engineering to Product Marketing or AgentOps<\/b> <\/span>\u2014 and craft a career that grows with your aspirations.
                    <\/div>

                    <\/p>

                    <\/p>

                    4. Global Exposure<\/b>
                    <\/div>
                    Work with <\/span>Fortune 500 enterprises, BFSI leaders, and global innovators<\/b>, delivering real\-world impact across industries and geographies.
                    <\/div>

                    <\/p>

                    <\/p>

                    5. Create Real Impact<\/b>
                    <\/div>
                    Contribute from day one. Even junior team members work on <\/span>mission\-critical product features<\/b> <\/span>that go into production.
                    <\/div>

                    <\/p>

                    <\/p>

                    6. Culture of Excellence<\/b>
                    <\/div>
                    Our values \u2014 <\/span>Agency, Taste, Ownership, Mastery, Impatience, and Customer Obsession<\/b> <\/span>\u2014 empower you to push boundaries and innovate fearlessly.
                    <\/div>

                    <\/p>

                    <\/p>

                    7. Responsible AI First<\/b>
                    <\/div>
                    Join a company that prioritizes <\/span>trustworthy, explainable, and compliant AI<\/b>. You\u2019ll contribute to <\/span>Responsible AI frameworks<\/b>, ensuring our agentic systems are not just powerful, but also ethical and reliable.\u200b<\/span>
                    <\/div>

                    <\/p>

                    \u200b<\/span>
                    <\/div><\/span>