AI/ML Engineer - Generative AI

We are looking for an AI/ML Engineer with hands-on experience in Generative AI, Large Language Models (LLMs), NLP, and Retrieval-Augmented Generation (RAG) to design, build, and deploy enterprise-grade AI solutions. The ideal candidate will have strong foundations in Python-based ML development, vector databases, prompt engineering, LLM orchestration, and the Azure AI ecosystem.

1. Generative AI & LLM Engineering

Build and fine-tune LLM-based applications using models such as GPT, LLaMA, Mistral, Phi, etc.
Develop custom pipelines for RAG, grounding LLMs on enterprise documents (PDFs, Word, PPT, HTML, images).
Design robust prompt engineering strategies—system prompts, few-shot examples, evaluation prompts.
Implement guardrails using safety filters, grounding validation, and hallucination detection.

2. NLP Model Development

Develop and optimize NLP models for text classification, entity extraction, summarization, Q&A, topic modelling, and semantic search.
Apply transformer-based architectures (BERT, RoBERTa, T5, LLaMA-based models).
Build multilingual NLP pipelines where required.

3. RAG Pipelines & Knowledge Engineering

Create scalable RAG architectures using:
- Vector databases (Azure AI Search, Pinecone, Weaviate, FAISS).
- Chunking, metadata tagging, hybrid search, and document embeddings.
Build connectors for ingestion pipelines (ETL) for structured/unstructured sources.
Evaluate and optimize retrieval quality (R@K, MRR, context relevancy scores).

4. Azure AI & MLOps

Use Azure AI Search, Azure OpenAI, Azure Machine Learning, Azure Functions, Data Lake, and DevOps pipelines to build end-to-end solutions.
Deploy LLM-based APIs, microservices, and inference endpoints using AKS/ACI.
Implement model evaluation frameworks, monitoring (logging, latency, cost), and lifecycle management.

5. Software Engineering & API Development

Build Python-based backend services for LLM inference, embeddings, and RAG operations.
Develop REST APIs and integrate with downstream applications.
(Nice-to-Have) Build simple UI front-ends using React for demos and internal tools.

6. Collaboration & Documentation

Work with domain SMEs, product teams, and engineering to convert business needs into AI use cases.
Document model architectures, design decisions, and best practices.

Required Skills & Qualifications

3–5 years of experience in AI/ML, with at least 1–2 years in Generative AI/LLMs/NLP.
Strong proficiency in Python, PyTorch, Hugging Face Transformers, LangChain/LlamaIndex.
Hands-on experience with RAG architectures, vector stores, and embedding models.
Strong understanding of transformer architecture, tokenization, embeddings, and LLM evaluation metrics.
Experience with Azure AI stack—Azure OpenAI, Cognitive Search/AI Search, Azure ML, Data Lake.
Familiarity with MLOps tools for tracking, deployment, CI/CD.
Experience working with unstructured data at scale—documents, text, OCR extractions.

Nice-to-Have Skills

Experience with React.js for building AI dashboards and internal apps.
Exposure to knowledge graphs, ontology/metadata design, and structured knowledge retrieval.
Experience deploying LLMs on edge or private environments (ONNX Runtime, quantized models).
Familiarity with evaluation frameworks such as Ragas, DeepEval, or custom LLM eval pipelines.

Education

Bachelor’s/Master’s degree in Computer Science, Engineering, Data Science, AI/ML, or equivalent fields.

BGV:

Employment with WSP India is subject to the successful completion of a background verification (“BGV”) check conducted by a third-party agency appointed by WSP India.
Candidates are advised to ensure that all information provided during the recruitment process — including documents uploaded — is accurate and complete, both to WSP India and its BGV partner”.

AI/ML Engineer - Generative AI

Similar jobs

AI/ML developer - R01564464

Senior AI Engineer

AI Engineer

AI Engineer

AI engineer

Senior AI engineer