Data Engineer

Data Engineer/Developer<\/span><\/span><\/b>
<\/p>

Role Summary<\/span><\/span><\/b>
Design, build, and optimize Azure data pipelines and lakehouse solutions.\nDeliver secure, reliable datasets with strong governance, automation, and\ndocumentation. Collaborate across teams and contribute to standards in an Agile\nsetting.<\/span><\/span><\/p>

Must\-Have (Day 1)<\/span><\/span><\/b>
<\/p>

  • Experience:\n 4\u20136 years in data engineering<\/span>
    <\/li>
  • Core\n Platform: Databricks with Python, Spark, Pandas (notebooks and modular\n code)<\/span>
    <\/li>
  • Orchestration:\n Azure Data Factory (pipelines, integration runtimes); ingest from diverse\n sources<\/span>
    <\/li>
  • Lakehouse:\n Delta Lake fundamentals; Medallion architecture (bronze/silver/gold) in\n production<\/span>
    <\/li>
  • Storage/SQL/Performance:\n Azure Data Lake Storage (ADLS); strong SQL; performance\-aware design<\/span>
    <\/li>
  • Data\n Patterns: ETL/ELT; data modeling (e.g., dimensional/star schema)<\/span>
    <\/li>
  • DevOps\n & Security: CI/CD for data projects (Azure DevOps or GitHub\n Enterprise); familiarity with Azure Entra ID for SSO/RBAC; secure\n workspace/data access<\/span>
    <\/li>
  • Quality\n & Observability: Data validation/testing, code reviews, and basic\n monitoring/alerting for jobs/pipelines<\/span>
    <\/li>
  • Ways of\n Working: Agile/Scrum (Jira/Confluence); clear pipeline and data contract\n documentation<\/span>
    <\/li>
  • Collaboration:\n Effective stakeholder engagement; support/mentor junior team members;\n clear communication<\/span>
    <\/li>
  • Generative\n AI (Day 1):<\/span>
    <\/li>
    • Prompt\n design for data tasks (ingestion, transformations, documentation) with\n clear objectives and constraints<\/span>
      <\/li>
    • Use of\n Copilot/ChatGPT to scaffold notebooks/jobs, generate tests, and optimize\n SQL/Spark\u2014validates outputs before merging<\/span>
      <\/li><\/ul><\/ul>

      Nice\-to\-Have (Train within 60\u201390 days)<\/span><\/span><\/b>
      <\/p>

      • Unity\n Catalog migration (Hive to Unity) and permissions/governance<\/span>
        <\/li>
      • Databricks\n DevOps (cluster configuration, secret management, workspace automation)<\/span>
        <\/li>
      • Azure\n Functions (C# or Python) for orchestration/integration<\/span>
        <\/li>
      • Synapse\n dedicated SQL pools or dbt; Delta Live Tables<\/span>
        <\/li>
      • Financial\n services domain exposure<\/span>
        <\/li><\/ul>

        <\/span><\/span>
        <\/div>

        Shared Expectations<\/span><\/span><\/b>
        <\/p>

        • Work\n independently with minimal supervision while contributing to team outcomes<\/span>
          <\/li>
        • Commitment\n to secure practices and production\-grade reliability<\/span>
          <\/li>
        • Continuous\n improvement mindset and willingness to learn new tools/technologies<\/span>
          <\/li>
        • Willingness\n to work within regulated environment controls and policies<\/span>
          <\/li>
        • Use\n Generative AI responsibly to improve velocity and quality (simple,\n structured prompts; guardrails; validate AI\-assisted outputs before\n adoption)<\/span>
          <\/li><\/ul>

          <\/span><\/span>
          <\/div><\/span>