We are seeking an experienced Databricks developer with\nstrong expertise in Azure Databricks, Apache Spark, and modern data engineering\npractices. The ideal candidate will be responsible for designing, developing,\nand optimising scalable data pipelines, data lakehouse solutions, and\ncloud\-based data platforms. The role requires hands\-on experience with Spark\nprocessing, Delta Lake, Unity Catalogue, data modelling, and enterprise\-grade data\nintegration solutions.<\/span>
<\/p><\/div>
Key\nResponsibilities<\/span><\/b><\/span>
<\/p>Data\nEngineering & Databricks Development<\/span><\/b>
<\/p>- Design, develop, and maintain scalable ETL/ELT pipelines using Databricks,\n Apache Spark, and SQL<\/b>.<\/span>
<\/li>- Build and optimize batch and streaming data pipelines using PySpark,\n Spark Structured Streaming, and Auto Loader<\/b>.<\/span>
<\/li>- Develop and support enterprise data lakehouse solutions using Delta\n Lake<\/b> and Databricks technologies.<\/span>
<\/li>- Implement data ingestion, transformation, cleansing, and\n aggregation processes for large\-scale datasets.<\/span>
<\/li>- Develop reusable frameworks and best practices for data engineering\n solutions.<\/span>
<\/li><\/ul>Data\nModeling & Performance Optimization<\/span><\/b>
<\/p>- Design and implement data models to support reporting, analytics,\n and business requirements.<\/span>
<\/li>- Build and maintain Slowly Changing Dimensions (SCD Type 1 &\n Type 2)<\/b> for data warehousing solutions.<\/span>
<\/li>- Develop and optimize Change Data Capture (CDC)<\/b> pipelines.<\/span>
<\/li>- Optimize Spark workloads through partitioning, clustering, caching,\n and performance tuning techniques.<\/span>
<\/li>- Ensure efficient query performance and scalability across large\n datasets.<\/span>
<\/li><\/ul>Unity\nCatalog & Data Governance<\/span><\/b>
<\/p>- Configure and manage Databricks Unity Catalog<\/b> environments.<\/span>
<\/li>- Create and manage catalogs, schemas, tables, materialized views,\n functions, and volumes.<\/span>
<\/li>- Implement enterprise data governance, security, access control, and\n compliance standards.<\/span>
<\/li>- Support metadata management and data lineage initiatives across the\n data platform.<\/span>
<\/li><\/ul>Cloud &\nIntegration<\/span><\/b>
<\/p>- Develop cloud\-native data solutions on Microsoft Azure<\/b> and\n related cloud services.<\/span>
<\/li>- Integrate data from multiple internal and external data sources.<\/span>
<\/li>- Implement Lakehouse Federation<\/b> and foreign catalogs to\n access external data platforms.<\/span>
<\/li>- Collaborate with architects and stakeholders to design scalable\n cloud data solutions.<\/span>
<\/li><\/ul>DevOps\n& Operational Excellence<\/span><\/b>
<\/p>- Support CI/CD implementation and automated deployment processes.<\/span>
<\/li>- Participate in code reviews, testing, and release activities.<\/span>
<\/li>- Monitor and troubleshoot data pipeline failures and performance\n issues.<\/span>
<\/li>- Ensure adherence to development standards, security policies, and\n operational best practices.<\/span>
<\/li><\/ul><\/div>
Required\nQualifications<\/span><\/b>
<\/p>- Strong hands\-on experience with Databricks<\/b> and Apache\n Spark (PySpark and/or Scala)<\/b>.<\/span>
<\/li>- Extensive experience with SQL<\/b> and complex data\n transformation techniques.<\/span>
<\/li>- Experience in ETL/ELT development and enterprise data pipeline\n implementation.<\/span>
<\/li>- Strong experience with Microsoft Azure<\/b> and cloud\-based data\n platforms.<\/span>
<\/li>- Hands\-on experience with Azure Databricks<\/b> and Delta Lake<\/b>.<\/span>
<\/li>- Experience building batch processing pipelines using Auto Loader<\/b> and real\-time pipelines using Spark Structured Streaming<\/b>.<\/span>
<\/li>- Strong understanding of data warehousing concepts and dimensional\n modeling.<\/span>
<\/li>- Experience implementing SCD Type 1<\/b>, SCD Type 2<\/b>, and CDC<\/b> processes.<\/span>
<\/li>- Strong knowledge of Spark performance tuning, partitioning, and\n optimization techniques.<\/span>
<\/li>- Experience with CI/CD pipelines and DevOps practices.<\/span>
<\/li>- Strong analytical, troubleshooting, and problem\-solving skills.<\/span>
<\/li><\/ul>
<\/div><\/span>