Databricks Developer
<\/div>
We are seeking a skilled Databricks Developer<\/b> to design, develop, and optimize scalable data pipelines and analytics solutions<\/b> using Apache Spark, Python (PySpark), and SQL<\/b> within modern cloud environments. The ideal candidate will have hands\-on experience working with Databricks, Delta Lake, and cloud data platforms<\/b>, and will be responsible for building high\-performance data processing workflows that support data\-driven decision\-making.
<\/p>Key Responsibilities
<\/h3>
Pipeline Development<\/b> Design, develop, and maintain ETL/ELT data pipelines<\/b> using PySpark and SQL<\/b> in Databricks notebooks. Process large\-scale datasets and ensure reliable and efficient data transformations. Architecture Design<\/b> Implement data lake and data warehouse architectures<\/b> using Databricks Delta Lake<\/b> and Delta Live Tables<\/b>. Build and manage Medallion Architecture (Bronze, Silver, Gold layers)<\/b> for structured data processing. Performance Optimization<\/b> Optimize Spark jobs and queries<\/b> for performance, scalability, and cost\-efficiency. Manage cluster configurations, partitioning strategies, and caching mechanisms<\/b>. Cloud Integration<\/b> Integrate Databricks solutions with cloud services<\/b> such as: Azure Data Factory (ADF) Azure Data Lake Storage (ADLS) Gen2 AWS S3 or other cloud storage platforms Data Governance & Quality<\/b> Implement data quality checks, validation frameworks, and monitoring processes<\/b>. Ensure data security, encryption, masking, and lineage tracking<\/b>. Workflow Automation<\/b> Build and manage automated workflows and scheduling<\/b> using Databricks Jobs, Airflow, or CI/CD pipelines. Integrate with DevOps tools such as Azure DevOps or Jenkins<\/b> for continuous integration and deployment. Collaboration<\/b> Work closely with data engineers, analysts, and business stakeholders<\/b> to translate business requirements into scalable data solutions. Participate in Agile development processes<\/b> including sprint planning and technical discussions. Core Technologies<\/b> 3\u20135+ years of experience<\/b> with Databricks, Apache Spark, and Python (PySpark)<\/b>. Strong experience building scalable ETL/ELT pipelines<\/b>. SQL Expertise<\/b> Advanced knowledge of Spark SQL or Databricks SQL<\/b> for data transformation and analysis. Cloud Platforms<\/b> Hands\-on experience with Azure Databricks, AWS, or Google Cloud Platform (GCP)<\/b>. Data Modeling<\/b> Experience with Delta Lake<\/b> and Medallion Architecture (Bronze/Silver/Gold layers)<\/b>. Strong understanding of data modeling and data warehouse concepts<\/b>. Version Control & CI/CD<\/b> Proficiency with Git<\/b> and CI/CD tools such as Azure DevOps or Jenkins<\/b>. Experience with Data Build Tool (dbt)<\/b>. Knowledge of workflow orchestration tools such as Apache Airflow<\/b>. Experience with Machine Learning workflows using MLflow<\/b>. Familiarity with data governance and enterprise data platforms<\/b>. Bachelor\u2019s or Master\u2019s degree in Computer Science, Information Technology, Data Engineering, or a related field<\/b>. Strong problem\-solving and analytical skills<\/b> Ability to handle large\-scale data processing environments<\/b> Good communication and collaboration skills<\/b> in Agile teams
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li><\/ul><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>Required Skills & Qualifications
<\/h3>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/p>
<\/p><\/li><\/ul>Preferred Qualifications
<\/h3>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li><\/ul>Education
<\/h3>
<\/p><\/li><\/ul>Key Competencies
<\/h3>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/div><\/span>