Data Engineer
Job Title: Junior Data Engineer
Experience: 2–4 Years
Location: Remote / Hybrid
Notice Period: Immediate Joiner
Job Summary
We are looking for a motivated Junior Data Engineer with 2–4 years of experience to join our growing data engineering team. The ideal candidate should have hands-on experience in building and maintaining data pipelines, working with cloud-based data platforms, and developing scalable ETL/ELT solutions. You will collaborate with senior data engineers, data analysts, and business stakeholders to deliver reliable and efficient data solutions.
Key Responsibilities
- Develop, maintain, and optimize ETL/ELT pipelines for data integration and processing.
- Build data processing applications using Python and PySpark.
- Ingest data from multiple sources, including APIs, databases, flat files, and streaming platforms.
- Write efficient, optimized, and scalable SQL queries for data transformation and reporting.
- Monitor, troubleshoot, and enhance the performance of existing data pipelines.
- Perform data quality validation and support data governance initiatives, including data lineage.
- Work with Azure Data Factory, Azure Data Lake, and Azure Databricks to develop cloud-native data solutions.
- Participate in Agile ceremonies, sprint planning, code reviews, and continuous improvement activities.
- Collaborate with cross-functional teams to understand business requirements and deliver data solutions.
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
- 2–4 years of hands-on experience in Data Engineering.
- Experience working with cloud platforms, preferably Microsoft Azure.
- Strong understanding of data integration, transformation, and pipeline development.
- Excellent SQL and analytical problem-solving skills.
- Good communication and collaboration skills.
- Ability to work independently as well as in a team-oriented Agile environment.
Preferred Skills
- Exposure to streaming data processing frameworks.
- Understanding of data warehousing concepts.
- Knowledge of data governance and data quality practices.
- Familiarity with DevOps practices and deployment automation.
Requirements
Required Technical Skills
- Strong programming skills in Python
- Proficiency in SQL for querying and data transformation
- Hands-on experience with PySpark
- Experience with Azure Data Factory (ADF)
- Knowledge of Azure Data Lake
- Basic working knowledge of Azure Databricks
- Experience with Git version control
- Basic understanding of Apache Kafka
- Basic knowledge of CI/CD pipelines
- Good understanding of ETL/ELT architecture and best practices