Senior Data Engineer
Job Description \- Senior Data Engineer <\/span><\/span><\/span><\/b> CivicDataLab is seeking a Senior Data Engineer to support our current and upcoming interventions through open data platforms and open\-source tools, focusing on building robust data pipelines for automated data mining, cleaning, standardisation and transformation for AI readiness. We are looking for people who are strongly aligned with our values and have an innate sense of problem\-solving, automating processes and adapting well to dynamic environments. They will work alongside data strategists, public policy researchers and other stakeholders to develop automated ETL (Extract, Transform and Load) data pipelines and know how to model, store and manage bulk datasets. This will help us co\-create comprehensive data analytics tools and dashboards for our diverse stakeholders.<\/span><\/span><\/span><\/span><\/span> At CivicDataLab, we believe in using data, tech, design, and social science to strengthen civic engagement and drive evidence\-based decision\-making. Our projects centre on building data strategy, platforms, and applications to foster data\-driven governance. <\/span><\/span><\/span><\/span><\/span> Take ownership over developing scalable data infrastructure for our existing platforms and products from data orchestration pipelines using Prefect and Airflow, to Data Sharing Protocols for our open data collaboratives.<\/span><\/span><\/span><\/span><\/span> Create and oversee data APIs responsible for collecting, managing, and analysing data from diverse public data sources.<\/span><\/span><\/span><\/span><\/span> Maintain and monitor our existing open data platforms like Open Budgets India, Justice Hub, Open Contracting India, and oversee dataset migration into new platforms.<\/span><\/span><\/span><\/span><\/span> Thoroughly document code, processes, and all activities performed by the data team, ensuring clarity and comprehensiveness. This includes documenting algorithms, methodologies, data transformations, and the overall workflow. <\/span><\/span><\/span><\/span><\/span> Lead strategic planning and scoping for data engineering projects, aligning them with product roadmaps and stakeholder needs.<\/span><\/span><\/span><\/span><\/span> Mentor and provide technical guidance to a team of data engineers, conducting code reviews and capacity\-building sessions.<\/span><\/span><\/span><\/span><\/span> Quality & compliance: Run structured checklists (licensing/ownership validation, PII checks, metadata completeness) and coordinate fixes with publishers. \u200b<\/span><\/span><\/span><\/span><\/span> Any graduate, however, candidates with a <\/span><\/span><\/span><\/span>B.Sc./M.Sc. (or higher) in Computer Science, Data Science, or a related field<\/span><\/span><\/span><\/span><\/b> will be preferred.<\/span><\/span><\/span><\/span><\/span><\/span> 5+ Years of experience as a Data engineer.<\/span><\/span><\/span><\/span><\/span><\/span> Proficiency in Python, SQL, and Excel.<\/span><\/span><\/span><\/span><\/span><\/span> Experience building dashboards (Superset, Tableau, or Python frameworks).<\/span><\/span><\/span><\/span><\/span><\/span> Excellent written/verbal communication and documentation skills.<\/span><\/span><\/span><\/span><\/span><\/span> Public\-sector experience: Demonstrated work with government departments/PMUs or allied institutions; comfortable with consultations and public\-sector processes. <\/span><\/span><\/span><\/span><\/span><\/span> Onboarding literacy: Familiarity with dataset discovery, metadata, licensing/ownership, and basic PII/privacy checks; rigorous documentation and follow\-through. <\/span><\/span><\/span><\/span><\/span><\/span> Operations & collaboration: Excellent coordination and communication skills; readiness for periodic intra\-state travel.<\/span><\/span><\/span><\/span><\/span><\/span> Extensive working knowledge of API or Stream\-based data extraction processes.<\/span><\/span><\/span><\/span><\/span><\/span> Ability to gauge requirements for the project, make decisions and coordinate with other team members.<\/span><\/span><\/span><\/span><\/span><\/span> Prior experience in actively working and contributing to FOSS (Free and Open Source Software) communities.<\/span><\/span><\/span><\/span><\/span><\/span> Familiarity with working with Agile methodologies and Scrum processes.<\/span><\/span><\/span><\/span><\/span><\/span> Experience with scalable infrastructure practices such as microservice architecture, infrastructure as code, distributed systems, scaling methods, load balancing and more.<\/span><\/span><\/span><\/span><\/span><\/span>
─
<\/span><\/span><\/span>What We're Looking For:<\/span><\/span><\/span><\/b><\/span><\/span><\/p>
<\/span><\/p>
<\/p>
<\/div>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li>
<\/p><\/li><\/ul>
<\/div><\/span>Requirements<\/h3>
<\/span><\/span><\/p>Recommended Skills & Requirements<\/span><\/span><\/span><\/span><\/b><\/span><\/span>:<\/span><\/span><\/span><\/span><\/b><\/span><\/span>
<\/span><\/span><\/h3>
<\/p><\/li>
<\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li>
<\/span><\/span><\/p><\/li><\/ul>
<\/div><\/span>Benefits<\/h3>