Data Engineer
Our team leverages modern Data Engineering, Generative AI and Machine Learning technologies to deliver actionable insights. You will be:
• Collaborating with data scientists across functional teams to define and enhance performance metrics that provide valuable insights for stakeholders
• Building and maintaining:
- Ingestion pipelines for real-time data processing
- Real-time applications driving operational monitoring
- Batch ETL/ELT applications populating our data warehouse
• Applying Generative AI and Retrieval Augmented Generation (RAG) techniques to enhance data analytics capabilities
• Applying Machine Learning technologies for anomaly detection
Minimum Qualifications
Bachelor's degree in Computer Science or equivalent professional experience
Experience in building large scale distributed systems in Java/Python or similar languages
Proficient in SQL
Experience with data warehouse architectures and dimensional modeling
Demonstrated ability to conduct performance analysis and troubleshoot large scale distributed systems
Strong collaboration skills with ability to understand complex architectures and work effectively across teams
Hands-on experience with Docker and Kubernetes
Preferred Qualifications
Production experience with Apache Kafka, Spark, or Flink
Working knowledge of Trino or similar distributed query engines
Experience building multi-agent AI systems or agentic workflows
Familiarity with Retrieval Augmented Generation (RAG) techniques working in conjunction with LLMs
Experience with creating and consuming Model Context Protocol (MCP) services