Big Data Engineer - Vertoz
What we want:<\/u><\/b>
<\/span><\/div>
<\/span><\/div>
We are looking for a skilled Big Data Engineer to design, develop, and maintain scalable big data solutions. The role involves working with Hadoop ecosystems, real\-time and batch data processing frameworks, and cloud\-based platforms. The ideal candidate will contribute to end\-to\-end data architecture, ensure efficient data processing, and collaborate with cross\-functional teams to deliver reliable and high\-performance data solutions.<\/span>
<\/span><\/div>
<\/span><\/div>
<\/span><\/div>
<\/span><\/div>
Who We are:<\/u><\/b><\/span>
<\/span><\/div>
<\/span><\/div>
Vertoz (NSEI: VERTOZ) is an AI\-powered MadTech and CloudTech platform offering Digital Advertising, Marketing & Monetization (MadTech) and Digital Identity and Cloud Infrastructure (CloudTech) solutions. We cater to Businesses, Digital Marketers, Advertising Agencies, Digital Publishers, Cloud Providers, and Technology companies.<\/span>
<\/span><\/div>
<\/span><\/div>
<\/span><\/div>
<\/span><\/div>
What you will do:<\/u><\/b><\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Design, develop, and maintain scalable Hadoop\-based applications and data pipelines.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Work on documentation, system design, development, and architecture of big data solutions.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Implement and manage batch and real\-time data processing using Spark, Spark Streaming, Kafka, and related technologies.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Develop efficient data workflows using Hadoop ecosystem tools such as Hive, Impala, and HDFS.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Work with stream\-processing frameworks including Spark Streaming, Storm, and Flume.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Integrate and manage data across relational SQL and NoSQL databases, including Vertica.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Support deployment and operations in cloud\-based environments.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Perform cluster management and monitoring using Cloudera Hadoop Distribution and related tools.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Write and maintain shell scripts to automate operational tasks.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Collaborate with teams to ensure data reliability, performance optimization, and scalability.<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span>Support data visualization and analytics using tools such as Superset.<\/span>
<\/div><\/span>
Requirements<\/h3>
<\/div><\/span>
Requirements<\/h3>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>1+ year of hands\-on experience working with Big Data technologies.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Strong knowledge of Hadoop ecosystem tools including Hadoop, Hive, Impala, Spark, Spark Streaming, and Kafka.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Experience with batch and real\-time data processing frameworks.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Proficiency in at least one programming language: Java, Python, or Scala.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Experience with stream\-processing systems such as Spark Streaming, Storm, or Flume.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Good understanding of relational SQL and NoSQL databases, including Vertica.\u200b
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Exposure to cloud services and distributed systems.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Hands\-on experience with Cloudera Hadoop Distribution and cluster management.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Basic to intermediate shell scripting skills.
<\/span><\/span><\/div>\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Strong problem\-solving skills and ability to work in a fast\-paced environment.<\/span><\/span>
<\/div><\/div><\/span>
Benefits<\/h3>\u2022No dress codes
<\/span><\/div>\u2022Flexible working hours<\/span>
<\/span><\/div>\u20225 days working<\/span>
<\/span><\/div>\u202224 Annual Leaves<\/span>
<\/span><\/div>\u2022International Presence<\/span>
<\/span><\/div>\u2022Celebrations\u200b<\/span>
<\/span><\/div>\u2022Team outings<\/span>
<\/div><\/span>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>1+ year of hands\-on experience working with Big Data technologies.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Strong knowledge of Hadoop ecosystem tools including Hadoop, Hive, Impala, Spark, Spark Streaming, and Kafka.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Experience with batch and real\-time data processing frameworks.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Proficiency in at least one programming language: Java, Python, or Scala.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Experience with stream\-processing systems such as Spark Streaming, Storm, or Flume.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Good understanding of relational SQL and NoSQL databases, including Vertica.\u200b
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Exposure to cloud services and distributed systems.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Hands\-on experience with Cloudera Hadoop Distribution and cluster management.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Basic to intermediate shell scripting skills.
<\/span><\/span><\/div>
<\/span><\/span><\/div>
\u2022<\/span><\/span><\/span><\/span><\/span><\/span>Strong problem\-solving skills and ability to work in a fast\-paced environment.<\/span><\/span>
<\/div><\/div><\/span>
Benefits<\/h3>
<\/div><\/div><\/span>
Benefits<\/h3>\u2022No dress codes
<\/span><\/div>\u2022Flexible working hours<\/span>
<\/span><\/div>\u20225 days working<\/span>
<\/span><\/div>\u202224 Annual Leaves<\/span>
<\/span><\/div>\u2022International Presence<\/span>
<\/span><\/div>\u2022Celebrations\u200b<\/span>
<\/span><\/div>\u2022Team outings<\/span>
<\/div><\/span>
<\/span><\/div>
\u2022Flexible working hours<\/span>
<\/span><\/div>
<\/span><\/div>
\u20225 days working<\/span>
<\/span><\/div>
<\/span><\/div>
\u202224 Annual Leaves<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022International Presence<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022Celebrations\u200b<\/span>
<\/span><\/div>
<\/span><\/div>
\u2022Team outings<\/span>
<\/div><\/span>
<\/div><\/span>