Data Engineer ETL
What you’ll be doing:
Data rules at ZipRecruiter. The ETL Data Engineer will work with the rest of the Data Services team to create and maintain the various pipelines that ingest and transform data to make it available to both internal and external facing clients. You will focus on our key data objectives of Data Freshness/Latency, Quality, Security, and Availability. You will be a key member of the team that is responsible for moving our legacy ETL processes to a unified ETL Pipeline utilizing the latest best-practices and tooling. ZipRecruiter values autonomy and ownership. The person who fills this role is expected to be self-driven, reliable, and a strong communicator.
What we’re looking for:
- 5+ Years Engineering experience
- 3+ Years experience as a Data-focused Engineer (ETL, Big Data)
- Experience in the Hadoop ecosystem, particularly Apache Spark, Apache Hive, Presto
- Strong experience writing and troubleshooting SQL and SQL-like dialects
- Experience in a Linux environment
- Strong desire and ability to Automate tasks and approach projects from a Test Driven mindset
- Experience with Scala on Spark
- Experience troubleshooting and performance tuning Spark/Hive/Hadoop jobs
- Experience with Streaming technologies such as Spark-Streaming, Kafka/Kinesis
Bonus Points:
- Experience with ElasticSearch, Amazon Redshift
- Ability to read and understand Perl, Python, Bash scripts
- Experience with AWS Products such as EMR, DynamoDB, S3, EC2, Kinesis
Technologies we use:
Spark, Hive, Presto, MySQL, Redshift, Scala, Python, Perl, AirFlow, ElasticSearch, Kinesis
Category: Engineering