Data Engineer

Description

We are looking for a talented data engineer to join our Consumer Data Science team. The group is part of a larger DS team and focuses on customer analytics and modelling, informing all product decisions and creating models to improve efficiency, growth, and security. In order to do this, we use data from various sources, and of varying quality. Our ETL processes serve both the wider company (in the form of clean, simplified tables of aggregated statistics and dashboards) as well as the Data Science team itself (cleaning and processing data for analysis and modelling purposes, ensuring reproducibility). 

 

We are looking for someone with experience in designing, building, and maintaining data pipelines and our data lake. As a data engineer, you will be involved in all aspects of data collection, cleaning and processing, ensuring quality and availability of data. You will collaborate closely with data scientists, platform, and front-end engineers, defining requirements and designing new data processes, as well as maintaining and improving existing ones. We are looking for someone who is passionate about high quality data and understands the impact they have in solving real-life problems. Being proactive in identifying issues, digging deep into their source, and developing solutions, are at the heart of this role



What you will do

  • Maintain and evolve the current data lake infrastructure and look to evolve it for new requirements
  • Maintain and extend our core data infrastructure and existing data pipelines and ETLs 
  • Provide best-practices and frameworks for data testing and validation and ensure reliability and accuracy of data
  • Complement our data scientists by providing a reliable, secure and maintainable modelling framework that can be used to easily deploy models to production 
  • Design, develop and implement data visualization and analytics tools and data products

 


What you will need

  • Bachelor’s degree in Computer Science, Applied Mathematics, Engineering or any other technology related field
  • Previous experience working in a data engineering role
  • Fluency in Python
  • Previous experience with ETL pipelines
  • Experience working with Google Cloud Platform
  • In-depth knowledge of SQL and no-SQL databases
  • Experience with Git

 


Nice to have

  • Experience with Airflow or Google Composer
  • Experience with other programming languages, like Java, Kotlin or Scala
  • Experience with Spark or other Big Data frameworks
  • Experience with distributed and real time technologies (Kafka, etc..)

 


Compensation and perks

  • Unlimited vacation policy; work hard and take time when you need it.
  • Apple equipment.
  • Full-time salary based on experience and meaningful equity in an industry-leading company
  • Benefits: dependant on employee location
  • Flexible hours and smart working options

 


Application

  • CV/Resume or Linkedin profile
  • Link to github, stackoverflow, personal website and/or blog (if applicable).

Locations

Fully Remote

Apply now!

Published 11 months ago56 people have read this page • 6 applied