
Senior Data Engineer: Data Lake
- Portugal
- Permanente
- Horário completo
- Data platform support (PySpark, Databricks, EMR, Luigi, Airflow)
- Development, optimization and maintenance for data pipelines framework to run 10 000+ of pipelines on a daily basis
- Data modeling (bronze, silver, gold)
- Development and maintenance of a Data Quality framework built on top of DBT
- Development and maintenance of user facing service for the behavioral data ingestion (FastAPI, Docker, AWS ECS)
- Own and evolve our pipeline framework that orchestrates 10 000+ jobs daily
- Shift workloads from batch to streaming, shrinking model-to-production latency from days to hours
- Design and develop the Data Quality framework and wire it into every Constructor core service
- Enable Spark on Kubernetes, giving teams elastic, cost-efficient compute
- Develop tooling for delivering backfills throughout the data platform
- 🏝️ Unlimited vacation time - we strongly encourage all of our employees take at least 3 weeks per year
- 🌎 Fully remote team - choose where you live
- 🛋️ Work from home stipend! We want you to have the resources you need to set up your home office
- 💻 Apple laptops provided for new employees
- 🧑🎓 Training and development budget for every employee, refreshed each year
- 👪 Maternity & Paternity leave for qualified employees
- 🧠 Work with smart people who will help you grow and make a meaningful impact
- 💵 This position has a base salary range between $80k and $120k USD. The offer varies on many factors including job related knowledge, skills, experience, and interview results.
- 🎉 Regular team offsites to connect and collaborate