logo inner

Senior Data Engineer

CompanyDefined AI
LocationLisbon, Portugal
TypeOnsite
Sub
Software Engineer

Description


Defined.ai is a leading provider of high-quality, ethically sourced data for Artificial Intelligence (AI) and Machine Learning (ML) model training. We host the world's largest AI marketplace and offer end-to-end services to help companies accelerate their AI solutions. Backed by significant funding and recognized globally for our commitment to ethical AI, we operate in a fast-paced, innovative environment with offices in Seattle and Lisbon. This is a hybrid or remote position.

What will you do?


  • Pipeline Orchestration
  • Design and maintain end-to-end data workflows using Dagster, handling complex dependencies, retries, backfills, and observability.
  • Build asset-based pipelines with clear ownership, lineage, and SLAs.
  • Data Transformation
  • Develop modular dbt models (staging, intermediate, marts) to transform raw data into clean, production-grade datasets.
  • Apply best practices in testing, documentation, and versioning.
  • Data Ingestion & Python Development
  • Write robust Python-based ingestion pipelines for APIs, event streams, files, and third-party systems.
  • Build reusable frameworks for validation, enrichment, and complex preprocessing (using Pandas, PySpark, or similar).
  • Data Modeling & Analytics Enablement
  • Implement dimensional models (Star Schema) and Medallion Architecture (Bronze/Silver/Gold).
  • Deliver datasets optimized for BI tools, analytics, and downstream consumers.
  • Quality, Observability & Reliability
  • Enforce data quality using dbt tests, freshness checks, and schema validation.
  • Implement monitoring and alerting through Dagster and cloud-native tooling.
  • Collaboration & Product Thinking
  • Work closely with Product Managers, Analytics, and Engineering teams to translate business needs into scalable data models.
  • Contribute to roadmap planning, technical trade-offs, and prioritization.

Who are we looking for?


We’re looking for a proactive, product-minded data engineer who enjoys building reliable systems, improving developer experience, and turning raw data into trusted insights.

  • BSc or MSc in Computer Science, Engineering, or a related field
  • 5+ years of experience in data engineering or analytics engineering roles
  • Strong Python skills (OOP, data processing, APIs)
  • Advanced SQL skills, including query optimization and performance tuning
  • Hands-on experience with:
  • Dagster (assets, schedules, sensors) or similar orchestration tools
  • dbt for transformations, testing, and documentation
  • Solid understanding of modern data architectures:
  • Data Lake, Data Warehouse, Lakehouse
  • Medallion Architecture, ELT/ETL patterns
  • Experience with at least one major cloud platform:
  • AWS (S3, Glue, Athena), GCP, or Azure
  • Familiarity with CI/CD pipelines (GitHub Actions, Azure DevOps, etc.)
  • Experience consuming and building REST APIs (e.g. FastAPI)
  • Strong problem-solving skills and a pragmatic engineering mindset
  • Professional proficiency in English (spoken and written)

Nice to have


  • Experience with cloud data warehouses (Snowflake, BigQuery, Redshift)
  • Exposure to data observability tools (e.g. Monte Carlo, OpenLineage, OpenTelemetry)
  • Experience supporting analytics or BI tools (Power BI, Looker, Tableau.

Benefits


You spend a lot of your time at work, so it should be challenging, fun and interesting. At Defined.ai it will be all of those things and more. Here’s what we offer:

  • Flexible working schedule and hybrid model. We know comfort can boost creativity and performance, so you can manage your schedule and work both from one of our modern office spaces or home.
  • Excellent career development opportunities in a high growth company. With us, you can accomplish your career goals and follow a well-described career path with the support of your supervisor.
  • Culture of feedback and continuous improvement. AI is a fast-paced area, so we keep track of tech trends, and we always ask for feedback.
  • An international and diverse team. We have more than 30 nationalities at our 2 locations, and we provide language classes.
  • Continuous training opportunities. You can choose from many options: leveraging hand-on workshops, unlimited access to Coursera and formal development opportunities.
  • We love to have fun together. We joke a lot, and we can't imagine work without fun activities – we already surfed, raced carts and played soccer together.

Privacy Notice:


defined.ai/candidate-privacy-statement

Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025