Lead Data Engineer

Description

About Gapstars

About Gapstars

Gapstars is a Netherlands-based software development services provider that builds remote, agile teams in Sri Lanka and Portugal for innovative tech companies. Today, we are home to 275+ TechStars and innovative minds, turning scalable software into performance products that shape the future. Our partners are highly ambitious tech companies that are looking to conquer their respective markets.

The Role

As a Senior Data Engineer, you will be responsible for designing and implementing scalable data pipelines to support our AI-driven applications. You will work closely with Data Scientists, Engineers, and Product teams to develop robust ETL/ELT processes, ensure data integrity, and optimize cloud-based data architectures. Your work will directly impact the quality of insights we deliver to healthcare professionals.

What’s in it for you

We’re pioneers. As our new team member, you will have the chance to influence the design, architecture, and refinement of the data-driven evidence-generation approaches in healthcare. You will also have a key role in driving the development of tools to expose processed information, taking advantage of the latest technologies available.

Responsibilities

  • Develop and maintain scalable ETL/ELT pipelines using Airflow.

  • Ingest and transform data from multiple sources into a structured and accessible format.

  • Write optimized SQL queries for efficient data retrieval and transformation.

  • Leverage AWS services such as S3 & Lambda for cloud-based data processing.

  • Automate data workflows using Airflow

  • Monitor data pipeline performance and troubleshoot issues proactively.

  • Collaborate with cross-functional teams to understand data needs and optimize workflows.

  • Experience in building CRM connectors for data pipelines (bonus points for VeevaCRM, or SalesforceCRM)

The Role

Lead Data Engineer

Requirements

  • 5+ years of experience in data engineering or a related field.

  • Proficiency in Python, Bash and SQL for data processing and analysis.

  • Experience with cloud-based data solutions, preferably AWS (S3, Lambda)

  • Hands-on experience with ETL/ELT frameworks including Apache Airflow (Luigi, Kedro, etc.).

  • Strong understanding of data modeling, indexing, and query optimization.

  • Familiarity with CI/CD pipelines, Git workflows, and Docker.

  • Excellent problem-solving and communication skills.

  • Willingness to learn and adapt in a fast-paced environment.

Nice to Have

  • Knowledge of Databricks or Snowflake

  • Exposure to machine learning or AI-driven data applications.

  • Experience with prompt engineering and LLMs

"Gapstars is committed to a diverse and inclusive workplace. We are an equal opportunity employer and do not discriminate based on race, national origin, gender, disability, or age. Your personal information collected during the application process is handled following our privacy policy and used exclusively for recruitment and hiring purposes only"


*You may unsubscribe from these communications at any time. For our full Privacy Policy, Click here.

*You may unsubscribe from these communications at any time. For our full Privacy Policy, Click here.

Here to help

Reach out to us, and let’s explore how we can build your dreams with the right people, expertise, and solutions.