Data Engineer

Description

About Gapstars

About Gapstars

Gapstars is a Netherlands-based software development services provider that builds remote, agile teams in Sri Lanka and Portugal for innovative tech companies. Today, we are home to 300+ TechStars and innovative minds, turning scalable software into performance products that shape the future. Our partners are highly ambitious tech companies that are looking to conquer their respective markets.

About the Role

The Data Operations Engineer will join a collaborative team working on a Python-based data engineering platform, including the data lake and processing pipelines. The role involves developing and maintaining data pipelines, ensuring solutions are built on a strong architectural foundation and follow best practices for efficiency, security, reliability, performance, and cost optimization.

The ideal candidate has 1-2 years of data engineering experience with a focus on data operations, including ETL/ELT, data management, and delivery. Strong communication, analytical skills, and excellent documentation abilities are essential, along with experience engaging with diverse business stakeholders.

Responsibilities

  • BSc in Computer Science, Engineering, or a related quantitative field.

  • Proficiency in modern Python programming Python 3.9 or later

  • Familiarity with version control systems, such as Git.

  • Experience with processing pipeline framework Airflow, or similar frameworks such as Kedro, Luigi or Argo.

  • Proficiency in SQL and experience with both relational and NoSQL databases.

  • Familiarity with cloud data processing and compute services e.g. EC2, S3 in AWS or equivalent in Azure or GCP

  • Experience with data processing libraries like Pandas, NumPy or Dask.

  • Exceptional communication skills and an ability to connect people with different points of view and varying levels of experience.

  • Willingness to learn and develop yourself

The Role

Data Engineer

Requirements

  • Manage multiple data load pipelines to ensure they are operational.

  • Modify existing ETL processes to improve where possible

  • Monitor and troubleshoot data pipeline issues, ensuring timely resolution and minimal disruption to data workflows.

  • Identify, design, and implement internal process improvements: automating manual processes and optimizing data delivery,

  • Run frequent audits, operationalize data quality monitoring, and resolve issues.

  • Help operationalize and automate future data loads, reporting, and analytics jobs.

  • Research new data sources and perform data source analysis.

  • Plan resources necessary for data operations in collaboration with the data Architect and our DevOps team.

  • Collaborate with cross-functional teams to understand business needs and provide data engineering support for various projects.

"Gapstars is committed to a diverse and inclusive workplace. We are an equal-opportunity employer and do not discriminate based on race, national origin, gender, disability, or age. Your personal information collected during the application process is handled following our privacy policy and used exclusively for recruitment and hiring purposes only"


*You may unsubscribe from these communications at any time. For our full Privacy Policy, Click here.

*You may unsubscribe from these communications at any time. For our full Privacy Policy, Click here.

Here to help

Reach out to us, and let’s explore how we can build your dreams with the right people, expertise, and solutions.