Antoine Gélin

Hello, I'm

Antoine Gélin


Data Engineer with 5 years of experience — ingestion pipelines, data warehouses, BI layers. Java/Spark background on high-volume systems, now focused on the Modern Data Stack (dbt, Redshift, AWS).

/ Expertise

The domains I operate in

Data Pipelines

Design and production deployment of ingestion and transformation pipelines. Modern ELT approach with built-in orchestration, testing, and monitoring.

Python Spark dbt SQL

Data Warehouse Architecture

Modeling and structuring analytics-optimized data warehouses: star schemas, staging/mart layers, naming conventions.

Redshift BigQuery dbt PostgreSQL

Cloud Infrastructure

Deployment and management of data infrastructure on AWS and GCP: storage, compute, orchestration — all versioned and reproducible.

AWS GCP Terraform Docker CI/CD

BI & Visualization

Self-service BI layers: business dashboards, automated KPIs, autonomous data access for non-technical teams.

QuickSight SQL dbt metrics

Data Quality & Observability

Data tests, alerts, and monitoring integrated into pipelines to ensure reliability and catch anomalies before they impact downstream use.

dbt tests Alerting Data contracts

Enablement & Documentation

Knowledge transfer, technical documentation, and training for business or engineering teams to build autonomy on data tooling.

Documentation Training Handover

/ Featured Projects

A selection of professional work, published apps, and experimental prototypes.

Velib Data Platform screenshot 1
Personal project

Velib Data Platform

Serverless ELT platform on Google Cloud for Vélib' Métropole bike-sharing data: high-frequency ingestion (~200M observations/year), BigQuery columnar storage, dbt analytical transformations, Terraform-managed infrastructure.

Python dbt BigQuery Terraform Cloud Run Docker
Modern Data Stack — Fintech screenshot 1
Mission

Modern Data Stack — Fintech

Business teams autonomous on their data, faster time-to-insight, and a platform the next team can pick up without handholding.

Python dbt Redshift S3 QuickSight Fargate
Critical Pipelines — Banking screenshot 1
Mission

Critical Pipelines — Banking

Stable processing on high volumes, fewer production incidents, and engineering practices that hold up over time.

Java Spark Hive SQL Server Jenkins

/ Tech Stack

Breakdown of my technical skills, categorized by domain and proficiency

Expert
Proficient
Beginner

Data Engineering

dbt
Apache Spark
Hadoop/Hive

Databases

BigQuery
Redshift
PostgreSQL
SQL Server

Cloud & DevOps

AWS
GCP
S3
GCS
Docker
Terraform
Git
Linux
Jenkins

Languages

Python
Java
SQL
Bash

/ Contact Me

Feel free to reach out. I'm always open to discussing new projects and opportunities.