"Rebuilding the Roman empire, pipeline by pipeline."
I'm a Lead Data Analyst and Data Engineer. With a background in Actuarial Science and Finance, I bring a uniquely quantitative lens to data β blending statistical rigour with engineering depth.
I work across the full data stack: from designing ETL pipelines that process 10M+ records monthly, to building real-time dashboards, to integrating LLMs and RAG pipelines into production reporting workflows.
- ποΈ Lead Data Analyst & Data Engineer β dual-hatted across analytics and engineering
- βοΈ Building scalable pipelines with Airflow, dbt, Kafka, Spark & Docker
- π€ Deploying LLMs in production β RAG, LangChain, Google MedGemma, Ollama
- βοΈ Cloud-native on GCP (BigQuery, Cloud SQL, Cloud Storage) and Databricks
- π Background in Actuarial Science & Finance β strong quantitative foundation
- π¬ martin@martinkilombe.dev
| Project | What It Does | Highlights | Stack |
|---|---|---|---|
| πΉ Financial Data Pipeline | Production-grade stock market ingestion system with dual-source data fusion (Polygon.io + Yahoo Finance) | 50K+ daily data points Β· sub-minute latency Β· 99.9% uptime Β· NYSE-aware scheduling Β· JSONB metadata Β· Alembic migrations | Python 3.12, PostgreSQL, SQLAlchemy 2.0, Loguru, Alembic, GCP Cloud SQL |
| Project | What It Does | Highlights | Stack |
|---|---|---|---|
| πΊ Netflix Viewership Dashboard | Interactive global viewership analysis with content trend breakdowns | Published to Tableau Public Β· drill-down filters | Tableau, SQL |
| πΌ UK Job Market Analysis | Labour market demographics study covering 2011β2014 workforce distribution | Regional & demographic segmentation | Tableau, SQL |
| π Yahoo Finance Stock Analysis | 12-month OHLC analysis of FAANG + Microsoft β moving averages, volatility, and correlation | MA10/MA20 crossover signals Β· ATR volatility Β· correlation heatmaps | Python, Pandas, yfinance, Matplotlib |
| π¦ Consumer Complaints Analysis | Full SQL data cleaning and analytics on 100K+ CFPB financial complaints dataset | Schema alteration Β· advanced window functions Β· data quality checks | PostgreSQL, SQL |
| π Forbes Global 2022 Analysis | Cleaning and analytical deep-dive on Forbes Global 2000 company rankings | Revenue/profit segmentation Β· sector analysis | SQL, PostgreSQL |
| π Metabase Dashboards | Self-serve BI dashboards designed for non-technical business stakeholders | Embedded analytics Β· custom metrics | Metabase, SQL |
| Project | What It Does | Highlights | Stack |
|---|---|---|---|
| π¦ Loan Predictor ML App | End-to-end ML web app predicting loan eligibility β fully containerised and deployable | Random Forest Β· 80%+ accuracy Β· Django web UI Β· Dockerised with docker-compose Β· PostgreSQL backend | Python, Scikit-learn, Django, Docker, PostgreSQL |
Analytics & Visualisation
Data Engineering & Pipelines
Cloud & Databases
AI & Emerging Tech


