Skip to content
View martinkilombe's full-sized avatar
πŸ’­
Rebuilding the Roman empire, pipeline by pipeline
πŸ’­
Rebuilding the Roman empire, pipeline by pipeline

Block or report martinkilombe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
martinkilombe/README.md

Martin Muti Kilombe

Lead Data Analyst & Data Engineer

"Rebuilding the Roman empire, pipeline by pipeline."

Profile Views Portfolio LinkedIn Twitter Email


πŸ‘‹ About Me

I'm a Lead Data Analyst and Data Engineer. With a background in Actuarial Science and Finance, I bring a uniquely quantitative lens to data β€” blending statistical rigour with engineering depth.

I work across the full data stack: from designing ETL pipelines that process 10M+ records monthly, to building real-time dashboards, to integrating LLMs and RAG pipelines into production reporting workflows.

  • πŸ—οΈ Lead Data Analyst & Data Engineer β€” dual-hatted across analytics and engineering
  • βš™οΈ Building scalable pipelines with Airflow, dbt, Kafka, Spark & Docker
  • πŸ€– Deploying LLMs in production β€” RAG, LangChain, Google MedGemma, Ollama
  • ☁️ Cloud-native on GCP (BigQuery, Cloud SQL, Cloud Storage) and Databricks
  • πŸ“ Background in Actuarial Science & Finance β€” strong quantitative foundation
  • πŸ“¬ martin@martinkilombe.dev

βš™οΈ Data Engineering Projects

Project What It Does Highlights Stack
πŸ’Ή Financial Data Pipeline Production-grade stock market ingestion system with dual-source data fusion (Polygon.io + Yahoo Finance) 50K+ daily data points Β· sub-minute latency Β· 99.9% uptime Β· NYSE-aware scheduling Β· JSONB metadata Β· Alembic migrations Python 3.12, PostgreSQL, SQLAlchemy 2.0, Loguru, Alembic, GCP Cloud SQL

πŸ“Š Analytics & BI Projects

Project What It Does Highlights Stack
πŸ“Ί Netflix Viewership Dashboard Interactive global viewership analysis with content trend breakdowns Published to Tableau Public Β· drill-down filters Tableau, SQL
πŸ’Ό UK Job Market Analysis Labour market demographics study covering 2011–2014 workforce distribution Regional & demographic segmentation Tableau, SQL
πŸ“ˆ Yahoo Finance Stock Analysis 12-month OHLC analysis of FAANG + Microsoft β€” moving averages, volatility, and correlation MA10/MA20 crossover signals Β· ATR volatility Β· correlation heatmaps Python, Pandas, yfinance, Matplotlib
🏦 Consumer Complaints Analysis Full SQL data cleaning and analytics on 100K+ CFPB financial complaints dataset Schema alteration · advanced window functions · data quality checks PostgreSQL, SQL
πŸ“Š Forbes Global 2022 Analysis Cleaning and analytical deep-dive on Forbes Global 2000 company rankings Revenue/profit segmentation Β· sector analysis SQL, PostgreSQL
πŸ“‰ Metabase Dashboards Self-serve BI dashboards designed for non-technical business stakeholders Embedded analytics Β· custom metrics Metabase, SQL

πŸ€– ML & Applied AI Projects

Project What It Does Highlights Stack
🏦 Loan Predictor ML App End-to-end ML web app predicting loan eligibility β€” fully containerised and deployable Random Forest Β· 80%+ accuracy Β· Django web UI Β· Dockerised with docker-compose Β· PostgreSQL backend Python, Scikit-learn, Django, Docker, PostgreSQL

πŸ› οΈ Tech Stack

Analytics & Visualisation

Python SQL Tableau Power BI Metabase Excel

Data Engineering & Pipelines

Apache Airflow Apache Kafka Apache Spark dbt Docker

Cloud & Databases

GCP BigQuery Databricks PostgreSQL Azure SQL MongoDB

AI & Emerging Tech

LangChain Ollama Git Bash


πŸŽ“ Certifications

Google Microsoft


πŸ“Š GitHub Stats


Open to data engineering roles, freelance collaborations, and interesting data problems.

Let's connect and build something meaningful with data.

Portfolio

Pinned Loading

  1. Loan-Predictor-ML Loan-Predictor-ML Public

    Jupyter Notebook 1

  2. Data-Analyst-Project Data-Analyst-Project Public

    This repo contains a list of Projects for my portfolio

    PLpgSQL 1

  3. Tableau_Projects Tableau_Projects Public

    This Repo contains various Tableau Projects

    1

  4. MetaBase-Projects MetaBase-Projects Public

    This repo contains a list of projects created using the Metabase Visualization Tool

    1

  5. Python-Data-Analysis-Projects Python-Data-Analysis-Projects Public

    This repo contains a list of various data analysis projects carried out in python

    Jupyter Notebook 1