Skip to content
View frostbitepy's full-sized avatar

Block or report frostbitepy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
frostbitepy/README.md

Hi, I'm Francisco Ruiz 👋

🚀 Data Engineer | Python Developer | SQL Specialist | Cloud & Big Data Enthusiast

I’m a Data Engineer with experience building end‑to‑end data pipelines, automation tools, and data validation systems. I love solving problems with clean architecture, scalable pipelines, and efficient SQL/Python code.


🧩 About Me

  • 🇵🇾 Based in Paraguay
  • 👨‍💻 Data Engineer at Núcleo S.A. (Telecom & Telefonía Internacional)
  • 🔄 Part of the Ingestas Team, responsible for migrating on‑premise PostgreSQL databases to BigQuery
  • 🛠️ Familiar with GCP ecosystem, especially BigQuery, Airflow, Data Fusion
  • 🤖 Strong background in Python automation, SQL optimization, and ETL/ELT processes
  • 📊 Built full validation frameworks, dashboards, and ingestion workflows
  • 🔍 Passionate about data modeling, pipeline reliability, and analytics engineering

🏗️ Featured Work at Núcleo S.A.

🔎 Automated Data Validation System (Owner / Developer)

A completely automated validation pipeline integrated into Airflow’s daily ingestion DAGS. Once each table is migrated from on‑prem PostgreSQL to BigQuery, this validator runs four validation stages:

  1. Row Counts Validation
  2. Numeric Sum Validation
  3. Date Range Validation
  4. Null Checks

📌 Highlights:

  • Parameterized design using metadata tables in BigQuery
  • Fully modeled and implemented from scratch (tables, logic, pipeline)
  • CI/CD workflow with GitLab (branches, dev → prod deployments)
  • Validation results stored in structured BigQuery tables
  • Results visualized in Looker Studio, with dashboards I designed

🔧 Tech Stack & Skills

Languages

  • Python
  • SQL (PostgreSQL, BigQuery SQL)
  • Bash

Cloud & Data

  • Google Cloud Platform (BigQuery, Cloud Composer/Airflow, Data Fusion)
  • CI/CD with GitLab
  • ETL/ELT pipelines
  • Data Modeling
  • Airflow DAG development

Tools

  • Git & GitLab
  • Looker Studio
  • Docker
  • Linux (Arch-based distros, Ubuntu)
  • Neovim / VS Code

🌱 Currently Learning

  • More advanced Data Engineering patterns
  • Spark & distributed processing
  • Cloud architecture practices
  • AWS fundamentals
  • Rust

📫 Contact


Thanks for visiting my profile! Feel free to explore my repositories or reach out if you want to collaborate 🤝

Pinned Loading

  1. cuentas-tecnicas-reaseguros cuentas-tecnicas-reaseguros Public

    Python

  2. Deploy_Datapath_project Deploy_Datapath_project Public

    Python

  3. Project1-ComputerVision Project1-ComputerVision Public

    Jupyter Notebook

  4. RobotControlCumulos RobotControlCumulos Public

    Python

  5. RobotNotasDeExclusion RobotNotasDeExclusion Public

    Python

  6. flask_fm flask_fm Public

    A proyect to work with flask and databases.

    Python 2