Skip to content
View FroCode's full-sized avatar
🦾
🦾
  • Capgemini
  • Poland

Block or report FroCode

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
FroCode/README.md

+7 years of experiance

Data Engineer focused on scalable pipelines, cloud infrastructure, and reliable analytics delivery.

I design and ship production-style data platforms — both batch and streaming — on AWS, with Python, SQL, Airflow, Kafka, and Spark. From raw ingestion to curated marts, I care deeply about modular ETL, orchestration, data quality, and ops-ready deployments (Docker, CI/CD, observability).

focus:       [ELT, Streaming, Cloud Data Platforms]
stack:       [Python, SQL, AWS, Airflow, Kafka, Spark, dbt, Docker]

What I bring

  • ETL & Orchestration — Airflow DAGs, idempotent staged loads
  • AWS Data Stack — S3, Glue, Lambda, Athena, Redshift Spectrum
  • Real-Time Systems — Kafka, Spark Streaming, end-to-end pipelines
  • Engineering Discipline — Docker, Terraform, CI/CD, Prometheus + Grafana

How I work

  • Modular by default — small, testable, composable units
  • Observable — metrics, structured logs, alerts, lineage
  • Cost-aware — partitioning, file formats, right-sized compute
  • Reproducible — IaC, versioned configs, containerized runs

Tech Stack

Languages

Python SQL Bash JavaScript C++

Data Engineering

Apache Spark Apache Airflow Kafka dbt Snowflake Delta Lake Pandas

Cloud & DevOps

AWS Azure GCP Docker Kubernetes Terraform Jenkins GitHub Actions

Databases & Storage

PostgreSQL MySQL Redis MongoDB Amazon S3

ML & Monitoring

MLflow TensorFlow PyTorch Prometheus Grafana


Featured Projects

Cloud-native ETL pipelines on AWS — ingestion, transformation, and curated outputs.

Python AWS S3 Glue

Orchestrated news ingestion → enrichment → analytics with Airflow on AWS.

Airflow Python AWS

Real-time event streaming with Kafka — producers, consumers, and downstream sinks.

Kafka Python Streaming

End-to-end city data platform: ingest → stream → process → visualize.

Python Kafka Real-time E2E

API → warehouse pipeline pulling Reddit data into structured analytics layers.

Python API Warehouse

Notebook-driven analysis & data engineering technical task.

Jupyter Python Analysis

See more


How I Build

flowchart LR
    A[Sources<br/>APIs · Files · DBs · Streams] -->|Ingest| B[Raw / Bronze<br/>S3 · Kafka]
    B -->|Clean & Conform| C[Staged / Silver<br/>Spark · dbt]
    C -->|Model & Curate| D[Marts / Gold<br/>Snowflake · Redshift]
    D --> E[Consumers<br/>BI · ML · Apps]

    F[Airflow] -.orchestrates.-> B
    F -.orchestrates.-> C
    F -.orchestrates.-> D

    G[Observability<br/>Prometheus · Grafana · Logs] -.monitors.-> B
    G -.monitors.-> C
    G -.monitors.-> D

    style A fill:#1f6feb,stroke:#58a6ff,color:#fff
    style B fill:#cd7f32,stroke:#58a6ff,color:#fff
    style C fill:#c0c0c0,stroke:#58a6ff,color:#000
    style D fill:#ffd700,stroke:#58a6ff,color:#000
    style E fill:#238636,stroke:#58a6ff,color:#fff
    style F fill:#017CEE,stroke:#58a6ff,color:#fff
    style G fill:#E6522C,stroke:#58a6ff,color:#fff
Loading

GitHub Analytics


Let's Connect

310+ contributions last year · actively shipping pipeline & platform work


Email LinkedIn GitHub


Open to Data Engineering, Streaming, and Cloud Pipeline opportunities.


footer

Pinned Loading

  1. TECHNICAL-TASK-YASA-1-LLC TECHNICAL-TASK-YASA-1-LLC Public

    Jupyter Notebook 1

  2. AWS-ETL AWS-ETL Public

    Python 1

  3. news-Data-Pipeline_Airflow_AWS news-Data-Pipeline_Airflow_AWS Public

    Python 1

  4. City_End_to_End_RealTime City_End_to_End_RealTime Public

    Python 2

  5. Real_Streaming_Kafka Real_Streaming_Kafka Public

    Python 1

  6. Reddit_ETL Reddit_ETL Public

    Python 2