MAAMAR HADDOUCHE FroCode

About ・ Stack ・ Projects ・ Architecture ・ Stats ・ Contact

+7 years of experiance

Data Engineer focused on scalable pipelines, cloud infrastructure, and reliable analytics delivery.

I design and ship production-style data platforms — both batch and streaming — on AWS, with Python, SQL, Airflow, Kafka, and Spark. From raw ingestion to curated marts, I care deeply about modular ETL, orchestration, data quality, and ops-ready deployments (Docker, CI/CD, observability).

focus:       [ELT, Streaming, Cloud Data Platforms]
stack:       [Python, SQL, AWS, Airflow, Kafka, Spark, dbt, Docker]

What I bring

ETL & Orchestration — Airflow DAGs, idempotent staged loads
AWS Data Stack — S3, Glue, Lambda, Athena, Redshift Spectrum
Real-Time Systems — Kafka, Spark Streaming, end-to-end pipelines
Engineering Discipline — Docker, Terraform, CI/CD, Prometheus + Grafana

How I work

Modular by default — small, testable, composable units
Observable — metrics, structured logs, alerts, lineage
Cost-aware — partitioning, file formats, right-sized compute
Reproducible — IaC, versioned configs, containerized runs

Tech Stack

Languages

Data Engineering

Cloud & DevOps

Databases & Storage

ML & Monitoring

Featured Projects

AWS-ETL

Cloud-native ETL pipelines on AWS — ingestion, transformation, and curated outputs.

Python AWS S3 Glue

news-Data-Pipeline_Airflow_AWS

Orchestrated news ingestion → enrichment → analytics with Airflow on AWS.

Airflow Python AWS

Real_Streaming_Kafka

Real-time event streaming with Kafka — producers, consumers, and downstream sinks.

Kafka Python Streaming

City_End_to_End_RealTime

End-to-end city data platform: ingest → stream → process → visualize.

Python Kafka Real-time E2E

Reddit_ETL

API → warehouse pipeline pulling Reddit data into structured analytics layers.

Python API Warehouse

TECHNICAL-TASK-YASA-1-LLC

Notebook-driven analysis & data engineering technical task.

Jupyter Python Analysis

How I Build

flowchart LR
    A[Sources<br/>APIs · Files · DBs · Streams] -->|Ingest| B[Raw / Bronze<br/>S3 · Kafka]
    B -->|Clean & Conform| C[Staged / Silver<br/>Spark · dbt]
    C -->|Model & Curate| D[Marts / Gold<br/>Snowflake · Redshift]
    D --> E[Consumers<br/>BI · ML · Apps]

    F[Airflow] -.orchestrates.-> B
    F -.orchestrates.-> C
    F -.orchestrates.-> D

    G[Observability<br/>Prometheus · Grafana · Logs] -.monitors.-> B
    G -.monitors.-> C
    G -.monitors.-> D

    style A fill:#1f6feb,stroke:#58a6ff,color:#fff
    style B fill:#cd7f32,stroke:#58a6ff,color:#fff
    style C fill:#c0c0c0,stroke:#58a6ff,color:#000
    style D fill:#ffd700,stroke:#58a6ff,color:#000
    style E fill:#238636,stroke:#58a6ff,color:#fff
    style F fill:#017CEE,stroke:#58a6ff,color:#fff
    style G fill:#E6522C,stroke:#58a6ff,color:#fff

GitHub Analytics

Let's Connect

310+ contributions last year · actively shipping pipeline & platform work

Open to Data Engineering, Streaming, and Cloud Pipeline opportunities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly