Skip to content
View QuantumDrizzy's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report QuantumDrizzy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
QuantumDrizzy/README.md

Antonio Zambudio

Performance is a function of what you control. So I control all of it.

I build bare-metal, high-performance systems — the fast, low-level code under hard problems. CUDA, Rust and C/C++ where latency and control decide the outcome; Python where it pays. Solo, end to end, on my own hardware (RTX 5060 Ti / Blackwell sm_120, Arch Linux) — with benchmarks anyone can re-run.

I'm a performance/systems engineer first. The domains below — quantum, neuroscience, materials, energy — are just where I point the same skill. The through-line: physics computes by minimizing energy; I write the systems that exploit it, at the metal.

Stack: CUDA (hand-written kernels, sm_120 roofline) · Rust (control loops, systems) · C / C++17 (compile-time guarantees) · Python (ML, glue, analysis). No cloud, by choice.


Flagship — SUBSTRATE: can I make the metal go fast — and prove it?

A multi-physics / quantum-biology simulation engine. The physics is the hard problem; the point is the engine underneath: hand-written CUDA (sm_120) with an honest, kernel-only roofline — 3×→154× vs JAX-CPU, end-to-end break-even stated, not hidden — plus tensor-network solvers for many-body systems. Start here if you want to know whether I can write fast kernels and back the numbers.


Selected systems — each proves one thing

BLACKWALL · ICEPICK · FLATLINE — I reverse-engineer the silicon I run on. A three-part Blackwell (sm_120) teardown, hand-written CUDA: the compute roofline (BLACKWALL), the microarchitecture beneath it — instruction latencies, caches, the SASS the compiler actually emits (ICEPICK), and the energy/thermal wall (FLATLINE). The metal, measured directly — compute · communication · energy.

AETHER — I implement hard physics correctly. Computational-materials lab: electronic structure, the full topological set (SSH, Haldane, Kane–Mele), metamaterials, GPU-accelerated solvers, inverse design. ~90 tests; every claim checked against a closed form. Correctness isn't optional.

KHAOS — I build real-time systems where safety is enforced, not hoped for. Closed-loop BCI kernel: a CUDA DSP hot-path, stimulation limits guaranteed by the C++ compiler (static_assert), three independent safety layers, post-quantum audit ledger. Sub-100 µs is the design target — marked unverified until benchmarked end-to-end.

HELIOS — I build control loops that can't go down. 24/7 predictive DC-microgrid controller. Rust MPPT loop (100 ms tick), CNN-LSTM forecasting, post-quantum trust anchors. Where the lights actually have to stay on.

DRIFT — I see the structure under the problem. Optimization, self-assembly and neural memory (Hopfield) read as ground states of one Ising Hamiltonian — the unification thesis, made measurable and benchmarked against the Landauer floor of computation.

Blaze — I compress massive scientific & quantum data on the GPU. Tensor-Train / MPS compression, GPU SVD, MPS↔circuit bridge. The specialist tool the rest of the ecosystem leans on.


More in the repos — same spine, other proving grounds. Everything is mine, on my own hardware, reproducible.

Pinned Loading

  1. SUBSTRATE SUBSTRATE Public

    Multi-scale electromagnetic computational framework — lattice gauge theory, quantum biology and geomagnetic risk on one GPU. Hand-written CUDA (sm_120) + tensor networks, no cloud.

    Python 1

  2. BLACKWALL BLACKWALL Public

    Honest precision-spectrum GEMM roofline on NVIDIA Blackwell (sm_120) — FP32 → FP4, measured. FP4 (nvfp4) at 20× FP32 via cuBLASLt, anchored to the computed peak. No inflated numbers.

    Cuda

  3. ICEPICK ICEPICK Public

    Reverse-engineering the Blackwell (sm_120) microarchitecture — instruction latencies, memory hierarchy and tensor cores, by microbenchmarking the silicon and reading the SASS the compiler emits.

    Cuda

  4. TESSERA TESSERA Public

    Neural-guided real quantum annealing via tensor networks — a GNN learns the schedule, an MPS engine simulates the transverse-field Ising adiabatic process on your own GPU. Local, sovereign, honest …

    C++

  5. HELIOS HELIOS Public

    24/7 predictive DC-microgrid controller — Rust MPPT loop (100ms tick) + CNN-LSTM irradiance predictor, post-quantum trust anchors, egui dashboard. Bare-metal, no cloud.

    Rust 1

  6. KHAOS KHAOS Public

    Closed-loop BCI kernel with neuroethics enforced at compile time, a sub-100µs CUDA DSP pipeline, and post-quantum sovereignty. Bare-metal, zero-cloud.

    Python