Machine Learning Visualized

Machine Learning Visualized is an interactive curriculum for machine learning, deep learning, language models, retrieval, diffusion, reinforcement learning, and the math behind them.

The project started as a collection of standalone animations. It is now centered on a unified React app with guided paths, lesson metadata, quizzes, labs, glossary links, and local progress tracking.

Open the live site

What is inside

A unified lesson browser with searchable topics and curriculum tracks.
Guided paths for fundamentals, experimentation and causal ML, LLMs, frontier LLMs and agentic systems, RAG, model reliability, vision and diffusion, and reinforcement learning.
Core ML lessons for splitting data, cross-validation, leakage, scaling, metrics, calibration, PCA, clustering, tree ensembles, and classical classifiers.
Model reliability lessons for debugging, interpretability, monitoring, fairness, and uncertainty estimation.
Experimentation and causal ML lessons for A/B testing foundations and power analysis, with planned modules for sequential testing, CUPED, confounding, DAGs, treatment effects, and propensity scores.
Transformer lessons for attention, masks, architecture families, training objectives, token generation, sampling, KV cache, Flash Attention, and fine-tuning.
Frontier LLM lessons for MoE at scale, MLA, RLVR/GRPO, test-time compute, long-context systems, omni multimodal models, diffusion language models, efficient serving, frontier evaluation/safety, tool-using reasoners, and agentic coding systems.
RAG lessons for chunking, vector indexing, reranking, grounding, retrieval evaluation, and failure modes.
Neural-network lessons for backpropagation, initialization, optimizers, dropout, batch normalization, and training-loop dynamics.
Diffusion lessons from beginner denoising intuition through sampling, classifier-free guidance, U-Net vs DiT, SD3, DiT, VAE, CLIP, T5, and flow matching.
Small from-scratch implementations in Rust, Go, Java, and Python for neural networks, diffusion, and Markov chains.

Current App

The unified app is in unified-app/.

cd unified-app
npm install
npm run dev

Build and test:

cd unified-app
npm test
npm run audit:quality
npm run test:smoke
npm run build

The app uses React, Vite, Tailwind CSS, Three.js, GSAP, and Recharts.

Screenshots

Core ML Lesson

LLM Generation Lesson

Frontier LLM Architecture

Reasoning RLVR / GRPO

Efficient LLM Serving

Frontier Evaluation and Safety

Diffusion Basics Lesson

Curriculum Areas

Foundations

The foundations track covers linear algebra, probability, statistics, optimization, and the core supervised-learning workflow. Lessons include matrix multiplication, linear regression, train/validation/test splits, gradient descent, PCA, k-means, overfitting, regularization, calibration, ROC and precision-recall curves, and bias-variance tradeoffs.

Natural Language Processing and Transformers

The NLP and transformer track starts with bag-of-words, tokenization, and embeddings, then moves into attention, self-attention, masks, positional encoding, RoPE, transformer architectures, LLM training objectives, token generation, sampling, KV cache, Flash Attention, and fine-tuning.

Frontier LLMs and Agentic Systems

The frontier path covers modern architecture and systems topics: dense vs MoE models, MLA and attention compression, reasoning models, RLVR/GRPO, test-time compute, tool-using reasoning, agentic coding, long-context systems, omni multimodal models, diffusion language models, efficient LLM serving, and frontier evaluation/safety.

RAG

The retrieval track covers the RAG pipeline as a system: chunking, embedding search, vector indexing, reranking, context packing, grounding, retrieval metrics, and failure modes.

Model Reliability

The model reliability track covers post-training and deployed-system concerns: debugging failures, interpreting model behavior, estimating uncertainty, monitoring drift and regressions, and evaluating fairness tradeoffs across slices and groups.

Experimentation and Causal ML

The experimentation track connects hypothesis testing, confidence intervals, metrics, calibration, leakage, fairness, monitoring, and uncertainty to causal decision-making. Active lessons now cover A/B testing foundations, power and sample size, sequential testing and peeking, CUPED variance reduction, confounding and Simpson's paradox, causal graphs and DAGs, treatment effects, and propensity scores.

The next-priority applied ML pillars are also active as overview lessons: time series and forecasting, recommender systems and ranking, ML security and robustness, efficient inference and compression, and data engineering for ML.

Vision and Diffusion

The diffusion track starts with basic denoising and sampling before moving into classifier-free guidance, U-Net vs DiT, latent VAEs, CLIP, T5, SD3, DiT, joint attention, and flow matching.

Reinforcement Learning

The RL track covers agents, rewards, discounted returns, MDPs, value iteration, policy iteration, Q-learning, exploration, policy gradients, actor-critic methods, and reward shaping.

Standalone Implementations

The repository also includes compact implementations meant for reading and experimentation:

mini-nn/, mini-nn-go/, mini-nn-java/, mini-nn-python/
mini-diffusion/, mini-diffusion-go/, mini-diffusion-java/, mini-diffusion-python/
mini-markov/, mini-markov-go/, mini-markov-java/, mini-markov-python/

Each directory has its own README with setup notes and examples.

Publishing

GitHub Pages is published manually from this machine. The deploy script builds the unified app and pushes the generated site to the gh-pages branch.

node scripts/deploy-github-pages.mjs

The script also publishes static *-animation/index.html entry pages with route-specific metadata, so older animation URLs and crawlers land on the current unified lessons.

Repository Layout

unified-app/                 Unified React app
screenshots/readme/          Current README screenshots
scripts/                     Local maintenance and deploy scripts
*-animation/                 Static lesson entry pages and legacy standalone lessons
mini-nn*/                    Small neural-network implementations
mini-diffusion*/             Small diffusion implementations
mini-markov*/                Small Markov-chain implementations

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
.agents/skills		.agents/skills
.history/tokenization-animation		.history/tokenization-animation
.vite/deps		.vite/deps
ab-testing-foundations-animation		ab-testing-foundations-animation
actor-critic-animation		actor-critic-animation
agentic-coding-systems-animation		agentic-coding-systems-animation
attention-masks-animation		attention-masks-animation
attention-mechanism-animation		attention-mechanism-animation
bag-of-words-animation		bag-of-words-animation
bayes-rule-ml-animation		bayes-rule-ml-animation
bert-animation		bert-animation
bias-variance-tradeoff-animation		bias-variance-tradeoff-animation
blog		blog
bloom-filter-animation		bloom-filter-animation
calibration-animation		calibration-animation
causal-graphs-dags-animation		causal-graphs-dags-animation
change-of-basis-animation		change-of-basis-animation
classification-metrics-animation		classification-metrics-animation
classifier-free-guidance-animation		classifier-free-guidance-animation
clip-encoder-animation		clip-encoder-animation
clip-text-encoder-animation		clip-text-encoder-animation
computation-graph-backprop-animation		computation-graph-backprop-animation
condition-number-animation		condition-number-animation
conditional-probability-animation		conditional-probability-animation
confounding-simpsons-paradox-animation		confounding-simpsons-paradox-animation
conv-relu-animation		conv-relu-animation
conv2d-animation		conv2d-animation
cosine-similarity-animation		cosine-similarity-animation
cross-entropy-animation		cross-entropy-animation
cross-validation-animation		cross-validation-animation
cuped-variance-reduction-animation		cuped-variance-reduction-animation
data-engineering-for-ml-track-animation		data-engineering-for-ml-track-animation
data-leakage-deep-dive-animation		data-leakage-deep-dive-animation
determinant-volume-animation		determinant-volume-animation
diffusion-basics-animation		diffusion-basics-animation
diffusion-language-models-animation		diffusion-language-models-animation
diffusion-sampling-animation		diffusion-sampling-animation
diffusion-tokenizer-animation		diffusion-tokenizer-animation
diffusion-vae-animation		diffusion-vae-animation
dit-animation		dit-animation
dit-transformer-animation		dit-transformer-animation
dropout-batchnorm-animation		dropout-batchnorm-animation
efficient-inference-compression-track-animation		efficient-inference-compression-track-animation
efficient-llm-serving-animation		efficient-llm-serving-animation
eigenvalue-animation		eigenvalue-animation
embeddings-animation		embeddings-animation
entropy-animation		entropy-animation
expected-value-variance-animation		expected-value-variance-animation
fasttext-animation		fasttext-animation
feature-scaling-preprocessing-animation		feature-scaling-preprocessing-animation
fine-tuning-animation		fine-tuning-animation
flash-attention-animation		flash-attention-animation
flow-matching-animation		flow-matching-animation
frontier-evaluation-safety-animation		frontier-evaluation-safety-animation
frontier-llm-architecture-overview-animation		frontier-llm-architecture-overview-animation
frontier-moe-systems-animation		frontier-moe-systems-animation
fundamental-subspaces-animation		fundamental-subspaces-animation
glove-animation		glove-animation
gpt2-comprehensive-animation		gpt2-comprehensive-animation
gradient-descent-animation		gradient-descent-animation
gradient-problems-animation		gradient-problems-animation
grouped-query-attention-animation		grouped-query-attention-animation
hypothesis-testing-intuition-animation		hypothesis-testing-intuition-animation
initialization-animation		initialization-animation
joint-attention-animation		joint-attention-animation
k-means-animation		k-means-animation
knn-naive-bayes-svm-animation		knn-naive-bayes-svm-animation
kv-cache-animation		kv-cache-animation
layer-normalization-animation		layer-normalization-animation
leaky-relu-animation		leaky-relu-animation
least-squares-projection-animation		least-squares-projection-animation
linear-regression-animation		linear-regression-animation
llm-training-objectives-animation		llm-training-objectives-animation
logistic-regression-animation		logistic-regression-animation
long-context-frontier-models-animation		long-context-frontier-models-animation
loss-functions-likelihoods-animation		loss-functions-likelihoods-animation
low-rank-approximation-animation		low-rank-approximation-animation
lstm-animation		lstm-animation
markov-chains-animation		markov-chains-animation
matrix-decompositions-animation		matrix-decompositions-animation
matrix-multiplication-animation		matrix-multiplication-animation
max-pooling-animation		max-pooling-animation
maximum-likelihood-estimation-animation		maximum-likelihood-estimation-animation
mdp-formalism-animation		mdp-formalism-animation
mini-diffusion-go		mini-diffusion-go
mini-diffusion-java		mini-diffusion-java
mini-diffusion-python		mini-diffusion-python
mini-diffusion		mini-diffusion
mini-markov-go		mini-markov-go
mini-markov-java		mini-markov-java
mini-markov-python		mini-markov-python
mini-markov		mini-markov
mini-nn-go		mini-nn-go
mini-nn-java		mini-nn-java
mini-nn-python		mini-nn-python
mini-nn		mini-nn
ml-security-robustness-track-animation		ml-security-robustness-track-animation
model-debugging-animation		model-debugging-animation
model-fairness-animation		model-fairness-animation
model-interpretability-animation		model-interpretability-animation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Visualized

What is inside

Current App

Screenshots

Core ML Lesson

LLM Generation Lesson

Frontier LLM Architecture

Reasoning RLVR / GRPO

Efficient LLM Serving

Frontier Evaluation and Safety

Diffusion Basics Lesson

Curriculum Areas

Foundations

Natural Language Processing and Transformers

Frontier LLMs and Agentic Systems

RAG

Model Reliability

Experimentation and Causal ML

Vision and Diffusion

Reinforcement Learning

Standalone Implementations

Publishing

Repository Layout

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Visualized

What is inside

Current App

Screenshots

Core ML Lesson

LLM Generation Lesson

Frontier LLM Architecture

Reasoning RLVR / GRPO

Efficient LLM Serving

Frontier Evaluation and Safety

Diffusion Basics Lesson

Curriculum Areas

Foundations

Natural Language Processing and Transformers

Frontier LLMs and Agentic Systems

RAG

Model Reliability

Experimentation and Causal ML

Vision and Diffusion

Reinforcement Learning

Standalone Implementations

Publishing

Repository Layout

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages