[codex] Add reproducible minimal PPO WSL workflow by HC-Seaple · Pull Request #484 · Emerge-Lab/PufferDrive

HC-Seaple · 2026-06-14T02:55:40Z

What changed

Add a self-contained continuous-action PPO trainer with vectorized rollouts, GAE, clipped updates, checkpointing, and deterministic evaluation.
Add Windows/WSL setup and launch scripts for the Linux-native Raylib build.
Add generic WOMD JSON-to-map preparation without committing datasets or generated binaries.
Add native third-person checkpoint visualization and JSON metrics.
Ensure complete renderer frames are written to ffmpeg.
Document clone, setup, map preparation, training, visualization, and handoff.

Validation

Python scripts pass python -m py_compile.
WSL launchers pass bash -n.
The staged change set passes git diff --check.
The end-to-end workflow was previously exercised in WSL with a 10,112-step checkpoint and 92-frame native render.

Current limitation

This is a smoke-test training architecture. Reward shaping still needs route-progress reward, reverse-motion penalties, and stronger collision/off-road costs before scaling.

Add reproducible minimal PPO WSL workflow

96b306a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Add reproducible minimal PPO WSL workflow#484

[codex] Add reproducible minimal PPO WSL workflow#484
HC-Seaple wants to merge 1 commit into
Emerge-Lab:2.0from
HC-Seaple:codex/minimal-ppo-wsl

HC-Seaple commented Jun 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HC-Seaple commented Jun 14, 2026

What changed

Validation

Current limitation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants