Skip to content

XY-932: Add live operator-debug benchmark scoring#186

Merged
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-932
Jun 11, 2026
Merged

XY-932: Add live operator-debug benchmark scoring#186
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-932

Conversation

@yvette-carlisle

Copy link
Copy Markdown
Member

Summary

  • Add Docker-scoped live operator-debug adapter runner for ELF and qmd.
  • Add selected-but-not-narrated operator-debug fixture and scored manifest/report rows.
  • Update benchmark reports to distinguish the narrow ELF/qmd operator-debug slice from broad viewer-product claims.

Validation

  • bash -n scripts/real-world-operator-debug-live-adapters.sh
  • jq empty on updated manifest/report JSON and new fixture
  • git diff --check
  • cargo test -p elf-eval --test real_world_job_benchmark operator_debug_live_adapter_task_is_docker_scoped -- --exact
  • cargo test -p elf-eval --test real_world_job_benchmark --all-features

Decodex Recovery

This PR resumes retained partial progress from XY-932 attempt 1 after Decodex stopped during review handoff with app_server_idle_timeout. Runtime artifacts were not committed.

@yvette-carlisle yvette-carlisle merged commit a0888d1 into main Jun 11, 2026
13 checks passed
@yvette-carlisle yvette-carlisle deleted the y/elf-xy-932 branch June 11, 2026 14:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant