Skip to content

XY-840: [ELF benchmark vNext] Define real-world agent memory benchmark contract and scoring#134

Merged
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-840
Jun 9, 2026
Merged

XY-840: [ELF benchmark vNext] Define real-world agent memory benchmark contract and scoring#134
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-840

Conversation

@yvette-carlisle

Copy link
Copy Markdown
Member

Summary:

  • Adds the normative real_world_job benchmark contract with schema fields, suites, scoring dimensions, typed report states, and claim rules.
  • Adds the operator-facing benchmark overview and links the new future-work contract from the benchmark indexes, README, live baseline guide, and adoption gate report without changing existing verdicts.

Validation:

  • cargo make fmt
  • cargo make lint-fix
  • cargo make checks

@yvette-carlisle yvette-carlisle merged commit ac0aef5 into main Jun 9, 2026
5 checks passed
@yvette-carlisle yvette-carlisle deleted the y/elf-xy-840 branch June 9, 2026 13:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant