Skip to content

XY-846: [ELF benchmark suite] Add memory evolution and temporal staleness cases#143

Merged
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-846
Jun 9, 2026
Merged

XY-846: [ELF benchmark suite] Add memory evolution and temporal staleness cases#143
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-846

Conversation

@yvette-carlisle

@yvette-carlisle yvette-carlisle commented Jun 9, 2026

Copy link
Copy Markdown
Member

Summary

  • Adds checked-in real-world memory evolution fixtures covering changed preferences, blocked-to-done issue state, superseded deployment guidance, overturned benchmark verdicts, and temporal relation limitations.
  • Extends the real_world_job runner/report with memory evolution counters, job-level encoding status, follow-up rows, and temporal validity not_encoded reporting.
  • Documents the new cargo make real-world-memory-evolution benchmark task and aligns suite not_encoded semantics.

Verification

  • cargo make real-world-memory-evolution
  • cargo nextest run -p elf-eval --test real_world_job_benchmark --all-features
  • Semantic drift helper plus manual evidence mapping
  • cargo make fmt
  • cargo make lint-fix
  • cargo make checks

@yvette-carlisle yvette-carlisle merged commit 7df50be into main Jun 9, 2026
10 checks passed
@yvette-carlisle yvette-carlisle deleted the y/elf-xy-846 branch June 9, 2026 15:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant