Summary
Carry source, decision, and output provenance through the main workflow so downstream agents can audit and cite it.
This issue was generated from an org-wide EvalOps mining pass on 2026-05-10 07:57 UTC. It combines live GitHub repo signals with a per-repo arXiv search. Treat the research links as grounding for a concrete implementation, not as a request for a literature review.
Repo Evidence
- Repository description: A composable code review engine for automated diff analysis
- Tree signals: 6 docs files, 7 workflows, 0 proto files, 8 test-like files.
CLAUDE.md:21 includes latent-spec language: - migrations/ — PostgreSQL migrations (sqlx) - eval/ — Evaluation and benchmarking - examples/ — Usage examples
CLAUDE.md:33 includes latent-spec language: - Wide events for observability (OpenTelemetry-compatible) - Self-hosted first: Ollama/vLLM/LM Studio should be first-class providers
README.md:89 includes latent-spec language: # Evaluate reviewer quality against fixtures diffscope eval --fixtures eval/fixtures --output eval-report.json ```
README.md:125 includes latent-spec language: ### Evaluation Fixtures ```yaml
README.md:145 includes latent-spec language: diffscope eval now reports per-rule precision/recall/F1 (micro and macro), and includes top rule-level TP/FP/FN counts in CLI and JSON output. Starter fixtures live in eval/fixtures/repo_regressions.
README.md:146 includes latent-spec language: diffscope eval now reports per-rule precision/recall/F1 (micro and macro), and includes top rule-level TP/FP/FN counts in CLI and JSON output. Starter fixtures live in eval/fixtures/repo_regressions. Markdown and smart-review reports now include rule-level issue breakdown tables when rule ids are available.
Research Grounding
Repo axes: memory, governance, evaluation, tooling
Search keywords: diffscope, review, model, fixtures, diff, eval, git, ollama, bash, docker, pull, github
- arXiv:2504.08893v1 Knowledge Graph-extended Retrieval Augmented Generation for Question Answering (Jasper Linders, Jakub M. Tomczak), 2025.
- arXiv:2504.05163v2 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness (Dongzhuoran Zhou, Yuqicheng Zhu, Xiaxia Wang, Yuan He, Jiaoyan Chen, Steffen Staab), 2025.
- arXiv:2511.11017v1 AI Agent-Driven Framework for Automated Product Knowledge Graph Construction in E-Commerce (Dimitar Peshevski, Riste Stojanov, Dimitar Trajanov), 2025.
- arXiv:2502.01113v3 GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation (Linhao Luo, Zicheng Zhao, Gholamreza Haffari, Dinh Phung, Chen Gong, Shirui Pan), 2025.
- arXiv:2502.06864v1 Knowledge Graph-Guided Retrieval Augmented Generation (Xiangrong Zhu, Yuexiang Xie, Yi Liu, Yaliang Li, Wei Hu), 2025.
- arXiv:2506.21556v3 VAT-KG: Knowledge-Intensive Multimodal Knowledge Graph Dataset for Retrieval-Augmented Generation (Hyeongcheol Park, Jiyoung Seo, MinHyuk Jang, Hogun Park, Ha Dam Baek, Gyusam Chang), 2025.
- arXiv:2507.16826v1 A Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval-Augmented Generation in Large Language Models (Qikai Wei, Huansheng Ning, Chunlong Han, Jianguo Ding), 2025.
- arXiv:2508.09460v1 Towards Self-cognitive Exploration: Metacognitive Knowledge Graph Retrieval Augmented Generation (Xujie Yuan, Shimin Di, Jielong Tang, Libin Zheng, Jian Yin), 2025.
- arXiv:2512.20626v2 MegaRAG: Multimodal Knowledge Graph-Based Retrieval Augmented Generation (Chi-Hsiang Hsiao, Yi-Cheng Wang, Tzung-Sheng Lin, Yi-Ren Yeh, Chu-Song Chen), 2025.
- arXiv:2405.15436v1 Hybrid Context Retrieval Augmented Generation Pipeline: LLM-Augmented Knowledge Graphs and Vector Database for Accreditation Reporting Assistance (Candace Edwards), 2024.
What To Build
- Add stable identifiers for source records, derived decisions, and emitted outputs.
- Thread those identifiers through logs/events/API responses without leaking secrets.
- Provide a query or debug surface that reconstructs the chain for one completed workflow.
Acceptance Criteria
Notes
- Generated issue 2/5 for
evalops/diffscope by evalops_org_miner.py.
- Before implementation, confirm the sampled latent-spec snippets still match
main; this issue intentionally cites exact file paths/lines where the mining pass saw them.
Summary
Carry source, decision, and output provenance through the main workflow so downstream agents can audit and cite it.
This issue was generated from an org-wide EvalOps mining pass on 2026-05-10 07:57 UTC. It combines live GitHub repo signals with a per-repo arXiv search. Treat the research links as grounding for a concrete implementation, not as a request for a literature review.
Repo Evidence
CLAUDE.md:21includes latent-spec language: -migrations/— PostgreSQL migrations (sqlx) -eval/— Evaluation and benchmarking -examples/— Usage examplesCLAUDE.md:33includes latent-spec language: - Wide events for observability (OpenTelemetry-compatible) - Self-hosted first: Ollama/vLLM/LM Studio should be first-class providersREADME.md:89includes latent-spec language: # Evaluate reviewer quality against fixtures diffscope eval --fixtures eval/fixtures --output eval-report.json ```README.md:125includes latent-spec language: ### Evaluation Fixtures ```yamlREADME.md:145includes latent-spec language:diffscope evalnow reports per-rule precision/recall/F1 (micro and macro), and includes top rule-level TP/FP/FN counts in CLI and JSON output. Starter fixtures live ineval/fixtures/repo_regressions.README.md:146includes latent-spec language:diffscope evalnow reports per-rule precision/recall/F1 (micro and macro), and includes top rule-level TP/FP/FN counts in CLI and JSON output. Starter fixtures live ineval/fixtures/repo_regressions. Markdown and smart-review reports now include rule-level issue breakdown tables when rule ids are available.Research Grounding
Repo axes: memory, governance, evaluation, tooling
Search keywords: diffscope, review, model, fixtures, diff, eval, git, ollama, bash, docker, pull, github
What To Build
Acceptance Criteria
Notes
evalops/diffscopebyevalops_org_miner.py.main; this issue intentionally cites exact file paths/lines where the mining pass saw them.