You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Analysis Period: Last 30 days (May 2026) Total PRs: 1,000 | Merged: 804 (80.6%) | Closed: 193 (19.4%) | Open: 3
Overall merge success rate is strong at 80.6%, consistent with the prior week (81.3%). The most significant differentiator between successful and closed PRs is the WIP label: PRs without [WIP] in the title merge at 82.4%, while WIP-labeled PRs merge at only 46.9%.
Prompt Categories and Success Rates
Category
Total
Merged
Closed
Success Rate
Bug Fix
815
665
148
81.8%
Feature Addition
161
121
39
75.6%
Refactoring
13
10
3
76.9%
Documentation
2
1
1
50.0%
Testing
4
4
0
100.0%
Other
5
3
2
60.0%
Note: Categories are not mutually exclusive — PRs are assigned to the first matching category by priority.
Prompt Analysis
✅ Successful Prompt Patterns
Common characteristics in merged PRs:
Average prompt length: 703 words
Optimal range: 150–500 words (82.8% merge rate — highest of all buckets)
Most common title words: fix, add, workflow, agent, model, safe
Prompts describe the problem clearly and include what changed and why
Example merged prompts:
PR fix: skip unlock job when activation was skipped #34124 (101 words): fix: skip unlock job when activation was skipped — describes the incorrect behavior, root cause, and specific files changed in pkg/workflow/compiler.
PR <empty> #31203: (empty body) — No prompt provided; essentially a blank submission.
Key Insights
WIP label is the strongest failure signal: [WIP] PRs have a 53.1% closure rate vs 17.6% for non-WIP. Agents appear to submit exploratory or incomplete work under WIP, which then gets rejected. If a task isn't ready, hold the PR rather than submitting WIP.
Sweet spot is 150–500 words: This range achieves the highest merge rate (82.8%). Both very short (<50 words, 73.7%) and long-but-not-comprehensive (500–1000 words, 41.7%) perform worse. Concise, well-scoped prompts outperform verbose ones.
Testing prompts have a 100% success rate: All 4 PRs categorized as test-related were merged. Prompts focused on adding test coverage or fixing tests are highly actionable and well-received.
Recommendations
Based on today's analysis:
DO keep prompts in the 150–500 word range — precise descriptions of the problem, change, and rationale without padding.
DO use specific action verbs (fix, add) paired with a technical component name — mirrors the pattern of 82%+ merged PR titles.
DO describe both what changed and why it was wrong — the highest-confidence merged PRs include root cause context.
AVOID submitting PRs with [WIP] in the title — they close at 53% vs 18% for non-WIP. Use draft PRs instead if you need a staging area.
AVOID prompts in the 500–1000 word range without clear structure — this is the worst-performing bucket (41.7%). If a prompt is long, use headers to structure it clearly.
Historical Trends
Date
Total PRs
Merged
Closed
Success Rate
2026-05-15
1,000
797
200
79.9%
2026-05-17
1,000
809
191
80.9%
2026-05-18
1,000
811
185
81.4%
2026-05-19
993
811
182
81.7%
2026-05-24
1,000
810
186
81.3%
2026-05-25
1,000
804
193
80.6%
Trend: Success rate has been stable in the 80–82% range for the past 10 days. Today shows a slight dip (80.6%) compared to the 81.3% peak on May 24, largely due to a small uptick in WIP closures. No concerning downward trend.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Analysis Period: Last 30 days (May 2026)
Total PRs: 1,000 | Merged: 804 (80.6%) | Closed: 193 (19.4%) | Open: 3
Overall merge success rate is strong at 80.6%, consistent with the prior week (81.3%). The most significant differentiator between successful and closed PRs is the WIP label: PRs without
[WIP]in the title merge at 82.4%, while WIP-labeled PRs merge at only 46.9%.Prompt Categories and Success Rates
Prompt Analysis
✅ Successful Prompt Patterns
Common characteristics in merged PRs:
fix,add,workflow,agent,model,safeExample merged prompts:
pkg/workflow/compiler.forecastoutput on effective-token predictions by removing yield and episode metrics #34750: Refocusforecastoutput on effective-token predictions — narrows scope clearly, explains what was removed and why.❌ Unsuccessful Prompt Patterns
Common characteristics in closed PRs:
Example closed prompts:
Key Insights
[WIP]PRs have a 53.1% closure rate vs 17.6% for non-WIP. Agents appear to submit exploratory or incomplete work under WIP, which then gets rejected. If a task isn't ready, hold the PR rather than submitting WIP.Recommendations
Based on today's analysis:
fix,add) paired with a technical component name — mirrors the pattern of 82%+ merged PR titles.[WIP]in the title — they close at 53% vs 18% for non-WIP. Use draft PRs instead if you need a staging area.Historical Trends
Trend: Success rate has been stable in the 80–82% range for the past 10 days. Today shows a slight dip (80.6%) compared to the 81.3% peak on May 24, largely due to a small uptick in WIP closures. No concerning downward trend.
References: §26419502431
Beta Was this translation helpful? Give feedback.
All reactions