gradient-methods

Star

Here are 2 public repositories matching this topic...

heyiamjj / when-better-gradients-hurt

Star

Completing the 2x2 factorial: HRM's hierarchy with full BPTT reveals gradient-architecture interaction

deep-learning sudoku hierarchical-architecture bptt recursive-reasoning gradient-methods

Updated Jun 16, 2026
Jupyter Notebook

heyiamjj / disentangling-gradients-recursive-reasoning

Star

Disentangling gradient quality from architecture in recursive reasoning. Controlled experiment: 1-step gradient approximation is the sole bottleneck in HRM vs TRM performance gap.

machine-learning pytorch sudoku research-paper recursive-reasoning gradient-methods

Updated Jun 16, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the gradient-methods topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gradient-methods topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gradient-methods

Here are 2 public repositories matching this topic...

heyiamjj / when-better-gradients-hurt

heyiamjj / disentangling-gradients-recursive-reasoning

Improve this page

Add this topic to your repo