-
Notifications
You must be signed in to change notification settings - Fork 119
Pull requests: InfiniTensor/InfiniCore
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
issue/1180 refactor(nn): decouple RoPE scaling logic with polymorphic interfaces
#1181
opened May 28, 2026 by
rubik-hua
Loading…
Issue/1177:NVIDIA机器上添加awq_marlin_gemm和gptq_marlin_gemm算子
#1178
opened May 27, 2026 by
xgqdut2016
Collaborator
Loading…
performance: remove conv algo choosing
#1176
opened May 27, 2026 by
PanZezhong1725
Collaborator
Loading…
feat: implement mha op with flash_attn::mha_fwd
#1174
opened May 26, 2026 by
PanZezhong1725
Collaborator
Loading…
issue/1167 - feat: add flash-attn via MooreThreads/mate for moore gpu
#1168
opened May 21, 2026 by
spike-zhu
Contributor
Loading…
issue/1148: PagedAttentionPrefill 添加 KV cache 连续性 guard
#1149
opened Apr 30, 2026 by
JoeZhang-0x000
Loading…
issue/1113:NVIDIA机器添加awq_marlin_gemm算子
#1137
opened Apr 21, 2026 by
xgqdut2016
Collaborator
Loading…
issue/1083: NVIDIA机器添加gptq_marlin_gemm算子
#1110
opened Mar 31, 2026 by
xgqdut2016
Collaborator
Loading…
issue/1100 Add round and cosh operator integration
#1101
opened Mar 20, 2026 by
GordonYang1
Loading…
Add native nvidia backend for flash attention.
#1060
opened Mar 6, 2026 by
gongchensu
Collaborator
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.