Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][chore] update model list
#11364 opened Feb 8, 2026 by tcherckez-nvidia Loading…
1 task done
[TRTC-264][doc] Add CLAUDE.md and AGENTS.md
#11358 opened Feb 7, 2026 by venkywonka Loading…
1 task done
[Draft] AutoDeploy GLM4.7 flash bundle
#11356 opened Feb 6, 2026 by bmarimuthu-nv Loading…
1 task done
Remove mem est 2
#11350 opened Feb 6, 2026 by HuiGao-NV Draft
1 task
[TRTLLM-10030][perf] avoid syncs in beam search + other improvements
#11349 opened Feb 6, 2026 by ixlmar Loading…
1 task done
[TRTLLM-9904][feat] KVCache V2 MTP support
#11346 opened Feb 6, 2026 by liji-nv Loading…
1 task done
[None][feat] Optimize mamba2 _chunk_scan_fwd_kernel
#11345 opened Feb 6, 2026 by JadoTu Loading…
1 task done
[None][feat] Optimize the q3n decode kernel with IO read
#11344 opened Feb 6, 2026 by JadoTu Loading…
1 task done
Sg/ll/glm4 7 flash rebased
#11338 opened Feb 6, 2026 by suyoggupta Draft
1 task
[TRTLLM-10866][feat] implement disaggregated harmony chat
#11336 opened Feb 6, 2026 by reasonsolo Loading…
1 task done
[None][chore] Reduce attention module repeated warnings.
#11335 opened Feb 6, 2026 by yuxianq Loading…
1 task done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.