Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[DON'T MERGE] NGram V2 test
#5401 opened Jun 23, 2025 by wili-65535 Draft
test: [CI] remove closed bugs
#5400 opened Jun 22, 2025 by xinhe-nv Loading…
remove libnuma conan dependency
#5399 opened Jun 22, 2025 by dongxuy04 Loading…
Add sleep function for disagg gen-only benchmarking
#5398 opened Jun 22, 2025 by qiaoxj07 Loading…
feat: Remove not used padding_idx in models
#5385 opened Jun 20, 2025 by HuiGao-NV Loading…
Detokenize option in /v1/completions request Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community
#5382 opened Jun 20, 2025 by Wokzy Loading…
Feat/unify checkpoints loading
#5372 opened Jun 19, 2025 by shaharmor98 Draft
fix: fix bug of qwen3 + eagle3 + finalize_moe_fusion
#5369 opened Jun 19, 2025 by byshiue Loading…
Mxfp4 moe
#5367 opened Jun 19, 2025 by Tracin Loading…
tests: update benchmark test lists
#5365 opened Jun 19, 2025 by xinhe-nv Loading…
Make moe permute and final as custom op
#5358 opened Jun 19, 2025 by limin2021 Loading…
feat(openai protocol):support logitbias
#5354 opened Jun 19, 2025 by xq25478 Loading…
feat(eagle):support qwen in eagle1/2
#5352 opened Jun 19, 2025 by xq25478 Loading…
feat(model):support qwen3 dense in trt flow
#5350 opened Jun 19, 2025 by xq25478 Loading…
ProTip! Adding no:label will show everything without a label.