-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test: add support for DGX B200 with 8 GPUs in L0 tests
#5397
opened Jun 21, 2025 by
yizhang-nv
Loading…
[1/N][TRTLLM-5195][feat] Share PyTorch tensor between processes
#5396
opened Jun 21, 2025 by
chang-l
Loading…
[TRTLLM-6019] feat: Remove cutlass min latency code from AutoTuner.
#5394
opened Jun 21, 2025 by
hyukn
Loading…
Detokenize option in /v1/completions request
Community Engagement
help/insights needed from community
Community want to contribute
PRs initiated from Community
#5382
opened Jun 20, 2025 by
Wokzy
Loading…
Fix: missing clientId when serialize and deserialize response (cherry-pick #5231)
#5378
opened Jun 20, 2025 by
kaiyux
Loading…
[TRTLLM-5831][feat] Add LoRA support for pytorch backend in trtllm-serve
#5376
opened Jun 19, 2025 by
talorabr
Loading…
Fix permission for local user issues in NGC docker container.
#5373
opened Jun 19, 2025 by
MartinMarciniszyn
Loading…
[TRTLLM-5838][fix] fix max batch size and max tokens in kv cache estimations for Nemotron-H
#5371
opened Jun 19, 2025 by
tomeras91
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.