-
Notifications
You must be signed in to change notification settings - Fork 14k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: Fix data race/hang in scalar/cm1 flash attention
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17887
opened Dec 9, 2025 by
jeffbolznv
Loading…
docs: clarify that CPU support should be first
#17886
opened Dec 9, 2025 by
JohannesGaessler
Loading…
ggml-alloc : fix reuse-parent logic for misaligned sizes
ggml
changes relating to the ggml tensor library for machine learning
#17884
opened Dec 9, 2025 by
ggerganov
Loading…
HIP: enable mmf for RDNA3
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17879
opened Dec 9, 2025 by
zhang-hui-yulo
Loading…
metal: SSM kernel improvements
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17876
opened Dec 8, 2025 by
gabe-l-hart
Loading…
Vulkan: Improve mul_mat_vec_iq1_s speed
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17874
opened Dec 8, 2025 by
lovedheart
Loading…
Add DIAG for CUDA
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17873
opened Dec 8, 2025 by
pwilkin
Loading…
vulkan: Allow non-pow2 n_experts in topk_moe
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17872
opened Dec 8, 2025 by
jeffbolznv
Loading…
metal: use shared buffers on eGPU
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17866
opened Dec 8, 2025 by
jdemeule
Loading…
Server: router per model config
examples
server
#17859
opened Dec 8, 2025 by
ServeurpersoCom
Loading…
examples: fix memory leak for simple example
examples
#17854
opened Dec 8, 2025 by
lizhenneng
Loading…
Webui: copy prompt and attachments
examples
server
#17841
opened Dec 7, 2025 by
ServeurpersoCom
Loading…
[SYCL] fix softmax for iGPU
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17838
opened Dec 7, 2025 by
NeoZhangJianyu
Loading…
debug:Adding CPU-side visual trace for hexagon
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
#17837
opened Dec 7, 2025 by
Ethan-a2
Loading…
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17826
opened Dec 6, 2025 by
NeoZhangJianyu
Loading…
cann : fix ops broken by circular padding guard
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17825
opened Dec 6, 2025 by
CISC
Loading…
CANN: support gated linear attn
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17814
opened Dec 6, 2025 by
YushengZhao
Loading…
vulkan: faster q6_k matmul
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17813
opened Dec 6, 2025 by
netrunnereve
Loading…
webui: Fix parsing non-LaTeX occurrencies of
\( or \)
examples
server
#17810
opened Dec 6, 2025 by
allozaur
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.