-
Notifications
You must be signed in to change notification settings - Fork 17.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix Bad Substitution Error
examples
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#22737
opened May 6, 2026 by
dogunbound
Loading…
webui : [ChatFormActionAdd][a11y] fix accessibility issues in attach menu trigger and items
examples
server/webui
server
#22736
opened May 6, 2026 by
vignesh191
Loading…
opencl: add q4_0 MoE GEMM for Adreno
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#22731
opened May 5, 2026 by
shawngu-quic
Contributor
Loading…
Write a readme on Multi-GPU usage in llama.cpp
documentation
Improvements or additions to documentation
#22729
opened May 5, 2026 by
gaugarg-nv
Contributor
Loading…
server, webui: support continue generation on reasoning models
examples
server/webui
server
#22727
opened May 5, 2026 by
ServeurpersoCom
Contributor
Loading…
webui: Remove Google Favicons & Improve MCP Information logic & UI
examples
server/webui
server
#22719
opened May 5, 2026 by
allozaur
Contributor
Loading…
Adding support for the granite multilingual embeddings R2 (ibm-granite/granite-embedding-{97,311}m-multilingual-r2 models)
model
Model specific
python
python script changes
#22716
opened May 5, 2026 by
hansolosan
Loading…
metal : promote mul_mv/mul_mm batch divisors to function constants
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#22711
opened May 5, 2026 by
guyfischman
Loading…
Fuse rms_norm, mul, quantize_q8_1
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#22710
opened May 5, 2026 by
lnigam
Contributor
Loading…
Feat/qlora training
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
Vulkan
Issues specific to the Vulkan backend
#22705
opened May 5, 2026 by
srossitto79
•
Draft
Feat/backward mul mat
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Vulkan
Issues specific to the Vulkan backend
#22704
opened May 5, 2026 by
srossitto79
•
Draft
webui: [ChatFormPickerPopover][a11y] add tabindex and aria-hidden to popover trigger
examples
server/webui
server
#22699
opened May 5, 2026 by
vignesh191
Loading…
vulkan: Check shared memory size for mmq shaders
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22693
opened May 4, 2026 by
jeffbolznv
Contributor
Loading…
llama: add pshard runtime for plan switching and streamed weights
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22692
opened May 4, 2026 by
aukarande
Loading…
tools: add llama-pshard-plan-params for token-tiered placement planning
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#22691
opened May 4, 2026 by
aukarande
Loading…
tests: add BF16 non-contig coverage for MUL_MAT permutations
testing
Everything test related
#22689
opened May 4, 2026 by
ServeurpersoCom
Contributor
Loading…
ggml : use CL_DEVICE_GLOBAL_MEM_SIZE as estimate for OpenCL --fit
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#22688
opened May 4, 2026 by
fl0rianr
Loading…
vulkan: optimize operations in the IM2COL shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#22685
opened May 4, 2026 by
daniandtheweb
Contributor
Loading…
server: (router) expose child model info from router's /v1/models
examples
server
#22683
opened May 4, 2026 by
ngxson
Contributor
Loading…
ggml-zendnn : adaptive fallback to CPU backend for small batch sizes
AMD ZenDNN
Issues related to the AMD ZenDNN backend
ggml
changes relating to the ggml tensor library for machine learning
#22681
opened May 4, 2026 by
z-sachin
Loading…
ci: validate model naming convention
devops
improvements to build systems and github actions
#22680
opened May 4, 2026 by
ngxson
Contributor
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.