-
Notifications
You must be signed in to change notification settings - Fork 12.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Apple NPU acceleration integrated into llama.cpp, using MiniCPM-V 4.0 as an example.
examples
python
python script changes
#15262
opened Aug 12, 2025 by
tc-mb
Loading…
WIP: ggml-cuda: Add bf16 cuda support to fattn (Flash Attention)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#15261
opened Aug 12, 2025 by
eous
Loading…
musa: fix build warnings
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15258
opened Aug 12, 2025 by
yeahdongcn
Loading…
vulkan: fuse adds
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15252
opened Aug 11, 2025 by
jeffbolznv
Loading…
ci : Enable pre-built cuda releases on ubuntu (#5106)
devops
improvements to build systems and github actions
#15249
opened Aug 11, 2025 by
michaelgiba
Loading…
Fixes #15247 | Update chat.cpp to support (at least) qwen3 reasoning + tool_choice = required
#15248
opened Aug 11, 2025 by
ExtReMLapin
Loading…
vulkan: perf_logger improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#15246
opened Aug 11, 2025 by
jeffbolznv
Loading…
Fix HIP warp synchronization function conflicts for ROCm 7.0+
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15241
opened Aug 11, 2025 by
slojosic-amd
Loading…
ci : Fix -Werror=return-type in clip.cpp so ci/run.sh can run without issue
examples
#15221
opened Aug 11, 2025 by
michaelgiba
Loading…
Adding Resume for curl downloads
testing
Everything test related
#15217
opened Aug 10, 2025 by
taf2
Loading…
ci : add copilot-setup-steps.yml
devops
improvements to build systems and github actions
#15214
opened Aug 10, 2025 by
CISC
Loading…
introduce how to build with Vulkan on Raspbian OS
documentation
Improvements or additions to documentation
#15206
opened Aug 10, 2025 by
MaoJianwei
Loading…
webui: prettify styling
examples
server
#15201
opened Aug 9, 2025 by
olegshulyakov
Loading…
11 tasks done
server: implementation of v1/completions echo logprobs support
examples
server
#15189
opened Aug 9, 2025 by
fo40225
Loading…
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others)
ggml
changes relating to the ggml tensor library for machine learning
#15188
opened Aug 9, 2025 by
Tak-RS
Loading…
ggml: add changes relating to the ggml tensor library for machine learning
testing
Everything test related
conv3d
op
ggml
#15182
opened Aug 8, 2025 by
rmatif
Loading…
server : implement /api/version endpoint for ollama compatibility (#15167 )
examples
server
#15177
opened Aug 8, 2025 by
albert-polak
Loading…
MoE Expert manipulation args
demo
Demonstrate some concept or idea, not intended to be merged
#15165
opened Aug 8, 2025 by
kooshi
Loading…
GPT-OSS: parse commentary tool calls; handle glued 'json'; add unit tests (#15102)
testing
Everything test related
#15158
opened Aug 7, 2025 by
Nerexis
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.