Tags · ggml-org/llama.cpp

b5590

ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)

* * ggml-vulkan: adds op CONV_TRANSPOSE_1D

* test-backend-ops: adds more spohisticated tests for CONV_TRANSPOSE_1D

* Missing barrier added to shader.
Number of additional tests reduced to 108.

* * Fixes typo in variable name.

* Removes extra whitespaces.

* Adds int64->int32 casts to prevent possible warnings.

* Problem size reduced in tests to pass tests with llvmpipe.

* supports_op condition moved from unintended position

Jun 4, 2025
0d39844
zip
tar.gz
Downloads

b5589

kv-cache : refactor the update/defrag mechanism (#13988)

* kv-cache : refactor update mechanism

ggml-ci

* memory : improve status handling

* defrag : reset head + add comments

ggml-ci

* cont : minor fixes

ggml-ci

Jun 4, 2025
3e63a58
zip
tar.gz
Downloads

b5588

ci : remove cuda 11.7 releases, switch runner to windows 2022 (#13997)

Jun 4, 2025
2589ad3
zip
tar.gz
Downloads

b5587

releases : use dl backend for linux release, remove arm64 linux relea…

…se (#13996)

Jun 4, 2025
4825487
zip
tar.gz
Downloads

b5586

llama-graph : use ggml_repeat_4d (#13998)

Jun 4, 2025
3ac6753
zip
tar.gz
Downloads

b5585

CUDA: fix FTZ in FA for Gemma 3 (#13991)

Jun 4, 2025
0b4be4c
zip
tar.gz
Downloads

b5584

kv-cache : fix unified::seq_rm to work with seq_id < 0 (#13985)

ggml-ci

Jun 4, 2025
e0e806f
zip
tar.gz
Downloads

b5581

opencl: add `backend_synchronize` (#13939)

* This is not needed by the normal use where the result is read
  using `tensor_get`, but it allows perf mode of `test-backend-ops`
  to properly measure performance.

Jun 2, 2025
71e74a3
zip
tar.gz
Downloads

b5580

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840)

* add concat, pad, repeat, tsembd, tanh, upscale

* small fixes

Jun 2, 2025
bfb1e01
zip
tar.gz
Downloads

b5579

server : disable speculative decoding for SWA models (#13970)

* server : use swa-full fo draft context

ggml-ci

* server : disable speculative decoding for SWA models

Jun 2, 2025
3637576
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5590

b5589

b5588

b5587

b5586

b5585

b5584

b5581

b5580

b5579

Tags: ggml-org/llama.cpp