Pulse · ollama/ollama

April 3, 2025 – April 10, 2025

Overview

38 Active pull requests

88 Active issues

1 Release published by 1 person

v0.6.5
published Apr 6, 2025

81 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

llama: remove model loading for grammar
#10096 commented on Apr 10, 2025 • 19 new comments
cmd: default to client2 and simplify pull progress display
#10069 commented on Apr 9, 2025 • 6 new comments
add ollama stop all
#10043 commented on Apr 7, 2025 • 2 new comments
Granite new engine
#9966 commented on Apr 4, 2025 • 1 new comment
support no caching when kvCacheType is "nocache" for deterministic completion
#10064 commented on Apr 6, 2025 • 0 new comments
NVIDIA GPU drivers not loaded on Jeston Orin Nano
#9503 commented on Apr 10, 2025 • 0 new comments
SPAM in your model database
#9134 commented on Apr 10, 2025 • 0 new comments
System memory leak. Gemma3
#10040 commented on Apr 10, 2025 • 0 new comments
SIGSEGV: segmentation violation
#9665 commented on Apr 10, 2025 • 0 new comments
AMD Ryzen NPU support
#5186 commented on Apr 10, 2025 • 0 new comments
AMD RX9070/9070XT support
#9812 commented on Apr 10, 2025 • 0 new comments
vram usage does not go back down after model unloads - stuck in Stopping...
#7606 commented on Apr 9, 2025 • 0 new comments
[ ROCm error: out of memory ] Runner Terminated: num_ctx within model / hardware limits reliably crashes
#9957 commented on Apr 9, 2025 • 0 new comments
Models can't be stopped correctly when using Webui combine with Ollama.
#8969 commented on Apr 9, 2025 • 0 new comments
qwen 2.5 coder stuck "Stopping"
#8178 commented on Apr 9, 2025 • 0 new comments
Stopping misbehaving model after some amount of time
#9617 commented on Apr 9, 2025 • 0 new comments
Provide logits or logprobs in the API
#2415 commented on Apr 9, 2025 • 0 new comments
Ollama always choses iGPU for computations in hybrind discrete+iGPU rocm setups
#9588 commented on Apr 9, 2025 • 0 new comments
Speed ten times slower than llamafile
#8305 commented on Apr 9, 2025 • 0 new comments
Batch embeddings get progressively worse with larger batches
#6262 commented on Apr 9, 2025 • 0 new comments
`pulling manifest Error: EOF` when pulling after disk is full
#1731 commented on Apr 9, 2025 • 0 new comments
Ollama errors on older versions of Linux/GLIBC on 0.5.13
#9506 commented on Apr 9, 2025 • 0 new comments
Ollama REFUSES to use GFX803 EVEN when detected
#9807 commented on Apr 9, 2025 • 0 new comments
Adding a search command
#10046 commented on Apr 6, 2025 • 0 new comments
server: enable content streaming with tools
#10028 commented on Apr 4, 2025 • 0 new comments
server: prevent model thrashing from unset API fields
#10003 commented on Apr 7, 2025 • 0 new comments
server: support streaming near tool usage
#9973 commented on Apr 7, 2025 • 0 new comments
Integration test improvements
#9654 commented on Apr 9, 2025 • 0 new comments
feat: add debug logging in chat/generate functions
#8957 commented on Apr 6, 2025 • 0 new comments
feat: Support Moore Threads GPU
#7554 commented on Apr 10, 2025 • 0 new comments
fix: consider any status code as redirect
#7231 commented on Apr 10, 2025 • 0 new comments
FEAT: add rerank support
#7219 commented on Apr 8, 2025 • 0 new comments
AMD integrated graphic on linux kernel 6.9.9+, GTT memory, loading freeze fix
#6282 commented on Apr 5, 2025 • 0 new comments
Enable AMD iGPU 780M in Linux, Create amd-igpu-780m.md
#5426 commented on Apr 8, 2025 • 0 new comments
llm/server.go: Fix ollama ps show 100%GPU even use CPU as runner
#4906 commented on Apr 9, 2025 • 0 new comments
cobra shell completions
#4690 commented on Apr 9, 2025 • 0 new comments
Ollama not freeing and eventually running out of memory [all models]
#10114 commented on Apr 10, 2025 • 0 new comments
Ollama hangs while generating a response
#10119 commented on Apr 10, 2025 • 0 new comments
Inference with OpenVINO on Intel
#2169 commented on Apr 10, 2025 • 0 new comments
CUDA error: an illegal memory access was encountered
#9018 commented on Apr 10, 2025 • 0 new comments
Add support for older AMD GPU gfx803, gfx802, gfx805 (e.g. Radeon RX 580, FirePro W7100)
#2453 commented on Apr 10, 2025 • 0 new comments
Add an easy way to list all models and their capabilities
#10097 commented on Apr 6, 2025 • 0 new comments
Unable to push: max retries exceeded on slower connections
#2155 commented on Apr 5, 2025 • 0 new comments
Possibility to remove max retries exceeded when downloading models from a slow connection
#3162 commented on Apr 5, 2025 • 0 new comments
Llama.cpp now supports distributed inference across multiple machines.
#4643 commented on Apr 5, 2025 • 0 new comments
Available memory calculation on AMD APU no longer takes GTT into account
#5471 commented on Apr 5, 2025 • 0 new comments
Allow importing multi-file GGUF models
#5245 commented on Apr 5, 2025 • 0 new comments
Deepseek R1, 671b is faster than 70b
#10030 commented on Apr 5, 2025 • 0 new comments
support deepseek 671b fp4
#9419 commented on Apr 4, 2025 • 0 new comments
gemma EOF error on image input due to improper memory management
#10041 commented on Apr 4, 2025 • 0 new comments
ollama does not utilize HBM3 memory on MI300A
#8735 commented on Apr 4, 2025 • 0 new comments
EOF with Gemma3:27b | POST predict: Post "http://127.0.0.1:35737/completion": EOF (status code: 500)
#9699 commented on Apr 4, 2025 • 0 new comments
Add Generate Embedding for Sparse vector
#6230 commented on Apr 4, 2025 • 0 new comments
add /metrics endpoint
#3144 commented on Apr 4, 2025 • 0 new comments
Llama3: Generated outputs inconsistent despite seed and temperature
#5321 commented on Apr 4, 2025 • 0 new comments
Update DeepSeek V3 to improved version
#9980 commented on Apr 4, 2025 • 0 new comments
Using Qwen as agent in VS Code
#10038 commented on Apr 4, 2025 • 0 new comments
Pull a model on start or without requiring serve
#3369 commented on Apr 4, 2025 • 0 new comments
Unsupported Value NaN in Ollama log
#9639 commented on Apr 4, 2025 • 0 new comments
Provide a single command for "serve + pull model", to be used in CI/CD
#5385 commented on Apr 4, 2025 • 0 new comments
Support for jinaai/jina-embeddings-v3 embedding model
#6922 commented on Apr 4, 2025 • 0 new comments
Ollama Not detecting adapter_config.json file
#9505 commented on Apr 9, 2025 • 0 new comments
Tool call support in Qwen 2.5 hallucinates with Maybe pattern
#7051 commented on Apr 9, 2025 • 0 new comments
Support for AMD 9000 GPUs
#9633 commented on Apr 8, 2025 • 0 new comments
Using split memory (RAM+VRAM) should never happen
#10092 commented on Apr 8, 2025 • 0 new comments
Provide an updated OpenAPI Specification file (a/k/a "swagger file") with each release
#3383 commented on Apr 8, 2025 • 0 new comments
为什么mac中 gemma3工作不正常
#9939 commented on Apr 8, 2025 • 0 new comments
Isn't it time to move onto Omni models?
#6786 commented on Apr 8, 2025 • 0 new comments
Compute Capability 3.7 still needed
#9620 commented on Apr 8, 2025 • 0 new comments
Support logit_bias
#3795 commented on Apr 7, 2025 • 0 new comments
Add support for array for head count GGUF KV
#9984 commented on Apr 7, 2025 • 0 new comments
Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled
#6160 commented on Apr 7, 2025 • 0 new comments
Ollama Bug Report: Application Launch Issue
#9832 commented on Apr 7, 2025 • 0 new comments
Error: POST predict: Post "http://127.0.0.1:62622/completion": read tcp 127.0.0.1:62627->127.0.0.1:62622: wsarecv: The remote host has closed a connection.
#9674 commented on Apr 7, 2025 • 0 new comments
add Qwen2-VL/Qwen2.5-VL
#6564 commented on Apr 7, 2025 • 0 new comments
llama_model_load_from_file_impl: failed to load model
#9541 commented on Apr 7, 2025 • 0 new comments
Feature request: support for OpenCL
#4373 commented on Apr 7, 2025 • 0 new comments
Update broken on Linux
#10101 commented on Apr 7, 2025 • 0 new comments
MacOS Ollama not binding to 0.0.0.0
#3581 commented on Apr 6, 2025 • 0 new comments
Support AMD GPUs on Intel Macs
#1016 commented on Apr 6, 2025 • 0 new comments
[ENHANCE] Add Ubuntu Support for AMD Ryzen AI 9 HX 370 w/ Radeon 890M (gfx1150)
#9999 commented on Apr 6, 2025 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

April 3, 2025 – April 10, 2025

Overview

Could not load contribution data

1 Release published by 1 person

15 Pull requests merged by 13 people

23 Pull requests opened by 18 people

41 Issues closed by 21 people

47 Issues opened by 41 people

81 Unresolved conversations

Insights: ollama/ollama

April 3, 2025 – April 10, 2025

Overview

Could not load contribution data

1 Release published by 1 person

15 Pull requests merged by 13 people

23 Pull requests opened by 18 people

41 Issues closed by 21 people

47 Issues opened by 41 people

81 Unresolved conversations