ollama

mirror of https://github.com/jmorganca/ollama synced 2025-10-05 16:22:53 +02:00

Files

Jesse Gross 19e6796eac llm: Support KV cache quantization with gpt-oss

With the new version of GGML in #12245, KV cache quantization
no longer causes a fallback to CPU.

2025-10-03 16:31:58 -07:00

2025-10-03 16:31:58 -07:00

2025-06-20 11:11:40 -07:00

next ollama runner (#7913 )

2025-02-13 16:31:21 -08:00

config.go

2025-06-25 21:47:09 -07:00