Skip to content

Conversation

dhiltgen
Copy link
Collaborator

@dhiltgen dhiltgen commented Jul 2, 2025

This adds some extra logs to make the new engine a bit more consistent with the llama engine.

I opted not to change the 2 existing lines showing buffer sizes since they're close enough.

Example from llama engine:

load_tensors: offloading 28 repeating layers to GPU
load_tensors: offloading output layer to GPU
load_tensors: offloaded 29/29 layers to GPU
load_tensors:   CPU_Mapped model buffer size =   308.23 MiB
load_tensors: Metal_Mapped model buffer size =  1918.36 MiB

With this change, same model on the new engine

time=2025-07-02T16:16:54.727-07:00 level=INFO source=ggml.go:362 msg="offloading 28 repeating layers to GPU"
time=2025-07-02T16:16:54.727-07:00 level=INFO source=ggml.go:368 msg="offloading output layer to GPU"
time=2025-07-02T16:16:54.727-07:00 level=INFO source=ggml.go:378 msg="offloaded 29/29 layers to GPU"
time=2025-07-02T16:16:54.727-07:00 level=INFO source=ggml.go:380 msg="model weights" buffer=Metal size="1.9 GiB"
time=2025-07-02T16:16:54.727-07:00 level=INFO source=ggml.go:380 msg="model weights" buffer=CPU size="308.2 MiB"

This adds some extra logs to make the new engine a bit more consistent
with the llama engine.
@dhiltgen dhiltgen merged commit 2c4ce40 into ollama:main Jul 2, 2025
8 checks passed
@dhiltgen dhiltgen deleted the logs branch July 2, 2025 23:38
gabe-l-hart added a commit to gabe-l-hart/ollama that referenced this pull request Jul 11, 2025
* origin/main:
ggml: Report ordinal IDs for AMD GPUs on Windows
doc: add MacOS docs (ollama#11334)
Reduce default parallelism to 1 (ollama#11330)
API/CLI context enhancements (ollama#11331)
add `tool_name` to api.md (ollama#11326)
template: add tool result compatibility (ollama#11294)
ci: modularization (ollama#11324)
Revert "ggml: Temporarily disable reporting UUIDs"
readme: update Ollama icon size
int: add performance integration tests (ollama#11173)
doc: add NVIDIA blackwell to supported list (ollama#11307)
Update base image to Ubuntu 24.04 LTS (ollama#9681)
doc: Update link for mac install (ollama#11288)
mimic logs for layers on new engine (ollama#11278)
readme: add NativeMind to community integrations (ollama#11242)
tools: fix parsing tool calls with empty arguments, missing required fields (ollama#11233)
readme: add ollama-bash-toolshed to community integrations (ollama#11224)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants