Multi GPUs #163

ther3zz · 2024-06-09T01:15:32Z

ther3zz
Jun 9, 2024

Is anyone running the Command-R model via external ollama server with this?
Whenever I set a larger context size and send something to ollama from the llama conversation integration, it seems like the model is loaded on just 1 GPU for some reason... I never actually get a response though and the model is unloaded after some time.

If I then use open webui with the same model, I can see all GPUs being used (same context size set) and the model works fine within open webui.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi GPUs #163

{{title}}

Replies: 0 comments

Select a reply

Multi GPUs #163

ther3zz Jun 9, 2024

Replies: 0 comments

ther3zz
Jun 9, 2024