Skip to content

Deepseek-R1 671B - Segmentation Fault Bug #8602

@Notbici

Description

@Notbici

What is the issue?

Hi,

I've been using the Deepseek-R1 671B model from Ollama on my 8x H100 machine and keep running into a segmentation fault, I've noticed that the frequency of the segfault happens the larger the context becomes.

I'm using the latest Ollama release.
Hardware Specs:

  • 8x H100 - 80GB SXM
  • Xeon Platinum 8468 (160c)
  • Micron 7450 ssd
  • 1548gb of ram
  • OS is ubuntu 22.04
  • CUDA: 12.6
  • NVIDIA driver: 560.35.05

Happy to test params or gather more data, I'm having a hard time working around this. The distilled models like the deepseek llama 70B work just fine.

server.err.log

Any advice is appreciated.

OS

Linux

GPU

Nvidia

CPU

Intel

Ollama version

0.5.7

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions