Better error message when trying to run fp16 weights on CPU

### 🚀 The feature, motivation and pitch

Hey :wave: from the Hugging Face Open-Source team,

We're seeing the following issue over and over again across libraries
```
RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'
```

or:

```
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'
```

E.g.: https://github.com/runwayml/stable-diffusion/issues/23

The problem here is that a PyTorch model has been converted to fp16 and the user tried to run it on CPU, e.g. the following:

```py
from torch import nn
import torch

linear = nn.Linear(2,2, dtype=torch.float16)

tensor = torch.ones((2,), dtype=torch.float16)

linear(tensor)
```

yields:

```
"addmm_impl_cpu_" not implemented for 'Half'
```

Could we maybe catch such errors in the forward of https://pytorch.org/docs/stable/_modules/torch/nn/modules/module.html#Module

and return a simpler error message that just says "Float16 cannot be run on CPU"?

### Alternatives

_No response_

### Additional context

_No response_

cc @malfet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better error message when trying to run fp16 weights on CPU #96292

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Better error message when trying to run fp16 weights on CPU #96292

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions