Loading Parlor at 16-bit

**Is your feature request related to a problem? Please describe.**
With Parlor-large being well... rather large, it would be nice to have the option to load it in (b)float16 instead of float32.

**Describe the solution you'd like**
Add an option in Parlor settings to choose dtype: float32, float16, bfloat16.

**Describe alternatives you've considered**
Actual quantization for even better memory savings. I don't know if any of them work right now though.

**Additional context**
Effect on quality is difficult to determine due to the model's inherent variability, but I did not notice anything significant. I think the option to save 50% on memory is more significant than any minor quality loss. This is easy to implement, only dtype on the model loading needs to be changed.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Loading Parlor at 16-bit #303

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Loading Parlor at 16-bit #303

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions