NF4 quantized flux models with loras

Is there any update here ?  With nf4 quantized flux models, i could not use any lora            

>  **Update**: NF4 serialization and loading are working fine. @DN6 let's brainstorm how we can support it more easily? This would help us unlock doing LoRAs on the quantized weights, too (cc: @BenjaminBossan for PEFT). I think this will become evidently critical for larger models. 
> 
> `transformers` has a nice reference for us to follow. Additionally, `accelerate` has: https://huggingface.co/docs/accelerate/en/usage_guides/quantization, but it doesn't support NF4 serialization yet.
> 
> Cc: @SunMarc for jamming on this together.
> 
> _Originally posted by @sayakpaul in https://github.com/huggingface/diffusers/issues/9165#issuecomment-2287694518_
>             

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NF4 quantized flux models with loras #10496

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NF4 quantized flux models with loras #10496

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions