Skip to content

Is non-RmPad version model and RmPad verison mdoel interchangeable? #20

@yanggthomas

Description

@yanggthomas

Hi, thanks for your great work!

We are attempting to deploy this framework on Volta GPUs without support of Flash-Attn. I noticed there are Llama models without RmPad that doesn't required flash-attn. Is it possible to employ those non-RmPad models? Are there any other required modifications related to RmPad behavior?

Thanks a lot.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions