gemma-3

To support gemma-3 it looks like we need a few changes:

- [ ] bump transformers to latest (#176 bumps from [4.49.0 -> 4.51.3](https://github.com/NVIDIA/reinforcer/pull/176/files#diff-84321598744d84dbee2318e634c74c9aae39a1c253f1c4bd17ebf9ef2f807b11R4063), which should be enough)
- [ ] use eager attention / make it configurable ([related](https://huggingface.co/google/gemma-2-9b-it/discussions/10#668259b4224478a1f6af428f))

(FSDP1PolicyWorker[rank=0] pid=1810886) It is strongly recommended to train Gemma3 models with the `eager` attention implementation instead of `sdpa`. Use `eager` with `AutoModelForCausalLM.from_pretrained('<path-to-checkpoint>', attn_implementation='eager')`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gemma-3 #236

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

gemma-3 #236

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions