generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Closed
Labels
Description
Reproduction
Entropies is a bfloat16 tensor when training with bf16, but quantile() input tensor must be either float or double dtype.
trl/trl/trainer/grpo_trainer.py
Line 1395 in 6a6d434
entropy_threshold = torch.quantile(entropies.flatten(), self.token_entropy_percentile_threshold) |
System Info
RuntimeError: quantile() input tensor must be either float or double dtype
Checklist
- I have checked that my issue isn't already filed (see open issues)
- I have included my system information
- Any code provided is minimal, complete, and reproducible (more on MREs)
- Any code provided is properly formatted in code blocks, (no screenshot, more on code blocks)
- Any traceback provided is complete