### Reproduction Run a small experiment w/ log completions enabled to reproduce ``` per_device_train_batch_size: 1 max_steps: 1 num_generations: 2 use_vllm: true vllm_mode: colocate vllm_tensor_parallel_size: 2 log_completions: true ``` ### System Info  ### Checklist - [x] I have checked that my issue isn't already filed (see [open issues](https://github.com/huggingface/trl/issues?q=is%3Aissue)) - [x] I have included my system information - [x] Any code provided is minimal, complete, and reproducible ([more on MREs](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks)) - [x] Any code provided is properly formatted in code blocks, (no screenshot, [more on code blocks](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks)) - [x] Any traceback provided is complete