Disable integer dtype for rotated bounding boxes #9133

AntoineSimoulin · 2025-06-30T16:31:10Z

Context

This PR disables integer dtype for rotated bounding boxes. Indeed rotated bounding boxes typically have floating coordinates for vertices. Functions to generate boxes or to transform boxes will typically involves trigonometric sinus and cosinus functions, which does not guarantees the coordinates will be integer. Rounding to the nearest integer can lead to degenerated boxes. Chaining transformation can further exacerbate the degeneration as rounding error will compound. For these reason, we choose to raise an error in the BoundingBoxes constructor if a integer dtype tensor is passed together with a rotated bounding box format. This check is done in the _check_format static function.

Testing

We adapt the tests with the following condition to verify that tests combining integer dtype and rotated bounding box formats are indeed failing.

if not dtype.is_floating_point and (
            tv_tensors.is_rotated_bounding_format(old_format) or tv_tensors.is_rotated_bounding_format(new_format)
        ):
            pytest.xfail("Rotated bounding boxes should be floating point tensors")

We finally remove artifacts in the testing and transforms added to make sure the transforms were compatible with integer dtype for rotated boxes. This includes all the rounding operations as well as epsilon offsets in the testing.

Please run tests with:

pytest test/test_transforms_v2.py -k box -v
...
3230 passed, 5686 deselected, 678 xfailed in 129.91s (0:02:09)

pytorch-bot · 2025-06-30T16:31:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9133

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 6fd0d51 with merge base 5d6e039 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build M1 Wheels / pytorch/vision / build-wheel-py3_9-cpu (gh) (trunk failure)
Process completed with exit code 1.
Build M1 Wheels / pytorch/vision / upload / upload-wheel-py3_9-cpu (gh) (trunk failure)
Unable to download artifact(s): Artifact not found for name: pytorch_vision__3.9_cpu_

This comment was automatically generated by Dr. CI and updates every 15 minutes.

NicolasHug

Thanks @AntoineSimoulin ! Made non-blocking comments below, approving assuming the CI is happy

NicolasHug · 2025-06-30T16:51:58Z

torchvision/tv_tensors/_bounding_boxes.py

@@ -111,6 +116,7 @@ def __new__(
        requires_grad: bool | None = None,
    ) -> BoundingBoxes:
        tensor = cls._to_tensor(data, dtype=dtype, device=device, requires_grad=requires_grad)
+        cls._check_format(tensor, format=format)


Since we call it only once and it's only 2 lines, we can inline it instead of making it a method. Also it might be best to do the validation before as the very first step, before calling cls._to_tensor()?

@NicolasHug, I included it within the function. However, it possible to pass list as input so it easier to run this test after the input has been converted to a tensor.

NicolasHug · 2025-06-30T16:55:57Z

torchvision/transforms/v2/functional/_meta.py

-    if clamping_mode == "hard":
-        bounding_boxes[..., 0].clamp_(0)  # Clamp x1 to 0


It's not immediately obvious that this relates to dtypes, so just flagging to make sure this change is intended?

@NicolasHug good catch. Yeah this was related to dtype and introduction of epsilon. I have remove it in a later commit attached to this PR.

NicolasHug · 2025-06-30T17:00:42Z

Forgot to add: before merging, let's add a small test in test_tv_tensors.py ensuring the error is raised on the rotated formats

AntoineSimoulin · 2025-07-01T02:06:30Z

Added a small test in test_tv_tensors.py ensuring the error is raised on the rotated formats

pytest test/test_tv_tensors.py -v -k "test_bbox_format_dtype"

…_dtype

NicolasHug · 2025-07-01T09:23:05Z

test/test_transforms_v2.py

+            # TODO there is a 1e-6 difference between GPU and CPU outputs
+            # due to clamping. To avoid failing this test, we do clamp before hand.


Thanks for writing this TODO, for GPU vs CPU it's typically OK to have differences of up to 1e-4. We should be able to pass atol and rtol to the check_kernel call, but we can address that later

github-actions · 2025-07-01T11:30:28Z

Hey @NicolasHug!

You merged this PR, but no labels were added.
The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Nicolas Hug <nh.nicolas.hug@gmail.com>

Reviewed By: AntoineSimoulin Differential Revision: D79175049 fbshipit-source-id: 28e287b1dd7d874f060c220014b978a00803ba90 Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Nicolas Hug <nh.nicolas.hug@gmail.com>

disable int rotated boxes

e417da1

facebook-github-bot added the cla signed label Jun 30, 2025

remove eps in _clamp_along_y_axis

1e4d8ae

NicolasHug approved these changes Jun 30, 2025

View reviewed changes

AntoineSimoulin added 4 commits June 30, 2025 18:48

Disable torch.float64 precision in tests and transforms

ed95753

add code inline

49962b5

added test

69f1eef

lint

a4b2534

NicolasHug added 5 commits July 1, 2025 09:20

Merge branch 'main' of github.com:pytorch/vision into disable-int-rbbox

2e6a52b

Fix is_rotated_bounding_format to accept str and fix test_bbox_format…

d94e031

…_dtype

Fix affine test?

15b0d78

lint

468f55b

types

a7e8b79

NicolasHug reviewed Jul 1, 2025

View reviewed changes

NicolasHug added 2 commits July 1, 2025 10:27

Fix minor test

4416135

lint bruuuvvvv

8a26141

NicolasHug mentioned this pull request Jul 1, 2025

Enforce PIL < 11.3 #9134

Merged

Merge branch 'main' into disable-int-rbbox

6fd0d51

NicolasHug merged commit fb3926e into pytorch:main Jul 1, 2025
57 of 61 checks passed

AntoineSimoulin added a commit to AntoineSimoulin/vision that referenced this pull request Jul 1, 2025

Disable integer dtype for rotated bounding boxes (pytorch#9133)

7b58ff4

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Nicolas Hug <nh.nicolas.hug@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable integer dtype for rotated bounding boxes #9133

Disable integer dtype for rotated bounding boxes #9133

Uh oh!

AntoineSimoulin commented Jun 30, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 30, 2025 •

edited

Loading

Uh oh!

NicolasHug left a comment •

edited

Loading

Uh oh!

NicolasHug Jun 30, 2025

Uh oh!

AntoineSimoulin Jul 1, 2025

Uh oh!

NicolasHug Jun 30, 2025

Uh oh!

AntoineSimoulin Jul 1, 2025

Uh oh!

NicolasHug commented Jun 30, 2025

Uh oh!

AntoineSimoulin commented Jul 1, 2025 •

edited

Loading

Uh oh!

NicolasHug Jul 1, 2025

Uh oh!

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

		if clamping_mode == "hard":
		bounding_boxes[..., 0].clamp_(0) # Clamp x1 to 0

		# TODO there is a 1e-6 difference between GPU and CPU outputs
		# due to clamping. To avoid failing this test, we do clamp before hand.

Disable integer dtype for rotated bounding boxes #9133

Disable integer dtype for rotated bounding boxes #9133

Uh oh!

Conversation

AntoineSimoulin commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Testing

Uh oh!

pytorch-bot bot commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9133

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

NicolasHug left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

AntoineSimoulin Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Jun 30, 2025

Uh oh!

AntoineSimoulin commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

AntoineSimoulin commented Jun 30, 2025 •

edited

Loading

pytorch-bot bot commented Jun 30, 2025 •

edited

Loading

NicolasHug left a comment •

edited

Loading

AntoineSimoulin commented Jul 1, 2025 •

edited

Loading