Add FLOAT8E8M0 data type #7030

yuanyao-nv · 2025-06-05T20:41:21Z

Description

Add new data type FLOAT8E8M0 and related helper functions.
Update Cast op for this new type.

Paper on cuda's choice of roundup: https://arxiv.org/abs/2506.08027

A followup PR will update Q/DQ and other non-compute operators.

Motivation and Context

E8M0 serves as the common scale type for microscaling (MX) formats: https://www.opencompute.org/documents/ocp-microscaling-formats-mx-v1-0-spec-final-pdf

justinchuby · 2025-06-05T22:27:29Z

Could you update https://github.com/onnx/ir-py/blob/main/src/onnx_ir/_enums.py and the tensor representations e.g. https://github.com/onnx/ir-py/blob/fdee1e28e199f67ced802d785565ff6ebba6f63c/src/onnx_ir/_core.py#L258 as well, after consensus is reached? Thanks!

onnx/helper.py

onnx/numpy_helper.py

onnx/defs/tensor/old.cc

+ONNX_OPERATOR_SET_SCHEMA(
+    Cast,


justinchuby · 2025-06-05T22:49:43Z

Out of curiosity: what are the benefits of each rounding mode? Was it different because of the lack of spec, or due to platform characteristics/ performance considerations?

justinchuby · 2025-06-05T22:50:31Z

Does the proposed rounding mode attribute for cast affect any other data types?

justinchuby · 2025-06-05T22:52:28Z

Given the difference in native behavior, a given backend is unlikely to implement all rounding modes, I assume. Wondering if this has an implication to model portability

yuanyao-nv · 2025-06-05T23:18:53Z

@justinchuby CUDA has done extensive experiments to show that roundup gives the best accuracy and has standardized it in the CUDA spec, so essentially roundup should be the only mode that matters for MX applications. I'm ok with adding just roundup in the ONNX spec.

Unfortunately OCP didn't define it this way and efforts to correct it has not seen much progress. Other libraries have mostly chosen RNE for consistency with other float types. But it's unlikely people will use that for MX use cases. I included the other
rounding modes to accommodate these other libraries, but I'm not sure if it's worthwhile for the above reason.

As the doc says, "round_mode" attribute only applies to e8m0, so this won't interact with the existing types.
cc: @gramalingam

onnx/reference/ops/_op_list.py

onnx/numpy_helper.py

onnx/helper.py

onnx/reference/ops/op_cast.py

justinchuby · 2025-06-06T00:01:05Z

The reference evaluator is likely going to be implemented by ml_dtypes (proposed). Is there a way to simulate the rounding mode in an efficient manner? I assume we can create a mask of everything that needs to be rounded up, and manipulate those elements as a post processing step?

codecov · 2025-06-22T04:15:33Z

Codecov Report

Attention: Patch coverage is 44.94382% with 49 lines in your changes missing coverage. Please review.

Project coverage is 56.37%. Comparing base (d5d3123) to head (092ab78).

Files with missing lines	Patch %	Lines
onnx/backend/test/case/node/cast.py	0.00%	27 Missing ⚠️
onnx/helper.py	25.92%	17 Missing and 3 partials ⚠️
onnx/reference/op_run.py	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7030      +/-   ##
==========================================
- Coverage   56.40%   56.37%   -0.04%     
==========================================
  Files         510      510              
  Lines       32721    32806      +85     
  Branches     3093     3115      +22     
==========================================
+ Hits        18457    18493      +36     
- Misses      13410    13456      +46     
- Partials      854      857       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

justinchuby

Could you also update https://github.com/onnx/onnx/blob/main/docs/docsgen/source/technical/float8.md

onnx/backend/test/case/node/cast.py

onnx/numpy_helper.py

justinchuby · 2025-07-08T03:02:15Z

Do you plan to update cast like as well?