Improve consistency of tensors produced by make_tensor #4510

xadupre · 2022-09-13T17:40:28Z

Signed-off-by: xadupre xadupre@microsoft.com

Description

An array of uint8 make store its content in attribute int32_data. It is more space than necessary. Storage field should not exist. With this PR, any type different from string, float, double, int32, int64 is always stored as raw tensors. There is no mismatch between the element type and the attribute used to hold the data and the tensor does not use more memory than it should.

Motivation and Context

Better code consistency.

Signed-off-by: xadupre <xadupre@microsoft.com>

jcwchen

So after this PR, there is no way to make INT16, INT8, UINT16, UINT8, BOOL, or FLOAT16 tensors into int32_data? Will it potentially break something? For instance, certain backend does not support loading ONNX tensors from raw data. To prevent this, perhaps we can still preserve a way to do that but warn users it is not recommended.

I don't have the history context about why storing small data type into large data type (int32_data), which causes memory waste. If this PR is necessary, we need to update the proto file as well:

onnx/onnx/onnx.proto

Line 545 in d15ba96

// INT32, INT16, INT8, UINT16, UINT8, BOOL, or FLOAT16

xadupre · 2022-09-20T18:37:50Z

This PR is not necessary but i think it is better this way and more consistent. I'm curious to know which runtime cannig read the raw data. It should be able to cast the pointer raw_data into the proper type to read it. If a runtime cannot read int16 format, then the converting library should modify the type of an initializer. I don't think we should keep this kind of weird behavior just to accomodate a runtime.

gramalingam · 2022-09-20T22:40:36Z

I suggest something in-between: we generalize the helper function make_tensor to generate the more efficient encoding, but the inefficient encoding should still be supported.

Specifically, I think we need to support backward-compatibility for pre-existing models, so we should not make proto changes that break backward-compatibility.

As long as there is an efficient encoding, and we help users create the efficient encoding, that should serve the main purpose. If someone wants to explicitly create the less efficient encoding, we should not prevent it.

gramalingam · 2022-09-20T22:54:45Z

Note that protobuf uses variable-length encoding for integers. So, a boolean value encoded as int32 is encoded using a single byte in the proto format. In fact, I believe values 1 to 127 are encoded using 1 byte (one bit is used as the continuation-bit). Similarly, values encodable using 14 bits take 2 bytes. Otherwise, it would be 3 bytes. Or, at least, that's my understanding based on a quick look at this description

gramalingam · 2022-09-20T22:55:56Z

In contrast, the raw format uses a fixed-length encoding.

Signed-off-by: xadupre <xadupre@microsoft.com>

lgtm-com · 2022-09-21T12:47:45Z

This pull request introduces 1 alert when merging c78c905 into 895593a - view on LGTM.com

new alerts:

1 for Unused local variable

Signed-off-by: xadupre <xadupre@microsoft.com>

xadupre · 2022-09-21T20:17:10Z

I updated the code to keep the former make_tensor. I read the documentation in onnx.proto and onnx implements what it says it should do. Maybe this PR is not needed. However, it still looks counterintuitive to me. We should probably do a benchmark to compare the loading / writing time when users use make_tensor(..., raw=True or False). That would help making the right decision.

improve consistency

ec8deca

Signed-off-by: xadupre <xadupre@microsoft.com>

xadupre requested a review from a team as a code owner September 13, 2022 17:40

xadupre added 4 commits September 14, 2022 15:31

lint

a862920

Signed-off-by: xadupre <xadupre@microsoft.com>

fix make_tensor, to_array

37e7484

Signed-off-by: xadupre <xadupre@microsoft.com>

type

8443388

Signed-off-by: xadupre <xadupre@microsoft.com>

lint

390a0dd

Signed-off-by: xadupre <xadupre@microsoft.com>

xadupre changed the title ~~[WIP] Improve consistency of tensors produced by make_tensor~~ Improve consistency of tensors produced by make_tensor Sep 14, 2022

xadupre mentioned this pull request Sep 15, 2022

Improve mapping and add more tests for make_tensor #4270

Merged

jcwchen reviewed Sep 20, 2022

View reviewed changes

xadupre added 2 commits September 21, 2022 13:55

Merge branch 'main' of https://github.com/onnx/onnx into make

6a6cc04

update with latest main

c78c905

Signed-off-by: xadupre <xadupre@microsoft.com>

xadupre added 4 commits September 21, 2022 17:12

lint

658f3a2

Signed-off-by: xadupre <xadupre@microsoft.com>

better error message

29e272b

Signed-off-by: xadupre <xadupre@microsoft.com>

string

a154855

Signed-off-by: xadupre <xadupre@microsoft.com>

lint

b688e50

Signed-off-by: xadupre <xadupre@microsoft.com>

xadupre closed this Oct 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve consistency of tensors produced by make_tensor #4510

Improve consistency of tensors produced by make_tensor #4510

Uh oh!

xadupre commented Sep 13, 2022 •

edited

Loading

Uh oh!

jcwchen left a comment •

edited

Loading

Uh oh!

xadupre commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

lgtm-com bot commented Sep 21, 2022

Uh oh!

xadupre commented Sep 21, 2022 •

edited

Loading

Uh oh!

Uh oh!

Improve consistency of tensors produced by make_tensor #4510

Improve consistency of tensors produced by make_tensor #4510

Uh oh!

Conversation

xadupre commented Sep 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

jcwchen left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xadupre commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

gramalingam commented Sep 20, 2022

Uh oh!

lgtm-com bot commented Sep 21, 2022

Uh oh!

xadupre commented Sep 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

xadupre commented Sep 13, 2022 •

edited

Loading

jcwchen left a comment •

edited

Loading

xadupre commented Sep 21, 2022 •

edited

Loading