TF-TRT CombinedNMS: fix top_k parameter max value #47698

tfeher · 2021-03-10T14:00:04Z

The TRT plugin has a limitation on the max value of the top_k input parameter. This PR modifies the converter to refuse conversion if top_k > 4096.

In some cases it might be desirable to do the conversion, even if the top_k<=4096 restriction would lead to loss of accuracy. The user can set TF_TRT_ALLOW_NMS_TOPK_OVERRIDE=1 environment variable can be set to opt-in for conversion in this case.

tfeher · 2021-03-10T14:01:49Z

Tagging @bixia1 for review and @DEKHTIARJonathan for visibility.

bixia1 · 2021-03-10T16:41:12Z

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc

+  // TRT has a limitation: top_k <=4096.
+  if (top_k > 4096) {
+    if (AllowNmsTopkOverride()) {
+      top_k = 4096;
+      keep_top_k = std::min(top_k, keep_top_k);
+    } else {
+      return errors::InvalidArgument(
+          "TRT NMS plugin allow top_k<=4096, where top_k = max(num_boxes, "
+          "max_total_size). You can override this by setting "
+          "TF_TRT_ALLOW_NMS_TOPK_OVERRIDE=1 environment variable, but this can "
+          "result in a loss of accuracy.");
+    }
+  }


Not sure if you agree, but I think making the env for user to over the top_k value is more flexible. If that is what we will do, this code will look like this:

std::optional<int64> user_request_top_k = read from env if (user_request_top_k) { if (top_k <= user_request_top_k.value()) { issue a VLOG message saying the user request is not used because it is bigger than the natural top_k. } else { issue a VLOG to tell the "natural top_k" and user requested top_k values, and the use requested one is use top_k = user_requested_top_k.value(); keep_top_k = min(top_k, keep_top_k) } } if (top_k > 4096) return errors::InvalidaArgument(...)

I believe what we need here is a way for the user to opt-in for a conversion in case the TRT converted op is not strictly equivalent with the TF op. While your suggestion can be used to reach the same goal, I do not see the need for flexibly overriding top_k, and therefore I would choose the current simpler solution.

Why would the user want to override top_k with a concrete value? More importantly, how would he/she decide what value to use? The TF-TRT converter does this job: it sets a value for top_k based on the input parameters. If that value is not compatible with TRT, then we either skip conversion, or not (this case only with the users consent). If we do not skip conversion, then the next best thing (in terms of correctness of the results) is to set the max value allowed by the plugin.

See this comment.
Can this value affect performance? That is the only reason I can think of to allow users change the value.

TF-TRT CombinedNMS: fix top_k parameter max value

398ecd5

google-ml-butler bot added the size:M CL Change Size: Medium label Mar 10, 2021

google-ml-butler bot requested a review from joker-eph March 10, 2021 14:00

google-cla bot added the cla: yes label Mar 10, 2021

gbaned self-assigned this Mar 10, 2021

gbaned added comp:gpu:tensorrt Issues specific to TensorRT prtype:bugfix PR to fix a bug labels Mar 10, 2021

gbaned requested a review from bixia1 March 10, 2021 14:30

bixia1 reviewed Mar 10, 2021

View reviewed changes

bixia1 approved these changes Mar 11, 2021

View reviewed changes

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Mar 11, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 11, 2021

copybara-service bot merged commit c183208 into tensorflow:master Mar 12, 2021

bixia1 mentioned this pull request May 7, 2021

[TF-TRT] CombinedNms Deactivated due to bug #48950

Closed

tfeher deleted the trt_nms_topk_4096 branch May 16, 2022 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TF-TRT CombinedNMS: fix top_k parameter max value #47698

TF-TRT CombinedNMS: fix top_k parameter max value #47698

Uh oh!

tfeher commented Mar 10, 2021

Uh oh!

tfeher commented Mar 10, 2021

Uh oh!

bixia1 Mar 10, 2021

Uh oh!

tfeher Mar 10, 2021

Uh oh!

bixia1 Mar 10, 2021

Uh oh!

Uh oh!

TF-TRT CombinedNMS: fix top_k parameter max value #47698

TF-TRT CombinedNMS: fix top_k parameter max value #47698

Uh oh!

Conversation

tfeher commented Mar 10, 2021

Uh oh!

tfeher commented Mar 10, 2021

Uh oh!

bixia1 Mar 10, 2021

Choose a reason for hiding this comment

Uh oh!

tfeher Mar 10, 2021

Choose a reason for hiding this comment

Uh oh!

bixia1 Mar 10, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!