Use sets to improve performance of dfs search #6213

kylesayrs · 2024-07-04T20:47:49Z

Description

This change modifies the DFS search used in model extraction to use sets rather than iterables

Motivation and Context

While shape inference is the most significant bottleneck, these changes are a step in the direction of being able to support model extraction for very large graphs.

Test Script

import onnx
from onnx.utils import Extractor

model = onnx.load("obertquant.onnx")
extractor = Extractor(model)
extracted_model = extractor.extract_model(
    input_names=["input_ids", "attention_mask", "token_type_ids"], output_names=["2058"]
)
onnx.save(extracted_model, "truncated.onnx")

Benchmarks were produced using pyinstrument and analyzing the Extractor.extract_model function

Model Name	Num Nodes	Previous	New
obertquant.onnx	1271	0.158s	0.110s
ai-town-3B.onnx	3515	8.002s	3.725s

codecov · 2024-07-04T20:56:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 57.11%. Comparing base (83194ed) to head (01d56d4).
Report is 61 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6213      +/-   ##
==========================================
+ Coverage   56.95%   57.11%   +0.15%     
==========================================
  Files         506      506              
  Lines       30467    31087     +620     
  Branches     4592     4624      +32     
==========================================
+ Hits        17353    17754     +401     
- Misses      12285    12505     +220     
+ Partials      829      828       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

onnx/utils.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

xadupre · 2024-07-04T22:00:23Z

It seems reachable + unreachable = set of nodes. Do you it is doable to remoave unreachable and simplify the function signature assuming performance stays the same?

kylesayrs · 2024-07-05T03:27:24Z

@xadupre The reachable set must be maintained, as it is the ultimate purpose of the function. This leaves the unreachable set to be removed. Unfortunately, in order to iterate through the negative set of reachable, the function must iterate through all node indices.

nodes_to_search = [
    index
    for index in range(len(nodes))
    if node_output_name in nodes[index].output and index not in reachable
]

This leads to a longer runtime for larger models

Model Name	Num Nodes	Previous	New	One Set
obertquant.onnx	1271	0.158s	0.110s	0.118s
ai-town-3B.onnx	3515	8.002s	3.725s	6.094s

### Description This change modifies the DFS search used in model extraction to use sets rather than iterables ### Motivation and Context While shape inference is the most significant bottleneck, these changes are a step in the direction of being able to support model extraction for very large graphs. ## Test Script ## ```python3 import onnx from onnx.utils import Extractor model = onnx.load("obertquant.onnx") extractor = Extractor(model) extracted_model = extractor.extract_model( input_names=["input_ids", "attention_mask", "token_type_ids"], output_names=["2058"] ) onnx.save(extracted_model, "truncated.onnx") ``` Benchmarks were produced using pyinstrument and analyzing the `Extractor.extract_model` function | Model Name | Num Nodes | Previous | New | | ---- | -------- | --------- | ----- | | obertquant.onnx | 1271 | 0.158s | 0.110s | | ai-town-3B.onnx | 3515 | 8.002s | 3.725s | Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Signed-off-by: Andreas Fehlner <fehlner@arcor.de>

### Description This change modifies the DFS search used in model extraction to use sets rather than iterables ### Motivation and Context While shape inference is the most significant bottleneck, these changes are a step in the direction of being able to support model extraction for very large graphs. ## Test Script ## ```python3 import onnx from onnx.utils import Extractor model = onnx.load("obertquant.onnx") extractor = Extractor(model) extracted_model = extractor.extract_model( input_names=["input_ids", "attention_mask", "token_type_ids"], output_names=["2058"] ) onnx.save(extracted_model, "truncated.onnx") ``` Benchmarks were produced using pyinstrument and analyzing the `Extractor.extract_model` function | Model Name | Num Nodes | Previous | New | | ---- | -------- | --------- | ----- | | obertquant.onnx | 1271 | 0.158s | 0.110s | | ai-town-3B.onnx | 3515 | 8.002s | 3.725s | Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Signed-off-by: Linsho Kaku <linsho@preferred.jp>

kylesayrs requested a review from a team as a code owner July 4, 2024 20:47

kylesayrs force-pushed the extractor-dfs-optimization branch from 5714577 to 61e7fb1 Compare July 4, 2024 20:50

github-advanced-security bot found potential problems Jul 4, 2024

View reviewed changes

onnx/utils.py Fixed Show fixed Hide fixed

onnx/utils.py Fixed Show fixed Hide fixed

onnx/utils.py Fixed Show fixed Hide fixed

onnx/utils.py Fixed Show fixed Hide fixed

onnx/utils.py Fixed Show fixed Hide fixed

onnx/utils.py Fixed Show fixed Hide fixed

kylesayrs force-pushed the extractor-dfs-optimization branch from c1ca823 to e75c67a Compare July 4, 2024 21:11

github-advanced-security bot found potential problems Jul 4, 2024

View reviewed changes

onnx/utils.py Fixed Show fixed Hide fixed

dfs performance improvements

01d56d4

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs force-pushed the extractor-dfs-optimization branch from e75c67a to 01d56d4 Compare July 4, 2024 21:22

kylesayrs changed the title ~~Use sets to increase performance of dfs search~~ Use sets to improve performance of dfs search Jul 5, 2024

xadupre approved these changes Jul 5, 2024

View reviewed changes

xadupre added this pull request to the merge queue Jul 5, 2024

Merged via the queue into onnx:main with commit c1dffb6 Jul 5, 2024
38 checks passed

kylesayrs mentioned this pull request Jul 5, 2024

Extractor dfs performance neuralmagic/deepsparse#1655

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use sets to improve performance of dfs search #6213

Use sets to improve performance of dfs search #6213

Uh oh!

kylesayrs commented Jul 4, 2024 •

edited

Loading

Uh oh!

codecov bot commented Jul 4, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xadupre commented Jul 4, 2024

Uh oh!

kylesayrs commented Jul 5, 2024

Uh oh!

Uh oh!

Uh oh!

Use sets to improve performance of dfs search #6213

Use sets to improve performance of dfs search #6213

Uh oh!

Conversation

kylesayrs commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Test Script

Uh oh!

codecov bot commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xadupre commented Jul 4, 2024

Uh oh!

kylesayrs commented Jul 5, 2024

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented Jul 4, 2024 •

edited

Loading

codecov bot commented Jul 4, 2024 •

edited

Loading