Skip to content

[inductor][cpu] Perf regression #108324

@yudongsi

Description

@yudongsi

perf regression found - compare with 2023_08_22 nightly

Repro

bash inductor_single_test.sh
multiple inference performance suite model float32 first dynamic cpp 0

new_perf_regression

name batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
doctr_det_predictor 1 1.069458 0.148391054 0.158697999828732 33.396333 1 1.503562 0.106578389 0.160247215721618 37.416409 0.71 1.01 0.72 1.12
pytorch_unet 1 0.862569 0.310560677 0.267880012599213 18.169774 1 1.057315 0.24839536899999998 0.262632149574235 27.68669 0.82 0.98 0.8 1.52
* * * * * * * * * * * * * * *
doctr_det_predictor 1 0.652977 3.336484332 2.178647529656364 38.657023 1 1.20895 1.828299074 2.2103221655123 36.253952 0.54 1.01 0.55 0.94
pytorch_unet 1 0.915661 5.48157092 5.01926071017812 20.518048 1 0.998196 4.898655984 4.889818808604864 29.142998 0.92 0.97 0.89 1.42
* * * * * * * * * * * * * * *

bash
inductor_single_test.sh
multiple inference performance torchbench llama float32 first static default 0

new_perf_regression

name batch_size_new speed_up_new inductor_new eager_new compilation_latency_new batch_size_old speed_up_old inductor_old eager_old compilation_latency_old Ratio Speedup(New/old) Eager Ratio(old/new) Inductor Ratio(old/new) Compilation_latency_Ratio(old/new)
llama 32 0.578757 0.053369041000000006 0.030887706062037 35.195648 32 1.143321 0.027263855 0.031171337962455 40.613965 0.51 1.01 0.51 1.15
* * * * * * * * * * * * * * *

SW info

SW Nightly commit Main commit
Pytorch f54acf0 bad3f2d
Torchbench / 770d5cf7
torchaudio dc83b38 66f661d
torchtext c11d758 60bea66
torchvision 58366ab a6dea86
torchdata 1d231d1 757c032
dynamo_benchmarks f228c8b /

cc @ezyang @msaroufim @wconstab @bdhirsh @zou3519

Metadata

Metadata

Assignees

Labels

oncall: cpu inductorCPU Inductor issues for Intel team to triageoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions