-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Closed
Labels
oncall: cpu inductorCPU Inductor issues for Intel team to triageCPU Inductor issues for Intel team to triageoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
perf regression found - compare with 2023_08_22 nightly
Repro
bash inductor_single_test.sh
multiple inference performance suite model float32 first dynamic cpp 0
new_perf_regression
name | batch_size_new | speed_up_new | inductor_new | eager_new | compilation_latency_new | batch_size_old | speed_up_old | inductor_old | eager_old | compilation_latency_old | Ratio Speedup(New/old) | Eager Ratio(old/new) | Inductor Ratio(old/new) | Compilation_latency_Ratio(old/new) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
doctr_det_predictor | 1 | 1.069458 | 0.148391054 | 0.158697999828732 | 33.396333 | 1 | 1.503562 | 0.106578389 | 0.160247215721618 | 37.416409 | 0.71 | 1.01 | 0.72 | 1.12 |
pytorch_unet | 1 | 0.862569 | 0.310560677 | 0.267880012599213 | 18.169774 | 1 | 1.057315 | 0.24839536899999998 | 0.262632149574235 | 27.68669 | 0.82 | 0.98 | 0.8 | 1.52 |
* | * | * | * | * | * | * | * | * | * | * | * | * | * | * |
doctr_det_predictor | 1 | 0.652977 | 3.336484332 | 2.178647529656364 | 38.657023 | 1 | 1.20895 | 1.828299074 | 2.2103221655123 | 36.253952 | 0.54 | 1.01 | 0.55 | 0.94 |
pytorch_unet | 1 | 0.915661 | 5.48157092 | 5.01926071017812 | 20.518048 | 1 | 0.998196 | 4.898655984 | 4.889818808604864 | 29.142998 | 0.92 | 0.97 | 0.89 | 1.42 |
* | * | * | * | * | * | * | * | * | * | * | * | * | * | * |
bash
inductor_single_test.sh
multiple inference performance torchbench llama float32 first static default 0
new_perf_regression
name | batch_size_new | speed_up_new | inductor_new | eager_new | compilation_latency_new | batch_size_old | speed_up_old | inductor_old | eager_old | compilation_latency_old | Ratio Speedup(New/old) | Eager Ratio(old/new) | Inductor Ratio(old/new) | Compilation_latency_Ratio(old/new) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
llama | 32 | 0.578757 | 0.053369041000000006 | 0.030887706062037 | 35.195648 | 32 | 1.143321 | 0.027263855 | 0.031171337962455 | 40.613965 | 0.51 | 1.01 | 0.51 | 1.15 |
* | * | * | * | * | * | * | * | * | * | * | * | * | * | * |
SW info
SW | Nightly commit | Main commit |
---|---|---|
Pytorch | f54acf0 | bad3f2d |
Torchbench | / | 770d5cf7 |
torchaudio | dc83b38 | 66f661d |
torchtext | c11d758 | 60bea66 |
torchvision | 58366ab | a6dea86 |
torchdata | 1d231d1 | 757c032 |
dynamo_benchmarks | f228c8b | / |
Metadata
Metadata
Assignees
Labels
oncall: cpu inductorCPU Inductor issues for Intel team to triageCPU Inductor issues for Intel team to triageoncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module