Hetero neighbor sampler multithreading #215

kgajdamo · 2023-03-16T22:26:50Z

The main roadblocks in the context of sampler parallelization is the mapper and the sampler. However, in the case of hetero sampling, there are separate mappers for each dst node type and separate samplers for each edge type. Therefore, to avoid race condition we can use parallelization per dst node type, i.e. each thread is assigned an edge types with unique dst node type that it is working on.

Example sampling times obtained using hetero_neighbor.py benchmark:

Example end-to-end times obtained using inference_benchmark.py:

codecov-commenter · 2023-03-16T22:34:43Z

Codecov Report

Merging #215 (819af72) into master (72d2647) will not change coverage.
The diff coverage is n/a.

❗ Current head 819af72 differs from pull request most recent head b6f7ae9. Consider uploading reports for the commit b6f7ae9 to get more accurate results

@@           Coverage Diff           @@
##           master     #215   +/-   ##
=======================================
  Coverage   83.49%   83.49%           
=======================================
  Files          26       26           
  Lines         848      848           
=======================================
  Hits          708      708           
  Misses        140      140

Impacted Files	Coverage Δ
pyg_lib/csrc/sampler/cpu/neighbor_kernel.cpp	`88.72% <ø> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

mingfeima · 2023-03-27T14:42:59Z

The speedup does not see to be very much, do we have analysis on what is the bottleneck ?

kgajdamo · 2023-04-02T08:14:33Z

The speedup does not see to be very much, do we have analysis on what is the bottleneck ?

I think the reason is that the number of threads is limited to the number of dst nodes. As many different dst nodes there are as many threads. For the ogbn-mag dataset, this will be 4.

mingfeima · 2023-04-11T08:06:48Z

The speedup does not see to be very much, do we have analysis on what is the bottleneck ?

I think the reason is that the number of threads is limited to the number of dst nodes. As many different dst nodes there are as many threads. For the ogbn-mag dataset, this will be 4.

The code generally LGTM, but i am still curious on the limited speedup. You may try from two aspects:

can we shift to another dataset with more dst nodes.
check VTune log on ogbn-mag dataset.

kgajdamo · 2023-04-11T10:16:00Z

The speedup does not see to be very much, do we have analysis on what is the bottleneck ?

I think the reason is that the number of threads is limited to the number of dst nodes. As many different dst nodes there are as many threads. For the ogbn-mag dataset, this will be 4.

The code generally LGTM, but i am still curious on the limited speedup. You may try from two aspects:

can we shift to another dataset with more dst nodes.

check VTune log on ogbn-mag dataset.

Sure, I'll check that in VTune.

rusty1s

Looks good. A few minor comments.

pyg_lib/csrc/sampler/cpu/neighbor_kernel.cpp

rusty1s · 2023-04-28T12:00:29Z

pyg_lib/csrc/sampler/cpu/neighbor_kernel.cpp

      }
+      at::parallel_for(0, node_types.size(), 1, [&](size_t _s, size_t _e) {


Does this bring any gain? We are not doing heavy work here, so the overhead of threading might not be worth it?

To check this, I made two measurements. The first in red are measurements made with the current code. And in green is after removing at::parallel_for (what was before). The results show that the introduction of parallelization at this point gives maybe not a big but still speedup (I took the measurements on a different machine than the one mentioned in the PR description, which is why the results are different):

But maybe to be sure, I'll take 10 measurements and average the results

The average of 10 measurements is as follows:

Looks like the results are almost the same. Should I remove it then? WDYT?

I would be in favor of removing it, as it doesn't look that useful.

This reverts commit f1b10bb.

This reverts commit b58ee31.

kgajdamo requested review from rusty1s, mingfeima, mszarma and DamianSzwichtenberg March 16, 2023 22:26

kgajdamo force-pushed the hetero-sampler-mt branch from 78b60f8 to 8dd411b Compare March 18, 2023 12:05

kgajdamo force-pushed the hetero-sampler-mt branch from 8dd411b to 930dc1a Compare April 7, 2023 10:37

rusty1s reviewed Apr 28, 2023

View reviewed changes

parallel hetero neighbor sampler

6449aca

kgajdamo force-pushed the hetero-sampler-mt branch from 930dc1a to 6449aca Compare May 3, 2023 14:33

mingfeima approved these changes May 4, 2023

View reviewed changes

kgajdamo and others added 3 commits May 4, 2023 09:20

remove at::parallel

819af72

Update CHANGELOG.md

35d275c

update

b6f7ae9

rusty1s approved these changes May 4, 2023

View reviewed changes

rusty1s enabled auto-merge (squash) May 4, 2023 08:55

rusty1s merged commit f1b10bb into pyg-team:master May 4, 2023

OlhaBabicheva added a commit to OlhaBabicheva/pyg-lib that referenced this pull request May 5, 2023

Revert "Hetero neighbor sampler multithreading (pyg-team#215)"

b58ee31

This reverts commit f1b10bb.

OlhaBabicheva added a commit to OlhaBabicheva/pyg-lib that referenced this pull request May 5, 2023

Revert "Revert "Hetero neighbor sampler multithreading (pyg-team#215)""

6cd20de

This reverts commit b58ee31.

rusty1s mentioned this pull request May 10, 2023

[Roadmap] Advanced Graph Sampling Routines 🚀 pyg-team/pytorch_geometric#7331

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Hetero neighbor sampler multithreading #215

Hetero neighbor sampler multithreading #215

kgajdamo commented Mar 16, 2023

Uh oh!

codecov-commenter commented Mar 16, 2023 •

edited

Loading

Uh oh!

mingfeima commented Mar 27, 2023

Uh oh!

kgajdamo commented Apr 2, 2023

Uh oh!

mingfeima commented Apr 11, 2023

Uh oh!

kgajdamo commented Apr 11, 2023

Uh oh!

rusty1s left a comment

Uh oh!

Uh oh!

rusty1s Apr 28, 2023

Uh oh!

kgajdamo May 2, 2023

Uh oh!

kgajdamo May 2, 2023

Uh oh!

kgajdamo May 3, 2023

Uh oh!

rusty1s May 3, 2023

Uh oh!

kgajdamo May 4, 2023

Uh oh!

Uh oh!

		}
		at::parallel_for(0, node_types.size(), 1, [&](size_t _s, size_t _e) {

Hetero neighbor sampler multithreading #215

Hetero neighbor sampler multithreading #215

Conversation

kgajdamo commented Mar 16, 2023

Uh oh!

codecov-commenter commented Mar 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mingfeima commented Mar 27, 2023

Uh oh!

kgajdamo commented Apr 2, 2023

Uh oh!

mingfeima commented Apr 11, 2023

Uh oh!

kgajdamo commented Apr 11, 2023

Uh oh!

rusty1s left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rusty1s Apr 28, 2023

Choose a reason for hiding this comment

Uh oh!

kgajdamo May 2, 2023

Choose a reason for hiding this comment

Uh oh!

kgajdamo May 2, 2023

Choose a reason for hiding this comment

Uh oh!

kgajdamo May 3, 2023

Choose a reason for hiding this comment

Uh oh!

rusty1s May 3, 2023

Choose a reason for hiding this comment

Uh oh!

kgajdamo May 4, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov-commenter commented Mar 16, 2023 •

edited

Loading