Skip to content

Excessive heap memory usage in raylet / owner process when shuffling many objects #14322

@ericl

Description

@ericl

To reproduce, run: python -m ray.experimental.shuffle --num-partitions=1000 --partition-size=1e6

You'll see that in top the raylet and owner process will end up using >10GB of heap memory. This is very unexpected, since theses processes should (1) only be storing metadata, and (2) the amount of "real data" is only 1GB in the benchmark above.

There might be some memory leak or other unexpected issue here.

Metadata

Metadata

Assignees

Labels

P1Issue that should be fixed within a few weeksbugSomething that is supposed to be working; but isn't

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions