[Pubsub] Generalize the pubsub interface and adapt it for ref counting protocol #15446

rkooo567 · 2021-04-21T23:24:43Z

Why are these changes needed?

This is fully review-able

Recommended review workflow.

First, look at core_worker.cc and reference_count.cc to see how the new interface looks like.
Check subscriber.h and publisher.h.
And then review other parts.

The current interface is not ideal, but it is working pretty well with the current status. Let's iterate on the interface!

I verified this reduced the number of WaitForRefRemoved requests.

Related issue number

Close #14322
#14762

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

rkooo567 · 2021-04-23T03:10:32Z

Current test failure should be temporary,.

clarkzinzow

Did a usage first pass, I have a few high-level pubsub design questions before I dig deeper.

It looks like we're using a channel abstraction in order to be able to piggyback off of shared publisher <--> subscriber long-polling connections, where the publisher and subscriber interfaces expose the channel concept to the application code via the channel type and deal only with a generic PubMessage message type that oneofs across the messages for the different channels. This then entails pass-through methods to the subinterfaces for each channel, maintaining channel --> data structure maps internally, and leaking what is essentially a transport detail (the long-polling connection) to the application level by requiring publishers/subscribers to specify the channel type. I'm wondering if there's a better way to structure this such that the application code doesn't have to worry about channels and more of this logic is encoded in the type system so we have compile-time guarantees that things are 👌 .

What if we adopted a pattern similar to gRPC's shared transport channel, generalized from managing a 1-to-1 connection to managing 1-to-n connections, where we have a transport broker concept that encapsulates the publisher <--> subscriber long-polling connections. The subscriber interface for each pubsub instance takes a shared pointer to the broker on construction, and can therefore share the broker across different pubsub instances (e.g. separate subscriber objects for WAIT_FOR_OBJECT_EVICTION and WAIT_FOR_REF_REMOVED), where new publisher <--> subscriber connections can be registered by any of the pubsub instances at subscription time.

subscribers <--> broker <--transport--> broker <--> publishers

I think there could be a few advantages here:

We could then expose a subscriber interface per pubsub instance, templated on the concrete message type (no need for a proto oneof message to be exposed to the application code, that could be isolated to the broker), which will give better compile-time checking of the application-level pubsub messages and will obviate the need for those passthrough methods in a top-level subscriber interface that requires a channel type to be specified.
I think that you'd still need a proto-level channel type enum, but I think that you could get away with specializing the subscriber with the enum value itself. E.g. Subscriber<typename MessageID, typename Message, rpc::ChannelType channel_type> could be specialized as Subscriber<ObjectID, rpc::WaitForObjectEvictionMessage, rpc::WAIT_FOR_OBJECT_EVICTION>, where the channel_type is included when interacting with the broker (e.g. subscribe, unsubscribe). Dispatch of messages received in the long-poll response by the broker will still have to use a channel --> subscriber table, but that can be populated dynamically when the subscriber is constructed: broker_.RegisterSubscriber(channel_type, this).
The transport-level optimization of sharing long-polling connections is pushed down as an implementation detail of the transport broker concept, and is a natural place to evolve other transport-level details, like switching to a bidi streaming connection or switching to a centralized message broker.
Other transport-level things, like the core worker client pool and the subscriber address/port, can be moved to the broker. Publisher addresses will probably continue to be required by subscriber methods, unless we want the broker to always map worker or node IDs to concrete addresses (probably less efficient in some cases?).
The current process of registering a new channel will be replaced by adding a new channel type to the proto enum, adding the channel's proto to the broker's pubsub message oneof, templating a subscriber with a new message type and the channel type, and reusing the existing broker. I don't think there should be much more required than that.

I think that you would have an analogous broker abstraction on the publisher side as well (encapsulating the cached long-polling connection, controlling the message batching), although I haven't thought it through as much since I think it should be simpler than the subscriber-side. My guess is that the publisher would be similarly templated, would similarly register itself with the publish-side broker on construction, and each publisher instance (one per channel type) would have its own subscription index. I think that the publisher's Subscriber would be internal to the publish-side broker.

The cons with this broker approach, as I see them:

Broker concept is exposed to the application level, although what's needed to configure that concept (worker client pool, subscriber address/port, etc.) is already exposed to the application level, so I'm not sure if this is any worse?
A shared transport-level broker will obviously have to be thread-safe, although that shouldn't be very difficult.
Hidden dragons that I haven't thought of.

Lmk what you think about this approach and whether I'm missing anything! This is also something that could wait until a future PR, e.g. when I port the OBOD object location subscriptions or the GCS node/actor tables to this pubsub I'm sure that I'll have to refactor some stuff anyway.

src/ray/core_worker/core_worker.cc

src/ray/core_worker/reference_count.h

src/ray/pubsub/publisher.cc

src/ray/pubsub/subscriber.cc

rkooo567 · 2021-04-23T22:31:38Z

Thanks for the suggestion! Did I understand correctly the below API examples are what you are describing?

# either this
# manages the connection
broker_ 
# manages the tracking
wait_for_object_channel = Channel<ObjectID, MessageType>(broker)
wait_for_object_channel->interface();

# or this

subscriber_broker->Channel(WaitForObjectEviction)->interface()

rkooo567 · 2021-04-26T18:56:24Z

@jovany-wang

BTW, I prefer name the channels to OBJECT_EVICTION and REF_REMOVED(Remove the WAIT_FOR_ prefix.).

Yeah it makes sense. But we probably want to have a prefix to distinguish worker-based channel and gcs-based channels? What about WORKER_OBJECT_EVICTION?

clarkzinzow · 2021-04-26T20:21:52Z

But one subscriber can still subscribe one channel right? Are you saying not quite channel because subscriber "subscribes" channels (that says, due to semantics) or is there other reasons?

I'm saying that the different pubsub "channels" are presented to the application-level code as independent subscriber objects, with APIs that are channel-agnostic. The broker should hide all of the details around managing subscriber <--> publisher connections, and the multiplexing of multiple different pubsub channels over a single subscriber <--> publisher connection should be hidden from the application code.

So, I suggest to move forward with the current impl as you mentioned here (This is also something that could wait until a future PR), and we compile a design doc that includes general interface of this and perform refactor in a separate PR if necessary. What do you think?

Definitely, that sounds like the best route to me! I should be able to get to another review pass of this PR today.

jovany-wang · 2021-04-27T03:14:38Z

@jovany-wang

BTW, I prefer name the channels to OBJECT_EVICTION and REF_REMOVED(Remove the WAIT_FOR_ prefix.).

Yeah it makes sense. But we probably want to have a prefix to distinguish worker-based channel and gcs-based channels? What about WORKER_OBJECT_EVICTION?

I totally agree that adding a prefix to distinguish worker-based channels and gcs-based channels. But I don't have an idea on what the best prefix format is~

clarkzinzow

LGTM! I mostly have nits, although we should definitely fix the more-than-one-move when queueing messages before merging, but I trust that you'll fix that. I'm also still of the opinion that we should try to register subscriptions before adding ref deletion callbacks (which include unregistering subscriptions) to prevent surprise leaks of subscriptions in the future (thread here), lmk what you think about that.

I also didn't get a chance to say this on the initial PR, but great work on the tests! 🙌

release/data_processing_tests/multi_node.yaml

src/ray/core_worker/reference_count.cc

src/ray/core_worker/reference_count_test.cc

src/ray/pubsub/publisher.cc

src/ray/pubsub/publisher.h

src/ray/pubsub/subscriber.cc

clarkzinzow · 2021-05-03T22:37:13Z

Btw, it looks like there's an ASAN failure for a reference count test that might need attention.

rkooo567 · 2021-05-03T22:49:00Z

@clarkzinzow Yeah I am aware of that one. It is super weird because it is not a memory-related error + never been reproducible from my laptop... I will make sure to fix this issue (probably there are some funny mistakes).

fishbone

This PR is too big. We should avoid this as much as possible in the future. I haven't reviewed subscriber.h/cc yet. Except for these, all the rest has been reviewed.

src/ray/common/ray_config_def.h

src/ray/core_worker/core_worker.cc

src/ray/protobuf/pubsub.proto

src/ray/pubsub/mock_pubsub.h

src/ray/core_worker/reference_count.cc

src/ray/pubsub/publisher.h

src/ray/pubsub/publisher.cc

fishbone · 2021-05-04T05:42:54Z

src/ray/pubsub/subscriber.cc

+/// Subscriber
+///////////////////////////////////////////////////////////////////////////////
+
+inline std::shared_ptr<SubscribeChannelInterface> Subscriber::Channel(


why inline here?

because this function is frequently called, but pretty tiny.

Looks like inline doesn't mean the function is actually inlined. I will move the definition to the header instead to get true inline functions.

This reverts commit 9a6a521.

jovany-wang

Looks pretty good now. I left some minor comments.

src/ray/core_worker/core_worker.cc

src/ray/core_worker/reference_count.cc

src/ray/pubsub/publisher.cc

src/ray/pubsub/subscriber.cc

rkooo567 · 2021-05-13T16:28:01Z

Docker / Mac wheel build times out frequently. The flaky test seems to pass, but it failed for some weird reasons. I will just merge it.

There are already 2 approvals, and I didn't receive new reviews for a while, so I will just merge it.

rkooo567 added 13 commits April 8, 2021 09:03

Add mock code first

bffdc63

Merge branch 'master' into generalize-pubsub-3

663bd30

In the initial progress.

59145fd

Merge branch 'master' into generalize-pubsub-3

7169c4f

Fix the number error

6c13cce

In progress.

d17b8ac

in more pgoress.

0c125a5

in progress.

de2fca2

lint.

43be41f

Prototype done.

0d8b412

Fix compilation bug.

2b5fdc9

Now it is working with reference counting.

0dad91b

Merge branch 'master' into generalize-pubsub-4

db69af9

rkooo567 changed the title ~~Generalize pubsub 4~~ [Pubsub] Generalize the pubsub interface so that reference counting can use it Apr 21, 2021

rkooo567 changed the title ~~[Pubsub] Generalize the pubsub interface so that reference counting can use it~~ [Pubsub] Generalize the pubsub interface and adapt it for ref counting protocol Apr 21, 2021

rkooo567 assigned wuisawesome, clarkzinzow and fishbone Apr 22, 2021

rkooo567 added 3 commits April 22, 2021 15:54

Remove template.

ce38a32

lint.

e45f406

Fixed issues.

dac1369

rkooo567 added 4 commits April 22, 2021 20:45

Fix reference count test.

08cb415

Reference count test passes now.

a9a6f83

Fixed the test array problem

b5b1831

Merge branch 'master' into generalize-pubsub-4

312c5da

clarkzinzow reviewed Apr 23, 2021

View reviewed changes

rkooo567 mentioned this pull request Apr 24, 2021

Excessive heap memory usage in raylet / owner process when shuffling many objects #14322

Closed

Addressed code review.

7d387ad

Merge branch 'master' into generalize-pubsub-4

5c59b72

rkooo567 unassigned wuisawesome May 3, 2021

clarkzinzow approved these changes May 3, 2021

View reviewed changes

clarkzinzow added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label May 4, 2021

fishbone previously requested changes May 4, 2021

View reviewed changes

rkooo567 added 10 commits May 5, 2021 21:55

Merge branch 'master' into generalize-pubsub-4

d9cdc61

Merge branch 'master' into generalize-pubsub-4

393f1ef

Addressed half of code review.

e9cbaed

Fix tests.

b10eed5

Addressed the most critical issue.

78c51a0

Make subscriber thread-safe.

9a6a521

Revert "Make subscriber thread-safe."

6942828

This reverts commit 9a6a521.

Fixed test failures. The only failure now is the asan failure.

a070afc

Merge branch 'master' into generalize-pubsub-4

0e8e38b

Reset test suites and see if it fixes the issue.

620d726

rkooo567 requested review from jovany-wang and fishbone May 12, 2021 16:47

rkooo567 removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label May 12, 2021

rkooo567 mentioned this pull request May 12, 2021

Ensure output params are initialized before calling IsPlasmaObjectPinnedOrSpilled() #15758

Merged

6 tasks

Fix a flaky test

456b6f9

jovany-wang approved these changes May 13, 2021

View reviewed changes

Addressed code review.

9994516

rkooo567 merged commit 259fcbd into ray-project:master May 13, 2021

[Pubsub] Generalize the pubsub interface and adapt it for ref counting protocol #15446

[Pubsub] Generalize the pubsub interface and adapt it for ref counting protocol #15446

Uh oh!

Conversation

rkooo567 commented Apr 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

rkooo567 commented Apr 23, 2021

Uh oh!

clarkzinzow left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rkooo567 commented Apr 23, 2021

Uh oh!

rkooo567 commented Apr 26, 2021

Uh oh!

clarkzinzow commented Apr 26, 2021

Uh oh!

jovany-wang commented Apr 27, 2021

Uh oh!

clarkzinzow left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clarkzinzow commented May 3, 2021

Uh oh!

rkooo567 commented May 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fishbone left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fishbone May 4, 2021

Choose a reason for hiding this comment

Uh oh!

rkooo567 May 11, 2021

Choose a reason for hiding this comment

Uh oh!

rkooo567 May 13, 2021

Choose a reason for hiding this comment

Uh oh!

jovany-wang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rkooo567 commented Apr 21, 2021 •

edited

Loading

clarkzinzow left a comment •

edited

Loading

clarkzinzow left a comment •

edited

Loading

rkooo567 commented May 3, 2021 •

edited

Loading