Skip to content

Conversation

alogfans
Copy link
Collaborator

To enable fully auto detection of topology in Transfer Engine, just leave the device name empty in the Python interface.

@@ -14,21 +14,46 @@

#include "memory_location.h"

#ifdef USE_CUDA
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By default, USE_CUDA is off. So we need to set USE_CUDA as ON?

@doujiang24
Copy link
Collaborator

almost LGTM, need main merge.

@stmatengss
Copy link
Collaborator

@alogfans It appears there are some conflicts in TransferEngine.init() introduced by the p2phandshade PR.

Copy link
Collaborator

@doujiang24 doujiang24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM~

@doujiang24 doujiang24 merged commit 0f61e46 into kvcache-ai:main Apr 16, 2025
7 checks passed
doujiang24 added a commit to doujiang24/Mooncake that referenced this pull request Apr 16, 2025
Signed-off-by: doujiang24 <doujiang24@gmail.com>
ShangmingCai pushed a commit that referenced this pull request Apr 16, 2025
Signed-off-by: doujiang24 <doujiang24@gmail.com>
@whybeyoung
Copy link

whybeyoung commented Apr 16, 2025

emmm.... what about this case.....
image

prefill.log

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants