-
Notifications
You must be signed in to change notification settings - Fork 186
Rework default Docker images #2799
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rework default Docker images #2799
Conversation
…-efa-version-of-the-default-docker-image
This comment was marked as resolved.
This comment was marked as resolved.
@@ -1,3 +1,3 @@ | |||
__version__ = "0.0.0" | |||
__is_release__ = False | |||
base_image = "0.9" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You should change the image version in the master after the images are built and published, so it's better to change it in a separate PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because we also change the format of the Docker image name, it's not possible to change it in a separate PR.
Moreover, we can build the new VM images from the PR branch before the merge, so it's OK.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, let's build the images from the PR then and merge when published.
src/dstack/_internal/server/background/tasks/process_running_jobs.py
Outdated
Show resolved
Hide resolved
@@ -1,15 +1,15 @@ | |||
ARG BASE_IMAGE=dstackai/base:py3.12-0.7-cuda-12.1 | |||
# syntax = edrevo/dockerfile-plus |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we live without it? An unfamiliar dependency that is no longer maintained.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Without this dependency, we would need to duplicate the code
src/tests/_internal/server/background/tasks/test_process_running_jobs.py
Outdated
Show resolved
Hide resolved
Roll-backed removing PythonVersion enum; Minor fixes
PR review comments
…and `NCCL_SOCKET_IFNAME` to the base images
…lude`, and `NCCL_SOCKET_IFNAME` to base images. - [x] Updated the list of EFA-enabled AWS EC2 instances
…and NCCL_SOCKET_IFNAME to base images. (bugfix)
This PR does:
mpirun
and/opt/nvcc-tests/build
to thebase
imagedevel-efa
image that comes with/opt/nvcc-tests/build
anduv
(fixes [UX] Pre-build a EFA version of the default Docker image #2793)python
(not only minor versions)devel-efa
on EFA-enabled AWS instancesOMPI_MCA_pml
,OMPI_MCA_btl
,OMPI_MCA_btl_tcp_if_exclude
,NCCL_SOCKET_IFNAME
. See an example.To be done separately:
Staged Docker images: https://hub.docker.com/repository/docker/dstackai/base-stgn/tags?name=ubuntu22.04&page=1