fix(cmake): building dynamic library for specified GPU architectures and support multi threads compile #164
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Added a CMake variable
USER_CUDA_ARCH_LIST
to allow users to specify CUDA architectures manually.If this variable is not set, CMake will automatically detect the CUDA architecture of the underlying machine and build the dynamic library accordingly.
Set the
TORCH_CUDA_ARCH_LIST
environment variable to the desired architecture. CMake will automatically read this value from the environment.for example:
dynamic library is built for sm_75 and sm_80:
dynamic library is built for local GPU: