CUDA 9 #49

slayton58 · 2017-06-22T13:04:09Z

Adds basic CUDA 9 support, including adding Volta arch, and making appropriate modifications for half precision datatype changes

New fp16 datatype, *_70 builds, fix include ordering in a test

pietern

Just a question

pietern · 2017-06-22T17:44:11Z

gloo/types.h

@@ -56,17 +56,27 @@ struct __attribute__((__aligned__(2))) float16 {
    return x == res.x;
  }
 #ifdef __CUDA_ARCH__
-  float16(half h) {
+  float16(__half h) {


It's OK the constructor for half is gone now? (don't know where that type is defined TBH)

I'm not 100% sure either :) I've always known the type as __half, and that's what is defined in cuda_fp16.h

I think half and __half are interchangeable. cuda_fp16.h has the following code:

#ifndef CUDA_NO_HALF typedef __half half; typedef __half2 half2; #endif /*CUDA_NO_HALF*/

Should we stick to one for consistency? I vote for half. :)

Yeah, some more directed grepping found that for me too. I don't care which one is used

pietern · 2017-06-22T17:45:39Z

cmake/Cuda.cmake

-set(gloo_known_gpu_archs "20 21(20) 30 35 50 52 60 61")
-set(gloo_known_gpu_archs7 "20 21(20) 30 35 50 52")
+set(gloo_known_gpu_archs "30 35 50 52 60 61 70")
+set(gloo_known_gpu_archs7 "30 35 50 52")


Could you add a gloo_known_gpu_archs8 so that we don't break the CUDA 8 builds. AFAICT this is what's tripping up the Travis build.

Done and pushed

facebook-github-bot · 2017-06-23T23:10:56Z

@pietern has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

pietern · 2017-06-23T23:16:45Z

Thanks for adding this @slayton58!

Summary: Adds basic CUDA 9 support, including adding Volta arch, and making appropriate modifications for half precision datatype changes Closes pytorch/gloo#49 Differential Revision: D5315336 Pulled By: pietern fbshipit-source-id: 6468b0f357206d604bdcfec69ba82509a2c91407

slayton58 added 2 commits June 22, 2017 09:02

CUDA 9 support

4bf76c9

New fp16 datatype, *_70 builds, fix include ordering in a test

Back out debug macro, add Volta support to cmake

76d727c

facebook-github-bot added the CLA Signed label Jun 22, 2017

pietern reviewed Jun 22, 2017

View reviewed changes

slayton58 added 2 commits June 22, 2017 13:49

Add gloo_cuda_known_archs8 and check

f029da8

Changed __half back to half

4213b1c

facebook-github-bot closed this in 194bc40 Jun 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA 9 #49

CUDA 9 #49

Uh oh!

slayton58 commented Jun 22, 2017

Uh oh!

pietern left a comment

Uh oh!

pietern Jun 22, 2017

Uh oh!

slayton58 Jun 22, 2017

Uh oh!

wesolwsk Jun 22, 2017

Uh oh!

slayton58 Jun 22, 2017

Uh oh!

pietern Jun 22, 2017

Uh oh!

slayton58 Jun 22, 2017

Uh oh!

facebook-github-bot commented Jun 23, 2017

Uh oh!

pietern commented Jun 23, 2017

Uh oh!

Uh oh!

CUDA 9 #49

CUDA 9 #49

Uh oh!

Conversation

slayton58 commented Jun 22, 2017

Uh oh!

pietern left a comment

Choose a reason for hiding this comment

Uh oh!

pietern Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

slayton58 Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

wesolwsk Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

slayton58 Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

pietern Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

slayton58 Jun 22, 2017

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 23, 2017

Uh oh!

pietern commented Jun 23, 2017

Uh oh!

Uh oh!