> Hi, why do you believe that bfloat16 is not supported? Apologies, I appear to ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ckitching on July 16, 2024 \| parent \| context \| favorite \| on: Run CUDA, unmodified, on AMD GPUs > Hi, why do you believe that bfloat16 is not supported? Apologies, I appear to be talking nonsense. I conflated bfloat16 with nvidia's other wacky floating point formats. This is probably my cue to stop answering reddit/HN comments and go to bed. :D So: ahem: bfloat16 support is basically just missing the fairly boring header. > Regarding cublasLt, what is your plan for support there? Pass everything through to hipblasLt (hipify style) or something else? Prettymuch that, yes. Not much point reimplementing all the math libraries when AMD is doing that part of the legwork already.

anthonix1 on July 16, 2024 [–]

OK, so in the case of llm.c, if you're just including the HIP headers, using hipblasLt, etc, what would be the benefit of using scale instead of hipify?

Straw on July 16, 2024 | [–]

Seems like a big benefit would come from not forking the codebase into two versions!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact