Identify and list features of existing extended floating point wrapper types #1451

jrhemstad · 2024-02-28T17:26:05Z

CUDA does not have standardized classes/operators available on both host and device to support these operations, so we define it here.

As it happens, there appears to be a lot of projects and people who feel the same:

cutlass::bfloat16_t
cutlass::half_t
cute::bfloat (alias to cutlass::bfloat16_t)
cute::half (alias to cutlass::half_t)
matx::matxFp16
matx::matxBf16
cub::half_t
cub::bfloat16_t
c10::BFloat16
c10::Half
Eigen::BFloat16
Eigen::Half
cuSPARSE made some internal wrappers too

Things to look into:

RAPIDS?

The text was updated successfully, but these errors were encountered:

weinbe2 · 2024-02-29T05:02:06Z

Kokkos has Kokkos::Experimental::half_t and Kokkos::Experimental::bhalf_t, where it'll properly wrap the data types if a given target supports it... and otherwise it's a thin wrapper around float.

It has a few levels of abstraction (as Kokkos does), but the core abstract implementation is here: https://github.com/kokkos/kokkos/blob/master/core/src/impl/Kokkos_Half_FloatingPointWrapper.hpp ; and, as an example, the way it'll picks a proper type for CUDA if a given version/arch supports it is here: https://github.com/kokkos/kokkos/blob/master/core/src/Cuda/Kokkos_Cuda_Half_Impl_Type.hpp

achirkin · 2024-03-01T14:41:21Z

In rapidsai/raft ANN benchmarks, we don't have a wrapper; instead, we have a uint16_t stub for CPU-only builds. Also across raft, the lack of <type_traits> support (i.e. something like std::is_floating_point) has been an annoying issue, which still prevents parts of raft from supporting half.

jrhemstad · 2024-03-01T16:34:01Z

In rapidsai/raft ANN benchmarks, we don't have a wrapper; instead, we have a uint16_t stub for CPU-only builds. Also across raft, the lack of <type_traits> support (i.e. something like std::is_floating_point) has been an annoying issue, which still prevents parts of raft from supporting half.

Thanks @achirkin, this is really helpful feedback. We've heard a lot of similar things, and so we're working on figuring out how to address all of these problems and more :)

miscco · 2024-03-05T16:55:21Z

I bricked my workstation for a few hours so here is a comparison. I tried to be diligent, but it is hard to find every function:

half

API	Kokkos	MatX	CUB	pytorch	Eigen	cutlass
cast_to_half	✔️	❌	✔️	✔️	✔️	✔️
cast_from_half	✔️	✔️	✔️	✔️	✔️	✔️
conversion to float	✔️	✔️	✔️	✔️	✔️	✔️
int constructor	✔️	✔️	✔️	✔️	✔️	✔️
converting constructor	✔️	✔️	✔️	✔️	✔️	✔️
converting assignment	✔️	✔️	❌	❌	❌	❌
---	---	---	---	---	---	---
operator+= half	✔️	✔️	✔️	✔️	✔️	✔️
operator+= float	✔️	✔️	❌	✔️	❌	❌
operator-= half	✔️	✔️	✔️	✔️	✔️	✔️
operator-= float	✔️	✔️	❌	✔️	❌	❌
operator*= half	✔️	✔️	✔️	✔️	✔️	✔️
operator*= float	✔️	✔️	❌	✔️	❌	❌
operator/= half	✔️	✔️	✔️	✔️	✔️	✔️
operator/= float	✔️	✔️	❌	✔️	❌	❌
operator+ half, half	✔️	✔️	✔️	✔️	✔️	✔️
operator+ T, half	✔️	✔️	❌	✔️	❌	❌
operator- half, half	✔️	✔️	✔️	✔️	✔️	✔️
operator- T, half	✔️	✔️	❌	✔️	❌	❌
operator* half, half	✔️	✔️	✔️	✔️	✔️	✔️
operator* T, half	✔️	✔️	❌	✔️	❌	❌
operator/ half, half	✔️	✔️	✔️	✔️	✔️	✔️
operator/ T, half	✔️	✔️	❌	✔️	❌	❌
unary+	✔️	❌	❌	❌	❌	❌
unary-	✔️	❌	❌	❌	❌	✔️
---	---	---	---	---	---	---
operator== half	✔️	✔️	✔️	❌	✔️	✔️
operator== float	❌	✔️	❌	❌	❌	❌
operator!= half	✔️	✔️	✔️	❌	✔️	✔️
operator!= float	❌	✔️	❌	❌	❌	❌
operator< half	✔️	✔️	✔️	❌	✔️	✔️
operator< float	✔️	❌	❌	❌	❌	❌
operator> half	✔️	✔️	✔️	❌	✔️	✔️
operator> float	✔️	❌	❌	❌	❌	❌
operator<= half	✔️	✔️	✔️	❌	✔️	✔️
operator<= float	✔️	❌	❌	❌	❌	❌
operator>= half	✔️	✔️	✔️	❌	✔️	✔️
operator>= float	✔️	❌	❌	❌	❌	❌
---	---	---	---	---	---	---
operatorOR	✔️	❌	❌	❌	❌	❌
operator&&	✔️	❌	❌	❌	❌	❌
operator++	✔️	✔️	✔️	❌	✔️	✔️
operator++(int)	✔️	✔️	✔️	❌	✔️	✔️
operator--	✔️	❌	❌	❌	✔️	✔️
operator--(int)	✔️	❌	❌	❌	✔️	✔️
---	---	---	---	---	---	---
abs	✔️	✔️	❌	❌	✔️	❌
fmod	✔️	❌	❌	❌	✔️	❌
fmax	✔️	❌	❌	❌	❌	❌
fmin	✔️	❌	❌	❌	❌	❌
fdim	✔️	❌	❌	❌	❌	❌
exp	✔️	✔️	❌	❌	✔️	❌
exp2	✔️	❌	❌	❌	❌	❌
expm1	✔️	❌	❌	❌	✔️	❌
log	✔️	✔️	❌	❌	✔️	❌
log1p	✔️	❌	❌	❌	✔️	❌
log2	✔️	✔️	❌	❌	✔️	❌
log10	✔️	✔️	❌	❌	✔️	❌
pow	✔️	✔️	❌	❌	✔️	❌
sqrt	✔️	✔️	❌	❌	✔️	✔️
cbrt	✔️	❌	❌	❌	❌	❌
hypot	✔️	❌	❌	❌	❌	❌
erf	✔️	❌	❌	❌	❌	❌
erfc	✔️	❌	❌	❌	❌	❌
tgamma	✔️	❌	❌	❌	❌	❌
lgamma	✔️	❌	❌	❌	❌	❌
signbit	❌	❌	❌	❌	❌	✔️
fpclassify	❌	❌	❌	❌	❌	✔️
copysign	❌	❌	❌	❌	❌	✔️
isinf	✔️	✔️	❌	❌	✔️	✔️
isnan	✔️	❌	❌	❌	✔️	✔️
isfinite	✔️	❌	❌	❌	✔️	✔️
isnormal	❌	❌	❌	❌	❌	✔️
ceil	✔️	✔️	❌	❌	✔️	❌
floor	✔️	✔️	❌	❌	✔️	❌
round	✔️	✔️	❌	❌	✔️	❌
trunc	✔️	❌	❌	❌	❌	❌
rint	❌	❌	❌	❌	✔️	❌
sin	✔️	✔️	❌	❌	✔️	❌
cos	✔️	✔️	❌	❌	✔️	❌
tan	✔️	✔️	❌	❌	✔️	❌
asin	✔️	✔️	❌	❌	✔️	❌
acos	✔️	✔️	❌	❌	✔️	❌
atan	✔️	✔️	❌	❌	✔️	❌
atan2	✔️	✔️	❌	❌	✔️	❌
sinh	✔️	✔️	❌	❌	❌	❌
cosh	✔️	✔️	❌	❌	❌	❌
tanh	✔️	✔️	❌	❌	✔️	❌
asinh	✔️	✔️	❌	❌	❌	❌
acosh	✔️	✔️	❌	❌	❌	❌
atanh	✔️	✔️	❌	❌	✔️	❌
numeric_limits	✔️	❌	✔️	✔️	✔️	✔️

jrhemstad · 2024-03-05T17:42:47Z

Awesome work @miscco. I think the only missing piece is to understand what each of these solutions does for atomic operations. Do they overload things like atomicAdd? Something else?

@cliffburdick was there anything you recall having to add for the matx types that you don't see reflected in the table above?

cliffburdick · 2024-06-13T16:08:24Z

Awesome work @miscco. I think the only missing piece is to understand what each of these solutions does for atomic operations. Do they overload things like atomicAdd? Something else?

@cliffburdick was there anything you recall having to add for the matx types that you don't see reflected in the table above?

Not sure why I didn't see this. I think this is comprehensive, but one of the main issues was also having types that are compatible with their complex counterparts.

jrhemstad mentioned this issue Feb 28, 2024

Add standardized extended floating point types with complete implementations #1011

Closed

jrhemstad self-assigned this Feb 28, 2024

jrhemstad changed the title ~~Identify existing extended floating point wrapper types~~ Identify and list features of existing extended floating point wrapper types Mar 5, 2024

jrhemstad mentioned this issue Mar 5, 2024

Write up a table of features for each existing extended floating point type #1452

Closed

jrhemstad mentioned this issue Aug 2, 2024

[EPIC] Extended Floating-Point Support #31

Open

jrhemstad closed this as completed Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Identify and list features of existing extended floating point wrapper types #1451

Identify and list features of existing extended floating point wrapper types #1451

jrhemstad commented Feb 28, 2024 •

edited

Loading

weinbe2 commented Feb 29, 2024 •

edited

Loading

achirkin commented Mar 1, 2024

jrhemstad commented Mar 1, 2024

miscco commented Mar 5, 2024

jrhemstad commented Mar 5, 2024 •

edited

Loading

cliffburdick commented Jun 13, 2024

Identify and list features of existing extended floating point wrapper types #1451

Identify and list features of existing extended floating point wrapper types #1451

Comments

jrhemstad commented Feb 28, 2024 • edited Loading

weinbe2 commented Feb 29, 2024 • edited Loading

achirkin commented Mar 1, 2024

jrhemstad commented Mar 1, 2024

miscco commented Mar 5, 2024

half

jrhemstad commented Mar 5, 2024 • edited Loading

cliffburdick commented Jun 13, 2024

jrhemstad commented Feb 28, 2024 •

edited

Loading

weinbe2 commented Feb 29, 2024 •

edited

Loading

jrhemstad commented Mar 5, 2024 •

edited

Loading