nearbyhint: Fix for ffast-math #550

RafaGago · 2021-08-31T19:17:26Z

See commit text.

This fixes:
#548

Notice that I wasn't able to verify it on Visual Studio, just g++ and clang.

The fix on 45cad81 wasn't working on Clang. On ffast-math the compiler is free to assume that "x + v -v = x". 45cad81 was workarounding this fact by storing "x + v" on a volatile variable. For Clang this wasn't enough to stop optimizing, as it correctly detected that the variable is local-scope, so no one can take a reference to it. This commit reworks the fix by defining a function to do the operation and disabling optimizations on that function for all supported compilers (and those using the same frontend). For non-supported compilers an #error is emitted, as the workaround wasn't safe enough. It could even break between compiler versions. This avoids potentially weird behaviour on the future.

serge-sans-paille · 2021-08-31T19:48:19Z

include/xsimd/arch/generic/xsimd_generic_math.hpp

@@ -1707,6 +1707,40 @@ namespace xsimd {
    }


+#if !defined(__FAST_MATH__)


Interesting approach! I wonder if the following works as expected and is faster or not:

namespace details { template<class T> T force_add(T x, T y) { #if defined(__FAST_MATH__) && (defined(__clang__) || defined(__GNUC__)) __attribute__((noinline)) #endif return x + y; } }

then call it appropriately

Now on $DAILY_JOB.

Without trying I'd say that a competent compiler will still optimize it away.

But notice that the optimization we want to break is ffast-math related. Inlining should make no difference.

serge-sans-paille · 2021-08-31T20:04:48Z

Can you give a try to the following patch instead: if it works, it's much less intrusive

diff --git a/include/xsimd/arch/generic/xsimd_generic_math.hpp b/include/xsimd/arch/generic/xsimd_generic_math.hpp
index 56e4d98..2ef41de 100644
--- a/include/xsimd/arch/generic/xsimd_generic_math.hpp
+++ b/include/xsimd/arch/generic/xsimd_generic_math.hpp
@@ -1722,8 +1722,8 @@ namespace xsimd {
         // to v. That's not what we want, so prevent compiler optimization here.
         // FIXME: it may be better to emit a memory barrier here (?).
 #ifdef __FAST_MATH__
-        volatile batch_type d0 = v + t2n;
-        batch_type d = *(batch_type*)(void*)(&d0) - t2n;
+        volatile auto d0 = (v + t2n).data;
+        batch_type d = batch_type(d0) - t2n;
 #else
         batch_type d0 = v + t2n;
         batch_type d = d0 - t2n;

RafaGago · 2021-09-01T06:13:47Z

Now I'm at $DAILY_JOB I don't have the codebase at hand, but I'd say that local scope volatile variables are not enough to stop Clang. It seems that it sees through the intrinsic types, otherwise it woudn't have correctly applied the optimization.

The problem with the volatile approach, is that even if it worked now we don't know if a compiler update would silently break it.

Notice that to remove the volatile instead of doing "(batch_type)(void*)(&d0)" you can do "const_cast<batch_type const&> (d0)"

serge-sans-paille · 2021-10-12T19:42:34Z

#551 achieves the same result on my laptop, while being less intrusive on the codebase. Can you confirm it works for you too?

serge-sans-paille reviewed Aug 31, 2021

View reviewed changes

serge-sans-paille mentioned this pull request Oct 13, 2021

Minimalistic fast math support #551

Open

JohanMabille force-pushed the master branch 3 times, most recently from 6c6dc1f to 52984ef Compare October 14, 2021 12:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nearbyhint: Fix for ffast-math #550

nearbyhint: Fix for ffast-math #550

RafaGago commented Aug 31, 2021

serge-sans-paille Aug 31, 2021

RafaGago Sep 1, 2021

RafaGago Sep 1, 2021

serge-sans-paille commented Aug 31, 2021

RafaGago commented Sep 1, 2021 •

edited

Loading

serge-sans-paille commented Oct 12, 2021

		@@ -1707,6 +1707,40 @@ namespace xsimd {
		}


		#if !defined(__FAST_MATH__)

nearbyhint: Fix for ffast-math #550

Are you sure you want to change the base?

nearbyhint: Fix for ffast-math #550

Conversation

RafaGago commented Aug 31, 2021

serge-sans-paille Aug 31, 2021

Choose a reason for hiding this comment

RafaGago Sep 1, 2021

Choose a reason for hiding this comment

RafaGago Sep 1, 2021

Choose a reason for hiding this comment

serge-sans-paille commented Aug 31, 2021

RafaGago commented Sep 1, 2021 • edited Loading

serge-sans-paille commented Oct 12, 2021

RafaGago commented Sep 1, 2021 •

edited

Loading