remove valgrind call causing GPU problems #4239

multitalentloes · 2024-09-30T14:48:30Z

Avoid having Valgrind calls/throw inside GPU kernels

multitalentloes · 2024-09-30T14:48:36Z

Jenkins build this please

blattms · 2024-10-01T06:33:38Z

I think it would be cleaner to introduce a CMake check to turn those checks on and off. Off by default

multitalentloes · 2024-10-01T06:38:43Z

Having many preprocessor statements is far from neat, so a cleaner solution that also avoids the compilation errors caused by calling host code in a device function is very welcome. How would a CMake check instead look and possibly avoid this issue in a more clean way?

akva2 · 2024-10-01T06:46:55Z

cmake would not avoid the preprocessor stuff, it would just give an option to flip it on/off.

this option sorta already exists - -DCMAKE_DISABLE_FIND_PACKAGE_Valgrind=1 make it into noops.

multitalentloes · 2024-10-01T06:58:51Z

Here I would think we want it to compile all the time. If the valgrind call is allowed in a device function then we cannot compile in debug mode for gpus, so having a macro that removes in undonditionally only on the gpu seems like the right move? If the option would allow this code on gpu we would introduce a sort of invalid build configuration that wont compile

atgeirr · 2024-10-01T07:13:16Z

Could we move the preprocessor logic into Valgrind::CheckDefined() instead and make it a no-op on devices?

multitalentloes · 2024-10-01T07:17:53Z

Oh I thought that function would be a part of the valgrind implementation but we have one layer inbetween where I can place this macro instead, that would allow me to clean up some code where this solution is already in place...

multitalentloes · 2024-10-01T10:50:03Z

Jenkins build this please

kjetilly · 2024-10-01T11:52:17Z

opm/material/common/Valgrind.hpp

@@ -105,7 +105,8 @@ inline bool CheckDefined([[maybe_unused]] const T& value)
 template <class T>
 inline bool CheckAddressable([[maybe_unused]] const T& value)
 {
-#if !defined NDEBUG && HAVE_VALGRIND
+// if we run in debug mode AND we have valgrind AND we are NOT in a gpu function
+#if !defined NDEBUG && HAVE_VALGRIND && !OPM_IS_INSIDE_DEVICE_FUNCTION


OPM_IS_INSIDE_DEVICE_FUNCTION will always be false if the function does not have a device decorator?

Yeah if the preprocessor statement is evaluated before the function is inlined then this seems to have no impact

I do not understand this statement, apart from "Yeah"...?

The code looks fine to me.

The very high level idea is that you don't want to call any Valgrind-function from within a device kernel.

This is accomplished by wrapping said statement within a preprocessor #if !OPM_IS_INSIDE_DEVICE_FUNCTION, where
OPM_IS_INSIDE_DEVICE_FUNCTION is true if and only if the function is being invoked from a device kernel.

However, a GPU kernel is not allowed to call a function which is not decorated with __device__ (with the exception of constexpr-functions with certain flags enabled), and the function in question, CheckAddressable does not have any such decorator, hence it will never be called from a device kernel, in other words, the current preprocessor test is not doing anything before the __device__ decorator (through the OPM wrapper macro) is added.

So the current calls to this function would call compilation errors on GPU since they are not decorated, which means that either we should decorate it, or we go with the original solution of avoiding the calls by lots of #ifs. I prefer the first (add decorator).

Yes, that is kind of the gist of my comment. I'm a bit perplexed if this even solved the compilation in Debug problems, @multitalentloes ? Or is the function in question not called from any device kernel at the moment?

I will look more into this early next week.

remove valgrind call causing GPU problems

4d9c5d2

move macro usage to valgrind.hpp

b8fbfa3

kjetilly reviewed Oct 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove valgrind call causing GPU problems #4239

remove valgrind call causing GPU problems #4239

multitalentloes commented Sep 30, 2024

multitalentloes commented Sep 30, 2024

blattms commented Oct 1, 2024 •

edited

Loading

multitalentloes commented Oct 1, 2024

akva2 commented Oct 1, 2024 •

edited

Loading

multitalentloes commented Oct 1, 2024

atgeirr commented Oct 1, 2024

multitalentloes commented Oct 1, 2024

multitalentloes commented Oct 1, 2024

kjetilly Oct 1, 2024

multitalentloes Oct 1, 2024

atgeirr Oct 1, 2024

kjetilly Oct 1, 2024 •

edited

Loading

atgeirr Oct 1, 2024

kjetilly Oct 1, 2024

multitalentloes Oct 2, 2024

remove valgrind call causing GPU problems #4239

Are you sure you want to change the base?

remove valgrind call causing GPU problems #4239

Conversation

multitalentloes commented Sep 30, 2024

multitalentloes commented Sep 30, 2024

blattms commented Oct 1, 2024 • edited Loading

multitalentloes commented Oct 1, 2024

akva2 commented Oct 1, 2024 • edited Loading

multitalentloes commented Oct 1, 2024

atgeirr commented Oct 1, 2024

multitalentloes commented Oct 1, 2024

multitalentloes commented Oct 1, 2024

kjetilly Oct 1, 2024

Choose a reason for hiding this comment

multitalentloes Oct 1, 2024

Choose a reason for hiding this comment

atgeirr Oct 1, 2024

Choose a reason for hiding this comment

kjetilly Oct 1, 2024 • edited Loading

Choose a reason for hiding this comment

atgeirr Oct 1, 2024

Choose a reason for hiding this comment

kjetilly Oct 1, 2024

Choose a reason for hiding this comment

multitalentloes Oct 2, 2024

Choose a reason for hiding this comment

blattms commented Oct 1, 2024 •

edited

Loading

akva2 commented Oct 1, 2024 •

edited

Loading

kjetilly Oct 1, 2024 •

edited

Loading