fix thread unsafety in occasional logging when using std::atomic #808

Nimrod0901 · 2022-03-17T15:36:18Z

This PR is to address #804

Note:

Declare LOG_OCCURRENCES_MOD_N as a local, non-atomic variable and use fetch_add % n to get the exact modulo remainder.
Initialize SOME_KIND_OF_LOG_EVERY_N from 0 to 1 since fetch_add will get the previous value.
Add an extra condition when n == 1.
Introduce a new variable LOG_OCCURRENCES_MOD_N in the macro SOME_KIND_OF_LOG_FIRST_N. Although the name seems unrelated, I prefer not to introduce another macro to name it.
Remove the usage AnnotateBenignRaceSized for variables unlikely to be under a multi-thread situation.

I only fix the atomic version and I have no idea if other versions have a similar problem.
Would you mind taking a look? @sergiud @drigz Thanks.

Aside question:
I doubt if AnnotateBenignRaceSized will take effect on std::atomic since no data race will happen.

sergiud · 2022-03-17T16:13:26Z

src/glog/logging.h.in

-  ++LOG_OCCURRENCES; \
-  if (++LOG_OCCURRENCES_MOD_N > n) LOG_OCCURRENCES_MOD_N -= n; \
-  if (LOG_OCCURRENCES_MOD_N == 1) \
+  int LOG_OCCURRENCES_MOD_N = LOG_OCCURRENCES.fetch_add(1, std::memory_order_relaxed) % n; \


It is not clear to me why changing to fetch_add provides a solution here. op++ also calls fetch_add albeit with different memory ordering. Consequently, you cannot avoid the problem you are trying to solve. To be able to, you would need to perform all the involved operations atomically by protecting concurrent access in a critical section.

The problem was "a series of atomic operations of LOG_OCCURRENCES_MOD_N, but in whole, they are not atomic, which unexpected". Here only the LOG_OCCURRENCES is static, the only place we need to protect. Specifically, we only do one atomic operation fetch_add on LOG_OCCURRENCES.

My question is: in what way do the changes fix the problem? The operations are still not atomic as a whole.

The operations are still not atomic as a whole.

Yes. But they don't need to be. There is only one operation that needs to be atomic, access to the static variable LOG_OCCURRENCES.

in what way do the changes fix the problem?

For every use of this macro (under different threads), a uniqueLOG_OCCURRENCES value is assigned, incremented by 1. We use a local variable LOG_OCCURRENCES_MOD_N to get the reminder of % n. By now, there is no need for protection or being atmoic. It can be written like

int LOG_OCCURRENCES_MOD_N = LOG_OCCURRENCES.fetch_add(1, std::memory_order_relaxed); if (LOG_OCCURRENCES_MOD_N % n == 1)

They are the same.

You cannot do that. Once LOG_OCCURRENCES overflows you will hit undefined behavior.

Also, the change from op++ to fetch_add is not clear to me. Why is this necessary? If you need the previous value, you can use the post increment.

The main idea is to promise every call can have the exact value in the if condition statements to represent if it should be logged. When the static atomic variable is fetch_add to a local variable, the state is recorded. You'll never get the same value.

Once LOG_OCCURRENCES overflows you will hit undefined behavior.

You're right. This may need more consideration.

the change from op++ to fetch_add is not clear to me. Why is this necessary?

int val = op++; // not atomic, there are two operations. self increment and assignment. ++op is the same int val = op.fetch_add(1) // atomic

That's not correct. With post increment you will get the previous value since it essentially calls fetch_add as well.

Please refer to the documentation of std::atomic::op++ before we continue the discussion. Thanks.

Sorry, It's my bad. You're right. I agree there is no need to use fetch_add. ++ is enough.

Once LOG_OCCURRENCES overflows you will hit undefined behavior.

Use atomic_uint instead. Unsigned integer overflow is not a UB. Will this help?

sergiud · 2022-03-18T10:06:41Z

Since a fix is not straightforward to get right, I suggest that you add unit tests.

Nimrod0901 · 2022-03-18T12:18:15Z

I make some updates. Please take a look if you have time.
The tests I added are most likely to fail on the master branch.

sergiud · 2022-04-03T14:33:50Z

I'm very sorry for the delay. I will look at the PR right after 0.6 release.

Nimrod0901 · 2022-04-04T03:10:42Z

I'm very sorry for the delay. I will look at the PR right after 0.6 release.

No problem.

sergiud · 2024-06-11T19:16:56Z

I'll close the PR since it's heavily diverged from master. @Nimrod0901 if you want continue working on this please rebase.

fix thread unsafety in occasional logging when using std::atomic

ba4ad45

sergiud reviewed Mar 17, 2022

View reviewed changes

Nimrod0901 added 3 commits March 18, 2022 19:19

replace fetch_add with ++

51d5348

use unsigned int to avoid UB

30f8a84

add unittest for multiple thread occasional logging for C++11 and later

821a5b6

sergiud added this to the next milestone Apr 3, 2022

replace implicit conversion with explicit one

eeb85a2

sergiud removed this from the 0.7 milestone Oct 14, 2023

sergiud linked an issue Jan 3, 2024 that may be closed by this pull request

LOG_EVERY_N seems not thread safe in C++11 and newer #804

Open

sergiud closed this Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix thread unsafety in occasional logging when using std::atomic #808

fix thread unsafety in occasional logging when using std::atomic #808

Nimrod0901 commented Mar 17, 2022

sergiud Mar 17, 2022

Nimrod0901 Mar 18, 2022

sergiud Mar 18, 2022

Nimrod0901 Mar 18, 2022

sergiud Mar 18, 2022 •

edited

Loading

Nimrod0901 Mar 18, 2022

Nimrod0901 Mar 18, 2022 •

edited

Loading

sergiud Mar 18, 2022 •

edited

Loading

Nimrod0901 Mar 18, 2022

sergiud commented Mar 18, 2022

Nimrod0901 commented Mar 18, 2022

sergiud commented Apr 3, 2022

Nimrod0901 commented Apr 4, 2022

sergiud commented Jun 11, 2024

fix thread unsafety in occasional logging when using std::atomic #808

fix thread unsafety in occasional logging when using std::atomic #808

Conversation

Nimrod0901 commented Mar 17, 2022

sergiud Mar 17, 2022

Choose a reason for hiding this comment

Nimrod0901 Mar 18, 2022

Choose a reason for hiding this comment

sergiud Mar 18, 2022

Choose a reason for hiding this comment

Nimrod0901 Mar 18, 2022

Choose a reason for hiding this comment

sergiud Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

Nimrod0901 Mar 18, 2022

Choose a reason for hiding this comment

Nimrod0901 Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

sergiud Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

Nimrod0901 Mar 18, 2022

Choose a reason for hiding this comment

sergiud commented Mar 18, 2022

Nimrod0901 commented Mar 18, 2022

sergiud commented Apr 3, 2022

Nimrod0901 commented Apr 4, 2022

sergiud commented Jun 11, 2024

sergiud Mar 18, 2022 •

edited

Loading

Nimrod0901 Mar 18, 2022 •

edited

Loading

sergiud Mar 18, 2022 •

edited

Loading