Newsportal USENET - Re: Memory ordering

Re: Memory ordering

Sujet : Re: Memory ordering
De : chris.m.thomasson.1 (at) *nospam* gmail.com (Chris M. Thomasson)
Groupes : comp.arch
Date : 20. Dec 2024, 21:17:21

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <vk4jch$3k04r$3@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
User-Agent : Mozilla Thunderbird

On 12/20/2024 11:39 AM, EricP wrote:

Chris M. Thomasson wrote:
On 12/19/2024 10:33 AM, MitchAlsup1 wrote:
On Thu, 5 Dec 2024 7:44:19 +0000, Chris M. Thomasson wrote:
>
On 12/4/2024 8:13 AM, jseigh wrote:
On 12/3/24 18:37, Stefan Monnier wrote:
                                           If there are places
in the code it doesn't know this can't happen it won't optimize
across it, more or less.
>
The problem is HOW to TELL the COMPILER that these memory references
are "more special" than normal--when languages give few mechanisms.
>
We could start with something like
>
     critical_region {
       ...
     }
>
such that the compiler must refrain from any code motion within
those sections but is free to move things outside of those sections as
if
execution was singlethreaded.
>
>
C/C++11 already defines what lock acquire/release semantics are.
Roughly you can move stuff outside of a critical section into it
but not vice versa.
>
Java uses synchronized blocks to denote the critical section.
C++ (the society for using RAII for everything) has scoped_lock
if you want to use RAII for your critical section. It's not
always obvious what the actual critical section is. I usually
use it inside its own bracket section to make it more obvious.
   { std::scoped_lock m(mutex);
     // .. critical section
   }
>
I'm not a big fan of c/c++ using acquire and release memory order
directives on everything since apart from a few situations it's
not intuitively obvious what they do in all cases. You can
look a compiler assembler output but you have to be real careful
generalizing from what you see.
>
The release on the unlock can allow some following stores and things to
sort of "bubble up before it?
>
Acquire and release confines things to the "critical section", the
release can allow for some following things to go above it, so to speak.
This is making me think of Alex over on c.p.t. !
>
This sounds dangerous if the thing allowed to go above it is unCacheable
while the lock:release is cacheable, the cacheable lock can arrive at
another core before the unCacheable store arrives at its destination.
>
Humm... Need to ponder on that. Wrt the sparc:
>
membar #LoadStore | #StoreStore
>
can allow following stores to bubble up before it. If we want to block that then we would use a #StoreLoad. However, a #StoreLoad is not required for unlocking a mutex.
I had an idea a few weeks back of a different way to do membars
that should be more flexible and controllable (if that's a good thing)
so I thought I'd toss it out there for comments.
This hypothetical ISA has normal LD and ST instructions, to which I
would add a LW Load for Write instruction to optimize moving shared lines
between caches. There are also the Atomic Fetch and OP instructions
AFADD, AFAND, AFOR, AFXOR, plus ASWAP and ACAS, LL Load Locked and
SC Store Conditional, for various size of naturally aligned data,
and with various address modes.
Here is the new part:
To the above instructions is added a 3-bit Coherence Group (CG) field.
This allows one to specify different groups that various above data
accesses belong to.
The ISA has a membar instruction: MBG Memory Barrier for Group
MBG has three fields:
- one 4-bit field where each bit enables which operations this barrier
applies to, in older-younger order: Load-Load, Load-Store, Store-Load,
and Store-Store.
- two 8-bit fields where each bit selects which sets of Coherence Group(s)
this barrier applies to, one field for the older (before the membar) sets,
one for the younger (after the membar) sets.
Also the Load Store Queue is assumed to be self coherent - that loads
and stores to the same address by a single core are performed in order,
and that nothing can bypass a load or store with an unresolved address.
The CG numbers are assigned by convention, probably by the OS designers
when they define the ABI for this ISA.
Here I assigned CG:0 to be thread normal access, CG:1 to be atomic items,
CG:2 to be shared memory sections. The remaining 5 CG's can be used to
indicate different shared memory sections if their locks can overlap.
Eg. An MBG with op bits for Load-Load and Load-Store, with a before CG of 1
and after CG's 3 and 4 would block all younger loads and stores in groups
3 and 4 from starting execution until all older loads in group 1 completed.
Loads and stores in all other groups are free to reorder, within the
LSQ self coherence rules.
An MBG with all op bits and all CG bits set is a full membar.
Also if one is say juggling multiple shared sections with multiple
spinlocks or mutexes, then one can use multiple membars applied to
different groups to achieve specific bypassing blocking effects.
An MBG instruction completes and retires when no older groups of
selected loads or stores are incomplete.

Interesting! I wrote about so-called "tagged" memory order a while back on this group. Just shooting the breeze, so to speak. Having some fun.

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
28 Oct 24	Arm ldaxr / stxr loop question	136	jseigh
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
1 Nov 24	Re: Arm ldaxr / stxr loop question	124	aph
2 Nov 24	Re: Arm ldaxr / stxr loop question	123	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	122	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	121	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	118	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	5	MitchAlsup1
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
11 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	23	aph
12 Nov 24	Re: Arm ldaxr / stxr loop question	18	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	17	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	8	Terje Mathisen
13 Nov 24	Brilliance (was: Arm ldaxr / stxr loop question)	4	Anton Ertl
13 Nov 24	Re: Brilliance	1	BGB
14 Nov 24	Re: Brilliance	2	Terje Mathisen
16 Nov 24	Re: Brilliance	1	Thomas Koenig
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	2	Terje Mathisen
14 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	4	BGB
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Robert Finch
26 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
14 Nov 24	Re: Arm ldaxr / stxr loop question	86	Kent Dickey
14 Nov 24	Re: Arm ldaxr / stxr loop question	85	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	81	Chris M. Thomasson
15 Nov 24	Re: Arm ldaxr / stxr loop question	80	aph
15 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
15 Nov 24	Memory ordering (was: Arm ldaxr / stxr loop question)	78	Anton Ertl
15 Nov 24	Re: Memory ordering	44	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	43	Michael S
15 Nov 24	Re: Memory ordering	42	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	41	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	39	jseigh
17 Nov 24	Re: Memory ordering	33	Anton Ertl
19 Nov 24	Re: Memory ordering	32	Chris M. Thomasson
3 Dec 24	Re: Memory ordering	31	Anton Ertl
3 Dec 24	Re: Memory ordering	30	jseigh
3 Dec 24	Re: Memory ordering	29	MitchAlsup1
4 Dec 24	Re: Memory ordering	22	Stefan Monnier
4 Dec 24	Re: Memory ordering	3	MitchAlsup1
4 Dec 24	Re: Memory ordering	2	Stefan Monnier
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
4 Dec 24	Re: Memory ordering	18	jseigh
5 Dec 24	Re: Memory ordering	17	Chris M. Thomasson
5 Dec 24	Re: Memory ordering	8	jseigh
16 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	6	jseigh
17 Dec 24	Re: Memory ordering	1	aph
17 Dec 24	Re: Memory ordering	4	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	1	MitchAlsup1
18 Dec 24	Re: Memory ordering	2	jseigh
19 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
19 Dec 24	Re: Memory ordering	8	MitchAlsup1
19 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	5	MitchAlsup1
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
5 Dec 24	Re: Memory ordering	4	Tim Rentsch
6 Dec 24	Re: Memory ordering	2	Terje Mathisen
6 Dec 24	Re: Memory ordering	1	Tim Rentsch
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	2	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	1	aph
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	2	Michael S
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
15 Nov 24	Re: Memory ordering	28	jseigh
15 Nov 24	Re: Memory ordering	27	Anton Ertl
15 Nov 24	Re: Memory ordering	18	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	17	Anton Ertl
16 Nov 24	Re: Memory ordering	16	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	15	Anton Ertl
18 Nov 24	Re: Memory ordering	14	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	13	Anton Ertl
19 Nov 24	Re: Memory ordering	12	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	11	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	7	BGB
17 Nov 24	Re: Memory ordering	1	Tim Rentsch
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Lawrence D'Oliveiro
18 Nov 24	Re: Memory ordering	1	aph
21 Nov 24	Re: Arm ldaxr / stxr loop question	3	Kent Dickey
9 Nov 24	Re: Arm ldaxr / stxr loop question	2	jseigh
8 Nov 24	Re: Arm ldaxr / stxr loop question	8	Lawrence D'Oliveiro
20 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson