Newsportal USENET - Re: Memory ordering

Re: Memory ordering

Sujet : Re: Memory ordering
De : mitchalsup (at) *nospam* aol.com (MitchAlsup1)
Groupes : comp.arch
Date : 16. Nov 2024, 01:51:36

Autres entêtes

Organisation : Rocksolid Light
Message-ID : <f39cc5fb9e74d80d385797f0a5e1c3a0@www.novabbs.org>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14
User-Agent : Rocksolid Light

On Fri, 15 Nov 2024 23:35:22 +0000, BGB wrote:

On 11/15/2024 4:05 PM, Chris M. Thomasson wrote:
On 11/15/2024 12:53 PM, BGB wrote:
On 11/15/2024 11:27 AM, Anton Ertl wrote:
jseigh <jseigh_es00@xemaps.com> writes:
Anybody doing that sort of programming, i.e. lock-free or distributed
algorithms, who can't handle weakly consistent memory models, shouldn't
be doing that sort of programming in the first place.
>
Do you have any argument that supports this claim.
>
Strongly consistent memory won't help incompetence.
>
Strong words to hide lack of arguments?
>
>
In my case, as I see it:
   The tradeoff is more about implementation cost, performance, etc.
>
Weak model:
   Cheaper (and simpler) to implement;
   Performs better when there is no need to synchronize memory;
   Performs worse when there is need to synchronize memory;
   ...
[...]
>
A TSO from a weak memory model is as it is. It should not necessarily
perform "worse" than other systems that have TSO as a default. The
weaker models give us flexibility. Any weak memory model should be able
to give sequential consistency via using the right membars in the right
places.
>
>
The speed difference is mostly that, in a weak model, the L1 cache
merely needs to fetch memory from the L2 or similar, may write to it
whenever, and need not proactively store back results.
>
As I understand it, a typical TSO like model will require, say:
Any L1 cache that wants to write to a cache line, needs to explicitly
request write ownership over that cache line;

The cache line may have been fetched from a core which modified the
data, and handed this line directly to this requesting core on a
typical read. So, it is possible for the line to show up with
write permission even if the requesting core did not ask for write
permission. So, not all lines being written have to request owner-
ship.

Any attempt by other cores to access this line,

You are being rather loose with your time analysis in this question::
Access this line before write permission has been requested,
or
Access this line after write permission has been requested but
before it has arrived,
or
Access this line after write permission has arrived.

may require the L2 cache
to send a message to the core currently holding the cache line for
writing to write back its contents, with the request unable to be
handled until after the second core has written back the dirty cache
line.

L2 has to know something about how L1 has the line, and likely which
core cache the data is in.

This would create potential for significantly more latency in cases
where multiple cores touch the same part of memory; albeit the cores
will see each others' memory stores.

One can ARGUE that this is a good thing as it makes latency part
of the memory access model. More interfering accesses=higher
latency.

>
So, initially, weak model can be faster due to not needing any
additional handling.
>
>
But... Any synchronization points, such as a barrier or locking or
releasing a mutex, will require manually flushing the cache with a weak
model.

Not necessarily:: My 66000 uses causal memory consistency, yet when
an ATOMIC event begins it reverts to sequential consistency until
the end of the event where it reverts back to causal. Use of MMI/O
space reverts to sequential consistency, while access to config
space reverts all the way back to strongly ordered.

And, locking/releasing the mutex itself will require a mechanism
that is consistent between cores (such as volatile atomic swaps or
similar, which may still be weak as a volatile-atomic-swap would still
not be atomic from the POV of the L2 cache; and an MMIO interface could
be stronger here).
>
>
Seems like there could possibly be some way to skip some of the cache
flushing if one could verify that a mutex is only being locked and
unlocked on a single core.
>
Issue then is how to deal with trying to lock a mutex which has thus far
been exclusive to a single core. One would need some way for the core
that last held the mutex to know that it needs to perform an L1 cache
flush.

This seems to be a job for Cache Consistency.

Though, one possibility could be to leave this part to the OS
scheduler/syscall/...

The OS wants nothing to do with this.

mechanism; so the core that wants to lock the
mutex signals its intention to do so via the OS, and the next time the
core that last held the mutex does a syscall (or tries to lock the mutex
again), the handler sees this, then performs the L1 flush and flags the
mutex as multi-core safe (at which point, the parties will flush L1s at
each mutex lock, though possibly with a timeout count so that, if the
mutex has been single-core for N locks, it reverts to single-core
behavior).
>
This could reduce the overhead of "frivolous mutex locking" in programs
that are otherwise single-threaded or single processor (leaving the
cache flushes for the ones that are in-fact being used for
synchronization purposes).
>
....

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
28 Oct 24	Arm ldaxr / stxr loop question	135	jseigh
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
1 Nov 24	Re: Arm ldaxr / stxr loop question	123	aph
2 Nov 24	Re: Arm ldaxr / stxr loop question	122	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	121	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	120	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	117	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	5	MitchAlsup1
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
11 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
12 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	22	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	18	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	17	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	8	Terje Mathisen
13 Nov 24	Brilliance (was: Arm ldaxr / stxr loop question)	4	Anton Ertl
13 Nov 24	Re: Brilliance	1	BGB
14 Nov 24	Re: Brilliance	2	Terje Mathisen
17 Nov 24	Re: Brilliance	1	Thomas Koenig
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	2	Terje Mathisen
15 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	BGB
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Robert Finch
14 Nov 24	Re: Arm ldaxr / stxr loop question	86	Kent Dickey
14 Nov 24	Re: Arm ldaxr / stxr loop question	85	aph
15 Nov 24	Re: Arm ldaxr / stxr loop question	81	Chris M. Thomasson
15 Nov 24	Re: Arm ldaxr / stxr loop question	80	aph
15 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
15 Nov 24	Memory ordering (was: Arm ldaxr / stxr loop question)	78	Anton Ertl
15 Nov 24	Re: Memory ordering	44	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	43	Michael S
15 Nov 24	Re: Memory ordering	42	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	41	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	39	jseigh
17 Nov 24	Re: Memory ordering	33	Anton Ertl
19 Nov 24	Re: Memory ordering	32	Chris M. Thomasson
3 Dec 24	Re: Memory ordering	31	Anton Ertl
3 Dec 24	Re: Memory ordering	30	jseigh
3 Dec 24	Re: Memory ordering	29	MitchAlsup1
4 Dec 24	Re: Memory ordering	22	Stefan Monnier
4 Dec 24	Re: Memory ordering	3	MitchAlsup1
4 Dec 24	Re: Memory ordering	2	Stefan Monnier
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
4 Dec 24	Re: Memory ordering	18	jseigh
5 Dec 24	Re: Memory ordering	17	Chris M. Thomasson
5 Dec 24	Re: Memory ordering	8	jseigh
16 Dec22:48	Re: Memory ordering	7	Chris M. Thomasson
17 Dec13:33	Re: Memory ordering	6	jseigh
17 Dec21:38	Re: Memory ordering	1	aph
17 Dec21:41	Re: Memory ordering	4	Chris M. Thomasson
17 Dec22:45	Re: Memory ordering	1	MitchAlsup1
18 Dec12:43	Re: Memory ordering	2	jseigh
19 Dec03:48	Re: Memory ordering	1	Chris M. Thomasson
19 Dec19:33	Re: Memory ordering	8	MitchAlsup1
19 Dec22:19	Re: Memory ordering	7	Chris M. Thomasson
20 Dec00:59	Re: Memory ordering	5	MitchAlsup1
20 Dec01:21	Re: Memory ordering	2	Chris M. Thomasson
20 Dec01:25	Re: Memory ordering	1	Chris M. Thomasson
20 Dec01:48	Re: Memory ordering	2	Chris M. Thomasson
20 Dec01:58	Re: Memory ordering	1	Chris M. Thomasson
20 Dec21:17	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
5 Dec 24	Re: Memory ordering	4	Tim Rentsch
6 Dec 24	Re: Memory ordering	2	Terje Mathisen
6 Dec 24	Re: Memory ordering	1	Tim Rentsch
20 Dec06:08	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	2	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	1	aph
21 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
21 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	2	Michael S
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
15 Nov 24	Re: Memory ordering	28	jseigh
15 Nov 24	Re: Memory ordering	27	Anton Ertl
15 Nov 24	Re: Memory ordering	18	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	17	Anton Ertl
17 Nov 24	Re: Memory ordering	16	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	15	Anton Ertl
18 Nov 24	Re: Memory ordering	14	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	13	Anton Ertl
19 Nov 24	Re: Memory ordering	12	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	11	Chris M. Thomasson
26 Nov 24	Re: Memory ordering	4	Chris M. Thomasson
3 Dec 24	Re: Memory ordering	6	Anton Ertl
15 Nov 24	Re: Memory ordering	7	BGB
17 Nov 24	Re: Memory ordering	1	Tim Rentsch
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Lawrence D'Oliveiro
18 Nov 24	Re: Memory ordering	1	aph
21 Nov 24	Re: Arm ldaxr / stxr loop question	3	Kent Dickey
9 Nov 24	Re: Arm ldaxr / stxr loop question	2	jseigh
8 Nov 24	Re: Arm ldaxr / stxr loop question	8	Lawrence D'Oliveiro
20 Dec10:11	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson