Newsportal USENET - Re: Memory ordering

Re: Memory ordering

Sujet : Re: Memory ordering
De : anton (at) *nospam* mips.complang.tuwien.ac.at (Anton Ertl)
Groupes : comp.arch
Date : 16. Nov 2024, 08:46:17

Autres entêtes

Organisation : Institut fuer Computersprachen, Technische Universitaet Wien
Message-ID : <2024Nov16.084617@mips.complang.tuwien.ac.at>
References : 1 2 3 4 5 6 7 8 9 10 11 12
User-Agent : xrn 10.11

BGB <cr88192@gmail.com> writes:

The tradeoff is more about implementation cost, performance, etc.

Yes. And the "etc." includes "ease of programming".

Weak model:
Cheaper (and simpler) to implement;

Yes.

Performs better when there is no need to synchronize memory;

Not in general. For a cheap multiprocessor implementation, yes. A
sophisticated implementation of sequential consistency can just storm
ahead in that case and achieve the same performance. It just has to
keep checkpoints around in case that there is a need to synchronize
memory.

Performs worse when there is need to synchronize memory;

With a cheap multiprocessor implementation, yes. In general, no: Any
sequentially consistent implementation is also an implementation of
every weaker memory model, and the memory barriers become nops in that
kind of implementation. Ok, nops still have a cost, but it's very
close to 0 on a modern CPU.

Another potential performance disadvantage of sequential consistency
even with a sophisticated implementation:

If you have some algorithm that actually works correctly even when it
gets stale data from a load (with some limits on the staleness), the
sophisticated SC implementation will incur the latency coming from
making the load non-stale while that latency will not occur or be less
in a similarly-sophisticated implementation of an appropriate weak
consistency model.

However, given that the access to actually-shared memory is slow even
on weakly-consistent hardware, software usually takes measures to
avoid having a lot of such accesses, so that cost will usually be
miniscule.

What you missed: the big cost of weak memory models and cheap hardware
implementations of them is in the software:

* For correctness, the safe way is to insert a memory barrier between
any two memory operations.

* For performance (on cheap implementations of weak memory models) you
want to execute as few memory barriers as possible.

* You cannot use testing to find out whether you have enough (and the
right) memory barriers. That's not only because the involved
threads may not be in the right state during testing for uncovering
the incorrectness, but also because the hardware used for testing
may actually have stronger consistency than the memory model, and so
some kinds of bugs will never show up in testing on that hardware,
even when the threads reach the right state. And testing is still
the go-to solution for software people to find errors (nowadays even
glorified by continuous integration and modern fuzz testing
approaches).

The result is that a lot of software dealing with shared memory is
incorrect because it does not have a memory barrier that it should
have, or inefficient on cheap hardware with expensive memory barriers
because it uses more memory barriers than necessary for the memory
model. A program may even be incorrect in one place and have
superflouous memory barriers in another one.

Or programmers just don't do this stuff at all (as advocated by
jseigh), and instead just write sequential programs, or use bottled
solutions that often are a lot more expensive than superfluous memory
barriers. E.g., in Gforth the primary inter-thread communication
mechanism is currently implemented with pipes, involving the system
calls read() and write(). And Bernd Paysan who implemented that is a
really good programmer; I am sure he would be able to wrap his head
around the whole memory model stuff and implement something much more
efficient, but that would take time that he obviously prefers to spend
on more productive things.

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
28 Oct 24	Arm ldaxr / stxr loop question	136	jseigh
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
1 Nov 24	Re: Arm ldaxr / stxr loop question	124	aph
2 Nov 24	Re: Arm ldaxr / stxr loop question	123	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	122	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	121	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	118	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	5	MitchAlsup1
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
11 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	23	aph
12 Nov 24	Re: Arm ldaxr / stxr loop question	18	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	17	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	8	Terje Mathisen
13 Nov 24	Brilliance (was: Arm ldaxr / stxr loop question)	4	Anton Ertl
13 Nov 24	Re: Brilliance	1	BGB
14 Nov 24	Re: Brilliance	2	Terje Mathisen
16 Nov 24	Re: Brilliance	1	Thomas Koenig
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	2	Terje Mathisen
14 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	4	BGB
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Robert Finch
26 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
14 Nov 24	Re: Arm ldaxr / stxr loop question	86	Kent Dickey
14 Nov 24	Re: Arm ldaxr / stxr loop question	85	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	81	Chris M. Thomasson
15 Nov 24	Re: Arm ldaxr / stxr loop question	80	aph
15 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
15 Nov 24	Memory ordering (was: Arm ldaxr / stxr loop question)	78	Anton Ertl
15 Nov 24	Re: Memory ordering	44	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	43	Michael S
15 Nov 24	Re: Memory ordering	42	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	41	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	39	jseigh
17 Nov 24	Re: Memory ordering	33	Anton Ertl
19 Nov 24	Re: Memory ordering	32	Chris M. Thomasson
3 Dec 24	Re: Memory ordering	31	Anton Ertl
3 Dec 24	Re: Memory ordering	30	jseigh
3 Dec 24	Re: Memory ordering	29	MitchAlsup1
4 Dec 24	Re: Memory ordering	22	Stefan Monnier
4 Dec 24	Re: Memory ordering	3	MitchAlsup1
4 Dec 24	Re: Memory ordering	2	Stefan Monnier
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
4 Dec 24	Re: Memory ordering	18	jseigh
5 Dec 24	Re: Memory ordering	17	Chris M. Thomasson
5 Dec 24	Re: Memory ordering	8	jseigh
16 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	6	jseigh
17 Dec 24	Re: Memory ordering	1	aph
17 Dec 24	Re: Memory ordering	4	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	1	MitchAlsup1
18 Dec 24	Re: Memory ordering	2	jseigh
19 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
19 Dec 24	Re: Memory ordering	8	MitchAlsup1
19 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	5	MitchAlsup1
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
5 Dec 24	Re: Memory ordering	4	Tim Rentsch
6 Dec 24	Re: Memory ordering	2	Terje Mathisen
6 Dec 24	Re: Memory ordering	1	Tim Rentsch
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	2	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	1	aph
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	2	Michael S
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
15 Nov 24	Re: Memory ordering	28	jseigh
15 Nov 24	Re: Memory ordering	27	Anton Ertl
15 Nov 24	Re: Memory ordering	18	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	17	Anton Ertl
16 Nov 24	Re: Memory ordering	16	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	15	Anton Ertl
18 Nov 24	Re: Memory ordering	14	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	13	Anton Ertl
19 Nov 24	Re: Memory ordering	12	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	11	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	7	BGB
17 Nov 24	Re: Memory ordering	1	Tim Rentsch
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Lawrence D'Oliveiro
18 Nov 24	Re: Memory ordering	1	aph
21 Nov 24	Re: Arm ldaxr / stxr loop question	3	Kent Dickey
9 Nov 24	Re: Arm ldaxr / stxr loop question	2	jseigh
8 Nov 24	Re: Arm ldaxr / stxr loop question	8	Lawrence D'Oliveiro
20 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson