Newsportal USENET - Re: Memory ordering

Re: Memory ordering

Sujet : Re: Memory ordering
De : jseigh_es00 (at) *nospam* xemaps.com (jseigh)
Groupes : comp.arch
Date : 03. Dec 2024, 14:59:18

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <vin2rp$3ofc$1@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14
User-Agent : Mozilla Thunderbird

On 12/3/24 04:01, Anton Ertl wrote:

"Chris M. Thomasson" <chris.m.thomasson.1@gmail.com> writes:
On 11/17/2024 7:17 AM, Anton Ertl wrote:
jseigh <jseigh_es00@xemaps.com> writes:
Or maybe disable reordering or optimization altogether
for those target architectures.
>
So you want to throw out the baby with the bathwater.
>
No, keep the weak order systems and not throw them out wrt a system that
is 100% seq_cst? Perhaps? What am I missing here?
Disabling optimization altogether costs a lot; e.g., look at
<http://www.complang.tuwien.ac.at/anton/bentley.pdf>: if you compare
the lines for clang-3.5 -O0 with clang-3.5 -O3, you see a factor >2.5
for the tsp9 program. For gcc-5.2.0 the difference is even bigger.
That's why jseigh and people like him (I have read that suggestion
several times before) love to suggest disabling optimization
altogether. It's a straw man that does not even need beating up. Of
course they usually don't show results for the supposed benefits of
the particular "optimization" they advocate (or the drawbacks of
disabling it), and jseigh follows this pattern nicely.

That wasn't a serious suggestion.
The compiler is allow to reorder code as long as it knows the
reordering can't be observed or detected. If there are places
in the code it doesn't know this can't happen it won't optimize
across it, more or less.
If you are writing code with concurrent shared data access then
you need let the compiler know. One way is with locks.
Another way for lock-free data structures with with
memory barriers. Even if you had cst hardware you
still need to tell the compiler so cst hardware doesn't
buy you any less effort from a programming point of view.
If you are arguing lock-free programming with memory barrriers
is hard, let's use locks for everything (disregarding that
locks have acquire/release semantics that the compiler has
to be aware of and programmers aren't always aware of), you
might want to consider the following performance timings
on some stuff I've been playing with.
unsafe 53.344 nsecs ( 0.000) 54.547 nsecs ( 0.000)*
smr    53.828 nsecs ( 0.484) 55.485 nsecs ( 0.939)
smrlite    53.094 nsecs ( 0.000) 54.329 nsecs ( 0.000)
arc 306.674 nsecs ( 253.330)    313.931 nsecs ( 259.384)
rwlock    730.012 nsecs ( 676.668)    830.340 nsecs ( 775.793)
mutex 2,881.690 nsecs ( 2,828.346)   3,305.382 nsecs ( 3,250.835)
smr is smrproxy, something like user space rcu. smrlite is smr
is smr w/o thread_local access so I have an idea how much that
adds to overhead. arc is arcproxy, lock-free reference count
based deferred reclamation. rwlock and mutex are what their
names would suggest. unsafe is no synchronization to get a
base timing on the reader loop body.
2nd col is per loop read lock/unlock average cpu time
3rd col is with unsafe time subtracted out
4th col is average elapsed time
5th col is with unsafe time subtracted out.
cpu time doesn't measure lock wait time so elapsed time
gives some indication of that.
8 reader threads, 1 writer thread
smrproxy is the version that doesn't need the cst_seq
memory barrier so it is pretty fast (you are welcome).
arc, rwlock, and mutex use interlocked instructions which
cause cache thrashing. mutex will not scale well with
number of threads on top of that. rwlock depends on
how much write locking is going on. With few write
updates, it will look more like arc.
Timings are for 8 reader threads, 1 writer thread on
4 core/8 hw thread machine.
There's going to be applications where that 2 to 3+ order
difference of overhead is going to matter a lot.
Joe Seigh

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
28 Oct 24	Arm ldaxr / stxr loop question	136	jseigh
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
31 Oct 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
1 Nov 24	Re: Arm ldaxr / stxr loop question	124	aph
2 Nov 24	Re: Arm ldaxr / stxr loop question	123	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	122	Chris M. Thomasson
8 Nov 24	Re: Arm ldaxr / stxr loop question	121	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	118	Chris M. Thomasson
9 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	5	MitchAlsup1
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
11 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Michael S
11 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	23	aph
12 Nov 24	Re: Arm ldaxr / stxr loop question	18	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	17	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	jseigh
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	aph
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	MitchAlsup1
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	8	Terje Mathisen
13 Nov 24	Brilliance (was: Arm ldaxr / stxr loop question)	4	Anton Ertl
13 Nov 24	Re: Brilliance	1	BGB
14 Nov 24	Re: Brilliance	2	Terje Mathisen
16 Nov 24	Re: Brilliance	1	Thomas Koenig
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	2	Terje Mathisen
14 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
12 Nov 24	Re: Arm ldaxr / stxr loop question	4	BGB
13 Nov 24	Re: Arm ldaxr / stxr loop question	3	Chris M. Thomasson
13 Nov 24	Re: Arm ldaxr / stxr loop question	2	Robert Finch
26 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
14 Nov 24	Re: Arm ldaxr / stxr loop question	86	Kent Dickey
14 Nov 24	Re: Arm ldaxr / stxr loop question	85	aph
14 Nov 24	Re: Arm ldaxr / stxr loop question	81	Chris M. Thomasson
15 Nov 24	Re: Arm ldaxr / stxr loop question	80	aph
15 Nov 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson
15 Nov 24	Memory ordering (was: Arm ldaxr / stxr loop question)	78	Anton Ertl
15 Nov 24	Re: Memory ordering	44	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	43	Michael S
15 Nov 24	Re: Memory ordering	42	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	41	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	39	jseigh
17 Nov 24	Re: Memory ordering	33	Anton Ertl
19 Nov 24	Re: Memory ordering	32	Chris M. Thomasson
3 Dec 24	Re: Memory ordering	31	Anton Ertl
3 Dec 24	Re: Memory ordering	30	jseigh
3 Dec 24	Re: Memory ordering	29	MitchAlsup1
4 Dec 24	Re: Memory ordering	22	Stefan Monnier
4 Dec 24	Re: Memory ordering	3	MitchAlsup1
4 Dec 24	Re: Memory ordering	2	Stefan Monnier
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
4 Dec 24	Re: Memory ordering	18	jseigh
5 Dec 24	Re: Memory ordering	17	Chris M. Thomasson
5 Dec 24	Re: Memory ordering	8	jseigh
16 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	6	jseigh
17 Dec 24	Re: Memory ordering	1	aph
17 Dec 24	Re: Memory ordering	4	Chris M. Thomasson
17 Dec 24	Re: Memory ordering	1	MitchAlsup1
18 Dec 24	Re: Memory ordering	2	jseigh
19 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
19 Dec 24	Re: Memory ordering	8	MitchAlsup1
19 Dec 24	Re: Memory ordering	7	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	5	MitchAlsup1
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	2	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
4 Dec 24	Re: Memory ordering	1	MitchAlsup1
5 Dec 24	Re: Memory ordering	4	Tim Rentsch
6 Dec 24	Re: Memory ordering	2	Terje Mathisen
6 Dec 24	Re: Memory ordering	1	Tim Rentsch
20 Dec 24	Re: Memory ordering	1	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	2	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	1	aph
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
20 Nov 24	Re: Memory ordering	1	Chris M. Thomasson
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	2	Michael S
15 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
15 Nov 24	Re: Memory ordering	28	jseigh
15 Nov 24	Re: Memory ordering	27	Anton Ertl
15 Nov 24	Re: Memory ordering	18	Chris M. Thomasson
16 Nov 24	Re: Memory ordering	17	Anton Ertl
16 Nov 24	Re: Memory ordering	16	Chris M. Thomasson
17 Nov 24	Re: Memory ordering	15	Anton Ertl
18 Nov 24	Re: Memory ordering	14	Chris M. Thomasson
18 Nov 24	Re: Memory ordering	13	Anton Ertl
19 Nov 24	Re: Memory ordering	12	Chris M. Thomasson
19 Nov 24	Re: Memory ordering	11	Chris M. Thomasson
15 Nov 24	Re: Memory ordering	7	BGB
17 Nov 24	Re: Memory ordering	1	Tim Rentsch
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Anton Ertl
16 Nov 24	Re: Memory ordering (was: Arm ldaxr / stxr loop question)	1	Lawrence D'Oliveiro
18 Nov 24	Re: Memory ordering	1	aph
21 Nov 24	Re: Arm ldaxr / stxr loop question	3	Kent Dickey
9 Nov 24	Re: Arm ldaxr / stxr loop question	2	jseigh
8 Nov 24	Re: Arm ldaxr / stxr loop question	8	Lawrence D'Oliveiro
20 Dec 24	Re: Arm ldaxr / stxr loop question	1	Chris M. Thomasson