Newsportal USENET - Re: number of registers

Re: number of registers

Sujet : Re: number of registers
De : anton (at) *nospam* mips.complang.tuwien.ac.at (Anton Ertl)
Groupes : comp.arch
Date : 21. Aug 2024, 11:13:12

Autres entêtes

Organisation : Institut fuer Computersprachen, Technische Universitaet Wien
Message-ID : <2024Aug21.121312@mips.complang.tuwien.ac.at>
References : 1 2 3 4 5 6 7 8 9 10
User-Agent : xrn 10.11

mitchalsup@aol.com (MitchAlsup1) writes:

The point is that the cost of not getting allocated into a register
is vastly lower--the count of instructions remains 1 while the
latency increases. That increase in latency does not hurt those
use once/seldom variables.

Latency is not the issue in modern high-performance AMD64 cores, which
have zero-cycle store-to-load forwarding
<http://www.complang.tuwien.ac.at/anton/memdep/>.

And yet, putting variables in registers gives a significant speedup:
On a Rocket Lake, numbers are times in seconds:

sieve bubble matrix fib fft
0.075 0.070 0.036 0.049 0.017 TOS in reg, RP in reg, IP in reg
0.100 0.149 0.054 0.106 0.037 TOS in mem, RP in mem, IP write-through to mem

In the first line, I used gforth-fast and tried to disable all
optimizations except those that keep certain variables in registers:

gforth-fast --ss-states=1 --ss-number=31 --opt-ip-updates=0 onebench.fs

I could not reduce the static superinstructions below 31 and still get
a result; I will have to investigate why, but that probably does not
make that much of a difference for several of these benchmarks.

In the second line I used gforth, an engine that keeps the top of
stack in memory, the return-stack pointer in memory, stores IP to
memory after every change, and does not use static superinstructions,
all for better identifying where an error happened.

The the examples cited, the lack of register allocation triples
the instruction count due to lack of LD-OP and LD-OP-ST. The
register count I stated is how many registers would a
non-LD-OP machine need to break even on the instruction count.

What makes you think that instruction count is particularly relevant?
Yes, you may save some decoding resources if you use LD-OP-ST on an
architecture that supports it, but you first had to invest into a more
complex decoder. And in the OoO engine the difference may be gone (at
least on Intel CPUs).

Consider the Forth program

: squared dup * ;

This results in the following code sequences for the two engines
mentioned above:

dup 1->1 dup 0->0
mov    $50[r13],r15
   add    rbx,$08 add    r15,$08
   mov    $00[r13],r8 mov    rax,[r14]
   sub    r13,$08 sub    r14,$08
mov    [r14],rax
* 1->1 * 0->0
mov    $50[r13],r15
   add    rbx,$08 add    r15,$08
mov    rax,$08[r14]
   imul r8,$08[r13] imul rax,[r14]
   add    r13,$08 add    r14,$08
mov    [r14],rax
;s 1->1    ;s 0->0
mov    $50[r13],r15
mov    rax,$58[r13]
   mov    rbx,[r14] mov    r10,[rax]
   add    r14,$08 add    rax,$08
mov    $58[r13],rax
mov    r15,r10
   mov    rax,[rbx] mov    rcx,[r15]
   jmp    rax jmp    rcx

TOS=r8, RP=r14, IP=rbx TOS=[r14], RP=$58[r13], IP=r15/$50[r13]

The registers are allocated differently in the two engines; for the
three things where the memory/register allocation differed, I have
shown the allocation.

One interesting case is the sequence

7FA02A77133D:   mov    rax,$58[r13]
7FA02A771341:   mov    r10,[rax]
7FA02A771344:   add    rax,$08
7FA02A771348:   mov    $58[r13],rax

Sure you could use a load-op-store instruction for adding 8 to
$58[r13], but the mov in 7FA02A771341 still needs the value in a
register, so apparently gcc (which produced the code snippets for the
individual Forth words above) decided that it's better to save
execution resources rather than reduce the number of instructions (at
a higher execution resource cost) by writing the code as

mov    rax,$58[r13]
add    $58[r13], $8
mov    r10,[rax]

- anton
--
'Anyone trying for "industrial quality" ISA should avoid undefined behavior.'
Mitch Alsup, <c17fcd89-f024-40e7-a594-88a85ac10d20o@googlegroups.com>

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
10 Aug 24	My 66000 and High word facility	94	Brett
10 Aug 24	Re: My 66000 and High word facility	92	MitchAlsup1
11 Aug 24	Re: My 66000 and High word facility	91	Brett
11 Aug 24	Re: My 66000 and High word facility	1	Thomas Koenig
11 Aug 24	Re: My 66000 and High word facility	61	Anton Ertl
11 Aug 24	Re: My 66000 and High word facility	20	Brett
12 Aug 24	Re: My 66000 and High word facility	19	Anton Ertl
12 Aug 24	Re: My 66000 and High word facility	17	MitchAlsup1
12 Aug 24	Re: My 66000 and High word facility	16	BGB
12 Aug 24	Re: My 66000 and High word facility	15	MitchAlsup1
12 Aug 24	Re: My 66000 and High word facility	14	BGB
12 Aug 24	Re: My 66000 and High word facility	13	MitchAlsup1
13 Aug 24	Re: My 66000 and High word facility	12	BGB
13 Aug 24	Re: My 66000 and High word facility	11	MitchAlsup1
13 Aug 24	Re: My 66000 and High word facility	10	BGB
13 Aug 24	Re: My 66000 and High word facility	9	MitchAlsup1
13 Aug 24	Re: My 66000 and High word facility	5	Thomas Koenig
13 Aug 24	Re: My 66000 and High word facility	3	MitchAlsup1
14 Aug 24	Re: My 66000 and High word facility	2	Thomas Koenig
14 Aug 24	Re: My 66000 and High word facility	1	MitchAlsup1
14 Aug 24	Re: My 66000 and High word facility	1	BGB
14 Aug 24	Re: My 66000 and High word facility	3	BGB
14 Aug 24	Re: My 66000 and High word facility	2	MitchAlsup1
15 Aug 24	Re: My 66000 and High word facility	1	BGB
14 Aug 24	Re: My 66000 and High word facility	1	MitchAlsup1
11 Aug 24	Re: My 66000 and High word facility	1	Niklas Holsti
11 Aug 24	Re: My 66000 and High word facility	31	BGB
12 Aug 24	Re: My 66000 and High word facility	30	Brett
12 Aug 24	Re: My 66000 and High word facility	2	Terje Mathisen
16 Oct 24	Re: My 66000 and High word facility	1	Paul A. Clayton
14 Aug 24	Re: My 66000 and High word facility	25	MitchAlsup1
15 Aug 24	Re: My 66000 and High word facility	24	Brett
15 Aug 24	Re: My 66000 and High word facility	23	Brett
15 Aug 24	Re: My 66000 and High word facility	22	Stephen Fuld
16 Aug 24	Re: My 66000 and High word facility	21	Brett
16 Aug 24	Re: My 66000 and High word facility	1	Brett
16 Aug 24	Re: My 66000 and High word facility	19	MitchAlsup1
17 Aug 24	Re: My 66000 and High word facility	18	Brett
17 Aug 24	Re: My 66000 and High word facility	8	Thomas Koenig
17 Aug 24	Re: My 66000 and High word facility	7	Brett
17 Aug 24	Re: My 66000 and High word facility	5	Thomas Koenig
18 Aug 24	Re: My 66000 and High word facility	4	MitchAlsup1
18 Aug 24	Re: My 66000 and High word facility	1	Brett
18 Aug 24	Re: My 66000 and High word facility	2	Thomas Koenig
19 Aug 24	Re: My 66000 and High word facility	1	BGB
19 Aug 24	Re: My 66000 and High word facility	1	BGB
17 Aug 24	Re: My 66000 and High word facility	9	MitchAlsup1
17 Aug 24	Re: My 66000 and High word facility	8	Brett
17 Aug 24	Re: My 66000 and High word facility	2	MitchAlsup1
18 Aug 24	Re: My 66000 and High word facility	1	Brett
19 Aug 24	Re: My 66000 and High word facility	5	Stefan Monnier
19 Aug 24	Re: My 66000 and High word facility	1	BGB
19 Aug 24	Re: My 66000 and High word facility	3	MitchAlsup1
19 Aug 24	Re: My 66000 and High word facility	1	Thomas Koenig
20 Aug 24	Re: My 66000 and High word facility	1	Michael S
20 Aug 24	Re: My 66000 and High word facility	2	Stefan Monnier
20 Aug 24	Re: My 66000 and High word facility	1	BGB
14 Aug 24	Re: My 66000 and High word facility	8	MitchAlsup1
15 Aug 24	Re: My 66000 and High word facility	3	Anton Ertl
15 Aug 24	Re: My 66000 and High word facility	2	Michael S
15 Aug 24	Re: My 66000 and High word facility	1	MitchAlsup1
15 Aug 24	Re: My 66000 and High word facility	4	Michael S
15 Aug 24	Re: My 66000 and High word facility	3	Stephen Fuld
15 Aug 24	Re: My 66000 and High word facility	2	Michael S
15 Aug 24	Re: My 66000 and High word facility	1	MitchAlsup1
18 Aug 24	Re: My 66000 and High word facility	28	MitchAlsup1
19 Aug 24	Re: My 66000 and High word facility	27	Brett
19 Aug 24	Re: My 66000 and High word facility	26	MitchAlsup1
20 Aug 24	Re: My 66000 and High word facility	3	Brett
20 Aug 24	Re: My 66000 and High word facility	2	MitchAlsup1
20 Aug 24	Re: My 66000 and High word facility	1	Brett
20 Aug 24	number of registers (was: My 66000 and High word facility)	22	Anton Ertl
20 Aug 24	Re: number of registers	21	MitchAlsup1
20 Aug 24	Re: number of registers	13	Michael S
20 Aug 24	Re: number of registers	12	MitchAlsup1
21 Aug 24	Re: number of registers	6	Brett
21 Aug 24	Re: number of registers	4	MitchAlsup1
21 Aug 24	Re: number of registers	2	Brett
23 Aug 24	Re: number of registers	1	Brett
22 Aug 24	Re: number of registers	1	Stephen Fuld
21 Aug 24	Re: number of registers	1	Anton Ertl
21 Aug 24	Re: number of registers	5	Anton Ertl
21 Aug 24	Re: number of registers	3	Stephen Fuld
21 Aug 24	Re: number of registers	2	Anton Ertl
21 Aug 24	Re: number of registers	1	Stephen Fuld
21 Aug 24	Re: number of registers	1	Anton Ertl
20 Aug 24	Re: number of registers	7	MitchAlsup1
21 Aug 24	Re: number of registers	6	Anton Ertl
21 Aug 24	Re: number of registers	3	Michael S
21 Aug 24	Re: number of registers	2	Anton Ertl
21 Aug 24	Re: number of registers	1	Michael S
21 Aug 24	Re: number of registers	2	MitchAlsup1
21 Aug 24	Re: number of registers	1	Michael S
10 Aug 24	Re: My 66000 and High word facility	1	MitchAlsup1