Newsportal USENET - Re: Tonights Tradeoff

Re: Tonights Tradeoff

Sujet : Re: Tonights Tradeoff
De : robfi680 (at) *nospam* gmail.com (Robert Finch)
Groupes : comp.arch
Date : 10. Sep 2024, 04:59:05

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <vbog6d$2p2rc$1@dont-email.me>
References : 1 2 3 4
User-Agent : Mozilla Thunderbird

On 2024-09-08 2:06 p.m., MitchAlsup1 wrote:

On Sun, 8 Sep 2024 3:22:55 +0000, Robert Finch wrote:

On 2024-09-07 10:41 a.m., MitchAlsup1 wrote:
On Sat, 7 Sep 2024 2:27:40 +0000, Robert Finch wrote:
>
Making the scalar register file a subset of the vector register file.
And renaming only vector elements.
>
There are eight elements in a vector register and each element is
128-bits wide. (Corresponding to the size of a GPR). Vector register
file elements are subject to register renaming to allow the full power
of the OoO machine to be used to process vectors. The issue is that with
both the vector and scalar registers present for renaming there are a
lot of registers to rename. It is desirable to keep the number of
renamed registers (including vector elements) <= 256 total. So, the 64
scalar registers are aliased with the first eight vector registers.
Leaving only 24 truly available vector registers. Hm. There are 1024
physical registers, so maybe going up to about 300 renamable register
would not hurt.
>
Why do you think a vector register file is the way to go ??
>
I think vector use is somewhat dubious, but they have some uses. In many
cases data can be processed just fine without vector registers. In the
current project vector instructions use the scalar functional units to
compute, making them no faster than scalar calcs. But vectors have a lot
of code density where parallel computation on multiple data items using
a single instruction is desirable. I do not know why people use vector
registers in general, but they are present in some modern architectures.
There is no doubt that much code can utilize vector arrangements, and
that a processor should be very efficient in performing these work
loads.
The problem I see is that CRAY-like vectors vectorize instructions
instead of vectorizing loops. Any kind of flow control within the
loop becomes tedious at best.
On the other hand, the Virtual Vector Method vectorizes loops and
can be implemented such that it performs as well as CRAY-like
vector machines without the overhead of a vector register file.
In actuality there are only 6-bits of HW flip-flops governing
VVM--compared to 4 KBytes for CRAY-1.

Qupls vector registers are 512 bits wide (8 64-bit elements). Bigfoot’s
vector registers are 1024 bits wide (8 128-bit elements).
When properly abstracted, one can dedicate as many or few HW
flip-flops as staging buffers for vector work loads to suit
the implementation at hand. A GBOoO may utilize that 4KB
file of CRAY-1 while the little low power core 3-cache lines.
Both run the same ASM code and both are efficient in their own
sense of "efficient".
So, instead of having ~500 vector instructions and ~1000 SIMD
instructions one has 2 instructions and a medium scale state
machine.

Still trying to grasp the virtual vector method. Been wondering if it can be implemented using renamed registers.
Qupls has RISC-V style vector / SIMD registers. For Q+ every instruction can be a vector instruction, as there are bits indicating which registers are vector registers in the instruction. All the scalar instructions become vector. This cuts down on some of the bloat in the ISA. There is only a handful of vector specific instructions (about eight I think). The drawback is that the ISA is 48-bits wide. However, the code bloat is less than 50% as some instructions have dual-operations. Branches can increment or decrement and loop. Bigfoot uses a postfix word to indicate to use the vector form of the instruction. Bigfoot’s code density is a lot better being variable length, but I suspect it will not run as fast. Bigfoot and Q+ share a lot of the same code. Trying to make the guts of the cores generic.

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
7 Sep 24	Tonights Tradeoff	99	Robert Finch
7 Sep 24	Re: Tonights Tradeoff	98	MitchAlsup1
8 Sep 24	Re: Tonights Tradeoff	97	Robert Finch
8 Sep 24	Re: Tonights Tradeoff	96	MitchAlsup1
10 Sep 24	Re: Tonights Tradeoff	95	Robert Finch
10 Sep 24	Re: Tonights Tradeoff	17	BGB
10 Sep 24	Re: Tonights Tradeoff	12	Robert Finch
10 Sep 24	Re: Tonights Tradeoff	10	BGB
11 Sep 24	Re: Tonights Tradeoff	9	Robert Finch
11 Sep 24	Re: Tonights Tradeoff	7	Stephen Fuld
11 Sep 24	Re: Tonights Tradeoff	1	MitchAlsup1
12 Sep 24	Re: Tonights Tradeoff	5	Robert Finch
12 Sep 24	Re: Tonights Tradeoff	4	MitchAlsup1
12 Sep 24	Re: Tonights Tradeoff	3	Robert Finch
12 Sep 24	Re: Tonights Tradeoff	2	MitchAlsup1
13 Sep 24	Re: Tonights Tradeoff	1	MitchAlsup1
12 Sep 24	Re: Tonights Tradeoff	1	BGB
11 Sep 24	Re: Tonights Tradeoff	1	MitchAlsup1
11 Sep 24	Re: Tonights Tradeoff	4	MitchAlsup1
12 Sep 24	Re: Tonights Tradeoff	3	Thomas Koenig
12 Sep 24	Re: Tonights Tradeoff	2	BGB
12 Sep 24	Re: Tonights Tradeoff	1	Robert Finch
11 Sep 24	Re: Tonights Tradeoff	77	MitchAlsup1
15 Sep 24	Re: Tonights Tradeoff	76	Robert Finch
16 Sep 24	Re: Tonights Tradeoff	75	Robert Finch
24 Sep 24	Re: Tonights Tradeoff - Background Execution Buffers	74	Robert Finch
24 Sep 24	Re: Tonights Tradeoff - Background Execution Buffers	73	MitchAlsup1
26 Sep 24	Re: Tonights Tradeoff - Background Execution Buffers	72	Robert Finch
26 Sep 24	Re: Tonights Tradeoff - Background Execution Buffers	71	MitchAlsup1
27 Sep 24	Re: Tonights Tradeoff - Background Execution Buffers	70	Robert Finch
4 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	69	Robert Finch
4 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	66	Anton Ertl
4 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	65	Robert Finch
5 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	64	Anton Ertl
9 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	63	Robert Finch
9 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	3	MitchAlsup1
9 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	1	Robert Finch
12 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	1	BGB
12 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	58	Robert Finch
12 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	57	MitchAlsup1
12 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	56	BGB
12 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	55	Robert Finch
13 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	3	MitchAlsup1
13 Oct 24	Re: Tonights Tradeoff - ATOM	2	Robert Finch
13 Oct 24	Re: Tonights Tradeoff - ATOM	1	MitchAlsup1
13 Oct 24	Re: Tonights Tradeoff - Carry and Overflow	1	BGB
31 Oct 24	Page fetching cache controller	50	Robert Finch
31 Oct 24	Re: Page fetching cache controller	1	MitchAlsup1
6 Nov 24	Re: Q+ Fibonacci	48	Robert Finch
17 Apr 25	Re: register sets	47	Robert Finch
17 Apr 25	Re: register sets	46	Stephen Fuld
17 Apr 25	Re: register sets	1	Robert Finch
17 Apr 25	Re: register sets	44	MitchAlsup1
18 Apr 25	Re: register sets	43	Robert Finch
18 Apr 25	Re: register sets	42	MitchAlsup1
20 Apr 25	Re: register sets	41	Robert Finch
21 Apr 25	Re: auto predicating branches	40	Robert Finch
21 Apr 25	Re: auto predicating branches	39	Anton Ertl
21 Apr 25	Is an instruction on the critical path? (was: auto predicating branches)	1	Anton Ertl
21 Apr 25	Re: auto predicating branches	37	MitchAlsup1
22 Apr 25	Re: auto predicating branches	36	Anton Ertl
22 Apr 25	Re: auto predicating branches	1	MitchAlsup1
22 Apr 25	Re: auto predicating branches	34	Anton Ertl
22 Apr 25	Re: auto predicating branches	33	MitchAlsup1
23 Apr 25	Re: auto predicating branches	3	Stefan Monnier
23 Apr 25	Re: auto predicating branches	2	Anton Ertl
25 Apr 25	Re: auto predicating branches	1	MitchAlsup1
23 Apr 25	Re: auto predicating branches	29	Anton Ertl
23 Apr 25	Re: auto predicating branches	28	MitchAlsup1
24 Apr 25	Re: asynch register rename	27	Robert Finch
27 Apr 25	Re: fractional PCs	26	Robert Finch
27 Apr 25	Re: fractional PCs	25	MitchAlsup1
28 Apr 25	Re: fractional PCs	24	Robert Finch
28 Apr 25	Re: fractional PCs	13	MitchAlsup1
29 Apr 25	Re: fractional PCs	12	Robert Finch
5 May 25	Re: control co-processor	11	Robert Finch
5 May 25	Re: control co-processor	10	Al Kossow
5 May 25	Re: control co-processor	9	Stefan Monnier
6 May 25	Re: control co-processor	2	MitchAlsup1
7 May 25	Re: control co-processor	1	MitchAlsup1
7 May 25	Scan chains (was: control co-processor)	6	Stefan Monnier
7 May 25	Re: Scan chains (was: control co-processor)	2	Al Kossow
7 May 25	Re: Scan chains	1	Stefan Monnier
7 May 25	Re: Scan chains	3	MitchAlsup1
7 May 25	Re: Scan chains	2	Stefan Monnier
8 May 25	Re: Scan chains	1	MitchAlsup1
29 Apr 25	Re: fractional PCs	10	Robert Finch
29 Apr 25	Re: fractional PCs	9	MitchAlsup1
30 Apr 25	Re: fractional PCs	8	Robert Finch
30 Apr 25	Re: fractional PCs	6	Thomas Koenig
1 May 25	Re: fractional PCs	1	Robert Finch
2 May 25	Re: fractional PCs	4	moi
2 May 25	Re: millicode, extracode, fractional PCs	2	John Levine
2 May 25	Re: millicode, extracode, fractional PCs	1	moi
2 May 25	Re: fractional PCs	1	moi
30 Apr 25	Re: fractional PCs	1	MitchAlsup1
13 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	1	Anton Ertl
4 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	1	BGB
6 Oct 24	Re: Tonights Tradeoff - Background Execution Buffers	1	MitchAlsup1