Newsportal USENET - Re: Computer architects leaving Intel...

Re: Computer architects leaving Intel...

Sujet : Re: Computer architects leaving Intel...
De : paaronclayton (at) *nospam* gmail.com (Paul A. Clayton)
Groupes : comp.arch
Date : 25. Sep 2024, 03:49:07

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <vd6lp6$prfn$1@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
User-Agent : Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.0

On 9/22/24 6:19 PM, MitchAlsup1 wrote:

On Sun, 22 Sep 2024 20:43:38 +0000, Paul A. Clayton wrote:

On 9/19/24 11:07 AM, EricP wrote:
[snip]
If the multiplier is pipelined with a latency of 5 and throughput
of 1,
then MULL takes 5 cycles and MULL,MULH takes 6.
>
But those two multiplies still are tossing away 50% of their work.
>
I do not remember how multipliers are actually implemented — and
am not motivated to refresh my memory at the moment — but I
thought a multiply low would not need to generate the upper bits,
so I do not understand where your "50% of their work" is coming
from.
    +-----------+   +------------+
     \ mplier /     \   mcand /        Big input mux
      +--------+       +--------+
           |                |
           |      +--------------+
           |     /               /
           |    /               /
           +-- /               /
              /     Tree      /
             /               /--+
            /               /   |
           /               /    |
          +---------------+-----------+
                hi             low        Products
two n-bit operands are multiplied into a 2×n-bit result.
{{All the rest is HOW not what}}

So are you saying the high bits come for free? This seems
contrary to the conception of sums of partial products, where
some of the partial products are only needed for the upper bits
and so could (it seems to me) be uncalculated if one only wanted
the lower bits.

The high result needs the low result carry-out but not the rest of
the result. (An approximate multiply high for multiply by
reciprocal might be useful, avoiding the low result work. There
might also be ways that a multiplier could be configured to also
provide bit mixing similar to middle result for generating a
hash?)
>
I seem to recall a PowerPC implementation did semi-pipelined 32-
bit multiplication 16-bits at a time. This presumably saved area
and power
You save 1/2 of the tree area, but ultimately consume more power.

The power consumption would seem to depend on how frequently both
multiplier and multiplicand are larger than 16 bits. (However, I
seem to recall that the mentioned implementation only checked one
operand.) I suspect that for a lot of code, small values are
common.
There might also be some benefits in special casing small values
if the multiplier supports SIMD. Small values can use
substantially less physical resources for multiplication and if
the multiplier is already designed to handle multiple
parallel/SIMD small multiplies, being able to squeeze another
scalar multiply in may be possible/practical (assuming the
communication of the values is not problematic).

while also facilitating early out for small
multiplicands,
Dadda showed that doubling the size of the tree only adds one
4-2 compressor delay to the whole calculation.

Interesting.
[snip]

<sound of soap box being dragged out>
This idea that macro-op fusion is some magic solution is bullshit.
The argument is, at best, of Academic Quality, made by a student
at the time as a way to justify RISC-V not having certain easy
for HW to perform calculations.

The RISC-V published argument for fusion is not great, but fusion
(and cracking/fission) seem natural architectural mechanisms *if*
one is stuck with binary compatibility.

1) It's not free.
>
Neither is increasing the number of opcodes or providing extender
prefixes. If one wants binary compatibility, non-fusing
implementations would work.
I did neither and avoided both.

My 66000's CARRY and PRED are "extender prefixes", admittedly
included in the original architecture so compensating for encoding
constraints (e.g., not having 36-bit instruction parcels) rather
than microarchitectural or architectural variation.
[snip]>> (I feel that encoding some of the dependency information could

be useful to avoid some of this work. In theory, common
dependency detection could also be more broadly useful; e.g.,
operand availability detection and execution/operand routing.)
So useful that it is encoded directly in My 66000 ISA.

How so? My 66000 does not provide any explicit declaration what
operation will be using a result (or where an operand is being
sourced from). Register names express the dependencies so the
dataflow graph is implicit.
I was speculating that _knowing_ when an operand will be available
and where a result should be sent (rather than broadcasting) could
be useful information. Classic transport-triggered architectures
do this but do not integrate dynamic scheduling and do not handle
multiple use well (awkwardness of delayed use seems connected both
of these aspects).
While such information can be cached for operation networks that
are revisited with reasonable temporal locality, discovering
optimization opportunities dynamically has risk of not being used
(similar to prefetching). Bloating the communication of "what to
do" also adds cost, so early and more persistent (compile time)
caching of such information may not actually be helpful.

5) Any fused instructions leave (multiple) bubbles that should be
compacted out or there wasn't much point to doing the fusion.
>
Even with reduced operations per cycle, fusion could still provide
a net energy benefit.
Here I disagree:: but for a different reason::
In order for RISC-V to use a 64-bit constant as an operand, it has
to execute either:: AUPIC-LD to an area of memory containing the
64-bit constant, or a 6-7 instruction stream to build the constant
inline. While an ISA that directly supports 64-bit constants in ISA
does not execute any of those.
Thus, while it may save power seen at the "its my ISA" level it
may save power, but when seem from the perspective of "it is
directly supported in my ISA" it wastes power.

Yes, but "computing" large immediates is obviously less efficient
(except for compression), the computation part is known to be
unnecessary. Fusing a comparison and a branch may be a consequence
of bad ISA design in not properly estimating how much work an
instruction can do (and be encoded in available space) and there
is excess decode overhead with separate instructions, but the
individual operations seem to be doing actual work.
I suspect there can be cases where different microarchitectures
would benefit from different amounts of instruction/operation
complexity such that cracking and/or fusion may be useful even in
an optimally designed generic ISA.
[snip]

- register specifier fields are either source or dest, never both
>
This seems mostly a code density consideration. I think using a
single name for both a source and a destination is not so
horrible, but I am not a hardware guy.
All we HW guys want is the where ever the field is specified,
it is specified in exactly 1 field in the instruction. So, if
field<a..b> is used to specify Rd in one instruction, there is
no other field<!a..!b> specifies the Rd register. RISC-V blew
this "requirement.

Only with the Compressed extension, I think. The Compressed
extension was somewhat rushed and, in my opinion, philosophically
flawed by being redundant (i.e., every C instruction can be
expanded to a non-C instruction). Things like My 66000's ENTER
provide code density benefits but are contrary to the simplicity
emphasis. Perhaps a Rho (density) extension would have been
better.☺ (The extension letter idea was interesting for an
academic ISA but has been clearly shown to be seriously flawed.)
16-bit instructions could have kept the same register field
placements with masking/truncation for two-register-field
instructions. Even a non-destructive form might be provided by
different masking or bit inversion for the destination. However,
providing three register fields seems to require significant
irregularity in extracting register names. (Another technique
would be using opcode bits for specifying part or all of a
register name. Some special purpose registers or groups of
registers may not be horrible for compiler register allocation,
but such seems rather funky/clunky.)
It is interesting that RISC-V chose to split the immediate field
for store instructions so that source register names would be in
the same place for all (non-C) instructions.
Comparing an ISA design to RISC-V is not exactly the same as
comparing to "best in class".

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
27 Aug 24	Computer architects leaving Intel...	539	Thomas Koenig
27 Aug 24	Re: Computer architects leaving Intel...	1	Michael S
27 Aug 24	Re: Computer architects leaving Intel...	1	Stephen Fuld
27 Aug 24	Re: Computer architects leaving Intel...	536	John Dallman
27 Aug 24	Re: Computer architects leaving Intel...	529	BGB
28 Aug 24	Re: Computer architects leaving Intel...	528	MitchAlsup1
28 Aug 24	Re: Computer architects leaving Intel...	527	BGB
28 Aug 24	Re: Computer architects leaving Intel...	2	Robert Finch
28 Aug 24	Re: Computer architects leaving Intel...	1	BGB
28 Aug 24	Re: Computer architects leaving Intel...	524	MitchAlsup1
29 Aug 24	Re: Computer architects leaving Intel...	523	BGB
29 Aug 24	Re: Computer architects leaving Intel...	511	MitchAlsup1
29 Aug 24	Re: Computer architects leaving Intel...	510	BGB
30 Aug 24	Re: Computer architects leaving Intel...	499	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	11	Thomas Koenig
30 Aug 24	Re: Computer architects leaving Intel...	1	Michael S
30 Aug 24	Re: Computer architects leaving Intel...	8	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	2	Michael S
30 Aug 24	Re: Computer architects leaving Intel...	1	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	5	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	4	Brett
30 Aug 24	Re: Computer architects leaving Intel...	1	John Dallman
2 Sep 24	Re: Computer architects leaving Intel...	2	Terje Mathisen
2 Sep 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
30 Aug 24	Re: Computer architects leaving Intel...	1	BGB
30 Aug 24	Re: Computer architects leaving Intel...	487	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	302	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	301	David Brown
30 Aug 24	Re: Computer architects leaving Intel...	293	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	292	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	290	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	288	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	2	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	1	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	284	Anton Ertl
31 Aug 24	Re: Computer architects leaving Intel...	279	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	157	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	153	MitchAlsup1
1 Sep 24	Re: Computer architects leaving Intel...	152	Stephen Fuld
2 Sep 24	Re: Computer architects leaving Intel...	151	Terje Mathisen
2 Sep 24	Re: Computer architects leaving Intel...	150	Stephen Fuld
3 Sep 24	Re: Computer architects leaving Intel...	139	David Brown
3 Sep 24	Re: Computer architects leaving Intel...	108	Stephen Fuld
4 Sep 24	Re: Computer architects leaving Intel...	107	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	103	Terje Mathisen
4 Sep 24	Re: Computer architects leaving Intel...	101	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	97	jseigh
4 Sep 24	Re: Computer architects leaving Intel...	96	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	95	Brett
4 Sep 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
4 Sep 24	Re: Computer architects leaving Intel...	1	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	8	BGB
5 Sep 24	Re: Computer architects leaving Intel...	7	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	6	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	5	Niklas Holsti
5 Sep 24	Re: Computer architects leaving Intel...	4	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	3	BGB
6 Sep 24	Re: Computer architects leaving Intel...	2	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	1	BGB
5 Sep 24	Re: Computer architects leaving Intel...	83	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	82	Terje Mathisen
5 Sep 24	Re: Computer architects leaving Intel...	79	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	2	Thomas Koenig
7 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
5 Sep 24	Re: Computer architects leaving Intel...	74	Terje Mathisen
5 Sep 24	Re: Computer architects leaving Intel...	16	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	15	Terje Mathisen
9 Sep 24	Re: Computer architects leaving Intel...	12	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	11	Brett
10 Sep 24	Re: Computer architects leaving Intel...	5	Terje Mathisen
10 Sep 24	Re: Computer architects leaving Intel...	4	Brett
10 Sep 24	Re: Computer architects leaving Intel...	2	Michael S
11 Sep 24	Re: Computer architects leaving Intel...	1	Brett
11 Sep 24	Re: Computer architects leaving Intel...	1	Terje Mathisen
10 Sep 24	Re: Computer architects leaving Intel...	5	David Brown
10 Sep 24	Re: Computer architects leaving Intel...	3	Anton Ertl
10 Sep 24	Re: Computer architects leaving Intel...	2	David Brown
10 Sep 24	Re: Computer architects leaving Intel...	1	Stefan Monnier
10 Sep 24	Re: Computer architects leaving Intel...	1	BGB
9 Sep 24	Re: Computer architects leaving Intel...	2	Michael S
10 Sep 24	Re: Computer architects leaving Intel...	1	Michael S
5 Sep 24	Re: Computer architects leaving Intel...	45	Bernd Linsel
6 Sep 24	Re: Computer architects leaving Intel...	1	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	2	Terje Mathisen
9 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
14 Sep 24	Re: Computer architects leaving Intel...	41	Kent Dickey
14 Sep 24	Re: Computer architects leaving Intel...	32	Anton Ertl
14 Sep 24	Re: Computer architects leaving Intel...	29	MitchAlsup1
14 Sep 24	Re: Computer architects leaving Intel...	28	Thomas Koenig
15 Sep 24	Re: Computer architects leaving Intel...	27	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	5	Thomas Koenig
16 Sep 24	Re: Computer architects leaving Intel...	4	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
17 Sep 24	Re: Upwards and downwards compatible, Computer architects leaving Intel...	1	John Levine
17 Sep 24	Re: Computer architects leaving Intel...	1	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	21	Terje Mathisen
16 Sep 24	Re: Computer architects leaving Intel...	20	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	14	Michael S
17 Sep 24	Re: Computer architects leaving Intel...	5	Terje Mathisen
15 Sep 24	Re: Computer architects leaving Intel...	2	BGB
14 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
16 Sep 24	Re: Computer architects leaving Intel...	5	Tim Rentsch
5 Sep 24	Re: Computer architects leaving Intel...	3	Tim Rentsch
6 Sep 24	Re: Computer architects leaving Intel...	9	Chris M. Thomasson
5 Sep 24	Re: Computer architects leaving Intel...	2	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	2	MitchAlsup1
7 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
4 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
5 Sep 24	Re: Computer architects leaving Intel...	1	Chris M. Thomasson
4 Sep 24	Re: Computer architects leaving Intel...	1	jseigh
13 Sep 24	Re: Computer architects leaving Intel...	2	Stephen Fuld
3 Sep 24	Re: Computer architects leaving Intel...	30	Stefan Monnier
3 Sep 24	Re: Computer architects leaving Intel...	10	Terje Mathisen
31 Aug 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
1 Sep 24	Re: Computer architects leaving Intel...	121	David Brown
1 Sep 24	Re: Computer architects leaving Intel...	3	John Dallman
3 Sep 24	Re: Computer architects leaving Intel...	1	Stefan Monnier
30 Aug 24	Re: Computer architects leaving Intel...	1	MitchAlsup1
30 Aug 24	Re: Computer architects leaving Intel...	4	Stefan Monnier
30 Aug 24	Re: Computer architects leaving Intel...	2	John Dallman
8 Sep 24	Re: Computer architects leaving Intel...	184	Tim Rentsch
30 Aug 24	Re: Computer architects leaving Intel...	10	MitchAlsup1
31 Aug 24	Re: Computer architects leaving Intel...	11	Paul A. Clayton
29 Aug 24	Re: Computer architects leaving Intel...	6	Anton Ertl