Newsportal USENET - Re: Making Lemonade (Floating-point format changes)

Re: Making Lemonade (Floating-point format changes)

Sujet : Re: Making Lemonade (Floating-point format changes)
De : cr88192 (at) *nospam* gmail.com (BGB)
Groupes : comp.arch
Date : 20. May 2024, 19:33:42

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <v2g529$4fn7$1@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
User-Agent : Mozilla Thunderbird

On 5/20/2024 7:36 AM, Michael S wrote:

On Mon, 20 May 2024 14:22:00 +0200
Terje Mathisen <terje.mathisen@tmsw.no> wrote:

Michael S wrote:
On Mon, 20 May 2024 09:24:16 +0200
Terje Mathisen <terje.mathisen@tmsw.no> wrote:

Michael S wrote:
On Sun, 19 May 2024 18:37:51 +0200
Terje Mathisen <terje.mathisen@tmsw.no> wrote:

Thomas Koenig wrote:
So, I did some more measurements on the POWER9 machine, and it
came to around 18 cycles per FMA. Compared to the 13 cycles for
the FMA instruction, this actually sounds reasonable.
>
The big problem appears to be that, in this particular
implementation, multiplication is not pipelined, but done by
piecewise by addition. This can be explained by the fact that
this is mostly a decimal unit, with the 128-bit QP just added as
an afterthought, and decimal multiplication does not happen all
that often.
>
A fully pipelined FMA unit capable of 128-bit arithmetic would
be an entirely different beast, I would expect a throughput of
1 per cycle and a latency of (maybe) one cycle more than 64-bit
FMA.
The FMA normalizer has to handle a maximally bad cancellation, so
it needs to be around 350 bits wide. Mitch knows of course but
I'm guessing that this could at least be close to needing an
extra cycle on its own and/or heroic hardware?
>
Terje

>
Why so wide?
Assuming that subnormal multiplier inputs are normalized before
>
They are not, this is part of what you do to make subnormal numbers
exactly the same speed as normal inputs.
>
Terje

>
1. I am not sure that "the same speed" is a worthy goal even for
binary64 (for binary32 it is).
2. It's certainly does not sound like a worthy goal for binary128,
where probability of encountering sub-normal inputs in real user
code, rather than in test vector, is lower than DP by another order
of magnitude,
3. Even if, for reason unclear to me, it is considered the goal, it
can be achieved by introduction of one more pipeline stage
everywhere. Since we are discussing high-latency design akin to
POWER9, the relative cost of another stage would be lower. BTW,
according to POWER9 manual, even for SP/DP FMA the latency is not
constant. It varies from 5 to 7.
>
So, IMHO, what you do to handle sub-normal inputs should depend on
what ends up smaller or faster, not on some abstract principles.
For less important unit, like binary128, 'smaller' would likely take
relative precedence over 'faster'. It's possible that you'll end up
with not doing pre-normalization, but the reason for it would be
different from 'same speed'.
>
Besides, pre-normalization vs wider post-normalization are not the
only available choices. When multiplier is naturally segmented into
57-bit section, there exists, for example, an option of
pre-normalization by full section. It looks very simple on the
front and saves quite a lot of shifter's width on the back.
>
But the best option is probably described in above post by Mitch.
If I understood his post correctly, he suggests to have two
alignment stages: one after multiplication and another one after
add/sub. The shift count for a first stage is calculated from
inputs in parallel with multiplication. The first alignment stage
does not try to achieve a perfect normalizations, but it does
enough for cutting the width of following adder from 3N to 2N+eps.
>
I do agree with Mitch's suggestion: Allow subnormal inputs but do the
partial muls from the top and move the normalization starting point
down for each all-zero input block.
>
In an extreme case (subnormal x subnormal) this would allow you to
discard a lot of partial products.
>
Terje
>
For subnormal x subnormal you don't need result of multiplication at
all. All you need to know is if it's zero or not and what sign.
Even that is needed only in non-default rounding modes and for inexact
flag in default mode.

For most non-tiny formats, the seeming advantage of subnormal numbers seems small, in any case.
But, yeah, in any case I would almost prefer if there could be a separate/cheaper standard, probably mostly aimed at embedded/microcontroller style use-cases (rather than "general purpose"), and would likely relax the requirements a fair bit.
Say, likely target might be, say:
   FADD/FSUB/FMUL;
   Binary16 and Binary32 as high-priority formats;
   Binary64 as optional (but nice to have);
   Probably DAZ/FTZ;
   Potentially allow for truncate-only rounding.
Assumption being that larger or higher precision cases would fall back to software emulation.
Could optionally have some 8-bit FP formats, but 8-bit FP is a little bit too limited for general-purpose use.
Likely main candidates being:
   S.E4.F3 (Bias=7)
   S.E3.F4 (Bias=7|8, ~ Unit Range)
   More or less A-Law without the XOR.
   Though, A-Law can also be interpreted as a ~ 12 bit integer value.
   Annoyingly, exact bias depends on context for this one
   (eg: 8/7/3/0)...
I had also used:
   E4.F4
   E4.F3.S
But, this is wonky (and the possible merit of E4.F3.S is defeated once one also needs S.E4.F3 or S.E3.F4, as these are the "actually used in the wild" formats, so may have been a mistake).

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
12 May 24	Making Lemonade (Floating-point format changes)	101	John Savard
12 May 24	Re: Making Lemonade (Floating-point format changes)	3	wolfgang kern
15 May 24	Re: Making Lemonade (Floating-point format changes)	2	Michael S
15 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
12 May 24	Re: Making Lemonade (Floating-point format changes)	6	Thomas Koenig
12 May 24	Re: Making Lemonade (Floating-point format changes)	5	John Savard
12 May 24	Re: Making Lemonade (Floating-point format changes)	1	John Savard
12 May 24	Re: Making Lemonade (Floating-point format changes)	3	MitchAlsup1
13 May 24	Re: Making Lemonade (Floating-point format changes)	2	John Savard
13 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
12 May 24	Re: Making Lemonade (Floating-point format changes)	91	John Dallman
12 May 24	Re: Making Lemonade (Floating-point format changes)	90	Thomas Koenig
13 May 24	Re: Making Lemonade (Floating-point format changes)	89	Michael S
13 May 24	Re: Making Lemonade (Floating-point format changes)	56	Thomas Koenig
14 May 24	Re: Making Lemonade (Floating-point format changes)	55	Michael S
15 May 24	Re: Making Lemonade (Floating-point format changes)	54	Thomas Koenig
15 May 24	Re: Making Lemonade (Floating-point format changes)	53	Michael S
19 May 24	Re: Making Lemonade (Floating-point format changes)	52	Thomas Koenig
19 May 24	Re: Making Lemonade (Floating-point format changes)	3	Michael S
19 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
20 May 24	Re: Making Lemonade (Floating-point format changes)	1	Thomas Koenig
19 May 24	Re: Making Lemonade (Floating-point format changes)	48	Terje Mathisen
19 May 24	Re: Making Lemonade (Floating-point format changes)	40	Michael S
19 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
20 May 24	Re: Making Lemonade (Floating-point format changes)	30	Terje Mathisen
20 May 24	Re: Making Lemonade (Floating-point format changes)	29	Michael S
20 May 24	Re: Making Lemonade (Floating-point format changes)	28	Terje Mathisen
20 May 24	Re: Making Lemonade (Floating-point format changes)	27	Michael S
20 May 24	Re: Making Lemonade (Floating-point format changes)	19	BGB
20 May 24	Re: Making Lemonade (Floating-point format changes)	18	MitchAlsup1
20 May 24	Re: Making Lemonade (Floating-point format changes)	1	Chris M. Thomasson
20 May 24	Re: Making Lemonade (Floating-point format changes)	1	Thomas Koenig
21 May 24	Re: Making Lemonade (Floating-point format changes)	15	BGB
21 May 24	Re: Making Lemonade (Floating-point format changes)	12	Thomas Koenig
21 May 24	Re: Making Lemonade (Floating-point format changes)	7	Michael S
21 May 24	Re: Making Lemonade (Floating-point format changes)	5	Terje Mathisen
21 May 24	Re: Making Lemonade (Floating-point format changes)	1	Michael S
21 May 24	Re: Making Lemonade (Floating-point format changes)	3	BGB
22 May 24	Re: Making Lemonade (Floating-point format changes)	2	MitchAlsup1
22 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB-Alt
21 May 24	Re: Making Lemonade (Floating-point format changes)	1	Thomas Koenig
21 May 24	Re: Making Lemonade (Floating-point format changes)	4	BGB
21 May 24	Re: Making Lemonade (Floating-point format changes)	3	MitchAlsup1
21 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
22 May 24	Re: Making Lemonade (Floating-point format changes)	1	Terje Mathisen
21 May 24	Re: Making Lemonade (Floating-point format changes)	2	MitchAlsup1
21 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
20 May 24	Re: Making Lemonade (Floating-point format changes)	7	Terje Mathisen
21 May 24	Re: Making Lemonade (Floating-point format changes)	6	Michael S
21 May 24	Re: Making Lemonade (Floating-point format changes)	5	MitchAlsup1
21 May 24	Re: Making Lemonade (Floating-point format changes)	2	Stefan Monnier
22 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
22 May 24	Re: Making Lemonade (Floating-point format changes)	1	Terje Mathisen
22 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
20 May 24	binary128 implementation (was: Making Lemonade (Floating-point format changes)	8	Anton Ertl
20 May 24	Re: binary128 implementation	7	Terje Mathisen
23 May 24	Re: binary128 implementation	6	BGB-Alt
23 May 24	Re: binary128 implementation	5	MitchAlsup1
24 May 24	Re: binary128 implementation	4	Terje Mathisen
24 May 24	Re: binary128 implementation	3	BGB-Alt
25 May 24	Re: binary128 implementation	2	MitchAlsup1
25 May 24	Re: binary128 implementation	1	BGB
19 May 24	Re: Making Lemonade (Floating-point format changes)	6	BGB
19 May 24	Re: Making Lemonade (Floating-point format changes)	5	MitchAlsup1
20 May 24	Re: Making Lemonade (Floating-point format changes)	4	BGB
20 May 24	Re: Making Lemonade (Floating-point format changes)	3	MitchAlsup1
20 May 24	Re: Making Lemonade (Floating-point format changes)	2	BGB
20 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
19 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
13 May 24	Re: Making Lemonade (Floating-point format changes)	32	BGB
13 May 24	Re: Making Lemonade (Floating-point format changes)	31	MitchAlsup1
14 May 24	Re: Making Lemonade (Floating-point format changes)	22	BGB
14 May 24	Re: Making Lemonade (Floating-point format changes)	21	MitchAlsup1
14 May 24	Re: Making Lemonade (Floating-point format changes)	20	BGB
14 May 24	Re: Making Lemonade (Floating-point format changes)	19	MitchAlsup1
14 May 24	Re: Making Lemonade (Floating-point format changes)	2	Michael S
15 May 24	Re: Making Lemonade (Floating-point format changes)	1	Michael S
14 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
16 May 24	Re: Making Lemonade (Floating-point format changes)	15	MitchAlsup1
17 May 24	Re: Making Lemonade (Floating-point format changes)	14	MitchAlsup1
17 May 24	Re: Making Lemonade (Floating-point format changes)	2	MitchAlsup1
18 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
18 May 24	Re: Making Lemonade (Floating-point format changes)	11	Chris M. Thomasson
19 May 24	Re: Making Lemonade (Floating-point format changes)	10	Chris M. Thomasson
19 May 24	Re: Making Lemonade (Floating-point format changes)	9	Chris M. Thomasson
19 May 24	Re: Making Lemonade (Floating-point format changes)	8	Chris M. Thomasson
20 May 24	Re: Making Lemonade (Floating-point format changes)	7	Chris M. Thomasson
20 May 24	Re: Making Lemonade (Floating-point format changes)	6	Chris M. Thomasson
20 May 24	Re: Making Lemonade (Floating-point format changes)	5	Chris M. Thomasson
24 May 24	Re: Making Lemonade (Floating-point format changes)	4	Chris M. Thomasson
26 May 24	Re: Making Lemonade (Floating-point format changes)	3	George Neuner
27 May 24	Re: Making Lemonade (Floating-point format changes)	1	Chris M. Thomasson
1 Jun 24	Re: Making Lemonade (Floating-point format changes)	1	Chris M. Thomasson
14 May 24	Re: Making Lemonade (Floating-point format changes)	4	Anton Ertl
14 May 24	Re: Making Lemonade (Floating-point format changes)	3	MitchAlsup1
14 May 24	Re: Making Lemonade (Floating-point format changes)	1	MitchAlsup1
14 May 24	Re: Making Lemonade (Floating-point format changes)	1	BGB
10 Jun 24	Re: Making Lemonade (Floating-point format changes)	4	Lawrence D'Oliveiro
10 Jun 24	Re: Making Lemonade (Floating-point format changes)	3	Terje Mathisen
10 Jun 24	Re: Making Lemonade (Floating-point format changes)	2	Niklas Holsti
11 Jun 24	Re: Making Lemonade (Floating-point format changes)	1	Lawrence D'Oliveiro