Newsportal USENET - Re: Continuations

Re: Continuations

Sujet : Re: Continuations
De : mitchalsup (at) *nospam* aol.com (MitchAlsup1)
Groupes : comp.arch
Date : 18. Jul 2024, 00:17:37

Autres entêtes

Organisation : Rocksolid Light
Message-ID : <99f80e5c5452ec87cf6f5a70dcb33863@www.novabbs.org>
References : 1 2 3 4 5 6 7 8 9 10 11 12
User-Agent : Rocksolid Light

On Wed, 17 Jul 2024 20:56:06 +0000, Stephen Fuld wrote:

MitchAlsup1 wrote:
>
On Wed, 17 Jul 2024 18:30:47 +0000, Stephen Fuld wrote:
>
MitchAlsup1 wrote:
>
On Wed, 17 Jul 2024 16:50:27 +0000, Thomas Koenig wrote:
>
MitchAlsup1 <mitchalsup@aol.com> schrieb:
>
What I am talking about is to improve their performance until a
sin() takes about the same number of cycles of FDIV, not 10×
more.
>
Maybe time for a little story.
>
Some unspecified time ago, a colleague did CFD calculations
which included fluid flow (including turbulence modelling and
diffusion) and quite a few chemical reactions together. So, he
evaluated a huge number of Arrhenius equations,
>
k = A * exp(-E_a/(R*T))
>
and because some of the reactions he looked at were highly
exothermic or endothermic, he needed tiny relaxation factors
(aka small steps). His calculaiton spent most of the time
evaluating the Arrhenius equation above many, many, many, many
times.
>
A single calculation took months, and he didn't use weak
hardware.
>
A fully pipelined evaluation of, let's say, four parallel exp
and four parallel fdiv instructions would have reduced his
calculation time by orders of magnitude, and allowed him to
explore the design space instead of just scratching the surface.
>
(By the way, if I had found a reasonable way to incorporate the
Arrhenius equation into your ISA, I would have done so already
:-)
>
   FMUL    Rt,RR,RT
   FDIV    Rt,-RE,Rt
   EXP Rt,Rt
   FMUL    Rk,RA,Rt
>
Does not look "all that bad" to me.
>
So for your GbOoO CPU, how many of the various FP operations, and
the EXP instruction can be done in parallel?
>
FMUL is   4 cycles of latency fully pipelined
FDIV is ~20 cycles of latency not   pipelined
EXP is ~16 cycles of latency not   pipelined
>
They are all performed in the FMAC unit and here the instructions are
serially dependent.
>
So, 44 cycles of latency, a 1-wide machine and a 6-wide machine would
see the same latency; that is, GBOoO is not a differentiator.
>
>
Good, I get that. But Thomas' original discussion of the problem
indicated that it was very parallel, so the question is, in your
design, how many of those calculations can go in in parallel?

The FDIV and EXP instructions consume all the FMAC cycles, so even if
you completely unrolled the loop, you are not going to get more than
6-cycles less in performing the repeated calculations.
A really BIG implementation with 4 FMAC units per core could unroll
the loop (by reservation stations) such that each iteration would
still be 44-cycles, but you could run 4 in parallel and achieve 4
results every 44-cycles--which to most people smells like 1 result
every 11-cycles.
{Would be an interesting reservation station design, though}

>
>
>

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
13 Jul 24	Continuations	138	Lawrence D'Oliveiro
13 Jul 24	Re: Continuations	4	BGB
14 Jul 24	Re: Continuations	2	aph
15 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
14 Jul 24	Re: Continuations	1	Anton Ertl
13 Jul 24	Re: Continuations	23	John Dallman
14 Jul 24	Re: Continuations	21	Lawrence D'Oliveiro
14 Jul 24	Re: Continuations	20	George Neuner
14 Jul 24	Re: Continuations	19	John Levine
14 Jul 24	Re: Continuations	18	Niklas Holsti
14 Jul 24	Re: Continuations	16	John Levine
15 Jul 24	Re: Continuations	1	Terje Mathisen
15 Jul 24	Re: Continuations	1	John Levine
15 Jul 24	Re: Continuations	9	Niklas Holsti
16 Jul 24	Re: Continuations	8	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	7	John Levine
16 Jul 24	Re: Continuations	1	Chris M. Thomasson
16 Jul 24	Re: Continuations	5	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	4	John Levine
16 Jul 24	Re: Continuations	3	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	2	MitchAlsup1
17 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	3	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	2	MitchAlsup1
16 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	1	MitchAlsup1
16 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
14 Jul 24	Re: Continuations	1	BGB
13 Jul 24	Re: Continuations	1	BGB
14 Jul 24	Re: Continuations	10	Lawrence D'Oliveiro
15 Jul 24	Re: Continuations	7	Thomas Koenig
15 Jul 24	Re: Continuations	6	Thomas Koenig
16 Jul 24	Re: Continuations	4	Thomas Koenig
16 Jul 24	Re: Continuations	2	MitchAlsup1
17 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
17 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
17 Jul 24	Re: Continuations	1	John Dallman
15 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	1	John Levine
14 Jul 24	Re: Continuations	1	George Neuner
14 Jul 24	Re: Continuations	92	John Savard
14 Jul 24	Re: Continuations	1	BGB
15 Jul 24	Re: Continuations	90	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	89	John Savard
16 Jul 24	Re: Continuations	2	MitchAlsup1
17 Jul 24	Re: Continuations	1	Lawrence D'Oliveiro
16 Jul 24	Re: Continuations	86	MitchAlsup1
17 Jul 24	Re: Continuations	69	John Savard
17 Jul 24	Re: Continuations	68	MitchAlsup1
17 Jul 24	Re: Continuations	67	Thomas Koenig
17 Jul 24	Re: Continuations	1	Thomas Koenig
17 Jul 24	Re: Continuations	1	Michael S
17 Jul 24	Re: Continuations	37	MitchAlsup1
17 Jul 24	Re: Continuations	36	Stephen Fuld
17 Jul 24	Re: Continuations	35	MitchAlsup1
17 Jul 24	Re: Continuations	22	Stephen Fuld
18 Jul 24	Re: Continuations	8	MitchAlsup1
18 Jul 24	Re: Continuations	1	Michael S
18 Jul 24	Re: Continuations	6	MitchAlsup1
19 Jul 24	Re: Continuations	1	Stephen Fuld
21 Jul 24	Re: Reservation stations [was Continuations]	2	Anton Ertl
21 Jul 24	Re: Reservation stations [was Continuations]	1	MitchAlsup1
21 Jul 24	Re: Reservation stations [was Continuations]	2	MitchAlsup1
22 Jul 24	IPC (was: Reservation stations)	1	Anton Ertl
18 Jul 24	Re: Continuations	11	Thomas Koenig
18 Jul 24	Re: Continuations	10	Michael S
18 Jul 24	Re: Continuations	9	Thomas Koenig
18 Jul 24	Re: Continuations	8	Michael S
18 Jul 24	Re: Continuations	6	Thomas Koenig
18 Jul 24	Re: Continuations	1	Michael S
18 Jul 24	Re: Continuations	4	Michael S
19 Jul 24	Re: Continuations	3	Thomas Koenig
19 Jul 24	Re: Continuations	2	Michael S
20 Jul 24	Re: Continuations	1	Thomas Koenig
18 Jul 24	Re: Continuations	1	MitchAlsup1
18 Jul 24	Re: Continuations	2	John Savard
18 Jul 24	Re: Continuations	1	Thomas Koenig
18 Jul 24	Re: Continuations	6	Thomas Koenig
18 Jul 24	Re: Continuations	5	Michael S
18 Jul 24	Re: Continuations	4	Michael S
18 Jul 24	Re: Continuations	3	Thomas Koenig
18 Jul 24	Re: Continuations	2	MitchAlsup1
20 Jul 24	Re: Continuations	1	Thomas Koenig
18 Jul 24	Non-pipelined FDIV/SQRT (was: Continuations)	3	Stefan Monnier
18 Jul 24	Re: Non-pipelined FDIV/SQRT	1	MitchAlsup1
28 Jul 24	Re: Non-pipelined FDIV/SQRT	1	Michael S
18 Jul 24	Re: Continuations	3	MitchAlsup1
28 Jul 24	Re: Continuations	2	Paul A. Clayton
28 Jul 24	Re: Continuations	1	Michael S
19 Jul 24	Re: Continuations	27	Terje Mathisen
19 Jul 24	Re: Continuations	5	Thomas Koenig
19 Jul 24	Re: Continuations	1	Chris M. Thomasson
19 Jul 24	Re: Continuations	3	MitchAlsup1
20 Jul 24	Re: Continuations	1	Terje Mathisen
20 Jul 24	Re: Continuations	1	Thomas Koenig
19 Jul 24	Re: Continuations	21	MitchAlsup1
19 Jul 24	Re: Continuations	8	Terje Mathisen
22 Jul 24	Re: Continuations	7	Michael S
22 Jul 24	Re: Continuations	3	MitchAlsup1
22 Jul 24	Re: Continuations	2	Michael S
23 Jul 24	Re: Continuations	1	MitchAlsup1
23 Jul 24	Re: Continuations	3	Terje Mathisen
19 Jul 24	Faster div or 1/sqrt approximations (was: Continuations)	12	Thomas Koenig
17 Jul 24	Re: Continuations	3	Lawrence D'Oliveiro
17 Jul 24	Re: Continuations	12	Stephen Fuld
17 Jul 24	Re: fancy instructions, Continuations	1	John Levine
15 Jul 24	Re: Continuations	1	wolfgang kern
15 Jul 24	Re: pessimal storage allocation, Continuations	3	John Levine
15 Jul 24	Re: Continuations	1	MitchAlsup1
15 Jul 24	Re: Continuations	1	Lynn Wheeler