Re: auto predicating branches

Liste des GroupesRevenir à c arch 
Sujet : Re: auto predicating branches
De : monnier (at) *nospam* iro.umontreal.ca (Stefan Monnier)
Groupes : comp.arch
Date : 23. Apr 2025, 03:59:10
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <jwvcyd338k2.fsf-monnier+comp.arch@gnu.org>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14
User-Agent : Gnus/5.13 (Gnus v5.13)
I do not see 2 LDDs being performed parallel unless the execution
width is at least 14-wide. In any event loop recurrence restricts the

IIUC you'll get multiple loads in parallel if the loads take a long time
because of cache misses.  Say each load takes 100 cycles, then there is
plenty of time during one load to predict many more iterations of the
loop and hence issue many more loads.

With a branching code, the addresses of those loads depend mostly on the
branch predictions, so the branch predictions end up performing a kind
of "value prediction" (where the value that's predicted is the address
of the next lookup).

With predication your load address will conceptually depend on 3 inputs:
the computation of `base + middle`, the computation of  `base + 0`, and
the computation of the previous `needle < *base[middle]` test to choose
between the first two.  If the LD of `*base[middle]` takes 100 cycle,
that means a delay of 100 cycles before the next LD can be issued.

Of course, nothing prevents a CPU from doing "predicate prediction":
instead of waiting for an answer to `needle < *base[middle]`, it could
try and guess whether it will be true or false and thus choose to send
one of the two addresses (or both) to the memory (and later check the
prediction and rollback, just like we do with normal branches).


        Stefan

Date Sujet#  Auteur
4 Oct 24 * Re: Tonights Tradeoff - Background Execution Buffers78Robert Finch
4 Oct 24 +* Re: Tonights Tradeoff - Background Execution Buffers75Anton Ertl
4 Oct 24 i`* Re: Tonights Tradeoff - Background Execution Buffers74Robert Finch
5 Oct 24 i `* Re: Tonights Tradeoff - Background Execution Buffers73Anton Ertl
9 Oct 24 i  `* Re: Tonights Tradeoff - Background Execution Buffers72Robert Finch
9 Oct 24 i   +* Re: Tonights Tradeoff - Background Execution Buffers3MitchAlsup1
9 Oct 24 i   i+- Re: Tonights Tradeoff - Background Execution Buffers1Robert Finch
12 Oct 24 i   i`- Re: Tonights Tradeoff - Background Execution Buffers1BGB
12 Oct 24 i   +* Re: Tonights Tradeoff - Carry and Overflow67Robert Finch
12 Oct 24 i   i`* Re: Tonights Tradeoff - Carry and Overflow66MitchAlsup1
12 Oct 24 i   i `* Re: Tonights Tradeoff - Carry and Overflow65BGB
12 Oct 24 i   i  `* Re: Tonights Tradeoff - Carry and Overflow64Robert Finch
13 Oct 24 i   i   +* Re: Tonights Tradeoff - Carry and Overflow3MitchAlsup1
13 Oct 24 i   i   i`* Re: Tonights Tradeoff - ATOM2Robert Finch
13 Oct 24 i   i   i `- Re: Tonights Tradeoff - ATOM1MitchAlsup1
13 Oct 24 i   i   +- Re: Tonights Tradeoff - Carry and Overflow1BGB
31 Oct 24 i   i   `* Page fetching cache controller59Robert Finch
31 Oct 24 i   i    +- Re: Page fetching cache controller1MitchAlsup1
6 Nov 24 i   i    `* Re: Q+ Fibonacci57Robert Finch
17 Apr 25 i   i     `* Re: register sets56Robert Finch
17 Apr 25 i   i      +* Re: register sets53Stephen Fuld
17 Apr 25 i   i      i+- Re: register sets1Robert Finch
17 Apr 25 i   i      i+* Re: register sets46MitchAlsup1
18 Apr 25 i   i      ii`* Re: register sets45Robert Finch
18 Apr 25 i   i      ii `* Re: register sets44MitchAlsup1
20 Apr 25 i   i      ii  `* Re: register sets43Robert Finch
21 Apr 25 i   i      ii   `* Re: auto predicating branches42Robert Finch
21 Apr 25 i   i      ii    `* Re: auto predicating branches41Anton Ertl
21 Apr 25 i   i      ii     +- Is an instruction on the critical path? (was: auto predicating branches)1Anton Ertl
21 Apr 25 i   i      ii     `* Re: auto predicating branches39MitchAlsup1
22 Apr 25 i   i      ii      `* Re: auto predicating branches38Anton Ertl
22 Apr 25 i   i      ii       +- Re: auto predicating branches1MitchAlsup1
22 Apr 25 i   i      ii       `* Re: auto predicating branches36Anton Ertl
22 Apr 25 i   i      ii        `* Re: auto predicating branches35MitchAlsup1
23 Apr 25 i   i      ii         +* Re: auto predicating branches3Stefan Monnier
23 Apr 25 i   i      ii         i`* Re: auto predicating branches2Anton Ertl
25 Apr 25 i   i      ii         i `- Re: auto predicating branches1MitchAlsup1
23 Apr 25 i   i      ii         `* Re: auto predicating branches31Anton Ertl
23 Apr 25 i   i      ii          `* Re: auto predicating branches30MitchAlsup1
24 Apr 25 i   i      ii           `* Re: asynch register rename29Robert Finch
27 Apr 25 i   i      ii            `* Re: fractional PCs28Robert Finch
27 Apr 25 i   i      ii             `* Re: fractional PCs27MitchAlsup1
28 Apr 25 i   i      ii              `* Re: fractional PCs26Robert Finch
28 Apr 25 i   i      ii               +* Re: fractional PCs15MitchAlsup1
29 Apr 25 i   i      ii               i`* Re: fractional PCs14Robert Finch
5 May 25 i   i      ii               i `* Re: control co-processor13Robert Finch
5 May 25 i   i      ii               i  `* Re: control co-processor12Al Kossow
5 May 25 i   i      ii               i   `* Re: control co-processor11Stefan Monnier
6 May 25 i   i      ii               i    +* Re: control co-processor3MitchAlsup1
7 May 25 i   i      ii               i    i+- Re: control co-processor1MitchAlsup1
15 Jul 25 i   i      ii               i    i`- Re: control co-processor1MitchAlsup1
7 May 25 i   i      ii               i    `* Scan chains (was: control co-processor)7Stefan Monnier
7 May 25 i   i      ii               i     +* Re: Scan chains (was: control co-processor)2Al Kossow
7 May 25 i   i      ii               i     i`- Re: Scan chains1Stefan Monnier
7 May 25 i   i      ii               i     +* Re: Scan chains3MitchAlsup1
7 May 25 i   i      ii               i     i`* Re: Scan chains2Stefan Monnier
8 May 25 i   i      ii               i     i `- Re: Scan chains1MitchAlsup1
15 Jul 25 i   i      ii               i     `- Re: Scan chains1MitchAlsup1
29 Apr 25 i   i      ii               `* Re: fractional PCs10Robert Finch
29 Apr 25 i   i      ii                `* Re: fractional PCs9MitchAlsup1
30 Apr 25 i   i      ii                 `* Re: fractional PCs8Robert Finch
30 Apr 25 i   i      ii                  +* Re: fractional PCs6Thomas Koenig
1 May 25 i   i      ii                  i+- Re: fractional PCs1Robert Finch
2 May 25 i   i      ii                  i`* Re: fractional PCs4moi
2 May 25 i   i      ii                  i +* Re: millicode, extracode, fractional PCs2John Levine
2 May 25 i   i      ii                  i i`- Re: millicode, extracode, fractional PCs1moi
2 May 25 i   i      ii                  i `- Re: fractional PCs1moi
30 Apr 25 i   i      ii                  `- Re: fractional PCs1MitchAlsup1
15 Jul 25 i   i      i`* Re: register sets5John Savard
15 Jul 25 i   i      i `* Re: register sets4MitchAlsup1
19 Jul 25 i   i      i  `* Re: register sets3Robert Finch
19 Jul 25 i   i      i   `* Re: register sets2Anton Ertl
19 Jul 25 i   i      i    `- Re: register sets1MitchAlsup1
15 Jul 25 i   i      `* Re: register sets2John Savard
15 Jul 25 i   i       `- Re: register sets1MitchAlsup1
13 Oct 24 i   `- Re: Tonights Tradeoff - Background Execution Buffers1Anton Ertl
4 Oct 24 +- Re: Tonights Tradeoff - Background Execution Buffers1BGB
6 Oct 24 `- Re: Tonights Tradeoff - Background Execution Buffers1MitchAlsup1

Haut de la page

Les messages affichés proviennent d'usenet.

NewsPortal