Newsportal USENET - Re: Decrement And Branch

Re: Decrement And Branch

Sujet : Re: Decrement And Branch
De : mitchalsup (at) *nospam* aol.com (MitchAlsup1)
Groupes : comp.arch
Date : 13. Aug 2024, 18:18:13

Autres entêtes

Organisation : Rocksolid Light
Message-ID : <6bb58806dc9aad223e36c238bebf6b78@www.novabbs.org>
References : 1 2
User-Agent : Rocksolid Light

On Tue, 13 Aug 2024 13:28:07 +0000, Anton Ertl wrote:

Lawrence D'Oliveiro <ldo@nz.invalid> writes:
The original designers of POWER clearly thought there was a point to
having such instructions; do you agree?
>
Sure. The question is what it was. Maybe they wanted to look good on
some kernels. In the same vein they also added loads and stores with
update (i.e., autoincrement/decrement addressing), and in one version
of the architecture reference manual I found the warning that these
may be as slow as a separate load and update.
>
AMD64 has LOOP. I looked at it here several times. Theoretically one
can branch-predict it perfectly, but when I measured that
<2016Jun16.103617@mips.complang.tuwien.ac.at>
<2017Mar14.183125@mips.complang.tuwien.ac.at>, I found that they just
use history-based branch prediction for these instructions like
everybody else.
>
I think that the major reason is that in an OoO CPU the OoO part would
need to move the count to the front end, and either let the front end
wait until that is done, or introduce some mechanism to let the front
end run ahead and, when the count finally becomes available to the
front end, update it to the right value where the front end is now.

Actually that is not necessary, but there are additional advantages.
Imagine a GBOoO machine with reservation stations and one runs into
a recognizable loop. Once the RSs are setup, one turns off the FETCH
stage, adds an increment to each station, and then each time the
loop instruction is encountered, you just fire off the RSs again.
This saves around 1/3 of the power being consumed at no loss in
perf.

>
Moreover, at least some AMD64 CPUs take more cycles for a LOOP than
for the equivalent "sub; jne" sequence
<2017Mar15.141411@mips.complang.tuwien.ac.at>
>
- anton

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
13 Aug 24	Decrement And Branch	24	Lawrence D'Oliveiro
13 Aug 24	Re: Decrement And Branch	2	Anton Ertl
13 Aug 24	Re: Decrement And Branch	1	MitchAlsup1
13 Aug 24	Re: Decrement And Branch	1	MitchAlsup1
13 Aug 24	Re: Decrement And Branch	18	Lawrence D'Oliveiro
14 Aug 24	Re: Decrement And Branch	3	MitchAlsup1
14 Aug 24	Re: Decrement And Branch	2	Lawrence D'Oliveiro
15 Aug 24	Re: Decrement And Branch	1	MitchAlsup1
14 Aug 24	Re: Decrement And Branch	14	Anton Ertl
15 Aug 24	Re: Decrement And Branch	4	Lawrence D'Oliveiro
15 Aug 24	Re: Decrement And Branch	1	MitchAlsup1
15 Aug 24	Re: Decrement And Branch	2	Anton Ertl
16 Aug 24	Re: Decrement And Branch	1	Lawrence D'Oliveiro
15 Aug 24	Re: Decrement And Branch	9	MitchAlsup1
15 Aug 24	Re: Decrement And Branch	8	Anton Ertl
15 Aug 24	Re: Decrement And Branch	5	MitchAlsup1
16 Aug 24	Instruction counts (was: Decrement And Branch)	4	Anton Ertl
16 Aug 24	Re: Instruction counts (was: Decrement And Branch)	2	Lawrence D'Oliveiro
16 Aug 24	Re: Instruction counts (was: Decrement And Branch)	1	Anton Ertl
16 Aug 24	Re: Instruction counts	1	MitchAlsup1
15 Aug 24	Re: Decrement And Branch	1	MitchAlsup1
9 Sep 24	Re: Decrement And Branch	1	Kent Dickey
16 Aug 24	Re: Decrement And Branch	2	quadibloc
16 Aug 24	Re: Decrement And Branch	1	MitchAlsup1