Newsportal USENET - Re: Computer architects leaving Intel...

On 05/09/2024 13:31, Anton Ertl wrote:

David Brown <david.brown@hesbynett.no> writes:
Anton writes code that seriously pushes the boundary of what can be
achieved. For at least some of the things he does (such as GForth) he
is trying to squeeze every last drop of speed out of the target. And he
is /really/ good at it. But that means he is forever relying on nuances
about code generation. His code, at least for efficiency if not for
correctness, is dependent on details far beyond what is specified and
documented for C and for the gcc compiler. He might spend a long time
working with his code and a version of gcc, fine-tuning the details of
his source code to get out exactly the assembly he wants from the
compiler.
No. We distribute Gforth as source code. It works for a wide variety
of architectures and compilers. So unlike what you suggest and what
some people have suggested earlier to avoid problems with new
"optimizations" in newer releases of gcc, we don't concentrate on a
specific version of gcc.

OK.

Of course it is frustrating for him when the next version of
gcc generates very different assembly from that same source, but he is
not really programming at the level of C, and he should not expect
consistency from C compilers like he does.
It's normal and no problem when the next version of gcc generates
different assembly language. There are some basic assumptions that
our code relies on, and that mostly does not change between gcc
versions.

As long as you are sticking to defined behaviour (defined by the C standards, or by the gcc documentation), and use specified C standard versions in the build, then there should not be any incorrect behaviour in different versions. Performance might regress, and of course there's always the risk of bugs.

An essential assumption is that, when we have:
A:
C code
B:
... that when we do &&A and &&B (which is documented in the GNU C
manual), we get the addresses pointing to the start and end of the
machine code corresponding to the C code.

I don't see anything in the gcc reference manual suggesting that &&B is the end of the corresponding code. What you get - all you get - is that "goto * &&A" gives the same effect as "goto A".

In the days starting with
gcc-3.0, we found that gcc started reordering the basic blocks within
loops, so replaced loops in the part of the code that needs such
assumptions into separate functions. Around gcc-7, gcc started to
compile
A: C-code1
B: C-code2
C: goto *...
to the same code as
A: C-code1; C-code2; goto *...;
B: C-code2; goto *...;
C: goto *...;
I found a workaround that avoids this kind of code generation.

This is all the kind of thing you can expect when you make assumptions about code generation that are not supported by the documentation. Compilers can, and do, move code around in various ways, duplicate it, combine it, unroll it, compress it - whatever gives (or tries to give - optimisation is not an exact science) better results while giving the documented behaviour.
I too have written code that relies on being able to identify the start and end of certain bits of code - typically for microcontrollers where you want some bits of code (like flash programming routines or very timing critical interrupt code) put in ram rather than flash. Sometimes that can be done with compiler extensions, sometimes it takes extra flags, linker file magic, or other messing around. But it's not something I would expect to be portable, and it needs confirmed for every compiler version and selection of flags used. (I realise that this is a vastly simpler task for the kind of work I do than for an open source project!)

Another problem from gcc-3.1 to at least gcc-4.4 (intermittently) is
that gcc compiled
goto *ca;
into the equivalent of
goto gotoca;
/* and elsewhere */
gotoca: goto *ca;
We reported that repeatedly. At one point a gcc maintainer gave us
some bullshit about a possible performance advantage from this
transformation, of course without presenting any empirical support,
while we saw a big slowdown on our code. We developed workarounds for
that, and they are in Gforth to this day, even though we have not
encountered a new gcc version with this problem for over a decade, but
new Gforth should also work on old gcc.

Again, the compiler is not doing anything outside its specifications. What you want here is a guarantee of behaviour that is not defined anywhere. You are not seeing a bug in the compiler, or an incompatibility with previous versions - you are seeing the need for a feature (and a controlling compiler flag) that gcc does not currently have. It's a potential feature that might be useful to other people too, while being an anti-feature to others.

Another assumption is that when we concatenate the code snippet
between label A and B (which contains C-code1) and the code snippet
between label X and Y (which contains C-code3), executing the result
will behave like the concatenation of C-code1 and C-code3 in source
code. This assumption has two aspects:
1) Do the register assignments at the labels fit together. It turns
out that we never had a problem with that, and I think that the
reason for that is that the "goto *" can jump to any of those
labels (all their addresses are taken), and so the register
assignment must be the same right after each label.
What guarantees that the assignments are the same right before each
label? Probably that after the label, there is not much between
the label and the next goto*, and that makes all registers at
potential targets live.
2) If we have two pieces of machine code produced in this fashion,
does the architecture guarantee that such a concatenation works?
It turns out that in general-purpose architectures, all-but-one do.
That includes IA-64. The exception is MIPS with its architectural
load-delay slot (and there are also scheduling restrictions having
to do with the hilo register that may be problematic): the first
code snippet may end in a load, and the next code snippet may start
with an instruction that reads the result of the load. So we just
disabled this concatenation on MIPS.
We do a number of things to achieve stability: We do sanity-checking
on the resulting machine code snippets and fall back to plain threaded
code if the snippets turn out not to be relocatable.
Also, we enable all the flags for defining behaviour in gcc that we
find (unfortunately, in the documentation they are intermixed with
other options). For good measure, this includes
-fno-delete-null-pointer-checks, although I doubt that it makes a
difference for our code either way.

(-fno-delete-null-pointer-checks will make no difference to code that doesn't accidentally use leap-before-you-look checking.)
There are certainly a few cases (-fno-strict-aliasing is a prime example) where flags are documented as disabling optimisations, when they are better viewed as adding definitions to the language and would be better documented under "Options Controlling C Dialect" or "Options for Code Generation Conventions".

One thing that came up about a year ago was that gcc auto-vectorizes
adjacent memory accesses on AMD64 (apparently the AMD64 port
maintainers are unhappy because AMD64 does not have instructions like
ARM A64's ldp and stp:-), which did not impact correctness, but led to
worse performance (not just in Gforth; I have also seen it in the
bubble benchmark from John Hennessy's Stanford small integer
benchmarks; I'm sure there is some SPEC benchmark that benefits). A
quick addition of -fno-tree-vectorize fixed that.

That happens sometimes. In my brief testing of clang, it often seems a bit too keen on vectorising code that would be better kept short and simple. I have no doubt gcc gets that wrong sometimes too.

We have been thinking about moving from C to a better-defined
language, namely assembly language, but have not yet taken the plunge,
and it has not been necessary yet. Gcc has not been as crazy in our
experience as the UB rethoric might make one think. Why is that? I
think the reasons are:
1) Gforth and a lot of other "irrelevant" (to the gcc maintainers)
projects sail in the slipstream of "relevant" code like SPEC and
the Linux kernel that are all full of undefined behaviour (Linux
defines many of them with flags, like Gforth does), so gcc does not
"optimize" as crazily as a UB fan might wish.
2) The code snippets are very short, with many in-edges on the
preceding and following label, which tends to destroy any
"knowledge" that the compiler might have derived from the
assumption that the program does not exercise undefined behaviour.
This severely limits the distance over which such "optimizations"
can be performed.
Nevertheless, the last time I tried what happens if I compile without
the behaviour-defining options, the result did not work; I did not
investigate this further.

You are looking for more than C and the gcc documented extensions give you. That is always going to be hard.
Ideally, you need a new gcc flag or two with documented and guaranteed effects to give you the assurance you need for your code. That's going to take a lot of effort, I would expect, and I can see it being hard for a relatively nice project like Gforth to push for that. Linux has the backing here to push for changes - even if Linus Torvalds rants and insults the gcc developers, IBM and friends can still pay gcc developers to make the changes he wants.
Thank you for your explanation of your needs here, and information about how your code works. I'm afraid I can't do anything to help, but it helps me understand where you are coming from.
(I'm still a fan of the principle of UB, and of compilers using knowledge of UB for optimisation - but that does not mean I can't sympathise with people who find that frustrating and who see things differently.)

Date	Sujet	#	Auteur
27 Aug 24	Computer architects leaving Intel...	539	Thomas Koenig
27 Aug 24	Re: Computer architects leaving Intel...	1	Michael S
27 Aug 24	Re: Computer architects leaving Intel...	1	Stephen Fuld
27 Aug 24	Re: Computer architects leaving Intel...	536	John Dallman
27 Aug 24	Re: Computer architects leaving Intel...	529	BGB
28 Aug 24	Re: Computer architects leaving Intel...	528	MitchAlsup1
28 Aug 24	Re: Computer architects leaving Intel...	527	BGB
28 Aug 24	Re: Computer architects leaving Intel...	2	Robert Finch
28 Aug 24	Re: Computer architects leaving Intel...	1	BGB
28 Aug 24	Re: Computer architects leaving Intel...	524	MitchAlsup1
29 Aug 24	Re: Computer architects leaving Intel...	523	BGB
29 Aug 24	Re: Computer architects leaving Intel...	511	MitchAlsup1
29 Aug 24	Re: Computer architects leaving Intel...	510	BGB
30 Aug 24	Re: Computer architects leaving Intel...	499	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	11	Thomas Koenig
30 Aug 24	Re: Computer architects leaving Intel...	1	Michael S
30 Aug 24	Re: Computer architects leaving Intel...	8	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	2	Michael S
30 Aug 24	Re: Computer architects leaving Intel...	1	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	5	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	4	Brett
30 Aug 24	Re: Computer architects leaving Intel...	1	John Dallman
2 Sep 24	Re: Computer architects leaving Intel...	2	Terje Mathisen
2 Sep 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
30 Aug 24	Re: Computer architects leaving Intel...	1	BGB
30 Aug 24	Re: Computer architects leaving Intel...	487	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	302	John Dallman
30 Aug 24	Re: Computer architects leaving Intel...	301	David Brown
30 Aug 24	Re: Computer architects leaving Intel...	293	Anton Ertl
30 Aug 24	Re: Computer architects leaving Intel...	292	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	290	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	288	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	2	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	1	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	284	Anton Ertl
31 Aug 24	Re: Computer architects leaving Intel...	279	Thomas Koenig
31 Aug 24	Re: Computer architects leaving Intel...	157	Bernd Linsel
31 Aug 24	Re: Computer architects leaving Intel...	153	MitchAlsup1
1 Sep 24	Re: Computer architects leaving Intel...	152	Stephen Fuld
2 Sep 24	Re: Computer architects leaving Intel...	151	Terje Mathisen
2 Sep 24	Re: Computer architects leaving Intel...	150	Stephen Fuld
3 Sep 24	Re: Computer architects leaving Intel...	139	David Brown
3 Sep 24	Re: Computer architects leaving Intel...	108	Stephen Fuld
4 Sep 24	Re: Computer architects leaving Intel...	107	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	103	Terje Mathisen
4 Sep 24	Re: Computer architects leaving Intel...	101	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	97	jseigh
4 Sep 24	Re: Computer architects leaving Intel...	96	David Brown
4 Sep 24	Re: Computer architects leaving Intel...	95	Brett
4 Sep 24	Re: Computer architects leaving Intel...	1	Thomas Koenig
4 Sep 24	Re: Computer architects leaving Intel...	1	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	8	BGB
5 Sep 24	Re: Computer architects leaving Intel...	7	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	6	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	5	Niklas Holsti
5 Sep 24	Re: Computer architects leaving Intel...	4	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	3	BGB
6 Sep 24	Re: Computer architects leaving Intel...	2	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	1	BGB
5 Sep 24	Re: Computer architects leaving Intel...	83	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	82	Terje Mathisen
5 Sep 24	Re: Computer architects leaving Intel...	79	David Brown
5 Sep 24	Re: Computer architects leaving Intel...	2	Thomas Koenig
7 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
5 Sep 24	Re: Computer architects leaving Intel...	74	Terje Mathisen
5 Sep 24	Re: Computer architects leaving Intel...	16	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	15	Terje Mathisen
9 Sep 24	Re: Computer architects leaving Intel...	12	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	11	Brett
10 Sep 24	Re: Computer architects leaving Intel...	5	Terje Mathisen
10 Sep 24	Re: Computer architects leaving Intel...	4	Brett
10 Sep 24	Re: Computer architects leaving Intel...	2	Michael S
11 Sep 24	Re: Computer architects leaving Intel...	1	Brett
11 Sep 24	Re: Computer architects leaving Intel...	1	Terje Mathisen
10 Sep 24	Re: Computer architects leaving Intel...	5	David Brown
10 Sep 24	Re: Computer architects leaving Intel...	3	Anton Ertl
10 Sep 24	Re: Computer architects leaving Intel...	2	David Brown
10 Sep 24	Re: Computer architects leaving Intel...	1	Stefan Monnier
10 Sep 24	Re: Computer architects leaving Intel...	1	BGB
9 Sep 24	Re: Computer architects leaving Intel...	2	Michael S
10 Sep 24	Re: Computer architects leaving Intel...	1	Michael S
5 Sep 24	Re: Computer architects leaving Intel...	45	Bernd Linsel
6 Sep 24	Re: Computer architects leaving Intel...	1	David Brown
9 Sep 24	Re: Computer architects leaving Intel...	2	Terje Mathisen
9 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
14 Sep 24	Re: Computer architects leaving Intel...	41	Kent Dickey
14 Sep 24	Re: Computer architects leaving Intel...	32	Anton Ertl
14 Sep 24	Re: Computer architects leaving Intel...	29	MitchAlsup1
14 Sep 24	Re: Computer architects leaving Intel...	28	Thomas Koenig
15 Sep 24	Re: Computer architects leaving Intel...	27	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	5	Thomas Koenig
16 Sep 24	Re: Computer architects leaving Intel...	4	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
17 Sep 24	Re: Upwards and downwards compatible, Computer architects leaving Intel...	1	John Levine
17 Sep 24	Re: Computer architects leaving Intel...	1	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	21	Terje Mathisen
16 Sep 24	Re: Computer architects leaving Intel...	20	David Brown
16 Sep 24	Re: Computer architects leaving Intel...	14	Michael S
17 Sep 24	Re: Computer architects leaving Intel...	5	Terje Mathisen
15 Sep 24	Re: Computer architects leaving Intel...	2	BGB
14 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
16 Sep 24	Re: Computer architects leaving Intel...	5	Tim Rentsch
5 Sep 24	Re: Computer architects leaving Intel...	3	Tim Rentsch
6 Sep 24	Re: Computer architects leaving Intel...	9	Chris M. Thomasson
5 Sep 24	Re: Computer architects leaving Intel...	2	MitchAlsup1
5 Sep 24	Re: Computer architects leaving Intel...	2	MitchAlsup1
7 Sep 24	Re: Computer architects leaving Intel...	1	Tim Rentsch
4 Sep 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
5 Sep 24	Re: Computer architects leaving Intel...	1	Chris M. Thomasson
4 Sep 24	Re: Computer architects leaving Intel...	1	jseigh
13 Sep 24	Re: Computer architects leaving Intel...	2	Stephen Fuld
3 Sep 24	Re: Computer architects leaving Intel...	30	Stefan Monnier
3 Sep 24	Re: Computer architects leaving Intel...	10	Terje Mathisen
31 Aug 24	Re: Computer architects leaving Intel...	3	Thomas Koenig
1 Sep 24	Re: Computer architects leaving Intel...	121	David Brown
1 Sep 24	Re: Computer architects leaving Intel...	3	John Dallman
3 Sep 24	Re: Computer architects leaving Intel...	1	Stefan Monnier
30 Aug 24	Re: Computer architects leaving Intel...	1	MitchAlsup1
30 Aug 24	Re: Computer architects leaving Intel...	4	Stefan Monnier
30 Aug 24	Re: Computer architects leaving Intel...	2	John Dallman
8 Sep 24	Re: Computer architects leaving Intel...	184	Tim Rentsch
30 Aug 24	Re: Computer architects leaving Intel...	10	MitchAlsup1
31 Aug 24	Re: Computer architects leaving Intel...	11	Paul A. Clayton
29 Aug 24	Re: Computer architects leaving Intel...	6	Anton Ertl