Newsportal USENET - Re: A Famous Security Bug

On 21/03/2024 18:41, Kaz Kylheku wrote:

On 2024-03-21, David Brown <david.brown@hesbynett.no> wrote:
On 20/03/2024 19:54, Kaz Kylheku wrote:
On 2024-03-20, Stefan Ram <ram@zedat.fu-berlin.de> wrote:
A "famous security bug":
>
void f( void )
{ char buffer[ MAX ];
/* . . . */
memset( buffer, 0, sizeof( buffer )); }
>
. Can you see what the bug is?
>
I don't know about "the bug", but conditions can be identified under
which that would have a problem executing, like MAX being in excess
of available automatic storage.
>
If the /*...*/ comment represents the elision of some security sensitive
code, where the memset is intended to obliterate secret information,
of course, that obliteration is not required to work.
>
After the memset, the buffer has no next use, so the all the assignments
performed by memset to the bytes of buffer are dead assignments that can
be elided.
>
To securely clear memory, you have to use a function for that purpose
that is not susceptible to optimization.
>
If you're not doing anything stupid, like link time optimization, an
external function in another translation unit (a function that the
compiler doesn't recognize as being an alias or wrapper for memset)
ought to suffice.
>
Using LTO is not "stupid". Relying on people /not/ using LTO, or not
using other valid optimisations, is "stupid".
LTO is a nonconforming optimization.

Really? That is news to me, and I suspect to the folks at gcc and clang/llvm that developed LTO for these compilers. (I have worked with embedded compilers that have had LTO-type optimisations for decades, but these are not often concerned with the minutiae of the standards.)

It destroys the concept that
when a translation unit is translated, the semantic analysis is
complete, such that the only remaining activity is resolution of
external references (linkage), and that the semantic analysis of one
translation unit deos not use information about another translation
unit.

Where is it described in the C standards that semantic information from one translation unit cannot be used (for optimisation, for static error checking, for other analysis or any other purposes) in another translation unit?
What makes you think that LTO, as implemented in compilers like gcc and clang/llvm, do not generate code according to the "as if" rules? (That is, they can generate code that is more optimal, but produces the same observable effects "as if" they were strict dumb translators of the functioning of the C abstract machine.)
I believe there is very little where the behaviour of a C program is different if parts of the code are in one translation unit, or if they are in several. There are utilities that merge multiple C files into single C files (for easier deployment, or for better optimisation). They have to take into account renaming static objects and functions to file-local names, and remove duplicate type definitions, but as long as certain reasonable rules are followed by the programmer, it all goes fine. (You could, I suppose, hit complications if you relied on compatibility of struct or union types across translation units where the identifiers were different and they are compatible across TU's but not within TU's, according to the 6.2.7p1 rules. But that would be unlikely, and I expect LTO compilers to handle those cases.)

This has not yet changed in last April's N3096 draft, where
translation phases 7 and 8 are:
7. White-space characters separating tokens are no longer significant.
Each preprocessing token is converted into a token. The resulting
tokens are syntactically and semantically analyzed and translated
as a translation unit.
8. All external object and function references are resolved. Library
components are linked to satisfy external references to functions
and objects not defined in the current translation. All such
translator output is collected into a program image which contains
information needed for execution in its execution environment.
and before that, the Program Structure section says:
The separate translation units of a program communicate by (for
   example) calls to functions whose identifiers have external linkage,
   manipulation of objects whose identifiers have external linkage, or
   manipulation of data files. Translation units may be separately
   translated and then later linked to produce an executable program.

All of that is irrelevant. It says nothing against sharing other information.

LTO deviates from the the model that translation units are separate,
and the conceptual steps of phases 7 and 8.

No, it does not. These paragraphs are requirements, not limitations.

The translation unit that is prepared for LTO is not fully cooked. You
have no idea what its code will turn into when the interrupted
compilation is resumed during linkage, under the influence of other
tranlation units it is combined with.

You have as much and as little idea of what the generated code will be as you always do during compilation. Compilers can do all kinds of manipulations of the source code you write - as long as the observable behaviour of the program is the same as a dumb translation. They can, and do, use all kinds of inter-procedural optimisations for inlining code, outlining it, breaking functions into pieces, cloning them, using constant propagation, and so on. LTO lets them do this across translation units.

So in fact, the language allows us to take it for granted that, given
my_memset(array, 0, sizeof(array)); }
at the end of a function, and my_memset is an external definition
provided by another translation unit, the call may not be elided.

No, the C language standards make no such guarantee.

The one who may be acting recklessly is he who turns on nonconforming
optimizations that are not documented as supported by the code base.
Another example would be something like gcc's -ffast-math.

That is /completely/ different. That option is clearly documented as potentially violating some of the rules of the ISO C standards. This is why it is not enabled by default or by any common optimisation levels (except "-Ofast", which is also documented as potentially violating standards).

You wouldn't unleash that on numerical code written by experts,
and expect the same correct results.

I would not expect identical results to floating point calculations, no.
Depending on the code in question, I would still expect correct results. I use "-ffast-math" in all my code in order to get correct results a good deal faster (for my targets, and my type of code) than I would get without it.

Date	Sujet	#	Auteur
20 Mar 24	A Famous Security Bug	118	Stefan Ram
20 Mar 24	Re: A Famous Security Bug	108	Kaz Kylheku
20 Mar 24	Re: A Famous Security Bug	2	Keith Thompson
20 Mar 24	Re: A Famous Security Bug	1	Keith Thompson
21 Mar 24	Re: A Famous Security Bug	35	David Brown
21 Mar 24	Re: A Famous Security Bug	34	Kaz Kylheku
21 Mar 24	Re: A Famous Security Bug	4	Chris M. Thomasson
21 Mar 24	Re: A Famous Security Bug	3	Chris M. Thomasson
22 Mar 24	Re: A Famous Security Bug	2	Chris M. Thomasson
22 Mar 24	Re: A Famous Security Bug	1	Chris M. Thomasson
21 Mar 24	Re: A Famous Security Bug	28	Keith Thompson
22 Mar 24	Re: A Famous Security Bug	24	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	19	Keith Thompson
22 Mar 24	Re: A Famous Security Bug	18	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	2	James Kuyper
22 Mar 24	Re: A Famous Security Bug	1	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	1	David Brown
22 Mar 24	Re: A Famous Security Bug	14	Keith Thompson
22 Mar 24	Re: A Famous Security Bug	13	Kaz Kylheku
23 Mar 24	Re: A Famous Security Bug	12	David Brown
23 Mar 24	Re: A Famous Security Bug	11	Kaz Kylheku
23 Mar 24	Re: A Famous Security Bug	2	David Brown
24 Mar 24	Re: A Famous Security Bug	1	Kaz Kylheku
23 Mar 24	Re: A Famous Security Bug	8	James Kuyper
24 Mar 24	Re: A Famous Security Bug	7	Kaz Kylheku
24 Mar 24	Re: A Famous Security Bug	6	David Brown
24 Mar 24	Re: A Famous Security Bug	5	Kaz Kylheku
24 Mar 24	Re: A Famous Security Bug	3	David Brown
27 Mar 24	Re: A Famous Security Bug	2	Kaz Kylheku
28 Mar 24	Re: A Famous Security Bug	1	David Brown
24 Mar 24	Re: A Famous Security Bug	1	Chris M. Thomasson
22 Mar 24	Re: A Famous Security Bug	1	James Kuyper
22 Mar 24	Re: A Famous Security Bug	3	David Brown
22 Mar 24	Re: A Famous Security Bug	2	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	1	David Brown
22 Mar 24	Re: A Famous Security Bug	3	James Kuyper
22 Mar 24	Re: A Famous Security Bug	2	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	1	James Kuyper
22 Mar 24	Re: A Famous Security Bug	1	David Brown
21 Mar 24	Re: A Famous Security Bug	70	Anton Shepelev
21 Mar 24	Re: A Famous Security Bug	1	Keith Thompson
21 Mar 24	Re: A Famous Security Bug	15	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	13	David Brown
22 Mar 24	Re: A Famous Security Bug	12	Kaz Kylheku
22 Mar 24	Re: A Famous Security Bug	1	James Kuyper
22 Mar 24	Re: A Famous Security Bug	10	David Brown
23 Mar 24	Re: A Famous Security Bug	9	Richard Kettlewell
23 Mar 24	Re: A Famous Security Bug	1	Kaz Kylheku
23 Mar 24	Re: A Famous Security Bug	2	David Brown
23 Mar 24	Re: A Famous Security Bug	1	Kaz Kylheku
24 Mar 24	Re: A Famous Security Bug	5	Tim Rentsch
24 Mar 24	Re: A Famous Security Bug	4	Malcolm McLean
17 Apr 24	Re: A Famous Security Bug	3	Tim Rentsch
18 Apr 24	Re: A Famous Security Bug	1	David Brown
18 Apr 24	Re: A Famous Security Bug	1	Keith Thompson
28 Mar 24	Re: A Famous Security Bug	1	Anton Shepelev
22 Mar 24	Re: A Famous Security Bug	1	Tim Rentsch
22 Mar 24	Re: A Famous Security Bug	52	James Kuyper
22 Mar 24	Re: A Famous Security Bug	51	bart
23 Mar 24	Re: A Famous Security Bug	5	Keith Thompson
23 Mar 24	Re: A Famous Security Bug	4	Kaz Kylheku
23 Mar 24	Re: A Famous Security Bug	3	David Brown
23 Mar 24	Re: A Famous Security Bug	2	bart
24 Mar 24	Re: A Famous Security Bug	1	David Brown
23 Mar 24	Re: A Famous Security Bug	45	James Kuyper
23 Mar 24	Re: A Famous Security Bug	44	bart
23 Mar 24	Re: A Famous Security Bug	37	David Brown
23 Mar 24	Re: A Famous Security Bug	36	bart
24 Mar 24	Re: A Famous Security Bug	29	David Brown
24 Mar 24	Re: A Famous Security Bug	28	bart
24 Mar 24	Re: A Famous Security Bug	12	Keith Thompson
25 Mar 24	Re: A Famous Security Bug	1	David Brown
25 Mar 24	Re: A Famous Security Bug	3	Michael S
25 Mar 24	Re: A Famous Security Bug	1	David Brown
25 Mar 24	Re: A Famous Security Bug	1	Keith Thompson
25 Mar 24	Re: A Famous Security Bug	7	bart
25 Mar 24	Re: A Famous Security Bug	6	Michael S
25 Mar 24	Re: A Famous Security Bug	4	bart
25 Mar 24	Re: A Famous Security Bug	3	David Brown
25 Mar 24	Re: A Famous Security Bug	2	Malcolm McLean
25 Mar 24	Re: A Famous Security Bug	1	Michael S
25 Mar 24	Re: A Famous Security Bug	1	David Brown
25 Mar 24	Re: A Famous Security Bug	15	David Brown
25 Mar 24	Re: A Famous Security Bug	14	Michael S
25 Mar 24	Re: A Famous Security Bug	13	David Brown
25 Mar 24	Re: A Famous Security Bug	3	Michael S
25 Mar 24	Re: A Famous Security Bug	1	David Brown
25 Mar 24	Re: A Famous Security Bug	1	bart
25 Mar 24	Re: A Famous Security Bug	9	bart
25 Mar 24	Re: A Famous Security Bug	7	Michael S
25 Mar 24	Re: A Famous Security Bug	6	bart
25 Mar 24	Re: A Famous Security Bug	1	David Brown
25 Mar 24	Re: A Famous Security Bug	4	Michael S
25 Mar 24	Re: A Famous Security Bug	3	bart
26 Mar 24	Re: A Famous Security Bug	2	Michael S
26 Mar 24	Re: A Famous Security Bug	1	bart
25 Mar 24	Re: A Famous Security Bug	1	David Brown
24 Mar 24	Re: A Famous Security Bug	6	Michael S
24 Mar 24	Re: A Famous Security Bug	5	bart
25 Mar 24	Re: A Famous Security Bug	2	Michael S
25 Mar 24	Re: A Famous Security Bug	1	Michael S
25 Mar 24	Re: A Famous Security Bug	1	David Brown
28 Mar 24	Re: A Famous Security Bug	1	James Kuyper
23 Mar 24	Re: A Famous Security Bug	1	Tim Rentsch
24 Mar 24	Re: A Famous Security Bug	1	Michael S
24 Mar 24	Re: A Famous Security Bug	3	Michael S
28 Mar 24	Re: A Famous Security Bug	1	James Kuyper
20 Mar 24	Re: A Famous Security Bug	1	Joerg Mertens
20 Mar 24	Re: A Famous Security Bug	5	Chris M. Thomasson
27 Mar 24	Re: A Famous Security Bug	3	Stefan Ram