Liste des Groupes | Revenir à cl c |
On 2024-03-21, David Brown <david.brown@hesbynett.no> wrote:Really? That is news to me, and I suspect to the folks at gcc and clang/llvm that developed LTO for these compilers. (I have worked with embedded compilers that have had LTO-type optimisations for decades, but these are not often concerned with the minutiae of the standards.)On 20/03/2024 19:54, Kaz Kylheku wrote:LTO is a nonconforming optimization.On 2024-03-20, Stefan Ram <ram@zedat.fu-berlin.de> wrote:>A "famous security bug":>
>
void f( void )
{ char buffer[ MAX ];
/* . . . */
memset( buffer, 0, sizeof( buffer )); }
>
. Can you see what the bug is?
I don't know about "the bug", but conditions can be identified under
which that would have a problem executing, like MAX being in excess
of available automatic storage.
>
If the /*...*/ comment represents the elision of some security sensitive
code, where the memset is intended to obliterate secret information,
of course, that obliteration is not required to work.
>
After the memset, the buffer has no next use, so the all the assignments
performed by memset to the bytes of buffer are dead assignments that can
be elided.
>
To securely clear memory, you have to use a function for that purpose
that is not susceptible to optimization.
>
If you're not doing anything stupid, like link time optimization, an
external function in another translation unit (a function that the
compiler doesn't recognize as being an alias or wrapper for memset)
ought to suffice.
Using LTO is not "stupid". Relying on people /not/ using LTO, or not
using other valid optimisations, is "stupid".
It destroys the concept thatWhere is it described in the C standards that semantic information from one translation unit cannot be used (for optimisation, for static error checking, for other analysis or any other purposes) in another translation unit?
when a translation unit is translated, the semantic analysis is
complete, such that the only remaining activity is resolution of
external references (linkage), and that the semantic analysis of one
translation unit deos not use information about another translation
unit.
This has not yet changed in last April's N3096 draft, whereAll of that is irrelevant. It says nothing against sharing other information.
translation phases 7 and 8 are:
7. White-space characters separating tokens are no longer significant.
Each preprocessing token is converted into a token. The resulting
tokens are syntactically and semantically analyzed and translated
as a translation unit.
8. All external object and function references are resolved. Library
components are linked to satisfy external references to functions
and objects not defined in the current translation. All such
translator output is collected into a program image which contains
information needed for execution in its execution environment.
and before that, the Program Structure section says:
The separate translation units of a program communicate by (for
example) calls to functions whose identifiers have external linkage,
manipulation of objects whose identifiers have external linkage, or
manipulation of data files. Translation units may be separately
translated and then later linked to produce an executable program.
LTO deviates from the the model that translation units are separate,No, it does not. These paragraphs are requirements, not limitations.
and the conceptual steps of phases 7 and 8.
The translation unit that is prepared for LTO is not fully cooked. YouYou have as much and as little idea of what the generated code will be as you always do during compilation. Compilers can do all kinds of manipulations of the source code you write - as long as the observable behaviour of the program is the same as a dumb translation. They can, and do, use all kinds of inter-procedural optimisations for inlining code, outlining it, breaking functions into pieces, cloning them, using constant propagation, and so on. LTO lets them do this across translation units.
have no idea what its code will turn into when the interrupted
compilation is resumed during linkage, under the influence of other
tranlation units it is combined with.
So in fact, the language allows us to take it for granted that, givenNo, the C language standards make no such guarantee.
my_memset(array, 0, sizeof(array)); }
at the end of a function, and my_memset is an external definition
provided by another translation unit, the call may not be elided.
The one who may be acting recklessly is he who turns on nonconformingThat is /completely/ different. That option is clearly documented as potentially violating some of the rules of the ISO C standards. This is why it is not enabled by default or by any common optimisation levels (except "-Ofast", which is also documented as potentially violating standards).
optimizations that are not documented as supported by the code base.
Another example would be something like gcc's -ffast-math.
You wouldn't unleash that on numerical code written by experts,I would not expect identical results to floating point calculations, no.
and expect the same correct results.
Les messages affichés proviennent d'usenet.