Newsportal USENET - Re: int a = a

David Brown <david.brown@hesbynett.no> writes:

On 20/03/2025 20:46, Keith Thompson wrote:
David Brown <david.brown@hesbynett.no> writes:
On 20/03/2025 11:20, Keith Thompson wrote:
Tim Rentsch <tr.17687@z991.linuxsc.com> writes:
Keith Thompson <Keith.S.Thompson+u@gmail.com> writes:
The "could have been declared with the register storage class"
seems quite odd. And in fact it is quite odd.
>
I don't have the same reaction. The point of this phrase is that
undefined behavior occurs only for variables that don't have
their address taken. The phrase used describes that nicely.
Any questions related to "registerness" can be ignored, because
'register' in C really has nothing to do with hardware registers,
despite the name.
DR 338 is explicitly motivated by an IA-64 feature that applies only
to
CPU registers. An object whose address is taken can't be stored (only)
in a register, so it can't have a NaT representation.
The phrase used is "could have been declared with register storage
class
(never had its address taken)". Surely "never had its address taken"
would have been clear enough if CPU registers weren't a big part of the
motivation.
>
I too think the phrasing is a bit odd.
>
Just because a variable's address is taken, does not mean it cannot be
put in a cpu register by the compiler. If the variable is not
accessed in a way that actually requires putting it in memory, then
the compiler can put it in a cpu register (or otherwise optimise it).
So simply taking the address of a variable on IA-64 does not mean it
cannot be in a register, and thus does not necessarily mean it cannot
be NaT. Taking the address of a variable means the variable cannot be
declared "register", but it does not mean it cannot be /in/ a
register.
Sure, any variable that's stored in memory can be mirrored by
holding
its value in a register.
int n = 42; // Assume n is assigned a memory address
printf("n+1=%d n+2=%d\n", n+1, n+2);
A compiler could plausibly store the value of n in a register before
computing n+1, and then reuse the register value to compute n+2.
>
Yes, of course. But there is also no necessity for variables to be in
memory at all, or that there is any consistency there. "Assume n is
assigned a memory address" is a completely unwarranted assumption for
almost all local variables.

I think you misunderstood what I meant by "assume". Certainly n could
be assigned a memory address. You can read it as "*IF* n is assigned a
memory address, then ...". I was asserting that it has a memory address
for purposes of the discussion, not presuming that it must actually have
one.

It is only if the address is taken, and
used in some way that is beyond the optimiser, that the variable
actually has to go in a fixed place in memory. Otherwise optimisers
can and do keep data in registers, or move them in and out of
registers and different stack slots according to convenience for
efficient code.
>
>
uint32_t float_to_uint(float f) {
uint32_t u;
memcpy(&u, &f, 4);
return u;
}
>
gcc compiles that to :
>
float_to_uint:
movd eax, xmm0
ret
>
So even though the addresses of the variable "u" and the parameter "f"
are taken, and converted to char pointers, and passed to a function
with external linkage, nothing is actually put in memory at all.
>
Thus the standard's wording as though the legality of using the
"register" storage-class specifier corresponds to cpu register usage
is, at best, wildly out of date.
>
(And there are some architectures where the cpu registers are directly
mapped to memory, and can be accessed as memory locations or
registers.)
>
My understanding is that IA-64 NaT (Not a Thing) representations
exist only for registers, and the NaT bit should be cleared when
a value is stored in the register.
The odd wording in the standard allows an IA-64 C compiler to
take advantage of NaT representations for their intended purpose.
It might impose some minor constraints on what machine code can be
generated, but *most* of the cases where a NaT could be accessed
are undefined behavior in C.
>
I see that, but I believe it would be much simpler and clearer if
attempting to read an uninitialised and unassigned local variable were
undefined behaviour in every case.

I probably agree (I haven't given it all that much thought), but the
committee made a specific decision between C90 and C99 to say that
reading an uninitialized automatic object is *not* undefined behavior.
I'm don't know why they did that (though, all else being equal, reducing
the number of instances of undefined behavior is a good thing), but
reversing that decision for this one issue is not something they decided
to do.

Alternatively, it could have said that the value is unspecified in
every case. Then on the IA-64, the compiler would have to ensure that
registers do not have their NaT bit set even if they are not
initialised - this would not be a difficult task. Enabling use of the
NaT bit for detection of bugs could then be a compiler option if
implementations wanted to provide that feature.

The whole point of the NaT bit is to detect accesses to uninitialized
values. Requiring the compiler to arbitrarily clear that bit
doesn't strike me as a good idea.

I dislike the way that wording was added to the standard specifically
to cater to one specific CPU (which happens to have been discontinued
later). I would have been happier with a more general solution.
I that making accessing the value of an uninitialized automatic
object UB would have been much cleaner, and it would have allowed for
sensible use of NaT by IA-64 compilers. But without knowing *why*
the committee removed that UB between C90 and C99, I'm hesitant to
say it was a mistake.

Meanwhile, I will in effect assume that accessing uninitialized objects
is UB, i.e., I'll carefully avoid doing so.

[...]

--
Keith Thompson (The_Other_Keith) Keith.S.Thompson+u@gmail.com
void Void(void) { Void(); } /* The recursive call of the void */

Date	Sujet	#	Auteur
18 Mar 25	Bart's Language	57	bart
18 Mar 25	Re: Bart's Language	56	Waldek Hebisch
18 Mar 25	Re: Bart's Language	55	bart
18 Mar 25	Re: Bart's Language	54	Waldek Hebisch
18 Mar 25	Re: Bart's Language	46	bart
18 Mar 25	Re: Bart's Language	36	David Brown
18 Mar 25	int a = a (Was: Bart's Language)	35	Kenny McCormack
18 Mar 25	Re: int a = a (Was: Bart's Language)	25	Janis Papanagnou
18 Mar 25	Re: int a = a (Was: Bart's Language)	1	Kaz Kylheku
19 Mar 25	Re: int a = a (Was: Bart's Language)	23	David Brown
19 Mar 25	Re: int a = a (Was: Bart's Language)	1	Kaz Kylheku
19 Mar 25	Re: int a = a	14	Keith Thompson
20 Mar 25	Re: int a = a	12	Tim Rentsch
20 Mar 25	Re: int a = a	11	Keith Thompson
20 Mar 25	Re: int a = a	8	David Brown
20 Mar 25	Re: int a = a	7	Keith Thompson
21 Mar 25	Re: int a = a	6	David Brown
21 Mar 25	Re: int a = a	5	Keith Thompson
21 Mar 25	Re: int a = a	1	David Brown
22 Mar 25	Re: int a = a	3	Tim Rentsch
22 Mar 25	Re: int a = a	2	Keith Thompson
28 Apr 25	Re: int a = a	1	Tim Rentsch
29 Apr 25	Re: int a = a	2	Tim Rentsch
29 Apr 25	Re: int a = a	1	Keith Thompson
20 Mar 25	Re: int a = a	1	David Brown
19 Mar 25	Re: int a = a (Was: Bart's Language)	5	Chris M. Thomasson
20 Mar 25	Re: int a = a (Was: Bart's Language)	4	David Brown
20 Mar 25	Re: int a = a (Was: Bart's Language)	3	bart
20 Mar 25	Re: int a = a (Was: Bart's Language)	2	David Brown
20 Mar 25	Re: int a = a (Was: Bart's Language)	1	wij
20 Mar 25	Re: int a = a (Was: Bart's Language)	2	Tim Rentsch
20 Mar 25	Re: int a = a (Was: Bart's Language)	1	David Brown
18 Mar 25	Re: int a = a (Was: Bart's Language)	3	David Brown
18 Mar 25	Re: int a = a (Was: Bart's Language)	2	Janis Papanagnou
19 Mar 25	Re: int a = a (Was: Bart's Language)	1	David Brown
19 Mar 25	Re: int a = a (Was: Bart's Language)	6	Tim Rentsch
19 Mar 25	Re: int a = a	2	Keith Thompson
27 Apr 25	Re: int a = a	1	Tim Rentsch
19 Mar 25	Re: int a = a (Was: Bart's Language)	1	David Brown
19 Mar 25	Re: int a = a (Was: Bart's Language)	2	Rosario19
20 Mar 25	Re: int a = a (Was: Bart's Language)	1	Tim Rentsch
20 Mar 25	Re: Bart's Language	9	Waldek Hebisch
21 Mar 25	Re: Bart's Language	8	Keith Thompson
22 Mar 25	Re: Bart's Language	5	Waldek Hebisch
22 Mar 25	Re: Bart's Language	4	James Kuyper
22 Mar 25	Re: Bart's Language	2	Waldek Hebisch
23 Mar 25	Re: Bart's Language	1	James Kuyper
23 Mar 25	By definition... (Was: Bart's Language)	1	Kenny McCormack
27 Apr 25	Re: Bart's Language	2	Tim Rentsch
27 Apr 25	Re: Bart's Language	1	Keith Thompson
18 Mar 25	Re: Bart's Language	7	bart
20 Mar 25	Re: Bart's Language	6	Waldek Hebisch
21 Mar 25	Re: Bart's Language	4	Kaz Kylheku
21 Mar 25	Re: Bart's Language	3	bart
21 Mar 25	Re: Bart's Language	2	Kaz Kylheku
22 Mar 25	Re: Bart's Language	1	Tim Rentsch
21 Mar 25	Re: Bart's Language	1	bart