Sujet : Re: The difference between strtol() and strtoul() ?
De : lew.pitcher (at) *nospam* digitalfreehold.ca (Lew Pitcher)
Groupes : comp.lang.cDate : 20. Jun 2024, 15:48:53
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <v51fgl$2j1fo$1@dont-email.me>
References : 1
User-Agent : Pan/0.139 (Sexual Chocolate; GIT bf56508 git://git.gnome.org/pan2)
On Thu, 20 Jun 2024 14:06:45 +0000, Kenny McCormack wrote:
Interestingly, I note that strtoul() accepts strings that begin with a sign
(+ or -). This is odd, since you'd (*) think that a sign (particularly, a
minus) would be a syntax error in parsing for an unsigned value.
IIUC, the ISO C standard does not make a distinction between strings that
make sense for an unsigned long vs strings that make sense for a signed long.
The standard says (with regards to the strtol, strtoll, strtoul, and strtoull
functions):
"... the expected form of the subject sequence is a sequence of letters
and digits representing an integer with the radix specified by base,
optionally preceded by a plus or minus sign ... . If the value of base
is 16, the characters 0x or 0X may optionally precede the sequence of
letters and digits, following the sign if present."
so, it appears that the ISO C standard permits the input string to specify
a sign, even if the resulting conversion does not.
Further, although the (Linux) man page is more than a bit murky on the
subject, it seems that the result of parsing, say, "-1", with strtoul() is
the largest unsigned value (usually, 2**N-1 or a lot of F's (in hex)).
Whereas, I would expect it to be 1 (i.e., just take the absolute value).
Why would you expect that? Again, the ISO standard says:
"If the subject sequence has the expected form ... it is used as the base
for conversion, ascribing to each letter its value ... . If the subject
sequence begins with a minus sign, the value resulting from the conversion
is negated (in the return type)."
and
"If the correct value is outside the range of representable values, LONG_MIN,
LONG_MAX, LLONG_MIN, LLONG_MAX, ULONG_MAX, or ULLONG_MAX is returned
(according to the return type and sign of the value, if any) ... ."
Comments? I find this all very counterintuitive.
I can't comment on /your/ internalization of the standards and expected
behaviour. But, the standard makes sense (in an eccentric sort of way)
to me, in that the defining distinction of the various strto*l() functions
is not the format of the input, but the format of the output of the function.
(*) Or should I say, "one would" ?
P.S. Why isn't there a strtoi() or strtou() ? I know, of course, that
there is atoi(), but that doesn't have the error checking capability that
the strto* functions have.
-- Lew Pitcher"In Skills We Trust"