Newsportal USENET - Re: Rationale for aligning data on even bytes in a Unix shell file?

Re: Rationale for aligning data on even bytes in a Unix shell file?

Sujet : Re: Rationale for aligning data on even bytes in a Unix shell file?
De : david.brown (at) *nospam* hesbynett.no (David Brown)
Groupes : comp.lang.c
Date : 29. Apr 2025, 09:58:46

Autres entêtes

Organisation : A noiseless patient Spider
Message-ID : <vuq4c6$1ca4v$2@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
User-Agent : Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.11.0

On 29/04/2025 01:13, Janis Papanagnou wrote:

On 28.04.2025 20:38, Bonita Montero wrote:
Am 28.04.2025 um 20:05 schrieb Janis Papanagnou:
>
(I thought Windows would use "UCS2". Anyway; would 16 bit suffice to
support full Unicode; I thought it wouldn't, or only old restricted
versions of Unicode.)
>
Windows is UTF-16 since Windows 2000, UCS2 before.

No, Windows has had /some/ UTF-16 support since W2K, with gradual improvements over time to APIs, filesystems, and applications. Later on, it started getting /some/ UTF-8 support, which is a much better choice for most uses.

Oh, so it's a multi-word encoding (like UTF-8 is a multi-byte encoding)
and a character not necessarily encoded with only one 16 bit word... -
...but then I wonder even more where you see an advantage.

When Unicode started, they thought 16 bits would be enough. UCS2 made sense then, because it was a fixed size encoding - though it had the huge disadvantages of being endian-dependent and totally incompatible with every existing character set. Early Unicode adopters included Windows NT and NTFS, Java, QT and Python, using UCS2.
Once Unicode was extended beyond 16 bits, three new encodings emerged - UCS4 (32-bit fixed size), UTF-8 and UTF-16. UCS4 has the advantage of being fixed size (that turns out to be a minor issue in practice, but was long thought to be important), but like UCS2 it suffers from endianness, and is inefficient in size (but is easily compressed, so that also does not matter as much as many people think). UCS4 covers all code points in Unicode, but combining characters mean it still does not cover all characters in one code unit.
UTF-16 is variable length, and can encode any Unicode code point. It has the advantage that existing UCS2 is a subset, making it a natural extension for UCS2 systems - but keeps the same disadvantages of inefficiency for common ASCII characters, incompatibility with ASCII, endian-dependent, and it requires dedicated functions for almost everything.
The biggest problem with UTF-16 IMHO is that it delayed adoption of UTF-8 on early Unicode software. Changing something like QT or Windows from UCS2 to UTF-8 is not easy, but it would have been much better in the long run if that had been done without changing to UTF-16 first.

Les messages affichés proviennent d'usenet.

Date	Sujet	#	Auteur
26 Apr 25	Rationale for aligning data on even bytes in a Unix shell file?	147	Janis Papanagnou
26 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Keith Thompson
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Kaz Kylheku
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	140	Bonita Montero
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	127	Janis Papanagnou
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	126	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	124	vallor
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	122	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	121	vallor
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	120	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	119	Janis Papanagnou
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	118	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	54	Janis Papanagnou
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	53	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	44	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	43	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	42	Richard Harnden
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	41	Bonita Montero
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	36	Richard Heathfield
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	33	Bonita Montero
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	31	Richard Heathfield
6 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	30	Bonita Montero
7 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	29	BGB
7 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	18	Janis Papanagnou
7 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	13	Michael S
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	11	BGB
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	10	Janis Papanagnou
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	9	BGB
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	8	Keith Thompson
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	7	BGB
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Keith Thompson
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	Lawrence D'Oliveiro
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	3	BGB
15 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Lawrence D'Oliveiro
15 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	BGB
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Lawrence D'Oliveiro
7 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	BGB
7 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	David Brown
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Lawrence D'Oliveiro
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	5	Lawrence D'Oliveiro
8 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	BGB
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	3	Lawrence D'Oliveiro
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	BGB
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	5	Bonita Montero
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	3	BGB
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Keith Thompson
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	BGB
14 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Lawrence D'Oliveiro
29 Apr 25	Locales [was: Re: Rationale for aligning data on even bytes in a Unix shell file?]	1	Alexis
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	David Brown
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Richard Heathfield
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	James Kuyper
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Bonita Montero
5 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Tim Rentsch
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Michael S
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	8	Janis Papanagnou
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Kaz Kylheku
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	6	Bonita Montero
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	5	Janis Papanagnou
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	David Brown
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	3	Muttley
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	David Brown
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Lawrence D'Oliveiro
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	54	Muttley
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	53	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	41	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	40	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	37	Michael S
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	11	Kaz Kylheku
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	10	Michael S
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Kaz Kylheku
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	8	Lawrence D'Oliveiro
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	7	Janis Papanagnou
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	6	Michael S
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	5	Lawrence D'Oliveiro
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	3	Janis Papanagnou
1 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Lew Pitcher
2 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Michael S
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	24	Muttley
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	23	Lawrence D'Oliveiro
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	22	Muttley
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	20	David Brown
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	18	Muttley
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	4	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	David Brown
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Muttley
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	13	David Brown
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	11	Muttley
1 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	10	Lawrence D'Oliveiro
1 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	vallor
1 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	7	David Brown
6 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	6	BGB
2 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
30 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Lawrence D'Oliveiro
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	11	Muttley
29 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	9	Lawrence D'Oliveiro
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	vallor
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Janis Papanagnou
27 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Kaz Kylheku
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Kenny McCormack
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	1	Bonita Montero
28 Apr 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	8	Lawrence D'Oliveiro
9 May 25	Re: Rationale for aligning data on even bytes in a Unix shell file?	2	Keith Thompson