Re: Rationale for aligning data on even bytes in a Unix shell file?

Liste des GroupesRevenir à cl c 
Sujet : Re: Rationale for aligning data on even bytes in a Unix shell file?
De : cr88192 (at) *nospam* gmail.com (BGB)
Groupes : comp.lang.c
Date : 06. May 2025, 19:01:41
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <vvdj1o$3gijr$1@dont-email.me>
References : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
User-Agent : Mozilla Thunderbird
On 5/1/2025 4:13 AM, David Brown wrote:
On 01/05/2025 01:56, Lawrence D'Oliveiro wrote:
On Wed, 30 Apr 2025 12:38:53 -0000 (UTC), Muttley wrote:
>
Its certainly not a scheme I'd use, but I've also seen Makefile and
makefile in the same package build directory in the past.
>
The GNU “make” command, specified without a filename, looks for
“GNUmakefile”, then “Makefile”, then “makefile”. The man page
<https://manpages.debian.org/make(1)> says:
>
     We recommend Makefile because it appears prominently near the
     beginning of a directory listing, right near other important files
     such as README.
>
But is this still true for most people? I think the default sort
settings these days no longer put all-caps names at the top.
 I can't speak for "most people", but since my project directories rarely have more than about a dozen files and directories (like "src" and "build") in the top directory, it could be called zzzz and still be near the top!
 
Wandering in a bit late, but I can note for my project (or, the makeshift OS part):
   Nominal filename format: UTF-8.
IIRC, my experimental (Unix style) filesystem could use one of several encodings:
   ASCII
   UTF8
   CP1252 (Latin-1 with extended control-codes replaced)
Merit of 1252 here is that it can potentially take fewer bytes, and statistically is most likely to cover any non-ASCII characters encountered (most are Latin-1), if everything fits into the character range, and using UTF-8 if it doesn't fit. It is possible to rely on disambiguation, and not use 1252 if it could be potentially confused for UTF-8. Most of the time, 1252 (if any non-ASCII chars are used) results in sequences that are invalid as UTF-8, thus no ambiguity would result (if not valid UTF-8, assume 1252). ASCII case can be ignored as it is equivalent between both encoding schemes.
The partial rationale here being that the directory entries in this case were fixed size (like FAT, albeit with longer names), and this could potentially make the difference between using a single directory entry or needing a more complex LFN style scheme. Though, in this case, the default name length is 48, and it is rare for a filename to not fit into 48 bytes.
No other codepages were supported here (so, anything not Latin-1 or similar will need to use UTF-8 regardless).
Another semi-filesystem is in use with similar rules, except with 32 byte filenames.
FAT32, as noted, is:
8.3, CP1252, with bits to encode upper or lower-case base and extension;
LFN's, with up to 256 characters of UCS-2.
...
At higher levels, API's generally assume normalization to UTF-8.
Though, with a few non-standard tweaks: 0080..009F are assumed to be the chars from 1252, and not the extended control chars;
In console settings, the Arabic alphabet was replaced with 2-digit hex numbers (00..FF), as:
   I felt a need for 2 dense hex numbers in the console;
   I ideally needed a spot low in the mapping;
   The Arabic characters don't map to 8x8 pixel character cells (1).
*1: Might reconsider if someone can make a case that this alphabet could in-fact be represented in a recognizable form in 8x8 pixel character cells.
This mostly doesn't apply outside the console. For application use, the standard character assignments would be assumed.
As for collating:
   Nominal order is raw unsigned bytes (based on the UTF-8 encoding);
   This will put uppercase before lowercase.
I debated some if/what style of normalization to use for UTF strings.
   Full Unicode normalization was too complicated;
   Fully non-normalized encoding could also pose issues.
If this context, if it takes much over a few hundred lines of code and around 1K of tables, it was too expensive.
Normalization rules ended up being a compromise:
   Only the Latin and Extended Latin combining characters are handled.
     Or, roughly, Latin-1 and Latin-2.
   Pretty much everything else is passed through as-is.
The combined characters are first broken up, and then any combining characters are combined. Filenames exist in combined form as this uses fewer bytes.
Say, filesystem layer does not normalize emojis, it has no reason to know what an emoji is.
There was some debate over representing non-BMP characters as UTF-8 coded surrogate pairs or as larger UTF-8 codepoints, off hand, I don't remember for certain. I think I may have chosen the latter due to fewer bytes, whereas I would usually have preferred UTF-8 coded surrogate pairs in other contexts. I do vaguely remember dealing with this issue in my normalization code though.
Though, in this case, the UTF-8 normalization was dealt with in the VFS level rather than in the FS drivers.
There was also a partial concern (that I remembered) that if a file in the filesystem were normalized in a way that differs from the VFS's normalization, it could potentially make the file effectively inaccessible. IIRC, there was no good solution to this possibility.
Most likely partial answer though is to assume that any filename normalization rules are to preferably be kept frozen.
...

Date Sujet#  Auteur
26 Apr 25 * Rationale for aligning data on even bytes in a Unix shell file?147Janis Papanagnou
26 Apr 25 +* Re: Rationale for aligning data on even bytes in a Unix shell file?2Keith Thompson
27 Apr 25 i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
27 Apr 25 +* Re: Rationale for aligning data on even bytes in a Unix shell file?2Kaz Kylheku
27 Apr 25 i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
27 Apr 25 +* Re: Rationale for aligning data on even bytes in a Unix shell file?140Bonita Montero
27 Apr 25 i+* Re: Rationale for aligning data on even bytes in a Unix shell file?127Janis Papanagnou
28 Apr 25 ii`* Re: Rationale for aligning data on even bytes in a Unix shell file?126Bonita Montero
28 Apr 25 ii +* Re: Rationale for aligning data on even bytes in a Unix shell file?124vallor
28 Apr 25 ii i+* Re: Rationale for aligning data on even bytes in a Unix shell file?122Bonita Montero
28 Apr 25 ii ii`* Re: Rationale for aligning data on even bytes in a Unix shell file?121vallor
28 Apr 25 ii ii `* Re: Rationale for aligning data on even bytes in a Unix shell file?120Bonita Montero
28 Apr 25 ii ii  `* Re: Rationale for aligning data on even bytes in a Unix shell file?119Janis Papanagnou
28 Apr 25 ii ii   `* Re: Rationale for aligning data on even bytes in a Unix shell file?118Bonita Montero
28 Apr 25 ii ii    +* Re: Rationale for aligning data on even bytes in a Unix shell file?54Janis Papanagnou
28 Apr 25 ii ii    i`* Re: Rationale for aligning data on even bytes in a Unix shell file?53Bonita Montero
28 Apr 25 ii ii    i +* Re: Rationale for aligning data on even bytes in a Unix shell file?44Bonita Montero
28 Apr 25 ii ii    i i`* Re: Rationale for aligning data on even bytes in a Unix shell file?43Bonita Montero
28 Apr 25 ii ii    i i `* Re: Rationale for aligning data on even bytes in a Unix shell file?42Richard Harnden
28 Apr 25 ii ii    i i  `* Re: Rationale for aligning data on even bytes in a Unix shell file?41Bonita Montero
29 Apr 25 ii ii    i i   +* Re: Rationale for aligning data on even bytes in a Unix shell file?36Richard Heathfield
29 Apr 25 ii ii    i i   i+* Re: Rationale for aligning data on even bytes in a Unix shell file?33Bonita Montero
29 Apr 25 ii ii    i i   ii+* Re: Rationale for aligning data on even bytes in a Unix shell file?31Richard Heathfield
6 May 25 ii ii    i i   iii`* Re: Rationale for aligning data on even bytes in a Unix shell file?30Bonita Montero
7 May 25 ii ii    i i   iii `* Re: Rationale for aligning data on even bytes in a Unix shell file?29BGB
7 May 25 ii ii    i i   iii  +* Re: Rationale for aligning data on even bytes in a Unix shell file?18Janis Papanagnou
7 May 25 ii ii    i i   iii  i+* Re: Rationale for aligning data on even bytes in a Unix shell file?13Michael S
8 May 25 ii ii    i i   iii  ii+* Re: Rationale for aligning data on even bytes in a Unix shell file?11BGB
8 May 25 ii ii    i i   iii  iii`* Re: Rationale for aligning data on even bytes in a Unix shell file?10Janis Papanagnou
8 May 25 ii ii    i i   iii  iii `* Re: Rationale for aligning data on even bytes in a Unix shell file?9BGB
8 May 25 ii ii    i i   iii  iii  `* Re: Rationale for aligning data on even bytes in a Unix shell file?8Keith Thompson
9 May 25 ii ii    i i   iii  iii   `* Re: Rationale for aligning data on even bytes in a Unix shell file?7BGB
9 May 25 ii ii    i i   iii  iii    +* Re: Rationale for aligning data on even bytes in a Unix shell file?2Keith Thompson
9 May 25 ii ii    i i   iii  iii    i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
9 May 25 ii ii    i i   iii  iii    `* Re: Rationale for aligning data on even bytes in a Unix shell file?4Lawrence D'Oliveiro
9 May 25 ii ii    i i   iii  iii     `* Re: Rationale for aligning data on even bytes in a Unix shell file?3BGB
15 May 25 ii ii    i i   iii  iii      `* Re: Rationale for aligning data on even bytes in a Unix shell file?2Lawrence D'Oliveiro
15 May 25 ii ii    i i   iii  iii       `- Re: Rationale for aligning data on even bytes in a Unix shell file?1BGB
9 May 25 ii ii    i i   iii  ii`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Lawrence D'Oliveiro
7 May 25 ii ii    i i   iii  i`* Re: Rationale for aligning data on even bytes in a Unix shell file?4BGB
7 May 25 ii ii    i i   iii  i +* Re: Rationale for aligning data on even bytes in a Unix shell file?2David Brown
8 May 25 ii ii    i i   iii  i i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
8 May 25 ii ii    i i   iii  i `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Lawrence D'Oliveiro
8 May 25 ii ii    i i   iii  +* Re: Rationale for aligning data on even bytes in a Unix shell file?5Lawrence D'Oliveiro
8 May 25 ii ii    i i   iii  i`* Re: Rationale for aligning data on even bytes in a Unix shell file?4BGB
9 May 25 ii ii    i i   iii  i `* Re: Rationale for aligning data on even bytes in a Unix shell file?3Lawrence D'Oliveiro
9 May 25 ii ii    i i   iii  i  +- Re: Rationale for aligning data on even bytes in a Unix shell file?1BGB
9 May 25 ii ii    i i   iii  i  `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
9 May 25 ii ii    i i   iii  `* Re: Rationale for aligning data on even bytes in a Unix shell file?5Bonita Montero
9 May 25 ii ii    i i   iii   +* Re: Rationale for aligning data on even bytes in a Unix shell file?3BGB
9 May 25 ii ii    i i   iii   i`* Re: Rationale for aligning data on even bytes in a Unix shell file?2Keith Thompson
9 May 25 ii ii    i i   iii   i `- Re: Rationale for aligning data on even bytes in a Unix shell file?1BGB
14 May 25 ii ii    i i   iii   `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Lawrence D'Oliveiro
29 Apr 25 ii ii    i i   ii`- Locales [was: Re: Rationale for aligning data on even bytes in a Unix shell file?]1Alexis
29 Apr 25 ii ii    i i   i`* Re: Rationale for aligning data on even bytes in a Unix shell file?2David Brown
29 Apr 25 ii ii    i i   i `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Richard Heathfield
29 Apr 25 ii ii    i i   +- Re: Rationale for aligning data on even bytes in a Unix shell file?1James Kuyper
29 Apr 25 ii ii    i i   +* Re: Rationale for aligning data on even bytes in a Unix shell file?2Bonita Montero
5 May 25 ii ii    i i   i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Tim Rentsch
29 Apr 25 ii ii    i i   `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Michael S
28 Apr 25 ii ii    i `* Re: Rationale for aligning data on even bytes in a Unix shell file?8Janis Papanagnou
28 Apr 25 ii ii    i  +- Re: Rationale for aligning data on even bytes in a Unix shell file?1Kaz Kylheku
28 Apr 25 ii ii    i  `* Re: Rationale for aligning data on even bytes in a Unix shell file?6Bonita Montero
29 Apr 25 ii ii    i   `* Re: Rationale for aligning data on even bytes in a Unix shell file?5Janis Papanagnou
29 Apr 25 ii ii    i    `* Re: Rationale for aligning data on even bytes in a Unix shell file?4David Brown
29 Apr 25 ii ii    i     `* Re: Rationale for aligning data on even bytes in a Unix shell file?3Muttley
29 Apr 25 ii ii    i      +- Re: Rationale for aligning data on even bytes in a Unix shell file?1David Brown
30 Apr 25 ii ii    i      `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Lawrence D'Oliveiro
28 Apr 25 ii ii    +* Re: Rationale for aligning data on even bytes in a Unix shell file?54Muttley
28 Apr 25 ii ii    i`* Re: Rationale for aligning data on even bytes in a Unix shell file?53Bonita Montero
28 Apr 25 ii ii    i +* Re: Rationale for aligning data on even bytes in a Unix shell file?41Bonita Montero
28 Apr 25 ii ii    i i`* Re: Rationale for aligning data on even bytes in a Unix shell file?40Bonita Montero
28 Apr 25 ii ii    i i +* Re: Rationale for aligning data on even bytes in a Unix shell file?37Michael S
28 Apr 25 ii ii    i i i+* Re: Rationale for aligning data on even bytes in a Unix shell file?11Kaz Kylheku
28 Apr 25 ii ii    i i ii`* Re: Rationale for aligning data on even bytes in a Unix shell file?10Michael S
28 Apr 25 ii ii    i i ii +- Re: Rationale for aligning data on even bytes in a Unix shell file?1Kaz Kylheku
29 Apr 25 ii ii    i i ii `* Re: Rationale for aligning data on even bytes in a Unix shell file?8Lawrence D'Oliveiro
29 Apr 25 ii ii    i i ii  `* Re: Rationale for aligning data on even bytes in a Unix shell file?7Janis Papanagnou
29 Apr 25 ii ii    i i ii   `* Re: Rationale for aligning data on even bytes in a Unix shell file?6Michael S
30 Apr 25 ii ii    i i ii    `* Re: Rationale for aligning data on even bytes in a Unix shell file?5Lawrence D'Oliveiro
30 Apr 25 ii ii    i i ii     `* Re: Rationale for aligning data on even bytes in a Unix shell file?4Janis Papanagnou
30 Apr 25 ii ii    i i ii      `* Re: Rationale for aligning data on even bytes in a Unix shell file?3Janis Papanagnou
1 May 25 ii ii    i i ii       `* Re: Rationale for aligning data on even bytes in a Unix shell file?2Lew Pitcher
2 May 25 ii ii    i i ii        `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
28 Apr 25 ii ii    i i i+- Re: Rationale for aligning data on even bytes in a Unix shell file?1Michael S
29 Apr 25 ii ii    i i i`* Re: Rationale for aligning data on even bytes in a Unix shell file?24Muttley
30 Apr 25 ii ii    i i i `* Re: Rationale for aligning data on even bytes in a Unix shell file?23Lawrence D'Oliveiro
30 Apr 25 ii ii    i i i  `* Re: Rationale for aligning data on even bytes in a Unix shell file?22Muttley
30 Apr 25 ii ii    i i i   +* Re: Rationale for aligning data on even bytes in a Unix shell file?20David Brown
30 Apr 25 ii ii    i i i   i+* Re: Rationale for aligning data on even bytes in a Unix shell file?18Muttley
30 Apr 25 ii ii    i i i   ii+* Re: Rationale for aligning data on even bytes in a Unix shell file?4Janis Papanagnou
30 Apr 25 ii ii    i i i   iii+* Re: Rationale for aligning data on even bytes in a Unix shell file?2David Brown
30 Apr 25 ii ii    i i i   iiii`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
30 Apr 25 ii ii    i i i   iii`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Muttley
30 Apr 25 ii ii    i i i   ii`* Re: Rationale for aligning data on even bytes in a Unix shell file?13David Brown
30 Apr 25 ii ii    i i i   ii +- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
30 Apr 25 ii ii    i i i   ii `* Re: Rationale for aligning data on even bytes in a Unix shell file?11Muttley
1 May 25 ii ii    i i i   ii  `* Re: Rationale for aligning data on even bytes in a Unix shell file?10Lawrence D'Oliveiro
1 May 25 ii ii    i i i   ii   +- Re: Rationale for aligning data on even bytes in a Unix shell file?1vallor
1 May 25 ii ii    i i i   ii   +* Re: Rationale for aligning data on even bytes in a Unix shell file?7David Brown
6 May 25 ii ii    i i i   ii   i`* Re: Rationale for aligning data on even bytes in a Unix shell file?6BGB
2 May 25 ii ii    i i i   ii   `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
30 Apr 25 ii ii    i i i   i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
30 Apr 25 ii ii    i i i   `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Lawrence D'Oliveiro
28 Apr 25 ii ii    i i `* Re: Rationale for aligning data on even bytes in a Unix shell file?2Bonita Montero
28 Apr 25 ii ii    i `* Re: Rationale for aligning data on even bytes in a Unix shell file?11Muttley
29 Apr 25 ii ii    `* Re: Rationale for aligning data on even bytes in a Unix shell file?9Lawrence D'Oliveiro
28 Apr 25 ii i`- Re: Rationale for aligning data on even bytes in a Unix shell file?1vallor
28 Apr 25 ii `- Re: Rationale for aligning data on even bytes in a Unix shell file?1Janis Papanagnou
27 Apr 25 i+* Re: Rationale for aligning data on even bytes in a Unix shell file?2Kaz Kylheku
28 Apr 25 i+- Re: Rationale for aligning data on even bytes in a Unix shell file?1Kenny McCormack
28 Apr 25 i+- Re: Rationale for aligning data on even bytes in a Unix shell file?1Bonita Montero
28 Apr 25 i`* Re: Rationale for aligning data on even bytes in a Unix shell file?8Lawrence D'Oliveiro
9 May 25 `* Re: Rationale for aligning data on even bytes in a Unix shell file?2Keith Thompson

Haut de la page

Les messages affichés proviennent d'usenet.

NewsPortal