Liste des Groupes | Revenir à cl python |
On Mon, 03 Jun 2024 14:58:26 -0400 (EDT)
Grant Edwards <grant.b.edwards@gmail.com> wrote:
>On 2024-06-03, Edward Teach via Python-list <python-list@python.org>
wrote:
The Gutenburg Project publishes "plain text". That's another
problem, because "plain text" means UTF-8....and that means
unicode...and that means running some sort of unicode-to-ascii
conversion in order to get something like "words". A couple of
hours....a couple of hundred lines of C....problem solved!
I'm curious. Why does it need to be converted frum Unicode to ASCII?
When you read it into Python, it gets converted right back to
Unicode...
Well.....when using the file linux.words as a useful master list of
"words".....linux.words is strict ASCII........
Les messages affichés proviennent d'usenet.