Sujet : Re: unicode text
De : mssr953 (at) *nospam* gmail.com (Michael Soyka)
Groupes : comp.lang.tclDate : 09. Nov 2024, 04:15:20
Autres entêtes
Organisation : self
Message-ID : <vgmk48$3fpdg$1@dont-email.me>
References : 1
User-Agent : Mozilla Thunderbird
On 11/08/2024 9:28 PM, saito wrote:
Is there a way to remove emojis, non-printable and other graphic characters from a string? I can use a regexp with a-zA-Z and such but this doesn't account for valid characters from non-ascii/non-Western languages, right?
I've found that this regular expression works for emojis:
[^[:print:][:cntrl:]]