Sujet : Re: Automating an atypical search & replace
De : the_stan_brown (at) *nospam* fastmail.fm (Stan Brown)
Groupes : comp.editorsDate : 14. Jul 2024, 07:13:55
Autres entêtes
Organisation : Oak Road Systems
Message-ID : <MPG.40fd0c80e0a741bc99030e@news.individual.net>
References : 1 2
User-Agent : MicroPlanet-Gravity/3.0.11 (GRC)
AOn Sat, 13 Jul 2024 23:39:14 -0000 (UTC), Lawrence D'Oliveiro wrote:
On Sat, 13 Jul 2024 11:08:48 -0500, Richard Owlett wrote:
These occurrences are consistently of the form
<span class='add'>arbitrary_text</span>
I wish to delete "<span class='add'>" and *ASSOCIATED* "</span>".
This is beyond the abilities of regular expressions. This is the point
where you need to use an actual HTML/XML-parsing library.
In general I'd agree with you. But the OP made a big deal -- in a
different thread, for some reason -- about wanting to use minimal
HTML, so I doubt very much there will be nested <span> ... </span>
sequences.
Also, the OP quite rightly wanted to confirm each change before it is
made, so presumably if there are any nested sequences he will say no
to that particular edit and fix it manually.
-- Stan Brown, Tehachapi, California, USA https://BrownMath.com/Shikata ga nai...