Sujet : Re: The new matcher (Was: Experiences with match() subexpressions?)
De : janis_papanagnou+ng (at) *nospam* hotmail.com (Janis Papanagnou)
Groupes : comp.lang.awkDate : 11. Apr 2025, 14:50:11
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <vtb6mj$1o9he$1@dont-email.me>
References : 1 2 3 4 5
User-Agent : Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
On 11.04.2025 10:57, Kenny McCormack wrote:
In article <67f8b7af$0$705$14726298@news.sunsite.dk>,
Aharon Robbins <arnold@freefriends.org> wrote:
...
Mike Haertel is writing a new regexp matcher for gawk; it was announced
here some time agao: https://github.com/mikehaertel/minrx. The code is
in the feature/minrx branch of the gawk Git repository.
Just out of curiosity, does the new matcher address the issue raised by
Janis?
I read his post as if he put it under discussion ("I just opened an
issue, [...] about this question. We shall see what develops.") and
the provided link shows this as well.[*]
(I don't see the answers, though, since my browser obviously doesn't
support the web-page's (dynamic?) format. - So I cannot tell what the
state of that discussion is.)
It sounds like you are implying that it does, but do not say so explicitly.
[...]
Janis
[*] From
https://github.com/mikehaertel/minrx/issues/43:
So there are two questions.
Is it theoretically possible to capture all the instances of
subexpressions matched by the interval expression?
Can this be brought out into the code? I understand it would take an
extended API with a richer data structure in order to do this. gawk's
extended version of the match() function could then be (somehow)
extended to take advantage of this feature.