Sujet : Re: Preliminary version of new regex matcher for gawk now available
De : ben (at) *nospam* bsb.me.uk (Ben Bacarisse)
Groupes : comp.lang.awkDate : 26. Jul 2024, 10:57:43
Autres entêtes
Organisation : A noiseless patient Spider
Message-ID : <87h6cc5vc8.fsf@bsb.me.uk>
References : 1 2
User-Agent : Gnus/5.13 (Gnus v5.13)
Janis Papanagnou <janis_papanagnou+
ng@hotmail.com> writes:
Well, I skimmed through the txt file on Mike's git page to learn
about the algorithm; especially the algorithm and its complexity
is of interest to me. The document was not quite clear about that
(or at least made me doubt) beyond the general and typical O(N*M)
characteristics. One thing I was astonished about was why there's
a non-deterministic automaton model used (NFSM can be transformed
into Deterministic FSM); isn't the non-deterministic tree-search
(where every branch is traversed breadth-first) sub-optimal?
The non-deterministic to deterministic transformation yields (at worst)
an exponential number of states. Keeping track of a set of states is
usually the preferred method.
-- Ben.