|Fragments firstname.lastname@example.org (Andy) (2019-12-21)|
|Re: Fragments email@example.com (Kaz Kylheku) (2019-12-21)|
|Re: Fragments firstname.lastname@example.org (Ben Hanson) (2019-12-22)|
|Date:||Sat, 21 Dec 2019 01:52:26 -0800 (PST)|
|Injection-Info:||gal.iecc.com; posting-host="news.iecc.com:2001:470:1f07:1126:0:676f:7373:6970"; logging-data="81191"; mail-complaints-to="email@example.com"|
|Posted-Date:||21 Dec 2019 13:10:24 EST|
In examples is usually used very small alphabet: 3 to 5 letters but in
lexical analysing is not only Ascii but many thousands of Unicode.
Many chars are grouped by the same action: for example digits->a
We can use "fragments" [A-Za-z], [0-9] instead of alone letters.
Problem that fragments not always are disjoint: digits and all chars, letters and letter 'a', etc.
How to handle with not disjoint fragments? on input we get regular
expression in Posix standard and we want make DFA with a few
Return to the
Search the comp.compilers archives again.