Re: Compiler Books? Parsers?

Oliver Zeigermann <oliver@zeigermann.de>
2 Jan 2004 03:39:54 -0500

          From comp.compilers

Related articles
[8 earlier articles]
Re: Compiler Books? Parsers? rbates@southwind.net (Rodney M. Bates) (2003-12-08)
Re: Compiler Books? Parsers? nick.roberts@acm.org (Nick Roberts) (2003-12-08)
Re: Compiler Books? Parsers? marcov@stack.nl (Marco van de Voort) (2003-12-20)
Re: Compiler Books? Parsers? cfc@world.std.com (Chris F Clark) (2003-12-21)
Re: Compiler Books? Parsers? cdc@maxnet.co.nz (Carl Cerecke) (2003-12-23)
Re: Compiler Books? Parsers? cfc@shell01.TheWorld.com (Chris F Clark) (2003-12-27)
Re: Compiler Books? Parsers? oliver@zeigermann.de (Oliver Zeigermann) (2004-01-02)
Re: Compiler Books? Parsers? cyber_nerdz@hotmail.com (Mark Sayers) (2004-01-07)
Re: Compiler Books? Parsers? Jeffrey.Kenton@comcast.net (Jeff Kenton) (2004-01-09)
Re: Compiler Books? Parsers? oliver@zeigermann.de (Oliver Zeigermann) (2004-01-22)
| List of all articles for this month |

From: Oliver Zeigermann <oliver@zeigermann.de>
Newsgroups: comp.compilers
Date: 2 Jan 2004 03:39:54 -0500
Organization: T-Online
References: 03-10-113 03-10-145 03-11-010 03-11-083 03-12-017 03-12-116 03-12-125 03-12-132 03-12-141
Keywords: parse, practice
Posted-Date: 02 Jan 2004 03:39:54 EST

Chris F Clark wrote:
> In the lexical case, the parser assumes that the tokens the lexer has
> handed to it are correct. However, when one delimiter is missed, the
> tokens the lexer hands to the parser are not the correct ones--they
> are gibberish (lexically correct gibberish). And if one looked at the
> lexical problem in terms of finding a correctly parsable program with
> the *minimum editing distance*, one would immediately see that the
> correct program would have a different set of tokens. However, I've
> never seen a lexer/parser that attempted to find the program with the
> minimum editing distance from the original source.


This sounds much like robust *natural* language parsing where you have
malformed sentences all the time still trying to understand the meaning
of them. I have seen approachs using propagation of constraints
describing the structure of valid sentences. This incoporates weak
constraints that only claim certain properties *should* be satiesfied. I
can't remember how parsing actually was done and what variables were
constraint over what domain, but I could find it out if anyone is
interested.


Oliver


Post a followup to this message

Return to the comp.compilers page.
Search the comp.compilers archives again.