12

I recall I have read about a parser which you just have to feed some sample lines, for it to know how to parse some text.

It just determines the difference between two lines to know what the variable parts are. I thought it was written in python, but i'm not sure. Does anyone know what library that was?

4
  • While extremely vague, this question is, nevertheless, quite interesting. I am also curious as to whether there are such "self-learning" parsers (especially if they are written in python). Commented May 28, 2009 at 16:24
  • I know it's vague, but I haven't got a clue what to tell more about it. Commented May 28, 2009 at 16:30
  • @shylent There doesn't seem to be publicly available code for this problem, but some research has been done: See "An Efficient Learning of Context-Free Grammars" by Sakakibara, PDF at tinyurl.com/nrpmor. Commented May 28, 2009 at 22:56
  • Thanks for asking this question. I have learned a lot from the responses. Commented May 29, 2009 at 1:59

2 Answers 2

10

Probably you mean TemplateMaker, I haven't tried it yet, but it builds on well-researched longest-common-substring algorithms and thus should work reasonably... If you are interested in different (more complex) approaches, you can easily find a lot of material on Google Scholar using the query "wrapper induction" or "template induction".

Sign up to request clarification or add additional context in comments.

1 Comment

Yeah, this is what I had seen. Not really a parser, but it commes close ;)
2

Conceivably you might mean Reverend?

1 Comment

No, it's not really what is was looking for.

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.