Looking for an Embedded JavaScript Parser

Question

I have a web crawler built in C# (I know) and it has grown fairly sophisticated in handling many of the features normally handled by an actually web browser. That said, I have nothing that will parse the incoming HTML and process the embedded JavaScript commands on the page.

I have tried numerous approaches - from Noesis to Awesomium - but nothing appears to be working. I also made the mistake of using the WinForms embedded web browser control and the memory leaks under load (I am running Parallel Tasks) literally corrupted the CLR. That said, it was able to process the page as a normal browser and the resultant content was great - not viable, but the end result content was on point.

Is there nothing out there that will either take a target URL or, ideally, take in HTML content downloaded via an HttpWebRequest and process the embedded JavaScript commands?

htmlagilitypack.codeplex.com

themhz
– themhz

2011-12-23 17:30:30 +00:00
Commented Dec 23, 2011 at 17:30 — themhz
– themhz, Commented Dec 23, 2011 at 17:30

Shiplu Mokaddim · Accepted Answer · 2011-12-23 17:31:20Z

1

Here is a list of JavaScript Engines. Also check ECMAScript engines.

edited Dec 23, 2011 at 17:31

answered Dec 23, 2011 at 17:06

Shiplu Mokaddim

57.5k20 gold badges147 silver badges193 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Looking for an Embedded JavaScript Parser

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related