4

Is it possible to get the html source code of a webpage with curl and the run a javascript interpreter over it, so i get the generated content?

The Page i need to get uses some encoded and genereated content in there so i want to first run the javascript to get the escaped and generated content... or do i need to regex the javascript and "compile" the javascript on my own? Like

curl <myurl> | perl -ne 'm/unescape\((.*)\)/; print "$1"' | <now to something with that>

i known there is no javascript engine in curl but can i just call another script / programm to do the job?

1

1 Answer 1

10

You can do it, but it's more involved than I think you realize. Neither curl nor wget have Javascript engines, so you'll need something that has one.

I would start by looking at PhantomJS and/or Chromium Headless.

Sign up to request clarification or add additional context in comments.

2 Comments

nice, i didnt knew that there where such things :) I will also update my screenshot script which currently goes over some edges then using phantomjs!
PhantomJS has been suspended as of March 2018. I'd love to know of a more up-to-date option. Depending on the use, there might be security issues with using an unmaintained package.

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.