I am trying to scrape dynamically filled webpages like this, on R.
I am trying to do that with RSelenium, but I am open to alternatives. For example, I would happy to do everything with rvest only.
The issue with RSelenium is that it does not start at all (even trying with Chrome). Just after loading the package, this is the output:
> rD <- rsDriver(browser = "firefox", port = 4545L, geckover = "latest")
checking Selenium Server versions:
BEGIN: PREDOWNLOAD
BEGIN: DOWNLOAD
BEGIN: POSTDOWNLOAD
checking chromedriver versions:
BEGIN: PREDOWNLOAD
BEGIN: DOWNLOAD
BEGIN: POSTDOWNLOAD
checking geckodriver versions:
BEGIN: PREDOWNLOAD
BEGIN: DOWNLOAD
BEGIN: POSTDOWNLOAD
checking phantomjs versions:
BEGIN: PREDOWNLOAD
BEGIN: DOWNLOAD
BEGIN: POSTDOWNLOAD
[1] "Connecting to remote server"
Could not open firefox browser.
Client error message:
Undefined error in httr call. httr output: Failed to connect to localhost port 4545 after 2259 ms: Couldn't connect to server
Check server log for further details.
Warning message:
In rsDriver(browser = "firefox", port = 4545L, geckover = "latest") :
Could not determine server status.
I have seen a similar issue in an question from another forum, but the only solution in that case simply seemed to be specifying the port.
With Chrome there appears to be the problem that Chrome is now at version 130, while ChromeDriver only gets to support up to the version 113, if I understand correctly.
rvestyou might want to check previous Q&As targetingrvest::read_html_live()- stackoverflow.com/…https://live.euronext.com/en/pd_es/data/stocks/download?mics=dm_all_stock&initialLetter=&fe_type=csv&fe_decimal_separator=.&fe_date_format=d%2Fm%2FYfor CSV export.read_html_live& login: stackoverflow.com/q/78948903/646761 .read_html_live()is basically an interface forchromote, so you might find this - github.com/rstudio/chromote/… - useful.