Python to Open html files in Excel

Question

I have a bunch of purchase orders in .html formats that I need to extract data and put in one simple excel sheet. While I could use beutifulsoup to do it I would rather just use excel's in built converter which already does a much better job. Then just work with excel files directly. Is there a way to use python to open html documents, then save it again in .xlsx. I tried using openpyxl but it does not take html files.

user20416 · Accepted Answer · 2018-06-05 23:46:44Z

0

You could use Python to automate an instance of the Excel application, opening each file, and saving as .xlsx:

import win32com.client
excelApp = win32com.client.Dispatch('Excel.Application')
book = excelApp.Open(path_to_html_file)
book.SaveAs(path_to_html_file + '.xlsx', 51)

answered Jun 5, 2018 at 23:46

user20416

15.5k9 gold badges76 silver badges149 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Python to Open html files in Excel

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related