Loop url links and save as pdf files in Python

Question

Given a dataframe df as follows:

             projectCode                                                url
0  FCZZZZCQ2021020200921  https://www.cspea.com.cn/list/c01/FCZZZZCQ2021020200921
1        GR2021BJ1000351  https://www.cspea.com.cn/list/c01/GR2021BJ1000351
2        GR2021QD1000030  https://www.cspea.com.cn/list/c01/GR2021QD1000030
3        GR2021BJ1000186  https://www.cspea.com.cn/list/c01/GR2021BJ1000186
4    FCZZCQ2020123011487  https://www.cspea.com.cn/list/c01/FCZZCQ2020123011487

I want to use pdfkit package save each url link as pdf file, and use projectCode as file name:

import pdfkit
import pandas as pd

data = []
urls =  df.url.tolist()
for url_link in urls:
    pdfkit.from_url(url, 'out.pdf')

How could I do that? Thanks.

is you are reading from pdf file or csv file just want to confirm — M_x
– M_x, Commented Apr 9, 2021 at 4:02
also how you created dataframe please do insert that code also so it will help solving issue fast — M_x
– M_x, Commented Apr 9, 2021 at 4:03

ah bon · Accepted Answer · 2021-04-09 04:44:29Z

1

You should zip the columns to use it:

for a, url in zip(df['projectCode'], df['url']):
    pdfkit.from_url(url, f'{a}.pdf')

edited Apr 9, 2021 at 4:44

ah bon

10.1k22 gold badges82 silver badges185 bronze badges

answered Apr 9, 2021 at 4:10

anky

75.3k11 gold badges46 silver badges76 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Loop url links and save as pdf files in Python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related