How can I extract HTML table data using Perl?

Question

I need to retrieve some data from a web page. After analysing the HTML code of the page, I found the data I need is embeded in a table with a unique table id. I don't know whether it is an HTML rule or not, anyway it's very good for parsing I think.

The data in the table is arranged as below (various attributes and tags have been omitted in order to give you a clear "data structure")

<table .... id = "tablename" .... >
    <tr>
         <td .... >filed1</td>
             ....
         <td .... >filedn</td>
    </tr>
         #several "trs" here
    <tr>
         <td .... >filed1</td>
             ....
         <td .... >filedn</td>
    </tr>
</table>

So my question is how to use Perl's HTML parser utility to meet my needs in this case.

Thanks in advance.

Leon Timmermans · Accepted Answer · 2009-12-21 07:33:19Z

12

HTML::TableExtract sounds exactly like what you are looking for.

answered Dec 21, 2009 at 7:33

Leon Timmermans

30.3k2 gold badges65 silver badges111 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

brian d foy · Accepted Answer · 2009-12-23 02:09:52Z

2

Use HTML::Table.

edited Dec 23, 2009 at 2:09

brian d foy

134k31 gold badges214 silver badges613 bronze badges

answered Dec 21, 2009 at 11:30

Pradeep

3,14119 silver badges21 bronze badges

Comments

brian d foy · Accepted Answer · 2009-12-23 02:15:03Z

-1

Look at Ken MacFarlane's Parsing HTML with HTML::Parser in The Perl Journal. I'm not sure if that's the parser you're referring to, but it looks like it can do what you want, or at least point you in the right direction.

edited Dec 23, 2009 at 2:15

brian d foy

134k31 gold badges214 silver badges613 bronze badges

answered Dec 21, 2009 at 5:55

Chris Thompson

35.6k12 gold badges86 silver badges110 bronze badges

1 Comment

brian d foy Over a year ago

You shouldn't have to reach down into HTML::Parser for this. There are many tools built on top of it that should be able to handle the job.

Collectives™ on Stack Overflow

How can I extract HTML table data using Perl?

3 Answers 3

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related