Get data from table using regex php

Question

I want to extract some data from a table using php preg_match_all(). I have the html as under, I want to get the values in td, say Product code: RC063154016. How can I do that? I don'y have any experience with regex,

  <table width="100%" border="0" cellspacing="0" cellpadding="0">
      <tbody>
        <tr>
          <td><span>Product code:</span> RC063154016</td>                   
          <td><span>Gender:</span> Female</td>
        </tr>
      </tbody>
    </table>

DomDocument might be better. Take a look at this.

machineaddict
– machineaddict

2014-01-31 12:02:47 +00:00
Commented Jan 31, 2014 at 12:02 — machineaddict
– machineaddict, Commented Jan 31, 2014 at 12:02
HTML and regex tags are not good friends.

Toto
– Toto

2014-01-31 12:03:22 +00:00
Commented Jan 31, 2014 at 12:03 — Toto
– Toto, Commented Jan 31, 2014 at 12:03

gwillie · Accepted Answer · 2014-01-31 12:02:18Z

3

Use DomDocument

$str = <<<STR
<table width="100%" border="0" cellspacing="0" cellpadding="0">
      <tbody>
        <tr>
          <td><span>Product code:</span> RC063154016</td>                   
          <td><span>Gender:</span> Female</td>
        </tr>
      </tbody>
    </table>
STR;

$dom = new DOMDocument();
@$dom->loadHTML($str);
$tds = $dom->getElementsByTagName('td');
foreach($tds as $td){
  echo $td->nodeValue . '<br>';
}

OUTPUT

Product code: RC063154016
Gender: Female

answered Jan 31, 2014 at 12:02

gwillie

1,8991 gold badge12 silver badges14 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

MJQ Over a year ago

Yes that's nice. But there are a lot of td elements in a webpage, and i want the specific ones under that table with <table width="100%" border="0" cellspacing="0" cellpadding="0">! So what about that?

gwillie Over a year ago

Well how do you identify what you want...id/class attributes/content of certain elements, your choice. An excellent time to read into DomDocument and DOMXpath. With those 2 tools you can manipulate HTML with absolute guarantee. Regex is not the best for structured languages. I use regex to parse simple html, but lets see your full table html, then we can determine the best path to use

MJQ Over a year ago

I can just identify by table attributes like, width="100%" border="0" cellspacing="0"! SO, anything?

gwillie Over a year ago

Post your html table markup that you're trying to parse. Regex maybe better, maybe DomDoc is better, lets see the code your working with, a little nuance here and there adds up to mountains, if you understand what I mean :)

MJQ Over a year ago

Figured it out myself by using Query. Thanks for your answer! :)

Sabuj Hassan · Accepted Answer · 2014-01-31 12:04:18Z

0

This should do for you:

preg_match_all('|<td><span>Product code:</span>([^<]*)</td>|', $html, $match);

But if you think there can be random white spaces around tags, then this one:

preg_match_all('|<td>\s*<span>\s*Product code:\s*</span>([^<]*)</td>|', $html, $match);

answered Jan 31, 2014 at 12:04

Sabuj Hassan

39.7k14 gold badges83 silver badges88 bronze badges

Comments

Hett · Accepted Answer · 2014-01-31 12:04:21Z

0

$data = <<<HTML
  <table width="100%" border="0" cellspacing="0" cellpadding="0">
      <tbody>
        <tr>
          <td><span>Product code:</span> RC063154016</td>
          <td><span>Gender:</span> Female</td>
        </tr>
      </tbody>
    </table>
HTML;


if(preg_match_all('#<td>\s*<span>Product code:</span>\s*([^<]*)</td>#i', $data, $matches)) {
    print_r($matches);
}

answered Jan 31, 2014 at 12:04

Hett

3,8652 gold badges40 silver badges57 bronze badges

Comments

Community · Accepted Answer · 2017-05-23 12:11:44Z

0

Use any one parser and parse the HTML and use it. Don't use preg* functions here. Please read this answer How do you parse and process HTML/XML in PHP?

edited May 23, 2017 at 12:11

CommunityBot

11 silver badge

answered Jan 31, 2014 at 12:05

Mohan

16310 bronze badges

Collectives™ on Stack Overflow

Get data from table using regex php

4 Answers 4

OUTPUT

5 Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

OUTPUT

5 Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related