rss feed xml: cannot access images after converting rss feed to xml object

Question

In the http://feeds.feedburner.com/rb286, there are many images. However, when i convert it into and xml object with simplXmlElement, i'm not able to see the images.My code:

if (function_exists("curl_init")){
$ch=curl_init();
curl_setopt($ch,CURLOPT_URL,"http://feeds.feedburner.com/rb286");
curl_setopt($ch,CURLOPT_RETURNTRANSFER,1);
$data=curl_exec($ch);
curl_close($ch);
//print_r($data);   //here i'm able to see the images
     $doc=new SimpleXmlElement($data);
     print_r($doc);   //here i'm not able to see the images
  }

Can someone tell me on how can I access the images after converting to xml object? thank you.

complex857 · Accepted Answer · 2014-02-26 15:32:15Z

2

You will have to iterate trough the <content:encoded> tags of the individual <items> in the <channel> main tag. I would use the xpath method to select the tags. Once you get the element you want you can grep the <img> out of them with string manipulation tools like preg_match_all:

Edit: added more refined image tag matching, that excludes ads from feedburner and other cdns.

$xml = simplexml_load_string(file_get_contents("http://feeds.feedburner.com/rb286"));

foreach ($xml->xpath('//item/content:encoded') as $desc) {
    preg_match_all('!(?<imgs><img.+?src=[\'"].*?http://feeds.feedburner.com.+?[\'"].+?>)!m', $desc, $>

    foreach ($m['imgs'] as $img) {
        print $img;
    }
}

The <content:encoded> tag is namespaced, so if you want to use simplexml's built in property mapping, you have to deal with it like this:

// obtain simplexml object of the feed as before
foreach ($xml->channel->item as $item) {
    $namespaces = $item->getNameSpaces(true);
    $content = $item->children($namespaces['content']);
    print $content->encoded; // use it howevery you want
}

You can read more from the xpath query language here.

edited Feb 26, 2014 at 15:32

answered Jul 23, 2012 at 6:57

complex857

20.9k6 gold badges54 silver badges56 bronze badges

Sign up to request clarification or add additional context in comments.

10 Comments

vaanipala Over a year ago

why ru using file_get_contents? can I just use $xml=simplexml_load_string($data)? YOur http addr is not the same as mine? is it a typo? mine is feeds.feedburner.com/rb286.

complex857 Over a year ago

Just to make the snippet shorter, doesn't matter how you obtain the feed, you could also use simplexml_load_file('http://.../'), too.

complex857 Over a year ago

Oh, the '?l' was a typo, however doesn't matter of in the result. The original feedburner url gives you different result for your browser then when you call it with curl/php. To get to the xml feed with the browser follow this url: feeds.feedburner.com/rb286?format=xml .

vaanipala Over a year ago

is it better not to use curl? now i tried using simpleXml_load_file("http//feeds.feedburner.com/rb286?format=xml"). I still don't see any <content:encoded> tag. Why i'm not able to see that tag? However, the images are displaying. Please help....i'm new to xml, xpath, rss feed. Thank you.

complex857 Over a year ago

Well, not by default because of namespacing, i've updated the example.

|

Collectives™ on Stack Overflow

rss feed xml: cannot access images after converting rss feed to xml object

1 Answer 1

10 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

10 Comments

Your Answer

Sign up or log in

Post as a guest

Related