getting text after html tag with php and xpath

Question

I have the following html from a curl scrap of a webpage:

<div id="box">
<br>
Your word(s):
<br>
<br>
functionally
<br>
<br>
<br>

I want what is after the third <br>: /html/body/div[2]/div/br[3] - that being functionality

@$itemCell = $xpath->query( "/html/body/div[2]/div/br[3]" );
$word = $itemCell->item( 0 );
return $word->nodeValue;

this does not return anything. If I back up to just /div I of course get the entire contents of box. How do I extract the word after the second <br>. My word is always going to be after the third <br>.

Seems so simple, yet it escapes me.

Phil · Accepted Answer · 2012-08-29 00:41:29Z

4

Try something like this query

$textNodes = $xpath->query('//div[@id="box"]/br[3]/following-sibling::text()[1]');

Working demo here - http://codepad.viper-7.com/00oeZh

The key here is the following-sibling Axes.

edited Aug 29, 2012 at 0:41

answered Aug 29, 2012 at 0:34

Phil

166k25 gold badges265 silver badges269 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

KiloJKilo Over a year ago

Thanks for this. It does work, and you actually just allowed me to clear a big hurdle with xpath and following-sibling.

SteveLin · Accepted Answer · 2013-06-01 14:16:35Z

-1

<dl>
        <dt>info</dt>
        <dd>
            <a>a1</a>b2
            <a>a2</a>
        </dd>
    </dl>

getting the b2 after tag. the xpath is like the following. //dl/dd/a[1]/following-sibling::text()

answered Jun 1, 2013 at 14:16

SteveLin

723 bronze badges

1 Comment

SteveLin Over a year ago

KiloJKilo, why don't you select my answer? The key is following-sibling::text(). I just don't tell you the detail answer for your problem.

Collectives™ on Stack Overflow

getting text after html tag with php and xpath

2 Answers 2

1 Comment

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related