Get the integer following part of a string?

Question

I have a bunch of strings that may or may not have a substring similar to the following:

<a class="tag" href="http://www.yahoo.com/5"> blah blah ...</a>

Im trying to retrieve the '5' at the end of the link (that isnt necessarily a one digit number, it can be huge). But, this string will vary. The text before the link, and after, will always be different. The only thing that will be the same is the <a class="tag" href="http://www.yahoo.com/ and the closing </a>.

Prasanth · Accepted Answer · 2012-08-07 22:21:28Z

1

You can do it using preg_match_all and <a class="tag" href="http:\/\/(.*)\/(\d+)"> regular expression.

answered Aug 7, 2012 at 22:21

Prasanth

5,2962 gold badges31 silver badges63 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

nook · Accepted Answer · 2012-08-07 22:19:09Z

1

Give parse_url() a try. Should be easy from there.

answered Aug 7, 2012 at 22:19

nook

2,3965 gold badges34 silver badges54 bronze badges

Comments

hakre · Accepted Answer · 2012-08-07 22:36:24Z

0

As you only need to retrieve the 5, it's pretty straight forward:

$r = pret_match_all('~\/(\d+)"~', $subject, $matches);

It's then in the first matching group.

If you need more information like the link text, I would suggest you to use a HTML Parser for that:

require('Net/URL2.php');

$doc = new DOMDocument();
$doc->loadHTML('<a class="tag" href="http://www.yahoo.com/5"> blah blah ...</a>');
foreach ($doc->getElementsByTagName('a') as $link)
{
    $url = new Net_URL2($link->getAttribute('href'));
    if ($url->getHost() === 'www.yahoo.com') {
        $path = $url->getPath();
        printf("%s (from %s)\n", basename($path), $url);
    }
}

Example Output:

5 (from http://www.yahoo.com/5)

edited Aug 7, 2012 at 22:36

answered Aug 7, 2012 at 22:25

hakre

200k55 gold badges454 silver badges865 bronze badges

5 Comments

Jonah Katz Over a year ago

But i need to get the link out of the string

hakre Over a year ago

In your question you wrote you need to get the 5 so I took you by the word. For the link I suggest a HTML Parser: Robust, Mature HTML Parser for PHP

Jonah Katz Over a year ago

"But, this string will vary. The text before the link, and after, will always be different"

Jonah Katz Over a year ago

But appreciate your answer, ill try and take it from there

hakre Over a year ago

@JonahKatz: I added a working example with PHP's HTML Parser and Pears Net_URL2 component.

insertusernamehere · Accepted Answer · 2012-08-07 22:37:03Z

0

I would got with "basename":

// prints passwd
print basename("/etc/passwd")

And to get the link you could use:

$xml  = simplexml_load_string( '<a class="tag" href="http://www.yahoo.com/5"> blah blah ...</a>' );
$attr = $xml->attributes();
print $attr['href'];

And finally: If you don't know the whole structure of the string, use this:

$dom = new DOMDocument;
$dom->loadHTML( '<a class="tag" href="http://www.yahoo.com/5"> blah blah ...</a>asasasa<a class="tag" href="http://www.yahoo.com/6"> blah blah ...</a>' );
$nodes = $dom->getElementsByTagName('a');
foreach ($nodes as $node) {
    print $node->getAttribute('href');
    print basename( $node->getAttribute('href') );
}

As this will also fix invalid HTML code.

edited Aug 7, 2012 at 22:37

answered Aug 7, 2012 at 22:24

insertusernamehere

23.7k10 gold badges93 silver badges130 bronze badges

2 Comments

Jonah Katz Over a year ago

But i need to get the link out of the string

insertusernamehere Over a year ago

Hey, I've edited the post while you were commenting. :) I have added two different approaches. The last one should fit for your problem.

Collectives™ on Stack Overflow

Get the integer following part of a string?

4 Answers 4

Comments

Comments

5 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

5 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related