PHP: remove `http://` from link title

Question

I have a string that looks like:

$string = '<a href="http://google.com">http://google.com</a>';

How can I remove the http:// part from the link text, but leave it in the href attribute?

You might find s($str)->replaceLast('http://') helpful, as found in this standalone library. — caw
– caw, Commented Jul 27, 2016 at 3:40

alex · Accepted Answer · 2011-02-02 13:54:26Z

11

Without using a full blown parser, this may do the trick for most situations...

$str = '<a href="http://google.com">http://google.com</a>';

$regex = '/(?<!href=["\'])http:\/\//';

$str = preg_replace($regex, '', $str);

var_dump($str); // string(42) "<a href="http://google.com">google.com</a>"

It uses a negative lookbehind to make sure there is no href=" or href=' preceding it.

See it on IDEone.

It also takes into account people who delimit their attribute values with '.

edited Feb 2, 2011 at 13:54

answered Feb 2, 2011 at 13:47

alex

492k205 gold badges890 silver badges992 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Alex Over a year ago

that works, tx. nice site this ideone, you can actually run php code on it :)

Neil Knight · Accepted Answer · 2011-02-02 14:00:16Z

9

$string = '<a href="http://google.com">http://google.com</a>'; 
$var = str_replace('>http://','>',$string);

Just tried this in IDEone.com and it has the desired effect.

edited Feb 2, 2011 at 14:00

answered Feb 2, 2011 at 13:41

Neil Knight

48.8k26 gold badges136 silver badges193 bronze badges

3 Comments

Robert Over a year ago

Just worth throwing out there, this won't catch > http://..., but if you trim out the spaces beforehand this should do it.

Robert Over a year ago

Nah, a space between the <a> tags, like <a href='...'> Text </a>

alex Over a year ago

@Robert Or newline if you indent your text nodes (I often do for readability.)

lonesomeday · Accepted Answer · 2011-02-02 13:52:47Z

4

In this simple case, the preg_replace function will probably work. For more stability, try using DOMDocument:

$string = '<a href="http://google.com">http://google.com</a>';
$dom = new DOMDocument;
$dom->loadXML($string);

$link = $dom->firstChild;
$link->nodeValue = str_replace('http://', '', $link->nodeValue);
$string = $dom->saveXML($link);

answered Feb 2, 2011 at 13:52

lonesomeday

239k54 gold badges330 silver badges329 bronze badges

1 Comment

alex Over a year ago

Just an edge case, you may want to use regex to make sure you strip it off from the beginning only, what about a link like http://example.com/send-to-friend?url=http://somewhere.com ? Also, +1 for using a parser.

Muhammad Tahir · Accepted Answer · 2014-07-02 18:32:39Z

4

$str = 'http://www.google.com';
$str = preg_replace('#^https?://#', '', $str);
echo $str; // www.google.com

that will work for both http:// and https://

running live code

answered Jul 2, 2014 at 18:32

Muhammad Tahir

2,50331 silver badges25 bronze badges

Comments

Justin Morgan · Accepted Answer · 2011-02-02 13:45:27Z

2

Any simple regular expression or string replacement code is probably going to fail in the general case. The only "correct" way to do it is to actually parse the chunk as an SGML/XML snippet and remove the http:// from the value.

For any other (reasonably short) string manipulation code, finding a counterexample that breaks it will be pretty easy.

answered Feb 2, 2011 at 13:45

Justin Morgan

2,4452 gold badges16 silver badges19 bronze badges

2 Comments

mario Over a year ago

Well, the incorrect way is still more appropriate. There's not enough edge case potential to warrant using the overkill solution (html parser) here. A regular expression is sufficient. (The no regex for html parsing meme is somewhat dated.)

Justin Morgan Over a year ago

One man's "meme" is another man's correctness. We don't know how critical it is for this to work all the time, or how trustworthy the input might be. Regex will probably work, but I don't want to give @Alexandra the impression that their problem solved for every possible input.

Jong Bor Lee · Accepted Answer · 2011-02-02 13:50:11Z

2

Assuming that "http://" always appears twice on $string, search the string for "http://" backwards using strripos. If the search succeeds, you'll know the start_index of the "http://" you want to remove (and you know the length of course). Now you can use substr to extract everything that goes before and after the chunk you want remove.

answered Feb 2, 2011 at 13:50

Jong Bor Lee

3,8651 gold badge28 silver badges27 bronze badges

Comments

devasia2112 · Accepted Answer · 2011-02-02 13:42:43Z

1

$string = '<a href="http://google.com">http://google.com</a>';
$var = explode('http://',$string);
echo $var[2];

answered Feb 2, 2011 at 13:42

devasia2112

6,0646 gold badges41 silver badges56 bronze badges

Collectives™ on Stack Overflow

PHP: remove `http://` from link title

7 Answers 7

1 Comment

3 Comments

1 Comment

Comments

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

1 Comment

3 Comments

1 Comment

Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related