Getting substring of a string that exists between patterns

Question

I have a string like this:

$string = 'startTHISISTHESTRINGINEEDend'

I need the string between start and the end obviously. I tried regex but its way hard for a newbie like me so no succes on that side. The length of end part is changing so substr function is no go. Tried to combine strpos with strstring but no success on that too. How can I achieve that?

Giacomo1968 · Accepted Answer · 2014-05-26 23:11:14Z

2

Try this; will provide a detailed explanation so the whole concept of the regex is simplified doesn’t seem like magic:

$string = "startTHISISTHESTRINGINEEDend";

preg_match("/^start([a-z0-9]+)end$/is", $string, $matches, null, 0);

echo '<pre>';
print_r($matches);
echo '</pre>';

The regex is fairly simple. And here is the explanation.

Match start at the beginning of the string; that is what the ^ means.
Next the parentheses of ( and ) basically mean you are capturing what are in between the parentheses.
So in between the ( and ) is regex logic to capture only alphanumeric characters as represented by [a-z0-9]+.
Match end at the end of the string; that is what the $ means.
The is at the end of the the regex basically means make sure the match is case insensitive via the i (aka: PCRE_CASELESS) and then the s indicates a PCRE_DOTALL which as explained in the PHP manual on pattern modifiers:

If this modifier is set, a dot metacharacter in the pattern matches all characters, including newlines. Without it, newlines are excluded. This modifier is equivalent to Perl's /s modifier. A negative class such as [^a] always matches a newline character, independent of the setting of this modifier.

If you wish to include non alpha numeric characters you can just use this (.*?) instead of ([a-z0-9]+). But unclear from your request since you are only showing alphabet characters. Or if you wanted to capture specific non-alphanumeric characters like %, / & ^ then just do this: ([a-z0-9%\/^]+). Note how the / is set as \/. Adding the \ escapes the / which makes preg_match realize that it needs to explicitly match / & not interpret that as part of the regex logic.

And the output of $matches would be:

Array
(
    [0] => startTHISISTHESTRINGINEEDend
    [1] => THISISTHESTRINGINEED
)

So just access it by referring to $matches[1].

echo $matches[1];

Output would be:

THISISTHESTRINGINEED

edited May 26, 2014 at 23:11

answered May 26, 2014 at 22:31

Giacomo1968

25.3k11 gold badges78 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

BlackVegetable Over a year ago

I'll remove my answer in favor of yours as you have a PHP specific solution. Could you just jot a quick edit down explaining what the [a-z0-9]+ does for this poster's sake?

BlackVegetable Over a year ago

Holy cow. I think you just answered 50% of all regex questions on Stack Overflow.

CanNuhlar Over a year ago

That's really a good explanation. Thanks a lot. But I didn't get the [a-z0-9]+ part. So characters like % / ^ will be ignored then?

TeoTN · Accepted Answer · 2014-05-26 22:27:47Z

0

$string = str_replace("start","", $string);
$string = str_replace("end","", $string);
echo $string;

Not a wise solution but probably sufficient one

answered May 26, 2014 at 22:27

TeoTN

5211 gold badge7 silver badges23 bronze badges

Comments

Hans Schindler · Accepted Answer · 2014-05-26 22:34:08Z

0

start(.*?)end

then get Group#1. In demo look at Group#1.

answered May 26, 2014 at 22:34

Hans Schindler

1,8075 gold badges17 silver badges25 bronze badges

Comments

Markus · Accepted Answer · 2014-05-26 22:37:00Z

0

If your start and end occurs somewhere in a string (only once) and you don't want to use regular expressions, you can use the following solution:

$string = 'HIstartTHISISTHESTRINGINEEDendANDOTHERSTUFFHERE';
$start = strpos($string, 'start') + strlen('start');
$end = strrpos($string, 'end');
$result = substr($string, $start, $end-$start);
var_dump($result);

answered May 26, 2014 at 22:37

Markus

3334 silver badges11 bronze badges

Comments

low_rents · Accepted Answer · 2014-05-26 22:47:28Z

0

$string = 'xwerwstartTHISISTHESTRINGINEEDasdwerendwerq';

$start = 'start';
$stop = 'end';

echo substr($string, 
    strlen($start) + strpos($string, $start), 
    strpos($string, $stop) - (strpos($string, $start) + strlen($start))
);

answered May 26, 2014 at 22:47

low_rents

4,4813 gold badges30 silver badges55 bronze badges

Collectives™ on Stack Overflow

Getting substring of a string that exists between patterns

5 Answers 5

3 Comments

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

3 Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related