PHP style question: "caching" object/array values in a variable?

Question

Please save me from myself (or reassure me that I'm not being completely misguided)

I've gotten into the habit of writing code something like the following:

function foo($aUserObject) {
    $theUserUID = $aUserObject->uid;
    $aDeepValue = $aUserObject->property[123][456];
    [more code, in which much use is made of $theUserUID and $aDeepValue]
}

My strategy is probably obvious: I'm taking the attitude that it's going to be easier for the PHP interpreter to handle a variable reference than to continually dig into the object to find the thing I'm interested in, so I should be getting some performance benefit. In addition, my code is perhaps a bit more bug-free and understandable (as long as I remember the meanings of the variable names), since I'm mostly writing simple variable names instead of longer and more complex object/array references where my fingers are more likely to slip. I understand that there's a price for doing this -- there are now two copies of $aUserObject->uid and $aUserObject->property[123][456] floating around, and if those values are large, the additional memory costs could add up. But I'm currently willing to pay that price in exchange for the (alleged) benefits.

Or, that's what I'm telling myself anyway, based on my naive theory of how PHP underpinnings work. But reality, especially when opcode caching tools like APC get introduced, may be a totally different matter. Any more informed opinions out there, that might push me one way or another?

Thanks!

You might want to do some reading here: php.net/manual/en/features.gc.php — datasage
– datasage, Commented Jun 10, 2011 at 23:18
I like the question. Normally I create local variables for readability (to avoid long lines), and when I need the value more than one time. But if I just need $aUserObject->uid once, It'd probably feel needless to write an extra line. — joakimdahlstrom
– joakimdahlstrom, Commented Jun 10, 2011 at 23:22
I agree with @joakimdahlstrom and do the same. In terms of some of the answers below, I would like to hear from someone who actually knows the answer to the OP question about the expense and memory issues to keep this focused. I think it used to be more of an issue and I used to do the same for REQUEST vars, but I don't think it helps much anymore (PHP >5.1), particularly if you are using opcode caching. — ldg
– ldg, Commented Jun 10, 2011 at 23:48

netcoder · Accepted Answer · 2011-06-11 01:07:17Z

4

I can reassure you, you're wrong when you say:

I understand that there's a price for doing this -- there are now two copies of $aUserObject->uid and $aUserObject->property[123][456] floating around, and if those values are large, the additional memory costs could add up.

Unless $theUserUID is modified or referenced, it points to the exact same memory location that the property you fetched it from.

You can even do:

$a = $b = $c = $d = $e = 'hello world!';

And it won't take any more memory than:

$a = 'hello world!';

A copy will be created in the following scenarios:

$a = 1;
$b = $a;  // $b references $a
$b = 2;   // $b is now a copy (no longer references $a)

$a = 1;
$b = $a;  // $b references $a
$c = &$b; // $b is now a copy (no longer references $a)

It's called copy-on-write.

Tip: Try debug_zval_dump and memory_get_usage and notice the refcount that increases, while the memory usage stays the same.

edited Jun 11, 2011 at 1:07

answered Jun 10, 2011 at 23:43

netcoder

68k19 gold badges129 silver badges142 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

J.C. Inacio Over a year ago

What about functions with pass-by-reference? isn't there always a copy before the (first) function call?

netcoder Over a year ago

@jcinacio: What do you mean functions with pass-by-reference? A reference will never create a copy. As for passing variables as function arguments, the same copy-on-write mechanism applies there. Unless the variable is modified within the function, it will reference the original. If it is modified, a copy will be created locally (in the function) and destroyed when it goes out of scope (at the end of the function).

J.C. Inacio Over a year ago

@netcoder: a reference obviously doesn't create a copy, but if $b = $a, passing $b by reference means $b must have a copy of the contents from $a, and not the actual same data?

netcoder Over a year ago

@jcinacio: In that case, yes that's right. If $b = $a, passing $b by reference will create a copy of $a and $b will not reference $a anymore, at all (even outside the function), producing the same effect as copy-on-write. Another reason to be careful with references. ;)

J.C. Inacio Over a year ago

@netcoder - that's what i figured. so the wording should be "Unless $theUserUID is modified or referenced" ... :)

|

joakimdahlstrom · Accepted Answer · 2011-06-10 23:47:55Z

1

Correct me if I'm mistaken, but I believe you're wrong about something.

there are now two copies of $aUserObject->uid and $aUserObject->property[123][456] floating around, and if those values are large, the additional memory costs could add up.

There'll be another reference to the value, but it does NOT mean that it will occupy twice the amount of memory. It's like a relational database, you can add lots of references to the same element, but it's only the references themselves that will be stored more than once.

edited Jun 10, 2011 at 23:47

answered Jun 10, 2011 at 23:38

joakimdahlstrom

1,6051 gold badge12 silver badges26 bronze badges

8 Comments

ldg Over a year ago

If they were references, perhaps, but he's creating new variables the way he's doing it there.

netcoder Over a year ago

@ldg: Nope, @joakimdahlstrom is right on this one. It's called copy-on-write.

ldg Over a year ago

mmm true, if they are treated as read-only. tx for the clarification.

ldg Over a year ago

So, ok, if you don't change the reference the memory usage is the same, but is there any performance advantage to using a copy vs referencing a deep object property, like "$aDeepValue" vs "$aUserObject->property[123][456]" as in the OP question? I'm guessing not but would be interested in the details.

joakimdahlstrom Over a year ago

@Idg I don't think the difference is noticeable even if you're doing it all over your code. But I'd like to hear it from an expert as well.

|

J.C. Inacio · Accepted Answer · 2011-06-10 23:23:05Z

0

Code being more understandable is a big plus if you plan on maintaining it.

Of course, some routines might need more attention to memory/performance issues than others, but unless you are dealing with big amounts of data, the benefits are well worth the (possible) costs.

by the way, you can also use references:

$theUserUID = &$aUserObject->property[123][456];
$theUserUID = 'someValue'; // updates $aUserObject

answered Jun 10, 2011 at 23:23

J.C. Inacio

4,4722 gold badges25 silver badges25 bronze badges

3 Comments

tomfumb Over a year ago

agreed on understandability being a big plus, though apparently the subject of references is a little contentious, and doesn't guarantee better performance: schlueters.de/blog/archives/125-Do-not-use-PHP-references.html

J.C. Inacio Over a year ago

@user519575: references have big performance gains as the amount of data grows - such as very big arrays - both memory costs, and time spent copying

tomfumb Over a year ago

yes, but like I said it doesn't guarantee better performance. References in PHP are best used for large objects - as you point out - or when a function needs to update an incoming value. Both small objects and nested arrays can have performance costs when passed by reference.

Stuck · Accepted Answer · 2011-06-10 23:27:49Z

0

This seems good to me. But only as a second point.

I suggest:

Be sure that your code is readable and understandable (documentation, proper variable naming, codestyle guidlines, etc).
Search for modern standards or new one in the future in the languge (here php) and keep to it!
Reusablity: This is the benefit of point 1 and 2.
Performance issues should be discussed - but on an algorithmic/design-pattern layer and not on deep-code-basis. On the final implementation just be sure that you implement it correct.
Optimization: Only if some performance issues arise, than think of deep-code-optimization. But be sure that its understanable and readable. Otherwise the code gets useless in the future.

answered Jun 10, 2011 at 23:27

Stuck

12.5k13 gold badges76 silver badges124 bronze badges

1 Comment

J.C. Inacio Over a year ago

I hate to disagree that performance shouldn't be thought about on "deep code". there are many simple things that can have a drastic effect on performance (such as passing large arrays by value)

Collectives™ on Stack Overflow

PHP style question: "caching" object/array values in a variable?

4 Answers 4

6 Comments

8 Comments

3 Comments

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

6 Comments

8 Comments

3 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related