Most memory efficient way to grow an array in Java?

Question

I'm not too concerned about time efficiency (the operation will be rare), but rather about memory efficiency: Can I grow the array without temporarily having all the values twice?

Is there a more efficient way to grow a large array than creating a new one and copying over all the values? Like, concatenating it with a new one?

What about having fixed-size arrays stored in another array and reallocate / copy that top-level one? Would that leave the actual values in place?

I'm aware of ArrayList, but I need a lot of control about accessing the array and the access needs to be very fast. For instance, I think I prefer a[i] to al.get(i).

The main reason why I care about this is that the array in question (or a number of such arrays) might very well occupy a large enough portion of main memory that the usual strategy of creating a double sized copy before discarding the original might not work out. This may mean that I need to reconsider the overall strategy (or up my hardware recommendations).

Are you specifically disallowing use of ArrayList? If so, why? — Vinay Sajip
– Vinay Sajip, Commented Sep 15, 2009 at 13:41

jjnguy · Accepted Answer · 2009-10-02 02:36:14Z

29

The best way to have a dynamically resizing 'array' or list of items is to use an ArrayList.

Java has already built in very efficient resizing algorithms into that data structure.

But, if you must resize your own array, it is best to use System.arraycopy() or Arrays.copyOf().

Arrays.copyOf() can most simply be used like so:

int[] oldArr;
int newArr = Arrays.copyOf(oldArr, oldArr.length * 2);

This will give you a new array with the same elements as the old array, but now with room to spare.

The Arrays class in general has lots of great methods for dealing with arrays.

Also

It is important to make sure that you aren't just growing your array by one element each time an element is added. It is best to implement some strategy where you only have to resize the array every once in a while. Resizing arrays is a costly operation.

edited Oct 2, 2009 at 2:36

answered Sep 15, 2009 at 13:40

jjnguy

139k54 gold badges298 silver badges328 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Hanno Fietz Over a year ago

Well, the growing strategy would most likely be the classic "double each time".

Hanno Fietz Over a year ago

I amended my post to clarify that I'm not concerned about the time it takes to grow the array, but rather about the extra memory that's required by the operation. If this has to be 100% of the array size, I can't grow arrays but have to add arrays.

Hanno Fietz Over a year ago

+1 for a very good answer to the more sloppy first revision of my question!

jjnguy Over a year ago

I'll keep this one here though, it still has some good information.

mikyra · Accepted Answer · 2015-12-14 02:38:20Z

18

Is there a more efficient way to grow a large array than creating a new one and copying over all the values? Like, concatenating it with a new one?

No. And probably there is no language, that guarantees growing an array will always take place without copying. Once you allocate the space for the array and do something else, you most likely have other objects in memory right after the end of the array. At that point, it's fundamentally impossible to grow the array without copying it.

What about having fixed-size arrays stored in another array and reallocate / copy that top-level one? Would that leave the actual values in place?

You mean have an array of arrays and treat it as one large array consisting of a concatenation of the underlying arrays? Yes, that would work (the "faking it by doing indirection" approach), as in Java, Object[][] is simply an array of pointers to Object[] instances.

edited Dec 14, 2015 at 2:38

mikyra

10.4k1 gold badge43 silver badges41 bronze badges

answered Sep 15, 2009 at 13:46

Michael Borgwardt

347k81 gold badges491 silver badges726 bronze badges

2 Comments

Hanno Fietz Over a year ago

Thanks for commenting on my indirection idea, that's exactly what I was wondering.

Lisa Over a year ago

Really minor side note on the topic of "no language does this"... C will do exactly this--resize an array without copying the data--so long as the realloc'd memory does not fill the entire block and\or no blocks lie beyond the allocated block. With a large block allocator or fastest fit allocator this is not uncommon depending on the size and alignment of your array. With best fit allocators, naturally the memory will be moved to a different block most of the time, however it still eliminates the "double copy" as the memory may be moved a segment at a time if the implementation supports it.

KLE · Accepted Answer · 2009-09-15 13:46:09Z

Arrays are constant-size, so there is no way to grow them. You can only copy them, using System.arrayCopy to be efficient.

ArrayList does exactly what you need. It's optimized much better than any of us could do, unless you devote a considerable time to it. It uses internally System.arrayCopy.

Even more, if you have some huge phases where you need the list to grow/reduce, and others where it doesn't grow/reduce and you make thousands of read or write in it. Suppose also you have a huge performance need, that you prooved that ArrayList is too slow when read/writing. You could still use the ArrayList for one huge phase, and convert it to an array for the other. Note this would be effective only if your application phases are huge.

Community · Accepted Answer · 2017-02-08 14:15:18Z

4

How about a linked list coupled with an array that holds only references.

The linked list can grow without having to allocate new memory, the array would ensure you have easy access. And every time the array becomes to small, you can simply trash the entire array and build it up again from the linked list.

alt text

edited Feb 8, 2017 at 14:15

CommunityBot

11 silver badge

answered Sep 15, 2009 at 14:10

NomeN

17.7k7 gold badges35 silver badges33 bronze badges

14 Comments

Yannick Motton Over a year ago

Wouldn't that mean you have a permanent copy of all the data in memory? With the memory limitations he talks about, don't think this will help.

Hanno Fietz Over a year ago

For my case, Yannick is right, but it's still an interesting idea, might be an inspiration for similar problems.

Yannick Motton Over a year ago

Well if were to be used only as a backup to rebuild a fixed size array, you might aswell use an ArrayList :-)

NomeN Over a year ago

@Yannick If you'd use an arraylist, don't you have a full copy in memory again when you increase the size of the ArrayList.

NomeN Over a year ago

@Yannick (again, 1st comment now) I don't believe I'd have a copy of all data in memory, if I'm not parsing your comment correctly or miss some fundamentals here please set me straight.

|

Sam Barnum · Accepted Answer · 2009-09-15 14:40:43Z

3

"Can I grow the array without temporarily having all the values twice?"

Even if you copy the arrays, you're only going to have all the values once. Unless you call clone() on your values, they're passed by reference into the new array.

If you already have your values in memory, the only additional memory expense when copying into a new array is allocating the new Object[] array, which doesn't take much memory at all, as it's just a list of pointers to value objects.

answered Sep 15, 2009 at 14:40

Sam Barnum

10.8k4 gold badges58 silver badges63 bronze badges

1 Comment

Yannick Motton Over a year ago

It's a large array of primitives. No object references. Hence copying the array does need at least twice the size of the array allocated.

Yannick Motton · Accepted Answer · 2009-09-15 14:52:40Z

2

Is the array itself large, or are you referencing large ReferenceTypes?

There is a difference between an array of a PrimitiveType with billions of elements, and an array with thousands of elements, but they refer to large class instances.

int[] largeArrayWithSmallElements = new int[1000000000000];
myClass[] smallArrayWithLargeElements = new myClass[10000];

Edit:

If you have performance considerations using ArrayList, I can assure you it will perform more or less exactly as Array indexing.

And if the application has limited memory resources, you can try to play around with the initial size of the ArrayList (one of it's constructors).

For optimal memory efficiency, you could create a container class with an ArrayList of Arrays.

Something like:

class DynamicList
{
    public long BufferSize;
    public long CurrentIndex;

    ArrayList al = new ArrayList();

    public DynamicList(long bufferSize)
    {
        BufferSize = bufferSize;

        al.add(new long[BufferSize]);
    }

    public void add(long val)
    {
        long[] array;

        int arrayIndex = (int)(CurrentIndex / BufferSize);

        if (arrayIndex > al.size() - 1)
        {
            array = new long[BufferSize];
            al.add(array);
        }
        else
        {
            array = (long[])al.get(arrayIndex);
        }

        array[CurrentIndex % BufferSize] = val;
    }

    public void removeLast()
    {
        CurrentIndex--;
    }

    public long get(long index)
    {
        long[] array;

        int arrayIndex = (int)(index / BufferSize);

        if (arrayIndex < al.size())
        {
            array = (long[])al.get(arrayIndex);
        }
        else
        {
            // throw Exception
        }

        return array[index % BufferSize];
    }
}

(my java is rusty, so please bear with me...)

edited Sep 15, 2009 at 14:52

answered Sep 15, 2009 at 14:07

Yannick Motton

36.2k4 gold badges41 silver badges55 bronze badges

3 Comments

Hanno Fietz Over a year ago

The array itself is large (millions), the element type is long. Also, there are several thousand such arrays.

Hanno Fietz Over a year ago

The difference being that copying the small array with large elements only takes up the space for the object references, not the objects?

Yannick Motton Over a year ago

Exactly, hence my question, but since you have an array that takes a lot of memory, I would suggest a different approach when you have memory efficiency in mind.

kgiannakakis · Accepted Answer · 2009-09-15 13:36:02Z

1

Have a look at System.arraycopy.

answered Sep 15, 2009 at 13:36

kgiannakakis

104k28 gold badges163 silver badges197 bronze badges

2 Comments

bruno conde Over a year ago

This is not what the OP is asking ... A copy of the array is still made when growing.

Hanno Fietz Over a year ago

That might not have been clear in the first revision of my post, I made some clarifications later.

Carlos Tasada · Accepted Answer · 2009-09-15 13:38:32Z

1

AFAIK the only way of growing or reducing an array is doing a System.arraycopy

   /**
    * Removes the element at the specified position in this list.
    * Shifts any subsequent elements to the left (subtracts one from their
    * indices).
    *
    * @param index the index of the element to removed.
    * @return the element that was removed from the list.
    * @throws    IndexOutOfBoundsException if index out of range <tt>(index
    *     &lt; 0 || index &gt;= length)</tt>.
    */
    public static <T> T[] removeArrayIndex(T[] src, int index) {
        Object[] tmp = src.clone();

        int size = tmp.length;
        if ((index < 0) && (index >= size)) {
            throw new ArrayIndexOutOfBoundsException(index);
        }

        int numMoved = size - index - 1;
        if (numMoved > 0) {
            System.arraycopy(tmp, index + 1, tmp, index, numMoved);
        }
        tmp[--size] = null; // Let gc do its work

        return (T[]) Arrays.copyOf(tmp, size - 1);
    }

   /**
    * Inserts the element at the specified position in this list.
    * Shifts any subsequent elements to the rigth (adds one to their indices).
    *
    * @param index the index of the element to inserted.
    * @return the element that is inserted in the list.
    * @throws    IndexOutOfBoundsException if index out of range <tt>(index
    *     &lt; 0 || index &gt;= length)</tt>.
    */
    public static <T> T[] insertArrayIndex(T[] src, Object newData, int index) {
        Object[] tmp = null;
        if (src == null) {
            tmp = new Object[index+1];
        } else {
            tmp = new Object[src.length+1];

            int size = tmp.length;
            if ((index < 0) && (index >= size)) {
                throw new ArrayIndexOutOfBoundsException(index);
            }

            System.arraycopy(src, 0, tmp, 0, index);
            System.arraycopy(src, index, tmp, index+1, src.length-index);
        }

        tmp[index] = newData;

        return (T[]) Arrays.copyOf(tmp, tmp.length);
    }

answered Sep 15, 2009 at 13:38

Carlos Tasada

4,4441 gold badge25 silver badges26 bronze badges

3 Comments

Gazzonyx Over a year ago

I should look this up before I make myself look silly (but time is an issue ATM), but don't you need a throws clause in the method declaration for the ArrayIndexOutOfBoundsException, or is that unchecked?

Carlos Tasada Over a year ago

I would need to double check, but this code is copy&paste from an utility class that I've in production (but I'm not sure if this code is used anymore) ;)

hjhill Over a year ago

ArrayIndexOutOfBoundsException is definitely unchecked.

Mkkt Bkkt · Accepted Answer · 2009-09-15 13:40:44Z

1

Obviously, the important bit here is not if you concatenate the arrays or copy them over; what's more important is your array growing strategy. It's not hard to see that a very good way to grow an array is always doubling its size when it becomes full. This way, you will turn the cost of adding an element to O(1) as the actual growing stage will happen only relatively rarely.

answered Sep 15, 2009 at 13:40

Mkkt Bkkt

122k40 gold badges154 silver badges177 bronze badges

1 Comment

Hanno Fietz Over a year ago

Yes, that's one important bit, but I was aware of that one. My question arises because I have a really large array that may occupy a large enough portion of memory that I can not just create a temporary copy.

Malaxeur · Accepted Answer · 2009-09-15 14:22:28Z

One way of doing this is having a linked list of array nodes. This is somewhat complex but the premise is this:

You have a linked list and each node within the list references an array. This way, your array can grow without ever copying. To grow you only need to add additional nodes at the end. Therefore the 'expensive' grow operation only occurs every M operations where M is the size of each node. Granted, this assumes that you always append to the end and you don't remove.

Insertion and removal in this structure is quite complicated, but if you can avoid them then that's perfect.

The only loss with this structure (ignoring insertion and deletion) is with the gets. The gets will be slightly longer; accessing the correct node requires accessing the correct node within the linked list and then fetching there. If there are a lot of accesses around the middle, this can be slow however there are tricks to speeding linked lists up.

Robert · Accepted Answer · 2009-09-15 14:30:21Z

1

Have you looked at GNU Trove for highly efficient java collections? Their collections store primatives directly for much better memory usage.

answered Sep 15, 2009 at 14:30

Robert

8,64910 gold badges42 silver badges57 bronze badges

Comments

Narayan · Accepted Answer · 2009-09-15 13:46:10Z

0

Heres a benchmark of time taken to add and remove elements from a collection/arraylist/vector

answered Sep 15, 2009 at 13:46

Narayan

6,3113 gold badges43 silver badges45 bronze badges

1 Comment

Gazzonyx Over a year ago

That's Java 1.2 Those figures are probably very out of date.

Collectives™ on Stack Overflow

Most memory efficient way to grow an array in Java?

12 Answers 12

4 Comments

2 Comments

Comments

14 Comments

1 Comment

3 Comments

2 Comments

3 Comments

1 Comment

Comments

Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

12 Answers 12

4 Comments

2 Comments

Comments

14 Comments

1 Comment

3 Comments

2 Comments

3 Comments

1 Comment

Comments

Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related