Most memory-efficient map in java

Question

I am implementing a tree-like structure using the Map interface like the following declaration:

Map<String, Map<String, Map<Integer, Double>>>

Currently I am using the HashMap implementation. After loading a huge amount of data, I am seeing the program consume 4GB of RAM. On persisting the whole entity using the Serializable interface, the resulting file's size is just 1GB.

What is the most memory-efficient Map implementation that I could use here?

Is using maps the right solution? Shouldn't you use a List<FirstLevelNode>, with FirstLevelNode holding a List<SecondLevelNode>, and SecondLevelNode holding a List<ThirdLevelNode>? — JB Nizet
– JB Nizet, Commented Nov 25, 2012 at 14:33
Wont using list affect the performance of retrieval. I am fine with larger load time but retrieval time is what i am trying to save here. — Vineeth Mohan
– Vineeth Mohan, Commented Nov 25, 2012 at 15:01
It is strange to call this structure a tree. It is indeed tree-shaped, assuming that none of the values in the map are coupled with more than key. Otherwise, you'd have a graph. In order to give you the best answer, you need to describe the access pattern for this structure. Do you usually have two strings and an integer in hand for which you want to find the corresponding double value? Or do you need to grab subtrees (say, given just the first string) and pass those around as well? Restated: Is this really a mapping from a composite key (a tuple of two strings and an integer) to a double? — seh
– seh, Commented Nov 25, 2012 at 15:06
All i want is to map a (String,String,Integer) -> Float . As there is a large volume of such data , its very important to achieve the most efficient method here. — Vineeth Mohan
– Vineeth Mohan, Commented Nov 25, 2012 at 16:02

JB Nizet · Accepted Answer · 2012-11-25 16:13:25Z

4

If you want to map a (String,String,Integer) to a Float, then the best thing to do is to use a Map<MyKey, Float>, where MyKey would be defined like this:

public final class MyKey {
    private final String a;
    private final String b;
    private final Integer c;

    public MyKey(String a, String b, Integer c) {
        this.a = a;
        this.b = b;
        this.c = c;
    }

    // getters, if needed

    @Override
    public int hashCode() {
        return Objects.hash(a, b, c);
    }

    @Override
    public boolean equals(Object o) {
        if (o == this) {
            return true;
        }
        if (!(o instanceof MyKey)) {
            return false;
        }
        MyKey other = (MyKey) o;
        return Objects.equal(a, o.a)
               && Objects.equal(b, o.b)
               && Objects.equal(c, o.c);
    }
}

answered Nov 25, 2012 at 16:13

JB Nizet

694k94 gold badges1.3k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

seh Over a year ago

That is a correct way to do it, but it does not address the OP's question as to which way is most efficient in terms of memory consumed by the structure. Here we add the overhead of a four object headers per key. There's another design that would use just two headers per key: a dummy type wrapped around a byte array.

Louis Wasserman Over a year ago

+1 for this solution over @seh's. The real issue is avoiding the overhead incurred by the three nested maps; fancier approaches require significantly more work for minimal benefit.

JB Nizet Over a year ago

Agreed. But it would already be a lot more efficient than maps of maps of maps. I would go with a clear and simple solution first, and see if it needs additional optimization only after.

seh Over a year ago

Well, sure, but the OP didn't ask whether he should care about such things. He said that he does, and I take him at his word that he'd like to learn more about storage overhead through our answers (and, in this case, our probing questions too).

Vineeth Mohan Over a year ago

@seh - I cant see how this is a solution. Here you are mapping (String,String,Integer) to a unique hashcode of integer type. This might not be possilbe in my case as there is a huge volume of such data and 2^32 integers wont be able to represent them all. My volume of data would be much higher than that.

|

Aleksander Blomskøld · Accepted Answer · 2012-11-25 14:32:38Z

3

You have two kinds of maps here. One which has String keys and Map values. For that I'd probably use Google Guava's ImmutableMap if immutability is ok for you. It will probably not save you a lot of memory, but it might save you some, and perform a bit better than a normal HashMap.

For the other Map with Integer keys and Double values, you should use a specialized Map implementation which stores primitives instead of objects. Take for instance a look at Trove4j's TIntDoubleHashMap. This will save you a lot of memory, as the primitives are stored as primitives instead of Objects.

answered Nov 25, 2012 at 14:32

Aleksander Blomskøld

18.6k10 gold badges79 silver badges82 bronze badges

Collectives™ on Stack Overflow

Most memory-efficient map in java

2 Answers 2

6 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

6 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related