I'm looking to make a specific graph layout from a specific format of Python data, but where the data may be text, string, or another Python object.
I can do this with XlsxWriter (see below), but the data overloads it (14 million lines...no joke. DNA data). Is it possible/how can I do this with Matplotlib or PAndas (or more stable with large data)?
DETAILS: It has to be able to do it on the fly, and make visual formatting decisions based on whether the data is "the same" (regardless of numerically, text or another Python object).
The coloring in the cells is based on whether the items within {1:"A", 2:"A"}are equivalent (not the same object). Green for True, red for False. For example: "A" == "A" (as in the preceding sentences), or (10/5) == (20/10), or ["A", 1, <test object at 0x1052c9b70>] == ["A", 1, <test object at 0x1052c9b70>] would all be green.
The text that appears in the cells is just the __str__ representation of the object.
EXAMPLE ...
{
1000:{
"Sample1":{1:"A", 2:"A"},
"Sample2":{1:"A", 2:"A"},
"Sample3":{1:"A", 2:"A"},
},
1001:{
"Sample1":{1:"A", 2:"A"},
"Sample2":{1:"A", 2:"A"},
"Sample3":{1:"A", 2:"A"}
},
1002:{
"Sample1":{1:"C", 2:"A"},
"Sample2":{1:"A", 2:"A"},
"Sample3":{1:"A", 2:"A"}
},
(...)
9999:{
"Sample1":{1:"A", 2:"T"},
"Sample2":{1:"A", 2:"A"},
"Sample3":{1:"A", 2:"G"}
},
}


Stack exchangeforum where this type of opinion question IS appropriate?