I have some specific questions about whether to use Pandas or alternative tools.
What is the reason to use Pandas rather than other tools or data structures?
When memory is a concern, how heavy is the cost of Pandas and what are the cheaper alternatives?
This is more of a qualitative question. What is the purpose of pandas? I find dictionaries and lists to fit my needs entirely. What's the big fuss with pandas?
For example I can store this table in a nested dictionary using much less memory, if there are lots of rows with identical values:
#key0 key1 value
A 1 a
A 1 b
A 2 a
A 2 b
B 1 a
B 1 b
B 2 a
B 2 b
d = {'A': {1: ['a', 'b'], 'A': {2: ['a', 'b'], 'B': {1: ['a', 'b'], 'B': {2: ['a', 'b']}}
Why would I want to use pandas, when there is a much more memory efficient way of holding my nested data? I just don't get it. Thanks!
I'm aware of the abilities of pandas to allow indexing by name, handle missing data, doing join, group by a value and so forth.
This is more of a qualitative question. Perhaps it belongs on Meta Stack Exchange instead.