3

I have a dictionary in the following form:

{
    'variable_1': [a1, b1, c1],
    'variable_2': [a2, b2, c2],
    'variable_3': [a3, b3, c3],
    ...
}

(There are n entries in this dictionary).

I have a problem where I want to obtain all the possible combinations of the different values, for instance:

[(a1, a2, a3), (b1, a2, a3), (c1, a2, a3), (a1, b2, a3), ...]

I know this can be done with a cartesian product with itertools. However, I have noticed my problem would be much easier if the result was in the following form:

[{'variable1': a1, 'variable2':a2, 'variable3': a3}, {'variable1': b1, 'variable2':a2, 'variable3': a3}, ...]

Is there an easy way to do that?

marc_s
  • 732,580
  • 175
  • 1,330
  • 1,459
Eldy
  • 1,027
  • 1
  • 13
  • 31

3 Answers3

2
import itertools
d = {
'variable_1': ['a1', 'b1', 'c1'],
'variable_2': ['a2', 'b2', 'c2'],
'variable_3': ['a3', 'b3', 'c3'],
}


keys = (list(d.keys())) # ['variable_1', 'variable_2', 'variable_3']
values = (list(itertools.product(*(list(d.values()))))) #[('a1', 'a2', 'a3'), ('a1', 'a2', 'b3'), ('a1', 'a2', 'c3')...]

dictionary = []
for val in values:
    dictionary.append(dict(zip(keys, val)))

output:

[{'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'b3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'c3'}, {'variable_1': 'a1', 'variable_2': 'b2', 'variable_3': 'a3'},...]

exapmle:

d = {
'variable_1': ['a1', 'b1', 'c1','d1'],
'variable_2': ['a2', 'b2', 'c2','d2'],
'variable_3': ['a3', 'b3', 'c3','d3'],
'variable_4': ['a3', 'b3', 'c3','d4'],
}

output:

[{'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3', 'variable_4': 'a3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3', 'variable_4': 'b3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3', 'variable_4': 'c3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3', 'variable_4': 'd4'}, ...]
ncica
  • 7,015
  • 1
  • 15
  • 37
1

You can also use a recursive function for a no-import solution:

def combo(data, c=[]):
  if not data:
    yield c
  else:
    for i in data[0]:
      yield from combo(data[1:], c+[i])

d = {'variable_1': ['a1', 'b1', 'c1'], 'variable_2': ['a2', 'b2', 'c2'], 'variable_3': ['a3', 'b3', 'c3']} 
keys, values = zip(*d.items())
result = [dict(zip(keys, i)) for i in combo(values)]

Output:

[{'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'a3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'b3'}, {'variable_1': 'a1', 'variable_2': 'a2', 'variable_3': 'c3'}, {'variable_1': 'a1', 'variable_2': 'b2', 'variable_3': 'a3'}, {'variable_1': 'a1', 'variable_2': 'b2', 'variable_3': 'b3'}, {'variable_1': 'a1', 'variable_2': 'b2', 'variable_3': 'c3'}, {'variable_1': 'a1', 'variable_2': 'c2', 'variable_3': 'a3'}, {'variable_1': 'a1', 'variable_2': 'c2', 'variable_3': 'b3'}, {'variable_1': 'a1', 'variable_2': 'c2', 'variable_3': 'c3'}, {'variable_1': 'b1', 'variable_2': 'a2', 'variable_3': 'a3'}, {'variable_1': 'b1', 'variable_2': 'a2', 'variable_3': 'b3'}, {'variable_1': 'b1', 'variable_2': 'a2', 'variable_3': 'c3'}, {'variable_1': 'b1', 'variable_2': 'b2', 'variable_3': 'a3'}, {'variable_1': 'b1', 'variable_2': 'b2', 'variable_3': 'b3'}, {'variable_1': 'b1', 'variable_2': 'b2', 'variable_3': 'c3'}, {'variable_1': 'b1', 'variable_2': 'c2', 'variable_3': 'a3'}, {'variable_1': 'b1', 'variable_2': 'c2', 'variable_3': 'b3'}, {'variable_1': 'b1', 'variable_2': 'c2', 'variable_3': 'c3'}, {'variable_1': 'c1', 'variable_2': 'a2', 'variable_3': 'a3'}, {'variable_1': 'c1', 'variable_2': 'a2', 'variable_3': 'b3'}, {'variable_1': 'c1', 'variable_2': 'a2', 'variable_3': 'c3'}, {'variable_1': 'c1', 'variable_2': 'b2', 'variable_3': 'a3'}, {'variable_1': 'c1', 'variable_2': 'b2', 'variable_3': 'b3'}, {'variable_1': 'c1', 'variable_2': 'b2', 'variable_3': 'c3'}, {'variable_1': 'c1', 'variable_2': 'c2', 'variable_3': 'a3'}, {'variable_1': 'c1', 'variable_2': 'c2', 'variable_3': 'b3'}, {'variable_1': 'c1', 'variable_2': 'c2', 'variable_3': 'c3'}]
Ajax1234
  • 69,937
  • 8
  • 61
  • 102
0

You'd still need itertools.product for this:

a = {'v1': [1,2], 'v2': [3,4]}
gen = (dict(zip(a.keys(), x)) for x in itertools.product(*a.values()))
>>> list(gen)
[{'v1': 1, 'v2': 3}, {'v1': 1, 'v2': 4}, {'v1': 2, 'v2': 3}, {'v1': 2, 'v2': 4}]

You can read the generator as follows from right to left:

  • make a Cartesian product from all of the lists inside of the dictionary (the * is called the splat operator):
>>> list(itertools.product(*a.values()))
[(1, 3), (1, 4), (2, 3), (2, 4)]
  • then use zip to make pairs of the keys and the result of the product.
>>> zip(['v1', 'v2'], [1, 3])
[('v1', 1), ('v2', 3)]
  • finally build a dictionary again from that pair
>>> dict([('v1', 1), ('v2', 3)])
{'v1': 1, 'v2': 3}
Lanting
  • 3,060
  • 12
  • 28