How to create duplicate for each value in a python list given the number of dups I want?

Question

I have this list:

a=[7086, 4914, 1321, 1887, 7060]. Now, I want to create duplicate of each value n-times. Such as:

n=2

a=[7086,7086,4914,4914,1321,1321,7060,7060]

How would I do this best? I tried a loop but it was inefficient.

The question looks opinion-based. If you know several ways to accomplish this, compare them however you want. — StSav012, Sep 14 '22 at 17:02
How inefficient? And what are your real list length and `n`? (Assuming they're much bigger) — Kelly Bundy, Sep 14 '22 at 17:07
Show the loop, otherwise we can't know if our methods could be more efficient... — Ignatius Reilly, Sep 14 '22 at 17:09
@KellyBundy the max possible lenght for n=4. it cannot be larger than that. — titutubs, Sep 14 '22 at 17:09
*n* is max 4 but how big is the list? Does it have 1000s of elements? — DarkKnight, Sep 14 '22 at 17:15
So you don't actually care that much about speed? Otherwise you'd answer the questions and I'd show my solution that's like 25 times faster than the answer you accepted... — Kelly Bundy, Sep 14 '22 at 17:44

dawg · Answer 1 · 2022-09-14T17:33:50.903

The idiomatic way in Python is to use zip and then flatten the resulting list of tuples.

As stated in the docs:

The left-to-right evaluation order of the iterables is guaranteed. This makes possible an idiom for clustering a data series into n-length groups using zip(*[iter(s)]*n).

So in this case:

>>> a=[7086, 4914, 1321, 1887, 7060]
>>> n=2
>>> [e for t in zip(*[a]*n) for e in t]
[7086, 7086, 4914, 4914, 1321, 1321, 1887, 1887, 7060, 7060]

Alternatively, just flatten a constructed list of sublists made [list_item]*n like so:

>>> [e for sl in ([e]*n for e in a) for e in sl]
# same output

Which can then be constructed as a generator which is as efficient as possible here:

>>> it=(e for sl in ([e]*n for e in a) for e in sl)
>>> next(it)
7086
>>> next(it)
7086
>>> next(it)
4914
>>> ...

Barmar · Answer 2 · 2022-09-14T17:23:05.607

0

I suspewct the most efficient way is to pre-allocate the result list, then fill it in with a loop. This way you don't have to allocate lots of temporary lists and repeatedly grow the result list. These memory allocations are likely to be the most expensive part of the code.

result = [0] * (len(a) * n)
for i, x in enumerate(a):
    for j in range(i*n, i*n+n):
        result[j] = x

edited Sep 14 '22 at 17:23

answered Sep 14 '22 at 17:10

Barmar

741,623
53
500
612

Sorry, the initialization was wroing – Barmar Sep 14 '22 at 17:23

Ignatius Reilly · Accepted Answer · 2022-09-14T17:17:03.003

-1

%%timeit
b=[]
for x in a:
    b += [x]*n

with %%timeit, 994 ns ± 8.22 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each).

Thanks to @Barmar for the += recommentdation!

edited Sep 14 '22 at 17:17

answered Sep 14 '22 at 17:14

Ignatius Reilly

1,594
2
6
15

How to create duplicate for each value in a python list given the number of dups I want?

3 Answers3

Linked