Converting lines of a file into a list of tuples

Question

I'm trying to read lines of a file into a list so every N lines will be in the same tuple. Assuming the file is valid so there are xN lines, how can I achive it?

The way I read the lines into the list:

def readFileIntoAList(file,N):
    lines = list()
    with open(file) as f:
        lines = [line.rstrip('\n') for line in f]
    return lines

What change I have to do with N so it will be a list of tuples so each tuple is of length N? For example I have the following file content:

ABC
abc xyz
123
XYZ
xyz abc
321

The output will be:

[("ABC","abc xyz","123"),("XYZ,"xyz abc",321")]

Possible duplicate of [How do you split a list into evenly sized chunks?](https://stackoverflow.com/questions/312443/how-do-you-split-a-list-into-evenly-sized-chunks) — Mateen Ulhaq, Sep 22 '19 at 12:19

score 1 · Answer 1 · answered Sep 22 '19 at 12:20

You could try using a chunking function:

def readFileIntoAList(file, n):
    with open(file) as f:
        lines = f.readlines()
        return [lines[i:i + n] for i in range(0, len(lines), n)]

This will split the list of lines in the file into evenly sized chunks.

urban · Answer 2 · 2019-09-22T12:37:19.307

0

One way would be:

>>> data = []
>>> N = 3
>>> with open('/tmp/data') as f:
...     while True:
...         chunk = []
...         for i in range(N):
...             chunk.append(f.readline().strip('\n'))
...         if any(True for c in chunk if not c):
...             break
...         data.append(tuple(chunk))
...
>>> print(data)
[('ABC', 'abc xyz', '123'), ('XYZ', 'xyz abc', '321')]

Note that this assumes the file has the right number of lines. Having the wrong number of lines in the above code can lead to infinite loop. A solution without that risk is:

data = []
N = 3
with open('/tmp/data') as f:
    i = 0
    chunk = []
    for line in f:
        chunk.append(line.strip('\n'))
        i += 1
        if i % N == 0 and i != 0:
            data.append(tuple(chunk))
            chunk = []

Both of these ways will not read the whole file in memory which should be more efficient when you process large datasets

edited Sep 22 '19 at 12:37

answered Sep 22 '19 at 12:20

urban

5,392
3
19
45

1

Doesn't answer the question, since OP is looking to chunk by a variable number of lines. – miike3459 Sep 22 '19 at 12:21
True... missed that! Fixing - trying to find a way that does not require reading the whole file... – urban Sep 22 '19 at 12:23
3

More "pythonic" will be to use [`enumerate()`](https://docs.python.org/3/library/functions.html#enumerate) for indexing instead of manual increment. – Olvin Roght Sep 22 '19 at 13:03

Olvin Roght · Answer 3 · 2019-09-22T12:54:54.990

0

You can use itertools.islice():

from itertools import islice

N = 3  # chunk size
with open("filename") as f:
    lines = []
    chunk = tuple(s.strip() for s in islice(f, N))
    while chunk:
        lines.append(chunk)
        chunk = tuple(s.strip() for s in islice(f, N))

Also you can use map() if you prefer functional style:

chunk = tuple(map(str.strip, islice(f, N)))

edited Sep 22 '19 at 12:54

answered Sep 22 '19 at 12:39

Olvin Roght

7,677
2
16
35

@urban, you can check [Itertools Recipes](https://docs.python.org/3/library/itertools.html#itertools-recipes). – Olvin Roght Sep 22 '19 at 12:52

Gökhan · Answer 4 · 2019-09-23T16:14:36.137

-1

import math
def readFileIntoAList(file,N):
    lines= list()
    lines1 = list()
    with open(file) as f:
        lines1 = [lineNew.rstrip("\n") for lineNew in f]
        for a in range(math.ceil(len(lines1)/N)):
            lines.append((*lines1[a*N:(a+1)*N],))
    return lines

I used loop, I tried to make it easily.

edited Sep 23 '19 at 16:14

answered Sep 22 '19 at 13:07

Gökhan

22
1
4

You have nested list while list of tuples required. – Olvin Roght Sep 22 '19 at 13:07
I am sorry , I misunderstood I hope It is true. – Gökhan Sep 22 '19 at 14:12
You can use `len(lines) // N` instead of `math.ceil()`. Also, read docs of [`range()`](https://docs.python.org/3/library/functions.html#func-range), it'll help you to remove `sayac` from your code. And it still produce list of lists instead of list of tuples. – Olvin Roght Sep 22 '19 at 14:44
Can't you test? – Olvin Roght Sep 22 '19 at 15:22
I tried your changes,I remove math.ceil,sayac and made // – Gökhan Sep 22 '19 at 15:28
[Here](https://repl.it/repls/NanoFamousSphere) is what I've told you about. – Olvin Roght Sep 22 '19 at 16:35
Thanks for your heIp.I understand you but I want to try it by using different ways. – Gökhan Sep 23 '19 at 16:24

Converting lines of a file into a list of tuples

4 Answers4