Pure Pandas approach to converting data in a text file into a table

Question

I am looking to convert data in a textile into a table (data frame) using just methods from Pandas.

Textfile

Table/Dataframe format

    0  1  2  3  4
0   0  0  1  0  0
1   1  1  1  1  0
2   1  0  1  1  0
3   1  0  1  1  1
4   1  0  1  0  1
5   0  1  1  1  1
6   0  0  1  1  1
7   1  1  1  0  0
8   1  0  0  0  0
9   1  1  0  0  1
10  0  0  0  1  0
11  0  1  0  1  0

My approach

The only way I could think of doing it was to use some Python code to read the file into a 2D list of characters and then convert that to a data frame:

with open("data.txt") as f:
        # Removes newline character and splits binary string into individual character bits
        binary = [list(line.strip()) for line in f]

df = pd.DataFrame(binary, dtype="object")  # 2D list into pd dataframe

Although this works, I would like to know if this could have been done using Pandas with the read_csv() method

You can look at this topic. [Load data from .txt with Pandas](https://stackoverflow.com/questions/21546739/load-data-from-txt-with-pandas) I think it might answer your question. — , Jun 27 '22 at 15:18

René · Accepted Answer · 2022-06-28T09:11:12.833

3

This should work in your case:

df = pd.read_fwf('untitled.txt', widths=[1,1,1,1,1], header=None)
print(df)

Result:

    0  1  2  3  4
0   0  0  1  0  0
1   1  1  1  1  0
2   1  0  1  1  0
3   1  0  1  1  1
4   1  0  1  0  1
5   0  1  1  1  1
6   0  0  1  1  1
7   1  1  1  0  0
8   1  0  0  0  0
9   1  1  0  0  1
10  0  0  0  1  0
11  0  1  0  1  0

edited Jun 28 '22 at 09:11

answered Jun 27 '22 at 15:28

René

4,594
5
23
52

1

Thanks for pointing that out. Just fixed the code and added "header=None". – René Jun 28 '22 at 04:18
1

And if I wanted to generalise to n columns, I would just do `df = pd.read_fwf('untitled.txt', widths=[1]*num_cols, header=None)`. – Suraj Kothari Jun 28 '22 at 12:32
1

Thanks for showing a one-liner approach. Pandas genuinely seems to have methods that do nearly everything :) – Suraj Kothari Jun 28 '22 at 12:36

Abhishek · Answer 2 · 2022-06-27T15:34:10.313

1

Below code can help. Will also work with txt file

df = pd.read_csv('Book2.csv',header=None, dtype='str') #read file
df = df[0].astype('str').str.split('',expand=True) #split column
df[df.columns[1:-1]] #print df after removing first & last empty column

Output will look like this

edited Jun 27 '22 at 15:34

answered Jun 27 '22 at 15:28

Abhishek

1,585
2
12
15

Nice, `df[df.columns[1:-1]]` can be `df.iloc[:, 1:-1]` – creanion Jun 27 '22 at 15:31
Yeah, it will be more appropriate. Will use this syntax in future. Thank you. – Abhishek Jun 27 '22 at 15:32
Why are you dropping the first and last columns? Also, I noticed that starts the index from 1, whereas I wanted it from 0, which needs a couple more lines of code to get to the desired output. @rene showed that read_fwf does all this in one line :) – Suraj Kothari Jun 28 '22 at 12:35

Pure Pandas approach to converting data in a text file into a table

Textfile

Table/Dataframe format

My approach

2 Answers2