Replace column values in large Pandas dataframe

Question

I have a large input file of numerical data (22000) columns and at the moment when I use
df = pd.read_csv(path_to_file), it uses the first line of numbers as the column values.

Is there any way to replace the column value with random variables or load the data in a way that the first line is not used as a column name?

Does this answer your question? [Pandas read in table without headers](https://stackoverflow.com/questions/29287224/pandas-read-in-table-without-headers) — talatccan, Mar 07 '20 at 11:30

score 0 · Answer 1 · answered Mar 07 '20 at 11:32

0

Use pd.read_csv("path_to_file", header=0).

If you also want to assign names to the columns you can pass a list in the names parameter of pd.read_csv.

answered Mar 07 '20 at 11:32

peterhunter

76
1
5

MarianD · Accepted Answer · 2020-03-07T12:03:56.103

0

Use the parameter header=None:

df = pd.read_csv(path_to_file, header=None)

Then column names will be 0, 1, 2, ..., 21999, and all rows in your CSV-file will be treated as data rows.

If you are not satisfied with those automatically assigned column names, you may change them as in this answer of the question “How to name Pandas Dataframe Columns automatically?”

edited Mar 07 '20 at 12:03

answered Mar 07 '20 at 11:57

MarianD

13,096
12
42
54

Replace column values in large Pandas dataframe

2 Answers2