0

I am attempting to combine 3 .xls files - and want to do this in python3.

I need to delete the some rows first then combine them into one .xls , then remove duplicates.(i will later manipulate the data for graphs)

I seek guidance on which modules to use that work in python3 that can achieve this effect.

Any assistance would be greatly appreciated.

Kuzen
  • 950
  • 3
  • 10
  • 15

1 Answers1

2

I seek guidance on which modules to use that work in python3 that can achieve this effect.

Pandas is definitely your best bet.

03 - Lesson: - Creating functions - Reading from EXCEL - Exporting to EXCEL - slice and dice data

04 - Lesson: - Adding/deleting columns - Index operations

http://pandas.pydata.org/pandas-docs/stable/tutorials.html

Here's how to merge the separate files into one sheet: Import multiple excel files into python pandas and concatenate them into one dataframe

Here's how to delete rows: Deleting DataFrame row in Pandas based on column value

Here's how to delete duplicates: Remove rows with duplicate indices (Pandas DataFrame and TimeSeries)

Community
  • 1
  • 1
Charles Clayton
  • 17,005
  • 11
  • 87
  • 120
  • Thats ideal, after a few hours installing it (compile errors ) got exactly what i needed. Cheers for the swift response. – Kuzen Apr 23 '14 at 21:48