1

Hi I'm looking for some help here. I am new to Azure Data Factory and I need to convert some Excel files to csv, how can I do this?

rbrayb
  • 46,440
  • 34
  • 114
  • 174
db_noob
  • 119
  • 2
  • 11
  • 1
    There is no direct support of Excel in ADF. Please see below for more details.https://stackoverflow.com/questions/52514153/how-to-read-files-with-xlsx-and-xls-extension-in-azure-data-factory – AnshuBhola Apr 16 '20 at 07:48

2 Answers2

2

As i know,Excel file is not supported by ADF so far,only these formats are supported.

So,you could try some workarounds to process excel files in ADF:

1.Use Databricks Activity to load excel files and convert into csv files,please refer to this case:How to construct Dataframe from a Excel (xls,xlsx) file in Scala Spark?

2.Use Azure Function Activity to convert excel files into csv files,you could choose development language to implement that. Then output the csv files into specific path and use them in the next steps.

3.Try Custom Activity which is actually tasks(for example,execute a Powershell Script) running on the Azure Batch Node.This approach is more flexible and easy to operate.

Jay Gong
  • 23,163
  • 2
  • 27
  • 32
1

As of 2022 .xlsx and .xls files are supported as source but not as sink.

This means that we can use a copy activity in Azure Synapse / Azure Data Factory to extract the data from a specific sheet of the xlsx file (=source) and copy it to a csv file (=sink).

Cribber
  • 2,513
  • 2
  • 21
  • 60