I can create CETAS tables in Azure Blob storage fairly easily ( link ), however, is there a simple way to automatically partition by CustomerID so that the rows associated with each customer will end up in their own folder?
Essentially I'm looking for a CETAS parameter that will cause my data to end up in folders like this:
Azure Blob Storage Container/
CustomerTable/
CustomerID=1/
.parquet files for Customer 1
CustomerID=2/
.parquet files for Customer 2
...
CustomerID=N/
.parquet files for Customer N
(Note: I'm running around 10,000 CETAS statements on 100+ TB of data, I imagine I can change my script to look for tables that have a CustomerID column and manually partition the data by CustomerID, however, I was hoping there was some syntax with the CETAS statement that I could have the system partition the data for me).