0

I am new to Databricks. I am reading Microsoft documentation on data lakehouse. In the documentation they make reference to delta lake without explaining what the difference is or even if there is any. Can someone please help explain this to me. Any help would be greatly appreciated.

Jay2454643
  • 15
  • 4

1 Answers1

0

The Lakehouse is a paradigm coined by this paper, while Delta Lake is a technology that can be used to create a Lakehouse-like Data platform.

Alternatives to the Lakehouse are:

  • Basic Data Lakes
  • Data Warehouses

Alternatives to Delta Lake are:

  • Apache Iceberg
  • Apache Hudi

I hope this makes it a little bit more clear.

Robert Kossendey
  • 6,733
  • 2
  • 12
  • 42
  • Okay I think I get it. So data lakehouse is the concept and delta lake is the technology used by Databricks similar to how data lake is the concept and Azure Data Lake is the technology used by Azure or Data warehouse is the concept and Amazon redshift is the technology used. Would this be a correct way of looking at it?. – Jay2454643 Aug 25 '23 at 11:07
  • This is exactly it! :) – Robert Kossendey Aug 25 '23 at 11:40