Total 6 Posts

Databricks

An Introduction to Delta Lake: The Open-Source Storage Layer for Big Data

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute…

Read More


Jul 11, 2023 5 min read

Theo LEBRUN

Data

Streamline your Data Transformations by Running dbt Directly on Databricks using Jobs

Running dbt (data build tool) on Databricks is a great alternative to dbt Cloud if…

Read More


Apr 26, 2023 4 min read

Theo LEBRUN

Data

Boost the Performance of Your Databricks Jobs and Queries

Databricks is doing a lot of optimization and caching by default to have jobs and…

Read More


Mar 10, 2023 4 min read

Theo LEBRUN

Data

Data+AI Summit 2022 - Top Announcements and Recap

Data+AI Summit 2022 [https://databricks.com/dataaisummit/] is the world’s largest gathering among…

Read More


Jul 07, 2022 3 min read

Theo LEBRUN

Data

Apache Spark 3.0

Databricks recently announced the release of Apache Spark 3.0 [https://databricks.com/blog/2020/…

Read More


Jun 23, 2020 3 min read

Theo LEBRUN

Apache Spark

A tour of Databricks Community Edition: a hosted Spark service

With the recent announcement [https://databricks.com/blog/2016/02/17/introducing-databricks-community-edition-apache-spark-for-all.html] of the…

Read More


Apr 13, 2016 6 min read

Raphael Brugier

Apache Spark