An Introduction to Delta Lake: The Open-Source Storage Layer for Big Data
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute…
Streamline your Data Transformations by Running dbt Directly on Databricks using Jobs
Running dbt (data build tool) on Databricks is a great alternative to dbt Cloud if…
Jumpstarting Your Data Engineering Career: A Beginner's Guide to the Data Stack - Part One: Fivetran
Summary Do you find yourself lost in an endless sea of data tools? Are you…
Boost the Performance of Your Databricks Jobs and Queries
Databricks is doing a lot of optimization and caching by default to have jobs and…
Power your Data Pipeline with Matillion Variables
Matillion is a great ETL tool for bringing over data from disparate sources into one…
Starting with AWS Glue and Querying S3 from Athena
Part one of three in a deep dive of ETL in AWS Glue. Learn how to create powerful low-code/no-code ETL processes from S3 to many data sources in AWS.…
Exploring Snowsight: Snowflake's Replacement for SQL Worksheets
In June of 2020, Snowflake announced Snowsight: the upcoming replacement for SQL Worksheets and is…
Snowflake is a cloud-based data warehousing company. They specialize in provisioning on-demand compute and elastic…
Innovative Snowflake Features Part 2: Caching
In the previous blog in this series Innovative Snowflake Features Part 1: Architecture [https://test-ippon.…
Innovative Snowflake Features Part 1: Architecture
Earlier this year, Ippon published an Introduction to Snowflake [https://test-ippon.ghost.io/introduction-to-snowflake/] post…
Snowflake is a native Cloud Relational Database that is a Data Warehouse as a Service…
On our previous video on the basics of Nifi [https://test-ippon.ghost.io/basics-of-apache-nifi], we…
In our previous article on Nifi [https://test-ippon.ghost.io/why-nifi-2], we discussed the history,…