How AWS Glue Data Quality Helps You Achieve Compliance For Your Data Lake With Confidence
The "recent" creation of data lakes by thousands of Organizations also created a…
An Introduction to Delta Lake: The Open-Source Storage Layer for Big Data
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute…
Streamline your Data Transformations by Running dbt Directly on Databricks using Jobs
Running dbt (data build tool) on Databricks is a great alternative to dbt Cloud if…
Boost the Performance of Your Databricks Jobs and Queries
Databricks is doing a lot of optimization and caching by default to have jobs and…
Capture Data History With SCD2 Using Databricks Delta Live Tables
Delta Live Tables is a great way to build and manage reliable batch and streaming…
Data+AI Summit 2022 - Top Announcements and Recap
Data+AI Summit 2022 [https://databricks.com/dataaisummit/] is the world’s largest gathering among…
Transform Data in your Warehouse using dbt, Airflow, and Redshift
Data Build Tool [https://www.getdbt.com/] (better and simply known as "dbt"…
Sync Two S3 Buckets Using CDK and a Lambda Layer Containing the AWS CLI
The AWS Command Line Interface (CLI) [https://aws.amazon.com/cli/] is a great tool…
Process CSVs from Amazon S3 using Apache Flink, JHipster, and Kubernetes
Apache Flink [https://flink.apache.org/] is one of the latest distributed Big Data frameworks…
Use Stargate by DataStax to effortlessly store and query your data
Stargate [https://stargate.io/] is one of the latest shiny tools from DataStax [https://www.…
Saving and Analyzing Trending Topics on Twitter using AWS Athena, Lambda, and CDK
With more than 300 million active users, Twitter is still one of the more optimal…
Databricks recently announced the release of Apache Spark 3.0 [https://databricks.com/blog/2020/…
Build an event sourcing system on AWS using DynamoDB and CDK
Over the past few years, event sourcing has become a popular pattern used in modern…
AWS Cognito and JHipster for the LOVE of OAuth 2.0
OAuth 2.0 [https://oauth.net/2/] is a stateful security mechanism. OpenID Connect (OIDC)…
Deploying a JHipster app to AWS using Elastic Beanstalk
JHipster [https://www.jhipster.tech/] is a great development platform to help you bootstrap a…