Coalesce 2022 - The Analytics Engineering Conference hosted by dbt Labs (Recap)
Coalesce 2022 is dbt Labs' analytics engineering conference. For its third year, from October…
Data+AI Summit 2022 - Top Announcements and Recap
Data+AI Summit 2022 [https://databricks.com/dataaisummit/] is the world’s largest gathering among…
A Primer on Snowflake Stored Procedures
Snowflake is a data warehouse-as-a-service hosted completely in the cloud. For a Snowflake Primer, take…
Data Basics for Life-Long Software Engineers
Having recently made the switch from software to data engineering, I learned there are many…
Event-Driven Architecture: Getting Started with Kafka (Part 2)
An event-driven architecture is a paradigm that has become increasingly used in modern microservices-based architectures. It promises a more flexible and responsive architecture to business events, while offering better technical decoupling. Let's see how we can build it with Kafka.…
Event-Driven Architecture: Getting Started with Kafka (Part 1)
An event-driven architecture is a paradigm that has become increasingly used in modern microservices-based architectures. It promises a more flexible and responsive architecture to business events, while offering better technical decoupling. Let's see how we can build it with Kafka.…
A Beginner’s Guide to InfluxDB: A Time-Series Database
A time series database (TSDB) is specifically made for data that can be evaluated as…
Is the Holiday Spirit Contagious? During Ippon's first Data Hackathon in December 2020,…
Process CSVs from Amazon S3 using Apache Flink, JHipster, and Kubernetes
Apache Flink [https://flink.apache.org/] is one of the latest distributed Big Data frameworks…
Use Stargate by DataStax to effortlessly store and query your data
Stargate [https://stargate.io/] is one of the latest shiny tools from DataStax [https://www.…
Tips and Tricks for Manually Scaling a Global DynamoDB Table from an AWS Lambda
Objective Write an AWS Lambda that manually scales a global DynamoDB table Why? DynamoDB tables…
This is the finale of a 3-part series introducing a Data Quality Management (DQM) framework…
This blog is a part of a series of posts on Data Quality Management. The…
Saving and Analyzing Trending Topics on Twitter using AWS Athena, Lambda, and CDK
With more than 300 million active users, Twitter is still one of the more optimal…
Starting with AWS Glue and Querying S3 from Athena
Part one of three in a deep dive of ETL in AWS Glue. Learn how to create powerful low-code/no-code ETL processes from S3 to many data sources in AWS.…