Total 7 Posts

Data Streaming

Process CSVs from Amazon S3 using Apache Flink, JHipster, and Kubernetes

Apache Flink [https://flink.apache.org/] is one of the latest distributed Big Data frameworks…

Read More


Feb 04, 2021 6 min read

Theo LEBRUN

Data Streaming

Confluent & Twitter4j Tutorial

Reading a Real-Time stream of Tweets into Kafka Kafka is an amazing tool for processing…

Read More


Jun 07, 2019 6 min read

Justin Risch

Data

Transient Cluster on AWS

This post demonstrates a cost-effective and automated solution for running Spark-Jobs on the EMR cluster on a daily basis using CloudWatch, Lambda, EMR, S3, and SNS.…

Read More


Jun 03, 2019 6 min read

Sripriya Rajanna

Apache Spark

Basics of Apache Nifi: 2

On our previous video on the basics of Nifi [https://test-ippon.ghost.io/basics-of-apache-nifi], we…

Read More


Nov 15, 2017 1 min read

Malcolm Thirus

Apache Nifi

Performance Tweaking Apache Spark

Apache Spark Streaming applications need to be monitored frequently to be certain that they are…

Read More


Jun 26, 2017 5 min read

Jeannine Stark

Data Streaming

Basics of Apache Nifi: 1

In our previous article on Nifi [https://test-ippon.ghost.io/why-nifi-2], we discussed the history,…

Read More


Apr 25, 2017 1 min read

Malcolm Thirus

Apache Nifi

Why NiFi?

In this day and age we are living in, it is not a luxury to…

Read More


Jan 26, 2017 4 min read

Doug Mengistu

Apache Nifi