Data

MongoDB and Apache Spark - Getting started tutorial

Data Science

MongoDB and Apache Spark are two popular Big Data technologies. In my previous post [https:…

May 03, 2017 6 min read

Raphael Brugier

Apache Spark

Basics of Apache Nifi: 1

In our previous article on Nifi [https://test-ippon.ghost.io/why-nifi-2], we discussed the history,…

Apr 25, 2017 1 min read

Malcolm Thirus

Apache Nifi

Why is an Address NOT a (series of) Strings?

In my article Why an Address is NOT a String [https://test-ippon.ghost.io/why-address-not-string/…

Apr 11, 2017 4 min read

Streaming With Scala: The Nuance of Real-Time Twitter Data

At Ippon Technologies USA, we're lucky enough to have "Coding Dojos"…

Mar 08, 2017 3 min read

Pokemon GO: A Big Data Learning Opportunity

LifeAtIppon

Nick Peterson and Justin Risch have begun to study Big Data, Spark, Hadoop, and the…

Feb 16, 2017 6 min read

Kafka Streams - Scaling up or down

Why NiFi?

In this day and age we are living in, it is not a luxury to…

Jan 26, 2017 4 min read

Doug Mengistu

Apache Nifi

Kafka Streams is a new component of the Kafka platform. It is a lightweight library…

Oct 06, 2016 6 min read

Spark - Calling Scala code from PySpark

In a previous post [https://test-ippon.ghost.io/spark-kafka-achieving-zero-data-loss/], I demonstrated how to consume a…

Sep 12, 2016 4 min read

Apache Spark Datasets

With a Spark 2.0 release imminent [https://databricks.com/blog/2016/05/11/spark-2-0-technical-preview-easier-faster-and-smarter.…

Jun 15, 2016 4 min read

Malcolm Thirus

Spark & Kafka - Achieving zero data-loss

Kafka and Spark Streaming are two technologies that fit well together. Both are distributed systems…

May 12, 2016 9 min read

A tour of Databricks Community Edition: a hosted Spark service

With the recent announcement [https://databricks.com/blog/2016/02/17/introducing-databricks-community-edition-apache-spark-for-all.html] of the…

Apr 13, 2016 6 min read

Raphael Brugier

Apache Spark

Kafka, Spark and Avro - Part 3 of 3, Producing and consuming Avro messages

This post is the third and last post in a series in which we learn…

Apr 06, 2016 6 min read

Testing strategy for Spark Streaming - Part 2 of 2

In a previous post [https://test-ippon.ghost.io/testing-strategy-apache-spark-jobs/], we’ve seen why it’s…

Mar 30, 2016 6 min read

Raphael Brugier

Apache Spark

Kafka, Spark and Avro - Part 2 of 3, Consuming Kafka messages with Spark

This post is the second post in a series in which we will learn how…

Mar 22, 2016 4 min read