Backend framework comparison

An overview of data processing technologies and ecosystems that might be interesting for us.

  • Seems to be the current favorite. Everyone seems to recommend it over hadoop.

  • Has model for both streaming and batch (map-reduce)

  • Supports explorative queries. Spark SQL. Designed to support ML algorithms.

  • Supported on Amazon straight off the box (Elastic mapreduce)

  • Very strong community

  • No ruby. Scala or Java, and they don't seem to have a plan for JRuby

  • Has beautiful supportarrow-up-right for elasticsearch

  • No concept of streams

  • Old. Familiar. Mature.

Tutorials

Last updated