Backend framework comparison

An overview of data processing technologies and ecosystems that might be interesting for us.

  • Seems to be the current favorite. Everyone seems to recommend it over hadoop.

  • Has model for both streaming and batch (map-reduce)

  • Supports explorative queries. Spark SQL. Designed to support ML algorithms.

  • Supported on Amazon straight off the box (Elastic mapreduce)

  • Very strong community

  • No ruby. Scala or Java, and they don't seem to have a plan for JRuby

  • Has beautiful support for elasticsearch

  • Data collection framework made for logs.

  • can split data into several endpoints, one being hdfs

  • In memory aggregations?

  • Complex event processing with JRuby, including SQL queries of streams

  • No concept of streams

  • Old. Familiar. Mature.

Tutorials

Last updated