HStreaming Community Edition is a real-time data analytics platform to analyze, process, store and archive streaming data on Hadoop. HStreaming Community Edition is compatible with all major Hadoop distributions including Apache Hadoop, Cloudera, MapR, Amazon EMR, Hortonworks, EMC, and IBM.
HStreaming Community Edition is a great way to explore and start working with real time data on Hadoop either using the high-level Pig language or native MapReduce. HStreaming Community Edition does not require any software installation on a Hadoop cluster and thus can be used from any development machine or desktop.
HStreaming Community Edition comes pre-packaged for Ubuntu/Debian and Redhat Linux for Cloudera CDH3 and MapR. HStreaming is also available as a downloadable tar archive.
HStreaming Community Edition is free.
HStreaming Community Edition allows to run real-time analytics processes using the native MapReduce API or HStreaming's stream-enhanced version of Apache Pig. Community Edition includes a visualization connector which allows to generate simple web-based visualization from within a Pig query, Jobtracker UI enhancements displaying streaming job parameters, HStreaming command line shell and a variety of stream connectors.
HStreaming Community Edition can run multiple real-time jobs concurrently. The number of attempts per map or reduce task is is limited to 1.
HStreaming Community Edition can be downloaded from our download section.