Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Disco MapReduce is described as 'Disco is a lightweight, open-source framework for distributed computing based on the MapReduce paradigm and written in Python' and is an app in the development category. There are seven alternatives to Disco MapReduce for a variety of platforms, including Linux, Mac, Windows, Web-based and BSD apps. The best Disco MapReduce alternative is Apache Spark, which is both free and Open Source. Other great apps like Disco MapReduce are Apache Hadoop, Amazon Kinesis, Apache Flink and S2.
Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license. It enables applications to work with thousands of computational independent computers and petabytes of data.
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.



Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.


Object storage has been nothing short of revolutionary. S3 broke ground in 2006 with simple storage operations on named objects – and 18 years later, S3 Express One Zone even allows appends. But ultimately, object storage is all about blobs and byte ranges.

HPCC Systems offers an open source cluster computing platform used to solve Big Data problems. Its unique architecture and simple yet powerful data programming language (ECL) makes it a compelling solution to solve data intensive computing needs.

dispy is a Python framework for parallel execution of computations by distributing them across multiple processors on a single machine (SMP), among many machines in a cluster or grid. dispy is well suited for data parallell (SIMD) paradigm where a computation is evaluated with...