Cloudera and Others Rally Behind Hadoop Challenger Spark

Folks in the Big Data and Hadoop communities are becoming increasingly interested in Apache Spark, an open source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. We’ve covered Spark before, and some reports are characterizing it as a tool that could supplant Hadoop in many enterprises.

According to Apache, Spark can run programs up to 100 times faster than Hadoop MapReduce in memory, and ten times faster on disk. When crunching large data sets, those are big performance differences.

RELATED ARTICLESMORE FROM AUTHOR

Using OpenTelemetry and the OTel Collector for Logs, Metrics, and Traces

Xen 4.19 is released

Advancing Xen on RISC-V: key updates

AI Produces Data-driven OpenFOAM Speedup (HPC Wire)

Delivering Prime Training Deals – 2 DAYS ONLY

RELATED ARTICLES MORE FROM AUTHOR