Experiences of Using Alluxio with Spark

Introduction Alluxio┬árefers to┬áitself as an "Open Source Memory Speed Virtual Distributed Storage" platform. It sits between the storage and processing framework layers in the distributed computing ecosystem and claims to heavily improve performance when multiple jobs are reading/writing from/to the same data. This post will cover some of the basic features of Alluxio and will … Continue reading Experiences of Using Alluxio with Spark