Offline analysis of HDFS metadata

Introduction HDFS is part of the core Hadoop ecosystem and serves as a storage layer for the Hadoop computational frameworks like Spark, MapReduce. Like other distributed file systems, HDFS is based on an architecture where namespace is decoupled from the data. The namespace contains┬áthe file system metadata which is maintained by dedicated server called namenode … Continue reading Offline analysis of HDFS metadata


Using Tiered Storage in Alluxio

Introduction Alluxio is an open source memory speed virtual distributed storage system. An brief overview of Alluxio has been covered in a previous blog. This post will cover one of the most powerful features of Alluxio, which is its tiered storage capabilities. Tiered storage allows the Alluxio volume to be expended outside of just memory. … Continue reading Using Tiered Storage in Alluxio