ARCHIVED from builddistributedsystem.com on 2026-04-28 — URL: https://builddistributedsystem.com/tracks/filesystem
Tracks/The Filesystem
20

The Filesystem

Advanced
Storage|10 tasks

GFS and HDFS showed the world how to store petabytes across thousands of cheap machines. Build a tiny distributed filesystem with chunk servers, replication, and master failover.

Subtracks & Tasks

Concepts Covered

GFS architecturemaster nodechunk server64MB chunksreplication factornamespace treedirectory hierarchychunk mappingmetadataWAL-backedchunk allocationplacement policyrack awarenessprimary assignmentchunk replicationpipeline writesprimary-secondarywrite acknowledgementdata flowleaseprimary electionlease renewallease expiryconsistency windowheartbeatchunk server monitoringliveness detectionchunk inventoryre-replicationunder-replicated chunksfailure recoveryload balancingchunk migrationdisk utilizationrebalancing thresholdmaster failovershadow masterWAL replayhot standbyfailover timechecksumdata integritycorruption detectionper-block checksumsilent corruption

Prerequisites

It is recommended to complete the previous tracks before starting this one. Concepts build progressively throughout the curriculum.