I use beegfs at work for the redundancy and clustering aspect. 1.8PB of storage with 100% redundancy.
While it supports a lot and CAN be quite involved, a very basic setup is in fact pretty simple:
A filesystem on a machine is a storage target.
A machine with storage targets is a storage node. (beegfs-storage)
A management server (beegfs-mgmtd) connects these together into a filesystem.
Any machine runs beegfs-client to mount this filesystem.
One machine needs to run beegfs_meta for the Metadata. It doesn’t require a lot.






Circular dependency issue