Fail-Slow at Scale


Fail-Slow at Scale is a scholarly work by Riza O. Suminto, published in 2018 in ''ACM Transactions on Storage''. The main subjects of the publication include computer hardware, root cause, computer security, Cluster, mode, scale, fault tolerance, clustered file system, embedded system, operating system, and computer science. The authors present a study of 114 reports of fail-slow hardware incidents, collected from large-scale cluster deployments in 14 institutions.

Related Works