这136家$10 亿公司(独角兽)里,在湾区的公司最多,有53家;中国也不少,有26家。 点击这个网页右下角 "Analysis & FAQ" 读作者的分析文章。
这篇文章基于 Etsy 的 Game Day 的概念阐述了如何在 production 环境中引入真实的故障,演习如何快速排查错误,进而改进整个系统,让整个系统变得更稳健。
At a high level, production fault injection should be considered one of many approaches used to gain confidence in the safety and resiliency of a system. Like unit testing, functional testing, and code review, this approach is limited as to which surprising events it can prevent, but it also has benefits, many of which are cultural.
真是艺高人胆大啊!类似的演习还有:Facebook关闭一个数据中心以检验系统稳定性、Stripe 的 "kill -9" 操练等。