Länk - Feltoleranta system: svårt detektera sådant som fallerar "bara ibland"

published Jul 18, 2008 03:53   by admin ( last modified Jul 18, 2008 03:53 )
It's not hard to have system B check that system A is on/off line, and step in if the latter is the case. But what happens when A is *mostly* or *sorta* online? Does system B check that ALL functionality done by A is being done appropriately? Almost never. And that's why, even in the best, most carefully designed, fully redundant high-availability systems, you never, ever see 100% uptime. It's just not possible to anticipate everything that can go wrong. So design a system that fails gracefully! That's what nature did.



Läs mer: Dublin Air Traffic Contol Brought Down By Faulty NIC