Dependencies between failures in operational networks may have a huge impact on their reliability and availability. In this paper we analyze failure logs to identify simultaneous and potentially correlated failures in routers and links of an IP backbone network. We show that the actual behavior of failure processes does not support the independence assumption commonly used in theoretical studies. Scatter plots are presented to visualize the failure processes, and it is seen that geographical adjacency has a pronounced effect. The existence of high correlation coefficients and high autocorrelation in some failure processes was observed. A formal analysis confirms this. The consequences of these dependencies on the provisioning of guaranteed availability are briefly discussed.
|Title of host publication||Proceedings 2010 IEEE 16th Pacific Rim International Symposium on Dependable Computing (PRDC 2010)|
|Subtitle of host publication||Tokyo, Japan, 13-15 Dec. 2010|
|Place of Publication||Los Alamitos, CA, USA|
|Publisher||IEEE Institute of Electrical and Electronic Engineers|
|ISBN (Print)||978-1-4244-8975-6, 978-0-7695-4289-8|
|Publication status||Published - 2010|
|MoE publication type||A4 Article in a conference publication|