Incidents Rezopole Communication Rezopole

Network Operations Center : FranceIX Lyon


Boucle sur le vlan de peering - loop on peering vlan
Beginning 2012-09-14 14:26:38
Ending 2012-09-25 11:00:00
Status Terminé
Impacts ix / lyonix
French Description MAJ : Nous avons effectué le changement de la pièce défectueuse (compact flash) sans impact sur le service.

-----------------
MAJ : Le support Telindus nous demande d'effectuer des tests sur le 6500 de L2 afin d'identifier les pieces défectueuses.
Un bagotement d'une minute est donc a prévoir demain (Jeudi 20 Sept 2012) a 6h00.

-----------------

Il semblerait que le problème de cet après midi vienne finalement d'un problème de flash sur le 6500 de L2.

A chaque commande dir (pour lister la flash, via des outils de supervision), nous avions un freeze toutes les 30 min, respectivement à 25 et 55.

Le problème est que cette commande génère des erreurs de ce type:
Sep 14 18:26:24.188 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (15/10),process = RFSS_server_action.
-Traceback= 404649C8 40466DD0 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40488F1C 402E66D4
40458CCC 4029C954
Sep 14 18:26:28.200 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (7/0),process = RFSS_server_action.
-Traceback= 40465DE4 40466E14 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40488F1C 402E66D4
40458CCC 4029C954
Sep 14 18:26:32.204 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (21/13),process = RFSS_server_action.
-Traceback= 404649C8 40466DD0 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40492E44 40489034
402E6908 40458DF8

Plusieurs forums proposent de formater la sup-bootdisk flash.
https://supportforums.cisco.com/thread/2034573

Nous avons ouvert un ticket chez Telindus pour avoir la marche à suivre.
English Description Update : We have change the defective part (compact flash) without any interruption of service.

-----------------
Update : The support of Telindus has asked us to make some test on the 6500 of L2 in order to identify the defectives parts. A short interruption of service of one minute is planned tomorrow morning (Thursday 20 Sept 2012) at 6:00AM CEST.

-----------------

It seems that the problem this afternoon finally came a flash problem on 6500 L2.

Each dir command (to list the flash via monitoring tools), we had a freeze every 30 min, respectively 25 and 55.

The problem is that this command generates errors like this:
Sep 14 18:26:24.188 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (15/10),process = RFSS_server_action.
-Traceback= 404649C8 40466DD0 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40488F1C 402E66D4
40458CCC 4029C954
Sep 14 18:26:28.200 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (7/0),process = RFSS_server_action.
-Traceback= 40465DE4 40466E14 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40488F1C 402E66D4
40458CCC 4029C954
Sep 14 18:26:32.204 CEST: %SYS-SP-3-CPUHOG: Task is running for
(2000)msecs, more than (2000)msecs (21/13),process = RFSS_server_action.
-Traceback= 404649C8 40466DD0 40466CDC 40486CB4 40486594 40486B78
404897D0 40491848 40491D0C 40492898 4048D6C8 40492AE8 40492E44 40489034
402E6908 40458DF8

Many forums propose to format the flash.
https://supportforums.cisco.com/thread/2034573

We opened a ticket at Telindus support.