News
From Molecular Modeling Wiki
(Difference between revisions)
Line 1: | Line 1: | ||
=== 2008-09-12 === | === 2008-09-12 === | ||
* after a server power source failure the cluster '''radon''' is up again; the power source has been replaced; | * after a server power source failure the cluster '''radon''' is up again; the power source has been replaced; | ||
+ | * some nodes in the '''lithium''' cluster are dead and some others have been repaired | ||
+ | ** '''l15''' - I/O error - does not reappear after fsck and restart - open | ||
+ | ** '''l22''' - blinking alert LED - solved, open | ||
+ | ** '''l23''' - blinking alert LED - solved, open | ||
+ | ** '''l24''' - blinking alert LED - solved, open | ||
+ | ** '''l25''' - failed memory module - in service for replacement - closed | ||
+ | ** '''l27''' - blinking alert LED - queue closed, waiting for jobs to finish | ||
+ | ** '''l37''' - blinking alert LED - queue closed, waiting for jobs to finish | ||
+ | ** '''l30''' - failed disk - closed, waiting for disk replacement | ||
+ | ** '''l05''' - dead - in service for repair | ||
+ | ** '''l06''' - dead - in service for repair | ||
+ | ** '''l07''' - failed - queue closed, witing for service repair | ||
* after an UPS failure the '''teogate''' server is up again; | * after an UPS failure the '''teogate''' server is up again; |
Revision as of 10:08, 12 September 2008
2008-09-12
- after a server power source failure the cluster radon is up again; the power source has been replaced;
- some nodes in the lithium cluster are dead and some others have been repaired
- l15 - I/O error - does not reappear after fsck and restart - open
- l22 - blinking alert LED - solved, open
- l23 - blinking alert LED - solved, open
- l24 - blinking alert LED - solved, open
- l25 - failed memory module - in service for replacement - closed
- l27 - blinking alert LED - queue closed, waiting for jobs to finish
- l37 - blinking alert LED - queue closed, waiting for jobs to finish
- l30 - failed disk - closed, waiting for disk replacement
- l05 - dead - in service for repair
- l06 - dead - in service for repair
- l07 - failed - queue closed, witing for service repair
- after an UPS failure the teogate server is up again;