Any views expressed within media held on this service are those of the contributors, should not be taken as approved or endorsed by the University, and do not necessarily reflect the views of the University in respect of any particular issue.

Computing Systems

Computing Systems

Informatics Computing Staff jottings

SAN Disk failure

Virtually all of Informatics storage is via our redundant Storage Array Network (SAN). All our arrays are configured as either level 1, 5 or 10 RAID. Meaning that if one of the physical hard disks fails, the data on the RAID array remains intact, allowing us to replace the failed disk without any interruption to service.

Though most times users never notice a single hard disk failure, last Thursday night (21/2/2013) one physical disk making up a RAID5 array did fail, and unusually this caused the array to go offline briefly. This is not normally the case. Unfortunately one of our servers was writing to the array at this time, which caused the kernel to report an error and took the mounted device off line. In this case it affected some 5 or so group file space areas stored on that array. These group areas remained off line until the computing staff were able to investigate the problem, check and repair any potential problems, and re-enable the group areas.

We’ve been in touch with the suppliers of this SAN unit, as this is not the expected behaviour, and they’ve pointed out that the firmware on the SAN unit is out of date, and we are there for assuming this was a bug in the old firmware, which has since been rectified.

We will be looking to update the firmware to the recommended version, but though it should be safe to apply the update to the running hardware, we will schedule some downtime to avoid the risk of any problems affecting the data on the array. Unfortunately this will mean disruption to any users with data on the array. We will notify users once we have a date and time in mind.

Neil

SAN Disk failure / Computing Systems by is licensed under a

Leave a reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

SAN Disk failure / Computing Systems by is licensed under a
css.php

Report this page

To report inappropriate content on this page, please use the form below. Upon receiving your report, we will be in touch as per the Take Down Policy of the service.

Please note that personal data collected through this form is used and stored for the purposes of processing this report and communication with you.

If you are unable to report a concern about content via this form please contact the Service Owner.

Please enter an email address you wish to be contacted on. Please describe the unacceptable content in sufficient detail to allow us to locate it, and why you consider it to be unacceptable.
By submitting this report, you accept that it is accurate and that fraudulent or nuisance complaints may result in action by the University.

  Cancel