Re: Advice on RAID crash

Tech-Archive recommends: Speed Up your PC by fixing your registry



How old are the drives and the RAID controller?

Suggestion: Budget for a new box, or at least a new RAID setup.

We had Event ID 15s on a 6 drive RAID 5 array that turned out to be some bad sectors on one of the drives in the array that the controller and the OS did not pickup on. We ended up figuring that out after we replaced the controller and drives and could test each drive individually.

But, we needed to go through some pain to get things back first.

Run chkdsk /f /r on the array. Note that it can take a huge chunk of time to complete.

If it is indeed bad sectors on one of the array members, the problem may not show itself for a while again. Part of the problem is that a chkdsk /f /r on the dismounted partition still may not fix things completely as the bad sector problem can keep growing beyond the bad areas marked by the OS after the chkdks is run.
--

Philip E.
MPECS Inc.
Microsoft Small Business Specialists
http://blog.mpecsinc.ca
"Al Williams" <donotreplydirect@xxxxxxxxxxxxxxxx> wrote in message news:eQeaBqkUIHA.4696@xxxxxxxxxxxxxxxxxxxxxxx
SBS2003 Premium SP2

Recently our Adaptec 2100 SCSI U160 RAID5 array went down and caused a bunch of errors like this in the system log:

Event Type: Error
Event Source: dpti2o
Event Category: None
Event ID: 15
Date: 08-01-07
Time: 8:18:55 AM
User: N/A
Computer: HISERVER
Description:
The device, \Device\Scsi\dpti2o1, is not ready for access yet.
----------------------
Event Type: Error
Event Source: Disk
Event Category: None
Event ID: 11
Date: 08-01-07
Time: 8:18:55 AM
User: N/A
Computer: HISERVER
Description:
The driver detected a controller error on \Device\Harddisk0.

This RAID array contains some data and the exchange and SQL databases. Other drives are on another separate RAID mirror, so the server continued to run as best it could but needed several reboots to get going Monday morning (lots of SQL and exchange errors in the logs due to the drive going offline). After rebooting a few times the server has been fine ever since (did a full backup as well).

The strange thing is I checked the Adaptec Storage Manager logs and it shows no errors. As far as it is concerned the RAID array is fine - none of the disks show any faults and the array did not degrade.

No changes have been made recently to the server except for a couple of of the usual monthly winupdates. I leaning towards this being a hardware problem as the server locked up tight a few times while attempting to recover from the issue Monday (gave a STOP 100000ea as well). Could the RAID card itself be having issues?

Any ideas on how I can determine what caused the RAID to go offline?

Thanks,

--
Allan Williams





.



Relevant Pages

  • Need help with RAID question. I think something is really screwed up!!!
    ... The I set up a RAID 5 array using an external enclosure and four ... Western Digital WD5000YS drives. ... I set up the array and it worked. ... controller, not the RAID controller). ...
    (comp.sys.ibm.pc.hardware.storage)
  • Re: Problems with software RAID on SATA
    ... Connected to this are two 320GB drives ... >>which I want to turn into a RAID1 array. ... >>I'm almost certain it's a problem with initting the RAID arrays at boot. ...
    (Debian-User)
  • Re: RAID newbie...can I have several partitions on a RAID 1 array?
    ... You haven't expounded upon why you think you need raid. ... better backup device rather than buy 2 cheap RAID HBAs. ... RAID array then I would have to replace the mobo with the same one or at ... Lets say, for example, you buy 2 identical model drives, from ...
    (comp.sys.ibm.pc.hardware.storage)
  • Re: [PATCH 000 of 5] md: Introduction
    ... "why linux raid isn't Raid really, why it can be worse than plain disk") ... After this, the array ... error is in the filesystem, due to the complex layout of raid5. ... hundreds or 1000s of drives, you've quite high probability that some of them will fail sometimes, or will develop a bad sector etc). ...
    (Linux-Kernel)
  • Re: best practice for hard drive upgrade
    ... It's annoying that after all these years this is not surprising, the simple action of replacing an existing array with a similar array on larger drives isn't exactly something I consider as 'pushing the envelope'. ... Apparently this RAID card - promise tx4310 - will not resize the array. ... The controller also may not support multiple volumes. ...
    (microsoft.public.windows.server.sbs)