Re: Physical Disk goes offline when cluster node reboots



Darrek,

Just to confirm that we have the symptom right

- All groups are online on Node 1 (therefore all disks are online on Node 1)
- you reboot Node 2
- All disks on Node 1 go offline on Node 1 during reboot/Post of Node 2

Please confirm that this is what you are experiencing

and two questions:
Q: are the disks who go offline on Node 2, do they fail or do they go
offline ? (please specify, as there is a difference)
Q: Do you see any "reservation lost" messages/events in the system event log
?

rgds,
Edwin.




"Darrek" <Darrek.Kay1@xxxxxxxx> wrote in message
news:1166032177.312098.234640@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
I have a 2 node Windows 2003 SP1 EE cluster connected to an MSA1000 SAN
via integrated FC hub. My SAN is single-path since this is our Dev/QA
environment.

When I reboot any node in the cluster all physical disk resources go
offline while the rebooted server goes through POST. I get Delayed
Write Failed errors in the event log of the node that is still running.
Once the rebooted node is up and running the cluster returns to
normal.

I'm worried that our production cluster may exhibit the same issues
when it goes live even though it is built in a more robust fashion.

I'm open for suggestions.

The servers are HP DL145's, using Emulex FC2243 cards. If I simply
failover a cluster group everything works great.

Thanks.
-DK



.



Relevant Pages

  • Re: Help adding third node.
    ... zone the HBA's of node 3 to your Clariion ... reboot node 3 or restart ... at that point they wil be able to see the same disk. ... Add node 3 and 4 to the cluster. ...
    (microsoft.public.windows.server.clustering)
  • Re: How to upgrade to SP2 on active/active
    ... I need a simple procedure to upgrade our SQL2005 active/active cluster. ... 2- Log on node 1 and install the SP2 on the two instances ... 4- Reboot node 1 ...
    (microsoft.public.sqlserver.clustering)
  • Re: Cannot join existing Cluster - Access is denied
    ... I recently had a problem and had to reboot node 1 of my 2 node cluster. ... I am getting access is denied errors. ... I have verified that all my NTLMv2 req., NTLM SSP are consistent between my Domain controller, node 1 and node 2. ...
    (microsoft.public.windows.server.clustering)
  • Cannot join existing Cluster - Access is denied
    ... I recently had a problem and had to reboot node 1 of my 2 node cluster. ... access is denied errors. ... I have verified that all my NTLMv2 req., NTLM SSP are ...
    (microsoft.public.windows.server.clustering)
  • Re: Clustering Service not starting right away on One Node
    ... Node 2 which I haven't seen the problem with having ownership of the cluster. ... When I reboot node 1 is when I see the problem of it taking 1 mintue to ... If the cluster service does come online quickly, ... > I support the Professional Association for SQL Server ...
    (microsoft.public.sqlserver.clustering)

Loading