RE: Cluster doesn't failover to other node properly

Tech Tip: Click here to run a free scan for Windows Errors and optimize PC performance



Hi Marlon,

Only really way to troubleshoot failover problems is by interpreting the
Cluster Log of what's going on between each of the node. Where it gets
hairy, is if you have more than 2 nodes in a Windows 2003 Cluster. Because
you would have to analyze all Cluster Logs simultaneously to interpret the
communication between the nodes, to know exactly where the failure is
occurring.

Title Recommendations:
Hard Copy: Microsoft Windows 2000 Server - Distributed Systems Guide
from the Windows 2000 Resource Kit
Chapter 20 (page 1121) Interpreting Cluster Log
ISBN: 1-57231-805-8

Online Copy:
http://www.microsoft.com/resources/documentation/windows/2000/server/reskit/
en-us/distsys/part3/dsgch20.mspx

Best way I've seen possible in doing this, would be:

1. Go to the Property of all the resources in that group having problems,
on the Advanced tab se them all to "Do not restart"
2. Do a Move Group from ExchServerA to ExchServerB
3. See which resource is failing.
4. Then open Cluster Log for both nodes side by side and Interpreting them
until you find the failure.

NOTE: Keep in mind that the log are in GMT time zone.

--------------------
Hope this helps,
Mike Rosado
Windows 2000 MCSE + MCDBA
Microsoft Enterprise Platform Support
Windows NT/2000/2003 Cluster Technologies

====================================================
When responding to posts, please "Reply to Group" via your newsreader so
that others may learn and benefit from your issue.
====================================================

This posting is provided "AS IS" with no warranties, and confers no rights.
<http://www.microsoft.com/info/cpyright.htm>

-----Original Message-----
> From: =?Utf-8?B?TWFybG9uIEJyb3du?= <MarlonBrown@xxxxxxxxxxxxxxxxxxxxxxxxx>
> Subject: Cluster doesn't failover to other node properly
> Date: Fri, 18 Nov 2005 22:29:02 -0800
>
> This is a recent problem:
> Win2003Sp1+Exch2003 SP1.
> I go to cluster administraton, click "Mycluster->Groups", I highlight
> "MyExchVirtualServer". Then I select "move group".
> I see all resources go from "Online to Offline", then it attempts to
> transfer resources to ExchserverB, but it doesn't go through.
> No error message appears, it just reverts back to "ExchServerA". All
other
> resources "Cluster Group" and "MSDTC Group" complete failover OK.
>
> I attempted to look at \Cluster\cluster.log but I don't see any relevant
info.
> Can you clarify again which log or how I can troubleshoot this and find
out
> what is causing this to fail ?
>

.



Relevant Pages

  • Re: Auto Failback issue?
    ... Check the event log and cluster log on the node where you're attempting to ... move the resources to. ... immediately failing on the other node. ...
    (microsoft.public.windows.server.clustering)
  • Re: Cluster log File
    ... The following is the recommended books and can ... Microsoft Windows 2000 Server - Distributed Systems Guide ... > Is there any document regarding how to read the cluster log file? ... > GUID_IO_VOLUME_MOUNT for L (Partition1) received. ...
    (microsoft.public.windows.server.clustering)