Lost Quorum
From: Malle (Malle_at_discussions.microsoft.com)
Date: 11/03/04
- Next message: TomD: "cluster administrator logon error"
- Previous message: Ramon Jiménez: "Re: Which clustering solution is appropriate"
- Next in thread: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Reply: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Date: Wed, 3 Nov 2004 04:39:04 -0800
We have a 2 node w2k cluster attached via fiberchannel to our san storage.
We use Veritas Volume Manager 3.1 on both nodes.
we have noticed a failover and our customer want to know the reason :-/
we have found event ID 1000,1015,1038 in the systemlog and it seems to be a
problem with the quorum.
we have checked the clusterlog we found the following errors:
00006f4.000007a8::2004/11/01-06:34:00.807 [DM]DmpCheckpointTimerCb- taking a
checkpoint
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogReset entry...
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpReset entry...
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpCreate : Entry
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog : Entry
pLog=0x000c5c08
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog::Quorumlog File
size=0x00000000
000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpInitLog : Entry
pLog=0x000c5c08
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpAppendPage : Writing
1024 bytes to disk at offset 0x00000000
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpInitLog :
NextLsn=0x00000408 FileAlloc=0x00000800 ActivePageOffset=0x00000400
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpCreate : Exit with success
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint:: Entry
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint exit,
returning 0x000013a8
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset:: no check point
found in the old log file
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogCheckPoint entry
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset:: Callback failed
to return a checkpoint, error=87
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Entry
LogFile=0x000c5c08
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogFlush : pLog=0x000c5c08
writing the 1024 bytes for active page at offset 0x00000400
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Exit returning
success
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset exit, returning
0x00000057
000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset exit, returning
0x00000057
000006f4.000007a8::2004/11/01-06:34:00.823 [DM]DmpCheckpointTimerCb - Failed
to reset log, error=87
000006f4.000007a8::2004/11/01-06:34:00.839 Microsoft Clustering Service
suffered an unexpected fatal error
at line 2166 of source module D:\nt\private\cluster\service\dm\dmlog.c. The
error code was 87.
000007e8.000007cc::2004/11/01-06:34:01.542 [RM] Going away, Status = 1,
Shutdown = 0.
000007e8.000007cc::2004/11/01-06:34:01.542 [RM] RmpRundownResources,
terminate resource <Logistik>...
000007e8.000007cc::2004/11/01-06:34:01.542 File Share <Logistik>:
SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!!
000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
[DiskArb] CompletionRoutine, status 0.
000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
[DiskArb] posting AsyncCheckReserve request.
000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
[DiskArb] error checking disk reservation thread, error 995.
000007e8.0000091c::2004/11/01-06:34:01.557 Physical Disk <Disk Q:>:
[DiskArb] CompletionRoutine: reservation lost!
000007e8.0000091c::2004/11/01-06:34:01.557 [RM] RmpLostQuorumResource,
cluster service terminated...
000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close
resource <Logistik>...
000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources,
terminate resource <PROBASTEST>...
000007e8.000007cc::2004/11/01-06:34:01.557 File Share <PROBASTEST>:
SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!!
000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close
resource <PROBASTEST>...
000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources,
terminate resource <NetworkerRemoteExec>...
000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service
<NetworkerRemoteExec>: Terminate request.
000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service
<NetworkerRemoteExec>: GenSvcTerminate : calling SCM
000007e8.00000920::2004/11/01-06:34:01.573 Physical Disk: PnP Event
GUID_IO_VOLUME_DISMOUNT for 619968 received
000007e8.000007cc::2004/11/01-06:35:01.728 Generic Service
<NetworkerRemoteExec>: GenSvcTerminate: retrying...
000020a4.000018c8::2004/11/01-06:35:01.822
000020a4.000018c8::2004/11/01-06:35:01.822 [CS] Cluster Service started -
Cluster Node Version 3.2195
000020a4.000018c8::2004/11/01-06:35:01.822 OS
Version 5.0.2195 - Service Pack 4 (AS)
000020a4.0000207c::2004/11/01-06:35:01.822 [CS] Service Starting...
Has anyone an idea what happens ?
- Next message: TomD: "cluster administrator logon error"
- Previous message: Ramon Jiménez: "Re: Which clustering solution is appropriate"
- Next in thread: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Reply: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|