Re: Lost Quorum
From: Mike Rosado [MSFT] (mikeros_at_online.microsoft.com)
Date: 11/03/04
- Next message: allan: "Upgrading winnt40 cluster to 2003 cluster on a new computer"
- Previous message: Mario Fuchs: "Postscript Driver Update"
- In reply to: Malle: "Lost Quorum"
- Next in thread: Malle: "Re: Lost Quorum"
- Reply: Malle: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Date: Wed, 3 Nov 2004 17:55:32 -0600
Hi Malle,
I'm by no means an expert in this subject matter of Veritas Volume Manager,
but I'll try to assist you to the best of my ability.
Here's your problem, the error 995 which caused the reservation on the
Quorum disk to be lost.
000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
[DiskArb] error checking disk reservation thread, error 995.
000007e8.0000091c::2004/11/01-06:34:01.557 Physical Disk <Disk Q:>:
[DiskArb] CompletionRoutine: reservation lost!
000007e8.0000091c::2004/11/01-06:34:01.557 [RM] RmpLostQuorumResource,
cluster service terminated...
Here's what the error 995 means:
# for decimal 995 / hex 0x3e3 :
ERROR_OPERATION_ABORTED
# The I/O operation has been aborted because of either a
# thread exit or an application request.
The unfortunate part, is that we don't support Dynamic Disk on a Cluster.
Which looks like you are doing with Veritas Volume Manager and as stated in
the paragraph excerpt below, they should be your first point of contact when
encountering problems. Have you already contacted Veritas?
237853 Dynamic Disk Configuration Unavailable for Server Cluster Disk
Resources
http://support.microsoft.com/?id=237853
When you install the Veritas Volume Manager product on a cluster and
configure Volume Manager Disk Group resources, Veritas is the first point of
support for cluster issues related to those resources.
-- Hope this helps, Mike Rosado Windows 2000 MCSE + MCDBA Microsoft Enterprise Platform Support Windows NT/2000/2003 Cluster Technologies ==================================================== When responding to posts, please "Reply to Group" via your newsreader so that others may learn and benefit from your issue. ==================================================== This posting is provided "AS IS" with no warranties, and confers no rights. <http://www.microsoft.com/info/cpyright.htm> -----Original Message----- "Malle" <Malle@discussions.microsoft.com> wrote in message news:D1009367-B7AC-402C-929E-706C8D6D99DF@microsoft.com... > We have a 2 node w2k cluster attached via fiberchannel to our san storage. > We use Veritas Volume Manager 3.1 on both nodes. > we have noticed a failover and our customer want to know the reason :-/ > we have found event ID 1000,1015,1038 in the systemlog and it seems to be a > problem with the quorum. > we have checked the clusterlog we found the following errors: > 00006f4.000007a8::2004/11/01-06:34:00.807 [DM]DmpCheckpointTimerCb- taking a > checkpoint > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogReset entry... > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpReset entry... > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpCreate : Entry > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog : Entry > pLog=0x000c5c08 > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog::Quorumlog File > size=0x00000000 > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpInitLog : Entry > pLog=0x000c5c08 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpAppendPage : Writing > 1024 bytes to disk at offset 0x00000000 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpInitLog : > NextLsn=0x00000408 FileAlloc=0x00000800 ActivePageOffset=0x00000400 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpCreate : Exit with success > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint:: Entry > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint exit, > returning 0x000013a8 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset:: no check point > found in the old log file > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogCheckPoint entry > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset:: Callback failed > to return a checkpoint, error=87 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Entry > LogFile=0x000c5c08 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogFlush : pLog=0x000c5c08 > writing the 1024 bytes for active page at offset 0x00000400 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Exit returning > success > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset exit, returning > 0x00000057 > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset exit, returning > 0x00000057 > 000006f4.000007a8::2004/11/01-06:34:00.823 [DM]DmpCheckpointTimerCb - Failed > to reset log, error=87 > 000006f4.000007a8::2004/11/01-06:34:00.839 Microsoft Clustering Service > suffered an unexpected fatal error > at line 2166 of source module D:\nt\private\cluster\service\dm\dmlog.c. The > error code was 87. > 000007e8.000007cc::2004/11/01-06:34:01.542 [RM] Going away, Status = 1, > Shutdown = 0. > 000007e8.000007cc::2004/11/01-06:34:01.542 [RM] RmpRundownResources, > terminate resource <Logistik>... > 000007e8.000007cc::2004/11/01-06:34:01.542 File Share <Logistik>: > SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!! > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>: > [DiskArb] CompletionRoutine, status 0. > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>: > [DiskArb] posting AsyncCheckReserve request. > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>: > [DiskArb] error checking disk reservation thread, error 995. > 000007e8.0000091c::2004/11/01-06:34:01.557 Physical Disk <Disk Q:>: > [DiskArb] CompletionRoutine: reservation lost! > 000007e8.0000091c::2004/11/01-06:34:01.557 [RM] RmpLostQuorumResource, > cluster service terminated... > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close > resource <Logistik>... > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, > terminate resource <PROBASTEST>... > 000007e8.000007cc::2004/11/01-06:34:01.557 File Share <PROBASTEST>: > SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!! > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close > resource <PROBASTEST>... > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, > terminate resource <NetworkerRemoteExec>... > 000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service > <NetworkerRemoteExec>: Terminate request. > 000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service > <NetworkerRemoteExec>: GenSvcTerminate : calling SCM > 000007e8.00000920::2004/11/01-06:34:01.573 Physical Disk: PnP Event > GUID_IO_VOLUME_DISMOUNT for 619968 received > 000007e8.000007cc::2004/11/01-06:35:01.728 Generic Service > <NetworkerRemoteExec>: GenSvcTerminate: retrying... > 000020a4.000018c8::2004/11/01-06:35:01.822 > > 000020a4.000018c8::2004/11/01-06:35:01.822 [CS] Cluster Service started - > Cluster Node Version 3.2195 > 000020a4.000018c8::2004/11/01-06:35:01.822 OS > Version 5.0.2195 - Service Pack 4 (AS) > > 000020a4.0000207c::2004/11/01-06:35:01.822 [CS] Service Starting... > > Has anyone an idea what happens ? >
- Next message: allan: "Upgrading winnt40 cluster to 2003 cluster on a new computer"
- Previous message: Mario Fuchs: "Postscript Driver Update"
- In reply to: Malle: "Lost Quorum"
- Next in thread: Malle: "Re: Lost Quorum"
- Reply: Malle: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|