Re: Lost Quorum
From: Malle (Malle_at_discussions.microsoft.com)
Date: 11/04/04
- Previous message: Chacko: "Re: MNS Print Server Clustering"
- In reply to: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Date: Wed, 3 Nov 2004 23:39:01 -0800
Hi Mike,
thanks for your explanations.
I will contact the veritas support.
"Mike Rosado [MSFT]" wrote:
> Hi Malle,
>
> I'm by no means an expert in this subject matter of Veritas Volume Manager,
> but I'll try to assist you to the best of my ability.
>
> Here's your problem, the error 995 which caused the reservation on the
> Quorum disk to be lost.
>
> 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
> [DiskArb] error checking disk reservation thread, error 995.
> 000007e8.0000091c::2004/11/01-06:34:01.557 Physical Disk <Disk Q:>:
> [DiskArb] CompletionRoutine: reservation lost!
> 000007e8.0000091c::2004/11/01-06:34:01.557 [RM] RmpLostQuorumResource,
> cluster service terminated...
>
> Here's what the error 995 means:
>
> # for decimal 995 / hex 0x3e3 :
> ERROR_OPERATION_ABORTED
> # The I/O operation has been aborted because of either a
> # thread exit or an application request.
>
> The unfortunate part, is that we don't support Dynamic Disk on a Cluster.
> Which looks like you are doing with Veritas Volume Manager and as stated in
> the paragraph excerpt below, they should be your first point of contact when
> encountering problems. Have you already contacted Veritas?
>
> 237853 Dynamic Disk Configuration Unavailable for Server Cluster Disk
> Resources
> http://support.microsoft.com/?id=237853
>
> When you install the Veritas Volume Manager product on a cluster and
> configure Volume Manager Disk Group resources, Veritas is the first point of
> support for cluster issues related to those resources.
>
> --
> Hope this helps,
> Mike Rosado
> Windows 2000 MCSE + MCDBA
> Microsoft Enterprise Platform Support
> Windows NT/2000/2003 Cluster Technologies
>
> ====================================================
> When responding to posts, please "Reply to Group" via your newsreader so
> that others may learn and benefit from your issue.
> ====================================================
>
> This posting is provided "AS IS" with no warranties, and confers no rights.
> <http://www.microsoft.com/info/cpyright.htm>
>
> -----Original Message-----
>
> "Malle" <Malle@discussions.microsoft.com> wrote in message
> news:D1009367-B7AC-402C-929E-706C8D6D99DF@microsoft.com...
> > We have a 2 node w2k cluster attached via fiberchannel to our san storage.
> > We use Veritas Volume Manager 3.1 on both nodes.
> > we have noticed a failover and our customer want to know the reason :-/
> > we have found event ID 1000,1015,1038 in the systemlog and it seems to be
> a
> > problem with the quorum.
> > we have checked the clusterlog we found the following errors:
> > 00006f4.000007a8::2004/11/01-06:34:00.807 [DM]DmpCheckpointTimerCb- taking
> a
> > checkpoint
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogReset entry...
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpReset entry...
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpCreate : Entry
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog : Entry
> > pLog=0x000c5c08
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpMountLog::Quorumlog
> File
> > size=0x00000000
> > 000006f4.000007a8::2004/11/01-06:34:00.807 [LM] LogpInitLog : Entry
> > pLog=0x000c5c08
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpAppendPage : Writing
> > 1024 bytes to disk at offset 0x00000000
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpInitLog :
> > NextLsn=0x00000408 FileAlloc=0x00000800 ActivePageOffset=0x00000400
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpCreate : Exit with
> success
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint:: Entry
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogGetLastChkPoint exit,
> > returning 0x000013a8
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset:: no check point
> > found in the old log file
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogCheckPoint entry
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset:: Callback
> failed
> > to return a checkpoint, error=87
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Entry
> > LogFile=0x000c5c08
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogFlush : pLog=0x000c5c08
> > writing the 1024 bytes for active page at offset 0x00000400
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogClose : Exit returning
> > success
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogpReset exit, returning
> > 0x00000057
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [LM] LogReset exit, returning
> > 0x00000057
> > 000006f4.000007a8::2004/11/01-06:34:00.823 [DM]DmpCheckpointTimerCb -
> Failed
> > to reset log, error=87
> > 000006f4.000007a8::2004/11/01-06:34:00.839 Microsoft Clustering Service
> > suffered an unexpected fatal error
> > at line 2166 of source module D:\nt\private\cluster\service\dm\dmlog.c.
> The
> > error code was 87.
> > 000007e8.000007cc::2004/11/01-06:34:01.542 [RM] Going away, Status = 1,
> > Shutdown = 0.
> > 000007e8.000007cc::2004/11/01-06:34:01.542 [RM] RmpRundownResources,
> > terminate resource <Logistik>...
> > 000007e8.000007cc::2004/11/01-06:34:01.542 File Share <Logistik>:
> > SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!!
> > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
> > [DiskArb] CompletionRoutine, status 0.
> > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
> > [DiskArb] posting AsyncCheckReserve request.
> > 000007e8.0000091c::2004/11/01-06:34:01.542 Physical Disk <Disk Q:>:
> > [DiskArb] error checking disk reservation thread, error 995.
> > 000007e8.0000091c::2004/11/01-06:34:01.557 Physical Disk <Disk Q:>:
> > [DiskArb] CompletionRoutine: reservation lost!
> > 000007e8.0000091c::2004/11/01-06:34:01.557 [RM] RmpLostQuorumResource,
> > cluster service terminated...
> > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close
> > resource <Logistik>...
> > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources,
> > terminate resource <PROBASTEST>...
> > 000007e8.000007cc::2004/11/01-06:34:01.557 File Share <PROBASTEST>:
> > SmbShareDoTerminate: SmbpShareNotifyWorker Terminated... !!!
> > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources, close
> > resource <PROBASTEST>...
> > 000007e8.000007cc::2004/11/01-06:34:01.557 [RM] RmpRundownResources,
> > terminate resource <NetworkerRemoteExec>...
> > 000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service
> > <NetworkerRemoteExec>: Terminate request.
> > 000007e8.000007cc::2004/11/01-06:34:01.557 Generic Service
> > <NetworkerRemoteExec>: GenSvcTerminate : calling SCM
> > 000007e8.00000920::2004/11/01-06:34:01.573 Physical Disk: PnP Event
> > GUID_IO_VOLUME_DISMOUNT for 619968 received
> > 000007e8.000007cc::2004/11/01-06:35:01.728 Generic Service
> > <NetworkerRemoteExec>: GenSvcTerminate: retrying...
> > 000020a4.000018c8::2004/11/01-06:35:01.822
> >
> > 000020a4.000018c8::2004/11/01-06:35:01.822 [CS] Cluster Service started -
> > Cluster Node Version 3.2195
> > 000020a4.000018c8::2004/11/01-06:35:01.822
> OS
> > Version 5.0.2195 - Service Pack 4 (AS)
> >
> > 000020a4.0000207c::2004/11/01-06:35:01.822 [CS] Service Starting...
> >
> > Has anyone an idea what happens ?
> >
>
>
>
- Previous message: Chacko: "Re: MNS Print Server Clustering"
- In reply to: Mike Rosado [MSFT]: "Re: Lost Quorum"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|