Re: Physical disk hangs at "offline pending"



Hi,

We have installed Oracle failsafe on this cluster and the drive in question
is part of the "Cluster Group" set of resources. The oracle database resides
on this SAN drive. I have stopped all oracle services on the server giving me
the problems and the disk still does not go offline to enable a failover
unless the server is shut down.
I suppose there must be something else preventing the failover and am trying
to determine what could be preventing this disk from being released. The
server in question does have exclusive rights to this physical disk when it
is the active member.
If anyone has any idea as to how I might determine if some process is
refusing to release it's resources please make a suggestion.
Is there a way to increase the logging level of the cluster and should that
give me a better indication of what may be the problem? (the logs are fairly
hard to decipher even at the default logging level).

Thanks in Advance,
--
Henry


"Chuck Timon [MSFT]" wrote:

Sounds like something has a handle to the drive that is preventing cluster
from completing the Offline process. What kind of group is this disk
resource in?

Chuck Timon, Jr.
Microsoft Corporation
Longhorn Readiness Team
This posting is provided "AS IS" with no warranties, and confers no rights.

"Henry" <Henry@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:4429C77B-C125-4677-8F00-C2D96D014716@xxxxxxxxxxxxxxxx
Hi,

I have a 2 node cluster that works correctly when the active server goes
down.
All resources are taken over by the passive member.

When I try to move resources from node 1 to node 2 everything works fine
as
well.
The problem is that when I try to move the the resources back to the
original node all resources move except for one physical disk. This
physical
disk status remains as "offline pending". The cluster log contains many
entries similar to what follows:
"FmpofflineResource: offline resource <drivex>returned pending"
until finally
"RmpTimerThread: Resource drivex pending timed out, CP 3 - seting state to
failed."

The only way for us to get the offline resource available for the other
cluster member is to reboot the server that failed to put the physical
drive
offline.

Any ideas would be appreciated.
--
Thanks in Advance,

Henry


.



Relevant Pages

  • Re: Cluster Freezes
    ... When testing the offline times I notice that the file shares go offline in a ... the Physical disk seem to take atleast 30-45 seconds to ... The failover time for all the resources is really good maybe 20 seconds tops ... the real problem is coming back online. ...
    (microsoft.public.windows.server.clustering)
  • Re: MSDTC Disk Problem
    ... As i told you it fails the DTC resources,instead i can move ip and network ... error that the resources can't open the log file that reside on the disk ... >> rejoin the crashed node in the cluster ...
    (microsoft.public.windows.server.clustering)
  • Re: Cluster disks show as healthy in Disk Management
    ... The cluster has two nodes with 4 SAN attached disks. ... active on one node at any one time, the resources are never split across ... 'disk manager' shows unallocated on that node). ... This cluster shows Healthy ...
    (microsoft.public.windows.server.clustering)
  • Re: 64 BIT Cluster fail over problem
    ... Cluster environment on Windows® Server 2003 running Data ONTAP® DSM ... the passive nodes may fail to take over the disk resources. ... When using the Data ONTAP DSM 3.2 for Windows MPIO on Windows Server 2003, ...
    (microsoft.public.windows.server.clustering)
  • Re: cluster scripting suggestions?
    ... cluster to house all of its data. ... are offline when data replication begins. ... Begin data replication. ... I've also determined how to take the desired resources offline via the ...
    (microsoft.public.windows.server.clustering)

Loading