Re: 99.999% Uptime For A Cluster - Real World Comments
- From: Simon <Simon@xxxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Sat, 13 May 2006 01:03:01 -0700
John
But this is what is happening. One Node has over 40 resources, which I dont
know is good or bad. Finding any real good working documentation outside of
microsoft is extremely hard. Even a book is hard to come by here in the UK.
Is there a limit to the amount of resources a node can host. I do plan to
add another node, but was concerned I may be ADDING to the problem, rather
than relieving.
And 2 nodes have failed with BSOD - which I did post on here before, and
never really received any real satisfactory response as the message is
extremely vague.
Also John, when a failure occurs, not all resources come back on line. Worse
still, is the resources move onto another node, and then all go offline and
attempt to go back to its host node. Only to then go down and revert to the
node it initially attempted to move to! In other words, its playing ping-pong!
Simon
"John Toner [MVP]" wrote:
Simon,.
If you plan out your clusters correctly, you should never get into most of
the situations you describe. You should rarely experience a hardware failure
on both nodes at the same time, though it can surely happen. Cluster has the
solution for this, though...add another node :)
One node should never be in the situation where it dies because "it cannot
take on all the resources." This is clearly a planning issue rather than a
cluster issue.
Regards,
John
"Simon" <Simon@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:4EA4153E-16A9-4056-BF90-E4908D7C0400@xxxxxxxxxxxxxxxx
Thanks Rodneynode
Cluster in question is a File Server on 2 nodes. We dont have an SLA for
99.999 uptime - someone said that in a year, you could achieve 32mins
downtime.
I sort of disagree - because if you have a hardware problem or both nodes
fail and its a complete disaster your stuffed really. and if one of your
dies because it cannot take on all resources, your also stuffed.book
Again, all depends on setup as well I guess. I wish there was a better
out there!!!!a
But I wanted to get the feel from other people who work day in and day out
with clusters. Surely, you have come across sites where the cluster is in
mess, and you are able to sort things out?Per
I think that is part of my problem - fully understanding the cluster, the
hardware config, why there are SO many resources on each node!
Simon
"Rodney R. Fournier [MVP]" wrote:
Short answer - Yes
Medium answer - Depends. How do you define uptime? Application uptime?
alerts.Server? If you define it on a per server at the hardware layer, then
probably not. Patches, BIOS/Firmware updates, etc. will kill a few 9s.
Long answer - I sure hope so.
What does your SLA define for uptime? You have an SLA right?
Hopefully you monitor it with user application availability.
Hopefully you have a monitoring system in place that can send out
themBoth proactive and reactive.
Hopefully you have standard - well defined maintenance windows, patch
management, virus protection, firewalls, policies, etc.
Hopefully you have a fully trained staff on hand 24x7.
Hopefully you have vendor support and a good working relationship with
mind,already.
Hopefully you have hardware from the Clustering HCL.
Hopefully your organization from the top down understands and wants to
maintain an HA environment.
Hopefully you have configuration and change management.
Hopefully you have a complete and accurate documentation for every
component. Documentation is very important.
If everything you do throughout the entire organization is with HA in
(weyou can indeed achieve 5 9's. I know we do on most of our clusters here
possiblehave 30 clusters).
Cheers,
Rodney R. Fournier
MVP - Windows Server - Clustering
http://www.nw-america.com - Clustering Website
http://msmvps.com/clustering - Blog
http://www.clusterhelp.com - Cluster Training
ClusterHelp.com is a Microsoft Certified Gold Partner
"Simon" <Simon@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:382FFAA7-86D7-4B80-A2D6-08D4CCBDD2FA@xxxxxxxxxxxxxxxx
Hi guys
I just wanted to get an idea if anyone really believes that its
serversto
have a 99.999% uptime for a Win2k3 Cluster.
Our cluster has been quite unreliable and in fact our stand alone
:-)are well behaved compared to the cluster!
Any comments would be appreciated - its just to get an overall picture
Greetings
Simon
- Follow-Ups:
- Re: 99.999% Uptime For A Cluster - Real World Comments
- From: Don Wilwol
- Re: 99.999% Uptime For A Cluster - Real World Comments
- References:
- Re: 99.999% Uptime For A Cluster - Real World Comments
- From: Rodney R. Fournier [MVP]
- Re: 99.999% Uptime For A Cluster - Real World Comments
- From: Simon
- Re: 99.999% Uptime For A Cluster - Real World Comments
- From: John Toner [MVP]
- Re: 99.999% Uptime For A Cluster - Real World Comments
- Prev by Date: Re: File Server Clustering Question
- Next by Date: Re: 99.999% Uptime For A Cluster - Real World Comments
- Previous by thread: Re: 99.999% Uptime For A Cluster - Real World Comments
- Next by thread: Re: 99.999% Uptime For A Cluster - Real World Comments
- Index(es):
Relevant Pages
|