Re: Server Shuts Down - HELP
- From: "Russ Grover" <russ@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Sat, 4 Jun 2005 21:41:16 -0700
Now that I'm thinking..., Maybe take off the CPUs and put some new grease on
them.
(That would be easy/cheap thing to do.)
You can then see if they are DRY...
Russ
--
Russ Grover
Small Business IT Support
16086 SW Melinda St.
Beaverton OR 97007-6303 USA
Email: Sales at SmallBusinessITSupport.com
Website: www.SmallBusinessITSupport.com
"James B" <nospam@xxxxxxxx> wrote in message
news:uebK$cYaFHA.2664@xxxxxxxxxxxxxxxxxxxxxxx
> Unless you stick a household fan blowing directly into the case it is wise
> to never run most box maker systems without the case fully on as they
> design
> airflow with the case on, not off. Now Intel designed (or just signed off
> on) some great cases that when you took the case off you still had plastic
> air ducts sending the airflow right to where it was needed.
>
> As far as logging a heat event I would not take Compaq's word for it that
> if
> it overheated you would get a log event and graceful shutdown. I'm sure
> you've got some tools from them though to monitor the case temps and CPU's
> so I would start there and see if you are getting any spikes during the
> day
> that are way above normal. The CPU's should back themselves off if they
> start overheating but you may want to check that your CPU's support
> thermal
> management.
>
> In my experience random shutdowns come from:
>
> 1. Power loss or bad power supply (duh)
> 2. Overheating
> 3. Driver conflicts (set the OS not to reboot so you can see the blue
> screen)
> 4. Memory issues
> 5. Bus issues
>
> On the memory the best bet is to make sure your memory is on the certified
> list for the board or server. Drop it down to a identical pair and see if
> your stable, maybe it's a conflict between the DIMMS, timing, heat,
> manufacturer, etc.
>
> Bus issues can cause a shutdown but you can usually point to an event that
> causes it. Examples would be you crash during a backup where your driving
> a
> ton of data out the PCI or in one case we had dual NICS on a PCI bus that
> worked great until under load and then the bus would freak which then
> caused
> the OS to freak (technical term instead of using the word crash, customers
> hate hearing that word).
>
>
> "Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
> news:495C06B2-EE49-49FB-B46C-FC04DB626AB0@xxxxxxxxxxxxxxxx
>> Russ,
>>
>> I thought the same thing, but the server never gets close to shut down
> temp.
>> If that was the case, it would be at night when the main A/C is off, and
> onlt
>> the small A/C for that room is running, but it has happen mid-day when
>> the
>> room is at 70'. Also, Compaq comfirms that if the server shut down that
> way,
>> I would get a "gracefully shutdown", and an event log of that.
>>
>> As for running it with the side case off, There is a BIG warning sticker
>> inside that says never run the server with the sides off. Is it a big
> deal?
>>
>> If you like you can call me - I love to hear any advice you might have.
>>
>> --
>> Sal Candela
>>
>>
>> "Russ Grover" wrote:
>>
>> > I know you said you tested the Memory, but Memory is Picky...
>> >
>> > Boy it does sound like an Over heating...
>> > Have you run it with the side case off as a test?
>> >
>> > Russ
>> >
>> > --
>> > Russ Grover
>> > Small Business IT Support
>> > 16086 SW Melinda St.
>> > Beaverton OR 97007-6303 USA
>> > Email: Sales at SmallBusinessITSupport.com
>> > Website: www.SmallBusinessITSupport.com
>> >
>> >
>> > "Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
>> > news:AEEB8EC0-A895-4980-86C3-CF3F0977F2A6@xxxxxxxxxxxxxxxx
>> > > Hello all,
>> > >
>> > > I'm hoping that someone here might be able to give good advice.
>> > >
>> > > The problem:
>> > >
>> > > My server justs shuts down. No blue screens, no error messages,
> nothing in
>> > > the event logs other than telling me that the prevois shutdown was
>> > > unexcepted. I never get any error id. It is as if someone just pulls
> the
>> > > plug
>> > > out of the wall. I server maybe up for days (2 max so far) or only a
> few
>> > > hours. Today, it last almost 12 hours before going down.
>> > >
>> > > The server:
>> > >
>> > > I have a Compaq 6000 with 3.8 GB Ram, 2 Xeon 500 both with 2 MB. The
>> > > server
>> > > has three (3) power supplies that are each pluged into seperate UPS.
>> > >
>> > > What I have tried:
>> > >
>> > > I have removed the serial cable from the UPS. I have replaced all
> three
>> > > power supplies. I have replaced the power baclplane that the power
>> > > supplies
>> > > plug into. I have replaced the mother board. I have diag the memory
> (using
>> > > Compaq tools), and all checks ok (ran test for several hrs). I have
>> > > replaced
>> > > the VRM that are used with XEON processors in Compaq systems. I
>> > > server
> is
>> > > not
>> > > shuting down for temp reasons (that would be loged). Nobody can fool
> with
>> > > the
>> > > on/off switch, or "kick the power cord loose".
>> > >
>> > > HELP:
>> > >
>> > > I can replaced the processors, since I have two spares, but I think
> I'm
>> > > waing my money and time. Any ideas?
>> > >
>> > > ANY IDEA's ?? Any Help ?? Should I use the server as a very large
>> > > paperweight?
>> > > --
>> > > Sal Candela
>> >
>> >
>> >
>
>
.
- References:
- Server Shuts Down - HELP
- From: Sal Candela
- Re: Server Shuts Down - HELP
- From: Russ Grover
- Re: Server Shuts Down - HELP
- From: Sal Candela
- Re: Server Shuts Down - HELP
- From: James B
- Server Shuts Down - HELP
- Prev by Date: Re: Server Shuts Down - HELP
- Next by Date: Exchange endless loop
- Previous by thread: Re: Server Shuts Down - HELP
- Next by thread: Re: Server Shuts Down - HELP
- Index(es):
Relevant Pages
|
Loading