Re: Server Shuts Down - HELP

Tech-Archive recommends: Fix windows errors by optimizing your registry



As I said a huge number of post ago it sounded like a thermal issue. What
gets me is why the Compaq monitoring software did not log an event or at
least a temp spike prior to a full crash. It should never just crash because
of thermal protection, the whole idea behind thermal protection is to stop
it from crashing by backing off the CPU cycles so they can cool down.

I'd give that cage a once over and make sure all fans are working, no major
dust bunnies blocking airflow, no cables in the way of airflow and heatsinks
fully contacting CPU's. BIOS update the board and make sure you've got the
latest monitoring software from Compaq. Also see if you can monitor the
temps and see what the spikes are now that you can keep it up as I'm very
interested in knowing why it crashed and didn't just back itself off.

Also please post or repost the specs on the system as to type of CPU's and
server model.


"Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
news:572D3A6A-7972-4C37-B8B3-913588D487A7@xxxxxxxxxxxxxxxx
> Russ, Just wanted to give yao and everybody who helped out a status
report.
>
> Still up and running! That just over two days now!
>
>
> --
> Sal Candela
>
>
> "Sal Candela" wrote:
>
> > Russ,
> >
> > Yup, up since Sunday at 7:00 am EST, and still going. I need to give it
a
> > few days, but recently the server wouldn't stay up for more than day to
day
> > and a half. If all is still well by this weekend, I will replace the
temp
> > sensors (if replacable).
> >
> >
> > --
> > Sal Candela
> >
> >
> > "Russ Grover" wrote:
> >
> > > Ok Status?
> > >
> > > --
> > > Russ Grover
> > > Small Business IT Support
> > > 16086 SW Melinda St.
> > > Beaverton OR 97007-6303 USA
> > > Email: Sales at SmallBusinessITSupport.com
> > > Website: www.SmallBusinessITSupport.com
> > >
> > >
> > > "Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
> > > news:EC8F4773-3852-43EE-BD69-412D9D3ECD53@xxxxxxxxxxxxxxxx
> > > > Russ,
> > > >
> > > > First things first - again thanks for your help and support.
> > > >
> > > > Second - Damn thing is still going!!
> > > >
> > > > Third - I really didn't have to go into BIOS to disable the setting.
> > > > Compaq
> > > > has monitoring software thats runs and from there I just clink on
disable
> > > > and
> > > > the setting was changed. I truely didn't want to reboot that damn
thing
> > > > one
> > > > more time. Stupid but true.
> > > >
> > > > But the setting that were set are
> > > >
> > > > I/O boards:49' C
> > > > CPU: 39' C
> > > >
> > > > The current temp is 32'C
> > > >
> > > > If the server stays up for more than a week, I will replace the
sensors.
> > > > If
> > > > it goes down again I will check to see what (if any) level it is set
at.
> > > >
> > > > I will also try the leving it in DOS mode and running the CPU test
all
> > > > night
> > > > long to see what is happening.
> > > >
> > > > Again thanks for checking up again.
> > > >
> > > > Ciao
> > > >
> > > > --
> > > > Sal Candela
> > > >
> > > >
> > > > "Russ Grover" wrote:
> > > >
> > > >> There's Usually like Levels 3 different temps
> > > >> Do you remember what Level it was at?
> > > >> Russ
> > > >>
> > > >> --
> > > >> Russ Grover
> > > >> Small Business IT Support
> > > >> 16086 SW Melinda St.
> > > >> Beaverton OR 97007-6303 USA
> > > >> Email: Sales at SmallBusinessITSupport.com
> > > >> Website: www.SmallBusinessITSupport.com
> > > >>
> > > >>
> > > >> "Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in
message
> > > >> news:BCA8A463-ED0D-49F2-A487-0A20A4158C29@xxxxxxxxxxxxxxxx
> > > >> > WOW! I want to thank everybody for the support and help.
> > > >> >
> > > >> > I diaabled the temp shutdown in BIOS. The server has been up
since 7:00
> > > >> > am
> > > >> > EST. Considering recent history, this is a marathon.
> > > >> >
> > > >> > I will keep you all updated in the next few days.
> > > >> >
> > > >> > Thanks you all
> > > >> > --
> > > >> > Sal Candela
> > > >> >
> > > >> >
> > > >> > "Gary Karasik" wrote:
> > > >> >
> > > >> >> Sal,
> > > >> >>
> > > >> >> After re-reading all the posts here, I don't have a clear idea
if the
> > > >> >> server
> > > >> >> is shutting down or shutting off, by which I'm asking, Is it
doing an
> > > >> >> orderly shutdown as though you told it to manually or is it just
> > > >> >> turning
> > > >> >> off
> > > >> >> instantly as though it had lost power? The lack of event log
> > > >> >> errors--usually
> > > >> >> when the shut down is sudden, there's an "unexpected shutdown"
> > > >> >> event-log
> > > >> >> entry--suggests that it is an orderly shutdown, in which case
I'd lean
> > > >> >> toward software or some hardware-software combination. Did you
load
> > > >> >> any
> > > >> >> UPS-specific shutdown software? If not, have you checked the
Control
> > > >> >> Panel's
> > > >> >> Power Options settings to see if a shutdown is called for? If
there's
> > > >> >> some
> > > >> >> UPS-specific software, uninstall it. If the Power Options are
> > > >> >> configured,
> > > >> >> disable that service.
> > > >> >>
> > > >> >> GaryK
> > > >> >>
> > > >> >> "Sal Candela" <SalCandela@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in
message
> > > >> >> news:AEEB8EC0-A895-4980-86C3-CF3F0977F2A6@xxxxxxxxxxxxxxxx
> > > >> >> > Hello all,
> > > >> >> >
> > > >> >> > I'm hoping that someone here might be able to give good
advice.
> > > >> >> >
> > > >> >> > The problem:
> > > >> >> >
> > > >> >> > My server justs shuts down. No blue screens, no error
messages,
> > > >> >> > nothing
> > > >> >> > in
> > > >> >> > the event logs other than telling me that the prevois shutdown
was
> > > >> >> > unexcepted. I never get any error id. It is as if someone just
pulls
> > > >> >> > the
> > > >> >> > plug
> > > >> >> > out of the wall. I server maybe up for days (2 max so far) or
only a
> > > >> >> > few
> > > >> >> > hours. Today, it last almost 12 hours before going down.
> > > >> >> >
> > > >> >> > The server:
> > > >> >> >
> > > >> >> > I have a Compaq 6000 with 3.8 GB Ram, 2 Xeon 500 both with 2
MB. The
> > > >> >> > server
> > > >> >> > has three (3) power supplies that are each pluged into
seperate UPS.
> > > >> >> >
> > > >> >> > What I have tried:
> > > >> >> >
> > > >> >> > I have removed the serial cable from the UPS. I have replaced
all
> > > >> >> > three
> > > >> >> > power supplies. I have replaced the power baclplane that the
power
> > > >> >> > supplies
> > > >> >> > plug into. I have replaced the mother board. I have diag the
memory
> > > >> >> > (using
> > > >> >> > Compaq tools), and all checks ok (ran test for several hrs). I
have
> > > >> >> > replaced
> > > >> >> > the VRM that are used with XEON processors in Compaq systems.
I
> > > >> >> > server
> > > >> >> > is
> > > >> >> > not
> > > >> >> > shuting down for temp reasons (that would be loged). Nobody
can fool
> > > >> >> > with
> > > >> >> > the
> > > >> >> > on/off switch, or "kick the power cord loose".
> > > >> >> >
> > > >> >> > HELP:
> > > >> >> >
> > > >> >> > I can replaced the processors, since I have two spares, but I
think
> > > >> >> > I'm
> > > >> >> > waing my money and time. Any ideas?
> > > >> >> >
> > > >> >> > ANY IDEA's ?? Any Help ?? Should I use the server as a very
large
> > > >> >> > paperweight?
> > > >> >> > --
> > > >> >> > Sal Candela
> > > >> >>
> > > >> >>
> > > >> >>
> > > >>
> > > >>
> > > >>
> > >
> > >
> > >


.