Server hang up



Hi

For Server hangs; we need to explore a little bit more:
- at the point of hang is the keyboard responsive(can we flip the number
lock)?
- can you ping the server or access shares?
- do the hangs occur during/around a specific time of the day?
- antivirus or backup applications you use; is the hang clashing with some
scheduled event?
- in the processes list do we have any instances of ctfmon.exe?

You wont be able to pick something off the event log just like that; you
need to explore for event id 2019's or 2020's indicating a memory leak,
3034's for MRXSMB errors, 10000 series for DCOM errors, so on so forth.

I would disagree with Vera in calling a server underdimensioned till the
point a user starts feeling the lag in the session. Also would advice
against the hotfixes till the point we know the real problem because hotfixes
are not generic they are made to address very specific problems though there
is nothing wrong in getting the latest and greatest files.

As indicated by Vera Perfmon would be a nice place to start; but i would
stress on Remote Perfmon as the box would feel undue stress because perfmon
itself might act as a burden. ;)

The following article would certainly make a good read:
http://www.microsoft.com/resources/documentation/windows/2000/professional/reskit/en-us/part6/proch27.mspx

The counters we might be interested are:
- Process
- Processor
- Memory
- Network interface
- Object
- Thread
- Disk

Make sure we make it all instances because once we have identified the
performance object where the spike is occurring; we need to go under the
object and identify the actual process.

For kernel mode leaks we might need tools like poolmon and if nothing works
out we might need to go to the extent of a complete memory dump. At this
point you might have to engage Microsoft PSS.

In addition to all this I would certainly recommend driver updates as they
are the most common culprits for memory leaks and this includes filter
drivers for your antivirus and backup applications.

If you are a consultant then Microsoft Product Support reporting tool would
come handy for data gathering:
http://www.microsoft.com/downloads/info.aspx?na=46&p=13&SrcDisplayLang=en&SrcCategoryId=&SrcFamilyId=cebf3c7c-7ca5-408f-88b7-f9c79b7306c0&genscs=&u=http%3a%2f%2fdownload.microsoft.com%2fdownload%2fb%2fb%2f1%2fbb139fcb-4aac-4fe5-a579-30b0bd915706%2fMPSRPT_SETUPPerf.EXE

What does the above tool do?
- http://support.microsoft.com/default.aspx?scid=kb;en-us;818742

Hope this helps. I dont get to frequest this place too often but feel free
to contact me at supersatya@xxxxxxxxx
.



Relevant Pages

  • Re: nfsrvstats.srvrpc_errs rapidly increasing
    ... > The srvrcp_errs are very likely unrelated to the hangs. ... > friends) from the NFS server routines in nfs_serv.c. ... other clients may behave strangely. ...
    (freebsd-net)
  • Re: SBS 2003 Hung on " Applying computer settings"
    ... > SBS 2003 system hung on "Applying computer settings" like a few other ... I shut down the server last night to avoid a lightning ... Workstations can't access server, can't ... On restart it still hangs the same only a little more ...
    (microsoft.public.windows.server.sbs)
  • Re: 6-stable locking problem
    ... There's a problem with a very busy server (ad server, ... The server hangs one or more times a day. ... Sleeping thread (tid 100006, pid 4) owns a non-sleepable lock ...
    (freebsd-stable)
  • socket_select() hangs sometimes; Bug?
    ... socket_selectfor accepting new connections and dealing with existing ... The select runs on all connected sockets for read and ... the server hangs after some time. ...
    (php.general)
  • Re: System hang due to high iowait
    ... The problem is that the server hangs. ... but authentication fails because the daemon died. ... but it happens with small or large lists. ...
    (RedHat)