Very slow SMB performance on one interface of a multi-homed server
- From: usenet@xxxxxxxxxxxxxxxxx
- Date: 6 Apr 2007 20:43:04 -0700
General problem:
Seven multi-homed hosts are connected to a fast ethernet office LAN on
one interface and a private gigabit network on the other. SMB
communication (tested with Windows Explorer) on the private network is
extremely slow for three of the machines.
Details:
Of the seven hosts, four are Windows 2000 server and three are XP.
The issue is present on two of the Win2k machines and one of the XP
machines. Each of the machines exhibiting the problem is identical in
terms of hardware, O/S build, and even installed programs to at least
one of the other machines which does not exhibit the problem.
Each machine has a static address on the office LAN (10.20.30.x) and a
static address on the private network (172.20.30.x). Four of the
machines have a third adapter with an address of 192.168.30.x which I
will discuss later.
The office LAN is configured with two domains in the DNS search order,
WINS servers, etc. As might be expected, the gateway is 10.20.30.1.
The private network has no name server and no gateway, nothing except
the 7 hosts attached. There is no attempt to route between the two
networks, and at this time no attempt to get name resolution working
on the private network.
The NIC on the private network is bound first on all hosts.
Some information I have found on the web indicates that setups similar
to the above just won't work. However, the cluster used to have only
four hosts and we had a parallel setup that worked flawlessly. The
original four had Qlogic fiber channel HBAs, and the Qlogic adapters
were configured as an IP interface. The private fiber channel network
was configured exactly as the private gigabit network is now and
worked perfectly with no adjustments to anything. Users could simply
point Windows Explorer to \\192.168.30.y\share and immediately get a
near-gigabit speed connection. The fiber channel "NIC" was bound
before the one connected to the office LAN (and is now bound in
between the gigabit network and the office LAN).
Buying another fiber channel switch and more HBAs is not a practical
solution to this problem, although everything I have seen so far
indicates that would actually solve my issue - at great cost. I
naiively assumed that it would be similarly easy to use a gigabit
ethernet network to accomplish what was possible and in fact quite
easy over the fiber channel equipment.
The fiber channel setup still exists on four of the hosts. If a host
is affected, connections to all of the other 6 are affected, but the
problem is not reciprocal. Host x may exhibit the slow link when
connecting to \\172.20.30.y, but host y will not exhibit the slow link
connecting to \\172.20.30.x. Connections using the office LAN and,
for four of the hosts, the fiber channel network are fine.
Troubleshooting steps tried so far:
1. Verified that the switch and all cables are 100% functional. Ran
all NICs through diagnostics (all passed).
2. Verified that there are no errors written to the event log and no
TCP/IP errors (there aren't).
3. Compared the network configurations between the hosts (all have
the same settings).
4. Profiled the problem connections with Network Monitor (results
described below).
5. Tried to compare the registry settings related to the fiber
channel NICs and the gigabit NICs (inconclusive).
6. Added entries to the HOSTS file (did not help).
7. Tried disabling the adapter connected to the office LAN on the
problem hosts. When I do this the gigabit network and the fiber
channel network are effectively crippled. Any attempt to connect to
another host results in (after 10 or 15 seconds) the error message:
"there are currently no logon servers available to service the logon
request". I know this should be a clue to me, but I do not know
enough to interpret it.
Network Monitor results:
Profiling a copy of a large file over the gigabit network one of the
problem hosts reveals that the following is happening: A session will
be negotiated as normal, then a few dozen packets will be exchanged.
After a 5-6 second pause, a few hundred more packets and acks will
travel over the network, then another 5-6 second pause will begin.
This packets-pause cycle repeats until the copy is finished. It
appears as though there is some internal timeout which keeps getting
invoked, and every time it does the pauses occur. When the connection
is not in a "paused" state, the packets are moving at the expected
speed and there are no errors. There is also no unusual activity on
the other adapters while all of this is taking place.
Any suggestions would be greatly appreciated. Thank you for your time
- PK
.
- Follow-Ups:
- Re: Very slow SMB performance on one interface of a multi-homed server
- From: Arkady Frenkel
- Re: Very slow SMB performance on one interface of a multi-homed server
- Prev by Date: Windows 2003 VB 6 COM Control
- Next by Date: plz help to create a windows server 2003 domain member user
- Previous by thread: Windows 2003 VB 6 COM Control
- Next by thread: Re: Very slow SMB performance on one interface of a multi-homed server
- Index(es):
Relevant Pages
|
|