Skip to Content.
Sympa Menu

ndt-users - Re: client timeout msg

Subject: ndt-users list created

List archive

Re: client timeout msg


Chronological Thread 
  • From: tate <>
  • To: Richard Carlson <>
  • Cc: <>
  • Subject: Re: client timeout msg
  • Date: Tue, 16 Jun 2009 08:32:24 -0400

Rich, thanks for the quick reply. I should have mentioned in my original
post that, after scanning the ndt-users archive, I tried the testoptions.c
modification recommended below. Unfortunately, the problem persists after
that change ....

Thanks,
Tate



On Mon, 15 Jun 2009 16:37:39 -0500, Richard Carlson
<>
wrote:
> Hi Tate;
>
> I think I introduced s bug in the 3.5.0 package. Go to the src/
> testoptions.c file and scroll down to line 681. Uncomment out the
> wait(NULL); line and rebuild/reinstall the package. Let me know if
> that fixes things.
>
> Rich
>
> On Jun 15, 2009, at 2:51 PM, tate wrote:
>
>> All,
>> We're getting odd errors at the end of testing that indicate a client
>> timeout.
>> Code being executed is specifically line 1265 of Tcpbw100.java.
>> Furthermore, the Stats and More Information fields occasionally have
>> scant
>> information after running tests.
>> Throughput tests do seem to work, however, which makes things a bit
>> perplexing.
>>
>> OS is Ubuntu 9.04 running kernel 2.6.28
>> web100-2.5.23
>> web100_userland-1.7
>> ndt-3.5.0
>>
>> We do run a firewall on the ndt server, but have ICMP, and TCP/
>> 3001-3003
>> open. I have tested with the firewall turned off with the same
>> results.
>> Any help would be greatly appreciated.
>> Thanks,
>> Tate
>>
>>
>>
>> --- debug log of tests with errors ---
>>
>> /usr/local/sbin/web100srv --record --interface eth7 -ddd
>> ANL/Internet2 NDT ver 3.5.0
>> Variables file = /usr/local/ndt/web100_variables
>> log file = /usr/local/ndt/web100srv.log
>> Debug level set to 3
>> server ready on port 3001
>> web100_init() read 69 variables from file
>> nothing in queue
>> Timer not running, waiting for new connection
>> New connection received, waiting for accept() to complete
>> Telling client 21802 testing will begin now
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> Starting test suite:
>>> Middlebox test
>>> Simple firewall test
>>> C2S throughput test
>>> S2C throughput test
>> <-- Middlebox test -->
>> -- port: 3003
>> Middlebox test, Port 3003 waiting for incoming connection (fd=3)
>> Set MSS to 536, Receiving Window size set to 33554432KB
>> Sending Window size set to 33554432KB
>> Server: a.b.33.140;3003 Client: x.y.234.48;3680 1260;-1;-1;
>> Sending 1260 Byte packets over the network
>> Signal 17 received by process 21802
>> now = 1245094141 Process started at 1245094138, run time = 3
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094144 Process started at 1245094138, run time = 6
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> <-------------------->
>> <-- Simple firewall test -->
>> -- port: 35641
>> -- time: 1
>> -- oport: 3681
>> SIGALRM was caught
>> SIGALRM was caught
>> Unable to create connect socket.
>> <-------------------------->
>> <-- C2S throughput test -->
>> -- port: 3002
>> listening for Inet connection on testOptions->c2ssockfd, fd=3
>> Sending 'GO' signal, to tell client to head for the next test
>> C2S test calling init_pkttrace() with pd=0xbfb3c488
>> Opening network interface 'eth7' for packet-pair timing
>> pcap_open_live() returned pointer 0x9bb87d0
>> installing pkt filter for 'host x.y.234.48 and port 3684'
>> Initial pkt src data = 80694a4
>> now = 1245094147 Process started at 1245094138, run time = 9
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> New packet trace started -- initializing counters
>> now = 1245094150 Process started at 1245094138, run time = 12
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094153 Process started at 1245094138, run time = 15
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094156 Process started at 1245094138, run time = 18
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> 93779 kbps outbound
>> libweb100: warning: accessing depricated variable AckPktsIn
>> libweb100: warning: accessing depricated variable AckPktsOut
>> Variable 13 (CwndRestores) not found in KIS
>> Variable 22 (MaxCaCwnd) not found in KIS
>> Variable 30 (MaxSaCwnd) not found in KIS
>> Signal USR1(10) sent to child [21805]
>> Signal 10 received by process 21805
>> 13:29:09.279041 x.y.234.48:3684 --> a.b.33.140:3002 Collected pkt-
>> pair
>> data max = 47766
>> 13:29:09.279041 a.b.33.140:3002 --> x.y.234.48:3684 Collected pkt-
>> pair
>> data max = 35709
>> 128 bytes read ' 0 1 1000 264 55 47766 44797 146 92 52 178 1 158.69
>> 0 0 0
>> 1 0' from monitor pipe
>> 128 bytes read ' 1 0 1 1236 51 35709 11212 37 46 12 187 1 23.33 24
>> 0 48468
>> 0 187' from monitor pipe
>> Signal 11 received by process 21805
>> Signal 17 received by process 21802
>> <------------------------->
>> <-- S2C throughput test -->
>> -- port: 3003
>> waiting for data on testOptions->s2csockfd
>> S2C test calling init_pkttrace() with pd=0xbfb3c488
>> Opening network interface 'eth7' for packet-pair timing
>> pcap_open_live() returned pointer 0x9bb8ec0
>> installing pkt filter for 'host x.y.234.48 and port 3685'
>> Initial pkt src data = 80694a4
>> Signal 17 received by process 21802
>> New packet trace started -- initializing counters
>> now = 1245094159 Process started at 1245094138, run time = 21
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094162 Process started at 1245094138, run time = 24
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094165 Process started at 1245094138, run time = 27
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> now = 1245094168 Process started at 1245094138, run time = 30
>> Timer expired while waiting for a new connection
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Waiting for new connection, timer running
>> sent 144801792 bytes to client in 10.68 seconds
>> Buffer control counters Total = 17676, new data = 0, Draining Queue
>> = 0
>> Signal USR2(12) sent to child [21807]
>> Signal 12 received by process 21807
>> 13:29:19.418817 a.b.33.140:3003 --> x.y.234.48:3685 Collected pkt-
>> pair
>> data max = 5406
>> 13:29:19.418817 x.y.234.48:3685 --> a.b.33.140:3003 Collected pkt-
>> pair
>> data max = 28243
>> Read ' 0 0 2 68 100 5406 2247 40 33 29 25 1 99.61 0 0 0 1 0' from
>> monitor
>> pipe
>> Read ' 0 0 5 8 332 28243 22574 162 477 34 264 0 93.35 108 568 51423
>> 0 262'
>> from monitor pipe
>> 108499 kbps inbound
>> Variable 13 (CwndRestores) not found in KIS
>> Variable 22 (MaxCaCwnd) not found in KIS
>> Variable 30 (MaxSaCwnd) not found in KIS
>> Variable 13 (CwndRestores) not found in KIS
>> Variable 22 (MaxCaCwnd) not found in KIS
>> Variable 30 (MaxSaCwnd) not found in KIS
>> Signal 11 received by process 21807
>> Signal 17 received by process 21802
>> Protocol error!
>> S2C throughput test FAILED!
>> Client --> Server data detects link = FastEthernet
>> Client <-- Server Ack's detect link = FastEthernet
>> Server --> Client data detects link = FastEthernet
>> Server <-- Client Ack's detect link = FastEthernet
>> CWND limited test = 13957.00 while unlimited = 0.00
>> Better throughput when CWND is limited, may be duplex mismatch
>> Successfully returned from run_test() routine
>> Signal 17 received by process 21801
>> now = 1245094170 Process started at 1245094138, run time = 32
>> Queue pointer = 21802, testing = 1, waiting = 1
>> Received SIGCHLD signal for active web100srv process [21801]
>> wait3() returned 0 for PID=21802
>> wexitstatus = '0'
>> Timer not running, waiting for new connection
>>
>>
>> --
>> This message has been scanned for viruses and
>> dangerous content by MailScanner, and is
>> believed to be clean.
>>
>
> Richard Carlson
>
> 1000 Oakbrook Dr
> Ann Arbor, MI 48104
>
> P: 734-352-7043
> C: 630-251-4572

--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.




Archive powered by MHonArc 2.6.16.

Top of Page