Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] throughput test timeouts

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] throughput test timeouts


Chronological Thread 
  • From: Michael Thomas <>
  • To: Mark Feit <>, "" <>
  • Subject: Re: [perfsonar-user] throughput test timeouts
  • Date: Tue, 8 Mar 2022 11:52:13 -0600
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=caltech.edu; dmarc=pass action=none header.from=caltech.edu; dkim=pass header.d=caltech.edu; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=DwBW7gQCJHOskO2+RxFB7xXuRLC9Z3bBZgKQXGog2FQ=; b=OMmgPuO4eOcI+w27eVFREYeGjiMyHqhUOIr7NzSiqblCeRjEKlNAt782EgGb/VO0UxSm8vbkxC0PN2KF2wbYdraKSogYi0Lr8Gbr4NetVQDb+XO5xdH/8/WwqQ9M8ZsXZtzLaUpqcFdBP848hyQBpwTM9f3iKLjZwCklNhoHjT96WZLSjA1hxePxJpB0ERclqxymI0GJAG1v4afNc2Jbmzx52T4gRjls+ZdAX+Vx5cOOsqGuSJM1/1fS5LkoMX+kWrkvPDM5s1tA6EsxKULB7Iy7orfDLXh/I42ZTQrYi/WCcu1Bo0SxmUpKKp9R3miK0mOOCVyajPZ5hvlrwx149w==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=LxYn/qN4EsokTj2fFyKO6bLpaojqxzdeGgJcYWhiusetx7m9vF2dnqC3ApGYf5xTTJmDzMy61EZyiI58ppBW9Te0MzdgU+wC/DA9nVnUasI8a8SHmKD+O8a2kTihfU/j1ADC/0X1u892cmVhP71hEy1VjKlwVcVKWjkfREZ3a/7N08bOjGJUnkg6Nn7dP1BEw+fPdTwzoJ9Et3BZFPOIg6rXLaDImvkG8sV8GKnmAns3sjILEBDxmqYaPadRjLzdZ4/Xoo5xgEgUswwH6Q8QLj5L6s5XixesGiAt9BxdYdv8Wk+ljx8YpE2i5ylMWHzC2BdArB0Ubp23xe9kXgQlag==

On 3/8/22 11:41, Mark Feit wrote:
Michael Thomas writes:

I recently set up a perfsonar server and configured it to run throughput
tests with two other sites in our group. The iperf3 tests from my site
to the remote locations works fine, but all incoming tests fail…

Error from perfsonar.ligo-wa.caltech.edu:
iperf3 output has neither 'sum_sent' nor 'sum' field, and one of them
is required

This part’s a bug. The plugin should have detected the error before looking
for real data in the output. I’ve opened a ticket on that and will fix it in
5.0: https://github.com/perfsonar/pscheduler/issues/1270.

Diagnostics from perfsonar.ligo-wa.caltech.edu:
/usr/bin/iperf3 -p 5201 -B 198.129.208.245 -c 208.69.128.92 -t 10
--json --rsa -public-key-path
/var/run/pscheduler-server/runner/tmp/tmp_j91baqu/tmpumiughjb/public-key
--username <username redacted>

Unrelated note: There’s no need to redact those; they’re only valid for one
measurement.

Good to know.

{
"error": "test authorization failed"
}

How can I further debug this authorization failure?

The fault is at the far end (208.69.128.92, ligo-la). I did a telnet to port
5201 on that host and got an answer, which is strange because there was no
throughput test scheduled when I did it. I followed it up with an attempt to
run iperf3 against it and got a similar authorization failure. Odds are
very, very good there’s a stray iperf3 process in server mode on that host.
If you’d like an additional set of eyes while you poke around on that system,
drop me a line off-list and we’ll set something up.

The production code will attempt to hunt down and terminate processes that
appear to be orphaned iperf3 servers, but it isn’t perfect. The latest
version of iperf3 has some switches that will let us prevent the server
process from running past the scheduled end time of the measurement and
there’s already code in the pipeline for 5.0 to take advantage of it.

Hi Mark,

This is also what I see. There is a stray iperf3 process that sits around on the ligo-la host after a scheduled throughput test completes. I can manually kill it, but the next throughput test (scheduled or manual) starts up another that does not exit when the test is complete.

--Mike



Archive powered by MHonArc 2.6.24.

Top of Page