Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] (forgot) Perfsonar BW box q

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] (forgot) Perfsonar BW box q


Chronological Thread 
  • From: Andrew Lake <>
  • To:
  • Cc: "" <>, Shawn McKee <>
  • Subject: Re: [perfsonar-user] (forgot) Perfsonar BW box q
  • Date: Wed, 20 May 2015 08:42:00 -0400

Hi Winnie,

CCing Shawn McKee, since I think you will need him to update the mesh
configuration file this host uses. I found a few things:

- Disabling the firewall got rid of the peer port issue. I think if you open
6001-6200 TCP and UDP you should be set to turn it back on.

- I was also able to reach your web page which allowed me to look at the mesh
config file you are using at
https://myosg.grid.iu.edu/pfmesh/mine/hostname/lcgnetmon02.phy.bris.ac.uk.
The way this is configured, you are only running BWCTL throughput tests to
hosts in the UK. You then have a very large mesh of traceroute tests you are
running. That means for a whole bunch of those hosts listed on the graph page
I wouldn’t expect any throughput data. My guess is that this is expected, but
just pointing it out in case it’s not.

- For the throughput tests to the UK, most of those are failing and I think i
see why (this is where we need Shawn). Your test configuration has the
omit_interval set to 5 seconds and the duration set to 30. This is actually a
35 second test which is above the time limit of 30 seconds BWCTL allows by
default. Thus the tests are getting rejected. This was actually a fix (I
believe made around the April 26th date you noted prior) as previously BWCTL
was only treating this as a 30 second test which was incorrect. The fix is to
have Shawn change the “duration” in the meshconfig to 25, which will make it
a true 30 second test within the BWCTL limits.

I think the summary is there are two problems 1) The firewall ports and 2)
the test duration + omit_interval exceeding the BWCTL limit after a recent
update. Hopefully resolving those will get things working.

Thanks,
Andy






> On May 20, 2015, at 6:55 AM, Winnie Lacesso
> <>
> wrote:
>
>
> Thank you Andy, this is very helpful!
>
> Earlier:
>> I can't seem to reach lcgnetmon02.phy.bris.ac.uk or run bwctl tests to it
>> from any host. My guess would be you have some firewall issues. The red
>
> For a time, the entire BW box firewall is shutdown. No firewall!
> (eeeek!)
>
> Can you try reaching lcgnetmon02.phy.bris.ac.uk or run bwctl tests to it
> again?
>
> While no firewall, still errors of the kind you named are being seen in
> /var/log/perfsonar/regular_testing.log:
>
> 2015/05/20 11:53:17 (3326) ERROR> MeasurementArchiveChild.pm:125
> perfSONAR_PS::RegularTesting::Master::MeasurementArchiveChild::__ANON__ -
> Problem handling test results: Problem storing results: Error writing
> metadata: Error running test from perfsonar02.datagrid.cea.fr to
> lcgnetmon02.phy.bris.ac.uk with output bwctl: start_endpoint:
> 3628747485.110260
> bwctl: run_endpoint: receiver: 137.222.171.39
> bwctl: run_endpoint: sender: 192.54.207.251
> bwctl: Unable to initiate peer handshake with [137.222.171.39]:6033 -
> canceling
> bwctl: stop_endpoint: 3628747487.112685
>
> So how can the firewall be the problem, if it's off?
>
> Can you advise how further to debug what the problem is, if it's not
> firewall?
>
> Very Grateful!!
>
>




Archive powered by MHonArc 2.6.16.

Top of Page