Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] Still looking at pS-SB scheduled BWCTL tests

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] Still looking at pS-SB scheduled BWCTL tests


Chronological Thread 
  • From: "Jeff W. Boote" <>
  • To: Richard Carlson <>
  • Cc: Aaron Brown <>, John F Bigrow <>, Shawn McKee <>,
  • Subject: Re: [perfsonar-user] Still looking at pS-SB scheduled BWCTL tests
  • Date: Wed, 25 Feb 2009 12:27:47 -0700

Then initially, I would suggest that the BNL host is not even contacting the chicago host. (At least not the one that corresponds to this log file.) Otherwise, there would be more connections logged.

jeff

On Feb 25, 2009, at 12:20 PM, Richard Carlson wrote:

Jeff;

These are the log files form the receiving side. The pS-SB is running on the BNL node.

John, would you please create a tarball of the /var/log dir on lhcmon and email/post it on a URL?
Thanks.

Rich

On Feb 25, 2009, at 12:32 PM, Jeff W. Boote wrote:

Rich - in looking at that log dir, I don't see any of the regularly scheduled tests... Only some on-demand ones. These logs don't seem to match the data you describe below... where are you seeing this data?

jeff

On Feb 25, 2009, at 10:21 AM, Richard Carlson wrote:

Hi Aaron;

Charles created a tarball with a complete dump from the /var/log directory on the UC machine. Unfortunately I can't tell what is causing the tests to fail. I can see requests coming in from the BNL node, but nothing jumps out at me as to why the test failed. What should we be looking for? (I can also point you at the tarball if that would help).

Rich

On Feb 25, 2009, at 9:02 AM, Aaron Brown wrote:


On Feb 25, 2009, at 9:57 AM, Richard Carlson wrote:

Hi John;

Thanks for the logs. I see from the log extract that your server is running 15 sec tests. However I see other 10 & 60 sec requests, so I'm assuming that the 10 sec tests are from people using the command line version (10 sec is the default test period) and some other servers are scheduling tests via the perfSONAR-Buoy interface on their machine.

Here's what I see right now. Testing to/from MSU is working, but we are loosing data. The current graphs show 7 successful tests from MSU --> BNL starting at 1:47 am and ending at 8:27. The 4:45 am test is missing. The graphs also show 4 tests in the opposite direction missing 1, 3, & 6 am.

I also see tests to/from OU with 4 tests from OU -> BNL at 1, 2, 7, & 9 am with 3 in the opposite direction at midnight, 1, & 7 am. The rest were unsuccessful.

I did notice that you are testing to the psum01 node, which I think Shawn setup as the delay server so that's why none of those work. I think you want to test to the psum02 node.

I'm still wondering if your server is just trying to run too many tests (19 peers in your config) and I don't know how many other servers are requesting tests. As I see it, we can try reducing the number of peers, or we can get the pS folks to help guide us on determining why some tests are failing and why some peers aren't responding at all.

Jeff/Aaron, what log files should we be looking at to determine what is going on?

I'd take a look in /var/log/messages, the output should be going there.

Cheers,
Aaron

Richard Carlson

1000 Oakbrook Dr
Ann Arbor, MI 48104

P: 734-352-7043
C: 630-251-4572



Richard Carlson

1000 Oakbrook Dr
Ann Arbor, MI 48104

P: 734-352-7043
C: 630-251-4572





Archive powered by MHonArc 2.6.16.

Top of Page