Skip to Content.
Sympa Menu

perfsonar-user - RE: [perfsonar-user] Regular testing stops and box loses global registration

Subject: perfSONAR User Q&A and Other Discussion

List archive

RE: [perfsonar-user] Regular testing stops and box loses global registration


Chronological Thread 
  • From: "Kern, Paul" <>
  • To: Trey Dockendorf <>
  • Cc: "" <>
  • Subject: RE: [perfsonar-user] Regular testing stops and box loses global registration
  • Date: Tue, 16 Dec 2014 20:41:43 +0000
  • Accept-language: en-US

I see the same thing here. Below is what I am seeing in my
/var/log/perfsonar/regular_resting.log, before the service failed:

2014/12/15 15:22:37 (3670) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 3676
2014/12/15 15:22:37 (3670) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 3675
2014/12/15 15:22:37 (3670) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 3679
2014/12/15 15:22:37 (3670) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 3674
2014/12/15 15:22:37 (3670) DEBUG> Master.pm:258
perfSONAR_PS::RegularTesting::Master::handle_exit - Process 'perfSONAR_PS
Regular Testing' exiting

I also see the 'duplicate checksum' entries in ls_registration_daemon.log.
Not sure if that is normal behavior or not. It looks like it has been going
on for quite some time.

Paul

From: Trey Dockendorf
[mailto:]

Sent: Tuesday, December 16, 2014 2:32 PM
To: Kern, Paul
Cc:

Subject: Re: [perfsonar-user] Regular testing stops and box loses global
registration

Thought I'd add that I'm seeing this on two of my perfsonar boxes too.  So
far the regular testing seems to show as "Not Running" more frequently on the
host performing bandwidth tests than the host performing latency tests.

Below are last entries in /var/log/perfsonar/regular_testing.log.1 before I
manually restarted the regular testing daemon.

2014/12/11 00:29:14 (2299) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 2304
2014/12/11 00:29:14 (2299) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 2305
2014/12/11 00:29:14 (2299) DEBUG> Master.pm:198
perfSONAR_PS::RegularTesting::Master::handle_child_exit - Received SIGCHLD
for PID: 2303
2014/12/11 00:29:14 (2299) DEBUG> Master.pm:258
perfSONAR_PS::RegularTesting::Master::handle_exit - Process 'perfSONAR_PS
Regular Testing' exiting

The global registration service seems to get hung on both systems equally.  I
see no obvious errors in /var/log/perfsonar/ls_registration_daemon.log.  I do
notice that there are a few entries that mention "Duplicate checksum".

- Trey


=============================

Trey Dockendorf 
Systems Analyst I 
Texas A&M University 
Academy for Advanced Telecommunications and Learning Technologies 
Phone: (979)458-2396 
Email:
 
Jabber:


On Tue, Dec 16, 2014 at 1:06 PM, Kern, Paul
<>
wrote:
Hello all,
 
I am dealing with an issue in which regular testing on our perfsonar box
(3.4.2) stops sporadically, and the box appears to lose its global
registration – global registration changes from “yes” to “no”.  I have not
seen anything in the logs that would indicate why this is happening, but I am
a novice with this tool and I may be missing something.  Restarting the
services seems to resolve the issue temporarily, but it always crops back up.
 
Regards,
 
Paul Kern
South Dakota Board of Regents
 




Archive powered by MHonArc 2.6.16.

Top of Page