Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] Re: meshconfig-agent-tasks not scheduling tasks regularly

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] Re: meshconfig-agent-tasks not scheduling tasks regularly


Chronological Thread 
  • From: Mark Feit <>
  • To: Casey Russell <>
  • Cc: Larry Blunk <>, "" <>
  • Subject: Re: [perfsonar-user] Re: meshconfig-agent-tasks not scheduling tasks regularly
  • Date: Fri, 20 Oct 2017 20:39:16 +0000
  • Accept-language: en-US
  • Authentication-results: kanren.net; dkim=none (message not signed) header.d=none;kanren.net; dmarc=none action=none header.from=internet2.edu;
  • Ironport-phdr: 9a23:SmpTWRE3buAFeOqIZaaUR51GYnF86YWxBRYc798ds5kLTJ7yo8ywAkXT6L1XgUPTWs2DsrQf2rqQ6/iocFdDyK7JiGoFfp1IWk1NouQttCtkPvS4D1bmJuXhdS0wEZcKflZk+3amLRodQ56mNBXdrXKo8DEdBAj0OxZrKeTpAI7SiNm82/yv95HJbQhFgDmwbaluIBmqsA7cqtQYjYx+J6gr1xDHuGFIe+NYxWNpIVKcgRPx7dqu8ZBg7ipdpesv+9ZPXqvmcas4S6dYDCk9PGAu+MLrrxjDQhCR6XYaT24bjwBHAwnB7BH9Q5fxri73vfdz1SWGIcH7S60/VC+85Kl3VhDnlCYHNyY48G7JjMxwkLlbqw+lqxBm3oLYfJ2ZOP94c6jAf90VWHBBU95RWSJfH428c4UBAekPPelaronyu1QBoACkCgWwAePi0CNEimP00KA8zu8vERvG3AslH98Wt3rbts/1NKQPWu610qbIzCnDZO5R1Df45ojHbBEhoe2XULJxd8rR1VcgFxnDjlqOtYzpISmZ2foQvGiG9udtU/+khWAgqwF0uDevx8Esh5HGhoIU1lDE9Th5z50vKdKkT057ZMaoEJhKuCGcLYt5XMUiT3tuuCkk1r0Lv4OwcisSyJk/2RLQceCLf5WN7x7+SeqdPDJ1hHxqdb6jmxq/9EqtxfPzW8Wp1VtHqzRJnsXQunwVyhDf9suKRuFj8kqiwzqDyQ/e5+VeLUwpiKbXNpgsyaMqmJUJq0TMBCr2lV32jKCIckUk/fCl5fz7b7vhupOQKpZ4hxzmPKkgg8C/Bv83PRYUU2ic5OS8yKbs/UrkQLVMk/I6iLHZsIrdJcQHuKG2HxNV0ock6xa5FTum18kYnWUDLFJCfxKHjJLlNE3JIPD9Ffu/glKsnyl3x/3eILHuGInBImXGnbv8YLpx9ktRyAQ8wNxD+55ZD7MML+z8V0PssdHVCwE1PxCoz+r/DdVyzIIeWWaBAq+DN6PStEeF5uchI+aSZY8VpC3wK/kj5/7yk3A5g1kdcre13ZcJcny3AOlpI1iBbXr2ntgBCXsKvhY5TOHykF2NTyRTZ3ipX6I74DE0EpimAZ7eRoC2nrOBxjy2HplXZmBdFlCMCmnke5+FW/cKdCKdPNVhkjoaWri9VYMtzw+huxLny+kvEu2B0SQDuIOr7sVu/ODXkVlm/iZpFN+Q12WlTGhyhG4OATk7wPYsj1Z6zwKm2LJ7y9JVFMAbs/ZHXwYmHZ/a0+FgDd3uAETMcsrfGwXuecmvHTxkFoF5+NQJeUsoXoz61h0=
  • Spamdiagnosticoutput: 1:0

Casey Russell writes:

 

    One of my hosts (ps-ku-bw) has failed to schedule tasks today.  This is one of my larger hosts and the MaxClients problem might have actually been the trigger that began the avalanche.  I've left the host broken in case Mark or one of the other developers wants information from it while it's in this failed state.

 

Sorry it took me so long to get to this; I had some catching up to do after TechEx.

 

As of right now, it looks like that host is fine.  I watched it for several minutes using the monitor (pscheduler monitor --host ps-ku-bw.perfsonar.kanren.net) and saw lots of streaming latency running plus a steady diet of trace and the occasional throughput.

 

 At 9:47am yesterday, the httpd error log showed the following:

 

[root@ps-ku-bw crussell]# tail -f /var/log/httpd/error_log

[Wed Oct 18 09:47:29 2017] [error] server reached MaxClients setting, consider raising the MaxClients setting

 

Since pScheduler depends on Apache to do scheduling, I could see this being the cause of runs not being scheduled.  We should focus on what’s causing the number of connections to get that high.  I wrote up a short shell script that will ping pScheduler periodically and produce a netstat if the ping fails.  Leave it to run for awhile on the offending perfSONAR node and it should catch the conditions when Apache runs out of connections.  You can download it here:  https://gist.github.com/mfeit-internet2/24ad3bb83a6dd3fdef87fb9469f92a4a

 

As long as the central MA isn’t having this problem, any run that completes will be archived correctly since the local server isn’t involved in that process.

 

--Mark

 




Archive powered by MHonArc 2.6.19.

Top of Page