perfsonar-user - Re: [perfsonar-user] more than 2000 threads
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: Marian Babik <>
- To: Pete Siemsen <>
- Cc: "" <>
- Subject: Re: [perfsonar-user] more than 2000 threads
- Date: Wed, 6 Dec 2017 09:26:03 +0000
- Accept-language: en-GB, en-US
- Authentication-results: spf=pass (sender IP is 188.184.36.46) smtp.mailfrom=cern.ch; internet2.edu; dkim=none (message not signed) header.d=none;internet2.edu; dmarc=bestguesspass action=none header.from=cern.ch;
- Ironport-phdr: 9a23:LunT5BRtjUE5N30/mZV8ky9iv9psv+yvbD5Q0YIujvd0So/mwa6zYBCN2/xhgRfzUJnB7Loc0qyN4vCmATRIyK3CmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TW94jEIBxrwKxd+KPjrFY7OlcS30P2594HObwlSijewZbB/IA+qoQnNq8IbnZZsJqEtxxXTv3BGYf5WxWRmJVKSmxbz+MK994N9/ipTpvws6ddOXb31cKokQ7NYCi8mM30u683wqRbDVwqP6WACXWgQjxFFHhLK7BD+Xpf2ryv6qu9w0zSUMMHqUbw5Xymp4rx1QxH0ligIKz858HnWisNuiqJbvAmhrAF7z4LNfY2ZKOZycqbbcNgHR2ROQ9xRWjRBDI2icoUPE+QPM+VWr4b/plsBsRSxCBKjBO/zzz9FnGP60bE43uknDArI3BYgH9ULsHnMq9v6Lr0SUeGvw6nO0D7OculZ1iz86IjLbxsspvaCUqhqccrQ00YvERnJg0iKpoP+PjOV1f8AvHSF4Op6U+KjkXIoqwForzWp28wiiZHJi5oLxl/e6Sl13YM4KcClREJmZNOkHpRduz2GO4ZzTMMtXXxkuCg/x7ADuJO3YjYGxIw6yxLBaPGLaZWE7x3+WOqLPDt1gHxodKiiixux/kWs0uP8Wde33VpWqydIl8fAumwD1xHR78WKSeZx80Wv1DuKyQzc9/tLLEMxmKfVL5MswaQ/m5wOukrZBCD2gl/5jKqOe0Uk5Oeo7+Pnb63pqJCSK4F4lhzyPr0hlcKwHOg0Kw8OUHOF9uim073j4FH5T65Njv0rlKnWrYrWJdwBpq6+Hw9azJos6wq+Dzeh1tQUh34HLE9ZeBKDiIjpPFLOLOrkAve4hlSgiDZrx/bYMb39GpjBMGLMnKv8cbt49kJQ1Rc/wNVR559bFr0NPPf+WkHvu9DFAB80Ngm5zuf5BNljzo8eXHiAAq6dMKPcq1+I4ecvLvGLaoAPojb9KuIq5/j0gXIkg1ASZqip3ZgMZX+kAPtmOUOZbWDwjdcBCWsKpBYxTPT2iF2eVj5ef26yULwm5jE1E4KmCoHDSZq3gLCYwSe7BYNZZnpdB1CIEHfobJmEW+wSZC6II89hlCAEWqa7S48nyx6uqBH2x6B5IeXJ5y1L/a7kgeB4++CbrhA/8Cd5CYzJyGCASnp5mEsVTDYsmq1zvBou5E2E1P1diuZZHNobzfpDUwRyYbvV1e1zDZbYUwjAff+CRUygBN6mV2JiBuktysMDNh4uU+6piQrOim/zW+cY
- Spamdiagnosticmetadata: NSPM
- Spamdiagnosticoutput: 1:99
Hi Pete,
my understanding was that Andy suggested to restart pscheduler runner just to
better understand where exactly to look for the root cause for the high
number of threads/processes (as stopping pscheduler runner leaves quite a
number of powstream processes around, it means there are indeed runaway
processes that are no longer managed). I suspect it’s a bug that will need to
be fixed, to be confirmed by Andy or Mark. As far as I can tell there is no
workaround for it at the moment.
Just to check if this is perhaps OS specific, are you machines on centOS7 or
still on SL6/or something else ?
Thanks,
Marian
> On Dec 5, 2017, at 8:03 PM, Pete Siemsen
> <>
> wrote:
>
> We monitor our perfSONAR machines with Nagios check-mk, which by default
> warns
> if there are more than 2000 threads on a machine. I'm getting that alarm. I
> found an email sequence where Andy suggested stopping/starting powstream. I
> did that but am still seeing "too many" threads. Should I raise the
> threshold
> above 2000?
>
> Here's what I did to no effect:
>
> perfsonar-1850# /etc/init.d/pscheduler-runner stop
> Stopping pScheduler runner: [ OK ]
> perfsonar-1850# /etc/init.d/pscheduler-runner status
> runner is stopped
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 357
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 354
> (waited 5 minutes)
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 354
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 353
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 352
> perfsonar-1850# pkill -9 -f powstream
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 9
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 9
> perfsonar-1850# /etc/init.d/pscheduler-runner start
> Starting pScheduler runner: [ OK ]
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 197
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 202
> perfsonar-1850# uptime
> 11:53:52 up 3 days, 21:50, 1 user, load average: 1.30, 1.02, 1.12
> (waited an hour)
> perfsonar-1850# ps -ef | grep "/usr/bin/powstream" | wc -l
> 203
Attachment:
smime.p7s
Description: S/MIME cryptographic signature
- [perfsonar-user] more than 2000 threads, Pete Siemsen, 12/05/2017
- Re: [perfsonar-user] more than 2000 threads, Marian Babik, 12/06/2017
- RE: [perfsonar-user] more than 2000 threads, Garnizov, Ivan (RRZE), 12/06/2017
- Re: [perfsonar-user] more than 2000 threads, Marian Babik, 12/06/2017
- Re: [perfsonar-user] more than 2000 threads, Andrew Lake, 12/06/2017
- Re: [perfsonar-user] more than 2000 threads, Mark Feit, 12/07/2017
- Re: [perfsonar-user] more than 2000 threads, Pete Siemsen, 12/07/2017
- Re: [perfsonar-user] more than 2000 threads, Marian Babik, 12/12/2017
- RE: [perfsonar-user] more than 2000 threads, Garnizov, Ivan (RRZE), 12/06/2017
- Re: [perfsonar-user] more than 2000 threads, Marian Babik, 12/06/2017
Archive powered by MHonArc 2.6.19.