Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] Figuring out how to trace these errors: pool.py:152:__init__:ValueError: Number of processes must be at least 1

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] Figuring out how to trace these errors: pool.py:152:__init__:ValueError: Number of processes must be at least 1


Chronological Thread 
  • From: Mark Feit <>
  • To: Casey Russell <>, "" <>
  • Subject: Re: [perfsonar-user] Figuring out how to trace these errors: pool.py:152:__init__:ValueError: Number of processes must be at least 1
  • Date: Wed, 21 Feb 2018 19:36:54 +0000
  • Accept-language: en-US
  • Authentication-results: kanren.net; dkim=none (message not signed) header.d=none;kanren.net; dmarc=none action=none header.from=internet2.edu;
  • Ironport-phdr: 9a23:HmOnZhNyRy+PdO4Gu3Al6mtUPXoX/o7sNwtQ0KIMzox0K/z5rsbcNUDSrc9gkEXOFd2Cra4c0KyO6+jJYi8p2d65qncMcZhBBVcuqP49uEgeOvODElDxN/XwbiY3T4xoXV5h+GynYwAOQJ6tL1LdrWev4jEMBx7xKRR6JvjvGo7Vks+7y/2+94fcbglUijexe69+IAmrpgjNq8cahpdvJLwswRXTuHtIfOpWxWJsJV2Nmhv3+9m98p1+/SlOovwt78FPX7n0cKQ+VrxYES8pM3sp683xtBnMVhWA630BWWgLiBVIAgzF7BbnXpfttybxq+Rw1DWGMcDwULs5Qiqp4bt1RxD0iScHLz85/3/Risxsl6JQvRatqwViz4LIfI2ZMfxzdb7fc9wHX2pMRsZfWTJcDIOgYYUBDOQBMuRZr4bhqFQDtgGxCRWpBO711jNEmn370Ksn2OohCwHG2wkgEsoJvnvOttX1N7oZX+Crw6nJ0TXIcvRb2TLz6IfWcxAhveqBXbN2ccrN10YvDQXFgUuMqYD7JT+ayPkCs3WC4udmSOmhhWknqwRrrTiuwMchkobJhoMJylDE6CV225w5KsG/SE5+Z9OvDZhetzmCOodoWMwuX3xktSMnxrEbpJK2eSYKxZs7yxLDbvGKcZaH7x3sWeuUJDp0mm5pdK+nixqv7UStyffwWte33VpQsCZInd3BumoQ2xDO98SLUvV98Vmi1DqT0g3e7+BJLloxmKrVNpIu3Lo9m58NvkjeAiD5g0D7g7KLekk6++Wl7fnsbK/8qZ+GLYB0jxnzMqQwlcy7BuQ1KhAAUnSc9+igyrHv5FD3Tq1XgvEvlanWq47VKd4cpq6kHw9ayYEj6wu5Dzi7ytgYhWMHLFVZeB2Zk4fpJ1DOIPf+DfulhFSsjStrx/TBPr3mAZXBNGTMkLDkfbpl6k5czhQ8zcxH6p5KEL4NPO//V0rsuNDFExM0NgK0zun7BNlh044TW3yAD6CWPa7XtFKE+P4jL/SUaI8Qojn9Kvwl5/D0jX8+nF8QZayp3ZUJZ3C5AvRrOESZYXzwgtgfC2cKuBQxTPD0hFGYTzFcemuyU7om5j4nEIKmEZvDRoe1jbyZwii7BJNWZntBClCKC3jobZyIW+oXaC2MOM9siTgEVbm6S489zhGiqhX2y7thLurI5CIYr5Tj28Zp5+HNjx096yF7D9nOm12KGkp9hW4ZDwUrx7typUg1nlyZzLNjjvhcPd9a4elEVEE8OIKKi6QwE93oVBnGeN6TDUu9T8+OADctQ8g3zsNUJUtxBp/q2gjOxSSxBLkcjfmWH5Eu2qPaw3XrIcthkTDL2LR33Hc8Rc4aEGS8heZR+gnPT9rMnUiYi46rc7gRxijA6D3FwGaT6hILGDVsWLnICChMLnDdqs70swabF+ej
  • Spamdiagnosticoutput: 1:0

Casey Russell writes:

 

     I've seen similar errors mentioned before, but I'm having a hard time tracing these back to their origins.  I've started getting dozens (and maybe more than a hundred) emails a day from my single Centos7 machine complaining of a problem running a traceroute test (if I'm interpreting it correctly).  The line from the pscheduler.log is as follows:

 

Feb 21 07:55:18 ps-fhsu-bw journal: runner INFO     12356773: Run failed 1: Traceback (most recent call last):#012  File "/usr/libexec/pscheduler/commands/../classes/tool/traceroute/run", line 389, in <module>#012    ases = pscheduler.as_bulk_resolve(ips, threads=len(ips))#012  File "/usr/lib/python2.7/site-packages/pscheduler/psas.py", line 76, in as_bulk_resolve#012    processes=min(len(candidates), threads))#012  File "/usr/lib64/python2.7/multiprocessing/pool.py", line 714, in __init__#012    Pool.__init__(self, processes, initializer, initargs)#012  File "/usr/lib64/python2.7/multiprocessing/pool.py", line 152, in __init__#012    raise ValueError("Number of processes must be at least 1")#012ValueError: Number of processes must be at least 1

 

This was sufficient information for me to hunt down the problem.  It’s a bug in a library routine used by the traceroute tool to determine the AS numbers of IP addresses.  I’ve opened a ticket on it:  https://github.com/perfsonar/pscheduler/issues/588.  It’s an easy fix and will show up in 4.1 or the next bugfix release if we do one of those.

 

For future reference, the large number before the colon is an internal ID used by the database.  (Yes, your system has done 12.3 million runs since the database was last reset.)  Anything else in the log related to it will have that number prepended, so a quick application of grep will pull out any additional information.

 

When I hand-restart pscheduler-scheduler (and occasionally when I restart the machine), I get an error about not having permissions to the pscheduler-archiver.pid file.  That has been a problem that has come and gone ever since I upgraded this machine to CentOS7 6 or more months ago.  So it may or may not be related to this problem.

 

That’s unrelated.  At some point since pScheduler was introduced, we did an overhaul to the scripts that start and stop the services that included not using root to create the PID files.  In some cases the old, root-owned files got left around and caused this problem.  The workaround for the time being is to stop the service, kill off any additional copies you see in the process table, remove /var/run/pscheduler-xxx.pid (where xxx is the service name) and then start the service.  After that, things should be fine.  I’ve opened a ticket on that and will get some clean-up code into the next release:   https://github.com/perfsonar/pscheduler/issues/589.

 

--Mark

 




Archive powered by MHonArc 2.6.19.

Top of Page