perfsonar-user - [perfsonar-user] pScheduler Internal Error on a mobile node
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: Elicia Heera <>
- To:
- Subject: [perfsonar-user] pScheduler Internal Error on a mobile node
- Date: Mon, 16 Oct 2017 13:58:42 +0200
- Ironport-phdr: 9a23:6B9wtxbqXQ3gi1/iC7Xqw/f/LSx+4OfEezUN459isYplN5qZoMS+bnLW6fgltlLVR4KTs6sC0LuG9fi4EUU7or+5+EgYd5JNUxJXwe43pCcHRPC/NEvgMfTxZDY7FskRHHVs/nW8LFQHUJ2mPw6arXK99yMdFQviPgRpOOv1BpTSj8Oq3Oyu5pHfeQtFiT6+bL9oMBm6sRjau9ULj4dlNqs/0AbCrGFSe+RRy2NoJFaTkAj568yt4pNt8Dletuw4+cJYXqr0Y6o3TbpDDDQ7KG81/9HktQPCTQSU+HQRVHgdnwdSDAjE6BH6WYrxsjf/u+Fg1iSWIdH6QLYpUjus9adrTALjhjkBOTA37WrbjtF8gL5erB+nuhdxwZPbYJuNOfR+cK3Tfs4US3RdUctKTSxNHpmxYpETA+YdP+tVqZT2qVsUrRu5AAmhHOzhyjtJhnDq3K01yfkqHwPY0wM+BdIBqmnfodLrO6cWUOC60KjIwi/YYvNNwzj97pLIfQ4nof2WR71/bdDdyEg1GA7ciFibtI/rPyuN2+gTsmWX8+htWOehi2MksA59vj2iy8gwhoXVmo0Yz0zL+Tl5zYswINC0VlB3bNqiHZBNrS+VLZF2TdknQ2xwuCY11LkGuZmjcSgP0psnxhrfZ+WZc4iL/h7vTemQLSlmiH9hYr6/iBGy8U+vyu34SMa4ykpFri1AktXUt3AN0QLc6tSfR/dj/0qtxTSC2gXd6uxHOk84ia/WJpE9zrIsipUetFjMEjP2lUjziaKaaFso9+yw5+TieLrmp5ucN4FuigH5N6QjgtKwAeA5MgcSXmiU4/+x1Kb58k3/WrVFkPs2nrPDv5/GP8gap7S2DxdP0ok/8xa/Eyum0NMAkHkfMl1FYhyHj5PuO1HIOv/4F+6zg0m3kDh13fDLJbnhApTWLnjfi7ftY6xx609ayAov099f/ZRUBa8dIP7tQEP+qsHXDgJqezCzlv7qEttm0YUXQyeSGaKDGKLUrVKS4O8zea+BaJJGliz6Lq0I4//ljHZxuVIQZ6DhiZYTaXu5F9x9KkODbHyqi9xHA25c7Vl2d/DjlFDXCW0bXH21Ra9pv2k2
Hi Everyone,
I need some help regarding the pScheduler server and some of the errors I am getting and how to resolve them. I recently did a fresh install of perfSONAR on a FIT-PC 3 device. When this device was deployed to site the pScheduler process has been giving numerous errors including Internal error on on local pScheduler server when I run pscheduler task idle --duration PT2S.
Under the pscheduler log:
Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Restarting
Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Program threw an exception after 0:00:00.001245
Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1"$
Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Waiting 19.75 seconds before restarting
Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Restarting
Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Program threw an exception after 0:00:00.001344
Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" an$
Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Waiting 19.75 seconds before restarting
Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Restarting
Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Program threw an exception after 0:00:00.001308
Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" $
Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Waiting 19.75 seconds before restarting
Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Restarting
Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Program threw an exception after 0:00:00.005828
Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" an$
Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Waiting 19.75 seconds before restarting
Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Restarting
Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Program threw an exception after 0:00:00.001213
Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1"$
Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Waiting 20.0 seconds before restarting
Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Restarting
Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Program threw an exception after 0:00:00.001320
Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" an$
Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Waiting 20.0 seconds before restarting
Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Restarting
Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Program threw an exception after 0:00:00.001256
Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" $
Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Waiting 20.0 seconds before restarting
Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Restarting
Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Program threw an exception after 0:00:00.005256
Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Exception: OperationalError: could not connect to server: Connection refused#012#011Is the server running on host "127.0.0.1" an$
Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Waiting 20.0 seconds before restarting
I get these errors when checking the status of any of the pscheduler services. All show active (running):
pscheduler-runner.service - pScheduler server - runner
Loaded: loaded (/usr/lib/systemd/system/pscheduler-runner.service; enabled; vendor preset: disabled)
Active: active (running) since Mon 2017-10-16 13:20:41 SAST; 16min ago
Main PID: 1254 (runner)
CGroup: /system.slice/pscheduler-runner.service
└─1254 /usr/bin/python /usr/libexec/pscheduler/daemons/runner --daemon --pid-file /var/run/pscheduler-runner.pid --dsn @/etc/pscheduler/database/database-dsn
Oct 16 13:36:02 localhost.localdomain runner[1254]: safe_run/runner ERROR Exception: OperationalError: could not connect to server: Connection refused
Is the server running on host "127.0.0.1" and accepting
TCP/IP connections on port 5432?...
Oct 16 13:36:02 localhost.localdomain runner[1254]: safe_run/runner ERROR Waiting 21.5 seconds before restarting
Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR Restarting
Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR Program threw an exception after 0:00:00.001386
Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR Exception: OperationalError: could not connect to server: Connection refused
Is the server running on host "127.0.0.1" and accepting
TCP/IP connections on port 5432?...
Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR Waiting 21.75 seconds before restarting
Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR Restarting
Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR Program threw an exception after 0:00:00.000983
Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR Exception: OperationalError: could not connect to server: Connection refused
Is the server running on host "127.0.0.1" and accepting
TCP/IP connections on port 5432?...
Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR Waiting 22.0 seconds before restarting
I have tried reinstalling the pscheduler server and restarting all the processes with no luck.
Any help would be appreciated.
Sincerly
Elicia Heera
Network Engineer
- [perfsonar-user] pScheduler Internal Error on a mobile node, Elicia Heera, 10/16/2017
- Re: [perfsonar-user] pScheduler Internal Error on a mobile node, Antoine Delvaux, 10/16/2017
- Re: [perfsonar-user] pScheduler Internal Error on a mobile node, Elicia Heera, 10/16/2017
- Re: [perfsonar-user] pScheduler Internal Error on a mobile node, Mark Feit, 10/16/2017
- Re: [perfsonar-user] pScheduler Internal Error on a mobile node, Elicia Heera, 10/16/2017
- Re: [perfsonar-user] pScheduler Internal Error on a mobile node, Antoine Delvaux, 10/16/2017
Archive powered by MHonArc 2.6.19.