Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] pScheduler Internal Error on a mobile node

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] pScheduler Internal Error on a mobile node


Chronological Thread 
  • From: Antoine Delvaux <>
  • To: Elicia Heera <>
  • Cc:
  • Subject: Re: [perfsonar-user] pScheduler Internal Error on a mobile node
  • Date: Mon, 16 Oct 2017 12:43:04 +0000
  • Ironport-phdr: 9a23:qXIqhBfIeQIGHW7Q+R4S0cV1lGMj4u6mDksu8pMizoh2WeGdxcSzYx7h7PlgxGXEQZ/co6odzbGH4+a4ASQp2tWoiDg6aptCVhsI2409vjcLJ4q7M3D9N+PgdCcgHc5PBxdP9nC/NlVJSo6lPwWB6nK94iQPFRrhKAF7Ovr6GpLIj8Swyuu+54Dfbx9GiTe5Yr5+Ngm6oRnMvcQKnIVuLbo8xAHUqXVSYeRWwm1oJVOXnxni48q74YBu/SdNtf8/7sBMSar1cbg2QrxeFzQmLns65Nb3uhnZTAuA/WUTX2MLmRdVGQfF7RX6XpDssivms+d2xSeXMdHqQb0yRD+v9LlgRgP2hygbNj456GDXhdJ2jKJHuxKquhhzz5fJbI2JKPZye6XQds4YS2VcRMZcTzBODYyhYYUPDeUPM+lWoYrzp1UQqhWzHhWsBPrqyjNUhn/6wa833uI8Gg/GxgwgGNcOvWzIodXzKKcSVuG1zK/Wwj7eYf1ZxzP96JbSfRA8rvCHQLV9ftDXyUkzEAPFj1OQppL/MzyIyOsNt3ab4PB9VeOgkGMnrht+oj61ysc0jYnIh4QVxUrC9Slj2IY1IcS1RUhmatCqF5tQsjuVN4pwQs46TGFnpjw1yrsauZKheygK0psnyhjCYPKEa4iF+gzvWPqNLTtlgX9oe66zihW3/EWl1+HxVMe53VNXoiZYitXBuW0B2wbQ58SZUPdx40as1SyO2g3V9+pKO1o7lbDBJJ4k2rMwloQcsUDEHiLug0r5krWWdlgg+uey8ejnZ6/pppmGO49sjQH/M6Iulda5AegiKggOW3CX+eW61LL94U30WKtGguEqnqXEtZ3XJtgXq628DgJQz4ou6RiyAjK73NgFhXUHKUhKeBODj4jnIVHOJ/X4AO+ljFSqjDdrwPHGPrv/DZnXM3fMjrPhfahn5E5Bxws+1tVf6IhSCr0bOPLzXU7wtNrCAR8/KQC02+LnBM1n1owCQWKPHrOZMKTKvF+Q+O0vOeeMZJQSuDb7Mfcl/efijWIimVADZ6mkxp8XaHGjHvR6OEWVf2DggtYHEWcWoAU+Vurqh0OeUTJNfXq9Qb8z5ixoQL6hWKrHR4usifSh0SqhF9UCa2RHAVGFOWzhcZ+JXbEFY2SAK5kyvCYDUO2ITZMm1Bej/CP3z6RuM/qcriQeqZXi0tUz6+DXixgv5RRxBsLby2afSWhy2HsMEWxllJtjqFBwnw/QmZNzhOZVQJkKv6tE

Hello Elicia,

The error messages you see in the logs seem to indicate that PostgreSQL is
not running on your machine. Can you try to start/restart it? If you cannot
have it running, it would be good to check the logs in /var/lib/pgsql/ for
any error.

pscheduler requires postgresql-9.5 to run.

Can you confirm you are running on CentOS7? Did you installed from the ISO
image or from the bundle packages?

Thanks,

-- --
Antoine Delvaux Systems Engineer
Poznań Supercomputing & Network Center Skype: toninb
GÉANT project Tel: +221.703368313
http://www.geant.org XMPP:

PGP fingerprint: DC65 0D8B 6938 9229 33C3 18CA 4EB6 09D3 A333 3378

> Le 16 oct. 2017 à 11:58, Elicia Heera
> <>
> a écrit :
>
> Hi Everyone,
>
> I need some help regarding the pScheduler server and some of the errors I
> am getting and how to resolve them. I recently did a fresh install of
> perfSONAR on a FIT-PC 3 device. When this device was deployed to site the
> pScheduler process has been giving numerous errors including Internal error
> on on local pScheduler server when I run pscheduler task idle --duration
> PT2S.
>
> Under the pscheduler log:
> Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Restarting
> Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Program
> threw an exception after 0:00:00.001245
> Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1"$
> Oct 16 13:33:38 localhost journal: safe_run/scheduler ERROR Waiting
> 19.75 seconds before restarting
> Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Restarting
> Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Program threw
> an exception after 0:00:00.001344
> Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" an$
> Oct 16 13:33:38 localhost journal: safe_run/runner ERROR Waiting 19.75
> seconds before restarting
> Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Restarting
> Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Program threw
> an exception after 0:00:00.001308
> Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" $
> Oct 16 13:33:38 localhost journal: safe_run/archiver ERROR Waiting 19.75
> seconds before restarting
> Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Restarting
> Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Program threw
> an exception after 0:00:00.005828
> Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" an$
> Oct 16 13:33:39 localhost journal: safe_run/ticker ERROR Waiting 19.75
> seconds before restarting
> Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Restarting
> Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Program
> threw an exception after 0:00:00.001213
> Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1"$
> Oct 16 13:33:58 localhost journal: safe_run/scheduler ERROR Waiting 20.0
> seconds before restarting
> Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Restarting
> Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Program threw
> an exception after 0:00:00.001320
> Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" an$
> Oct 16 13:33:58 localhost journal: safe_run/runner ERROR Waiting 20.0
> seconds before restarting
> Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Restarting
> Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Program threw
> an exception after 0:00:00.001256
> Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" $
> Oct 16 13:33:58 localhost journal: safe_run/archiver ERROR Waiting 20.0
> seconds before restarting
> Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Restarting
> Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Program threw
> an exception after 0:00:00.005256
> Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Exception:
> OperationalError: could not connect to server: Connection refused#012#011Is
> the server running on host "127.0.0.1" an$
> Oct 16 13:33:59 localhost journal: safe_run/ticker ERROR Waiting 20.0
> seconds before restarting
>
> I get these errors when checking the status of any of the pscheduler
> services. All show active (running):
> pscheduler-runner.service - pScheduler server - runner
> Loaded: loaded (/usr/lib/systemd/system/pscheduler-runner.service;
> enabled; vendor preset: disabled)
> Active: active (running) since Mon 2017-10-16 13:20:41 SAST; 16min ago
> Main PID: 1254 (runner)
> CGroup: /system.slice/pscheduler-runner.service
> └─1254 /usr/bin/python /usr/libexec/pscheduler/daemons/runner
> --daemon --pid-file /var/run/pscheduler-runner.pid --dsn
> @/etc/pscheduler/database/database-dsn
>
> Oct 16 13:36:02 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Exception: OperationalError: could not connect to server: Connection
> refused
> Is the server
> running on host "127.0.0.1" and accepting
> TCP/IP
> connections on port 5432?...
> Oct 16 13:36:02 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Waiting 21.5 seconds before restarting
> Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Restarting
> Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Program threw an exception after 0:00:00.001386
> Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Exception: OperationalError: could not connect to server: Connection
> refused
> Is the server
> running on host "127.0.0.1" and accepting
> TCP/IP
> connections on port 5432?...
> Oct 16 13:36:23 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Waiting 21.75 seconds before restarting
> Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Restarting
> Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Program threw an exception after 0:00:00.000983
> Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Exception: OperationalError: could not connect to server: Connection
> refused
> Is the server
> running on host "127.0.0.1" and accepting
> TCP/IP
> connections on port 5432?...
> Oct 16 13:36:45 localhost.localdomain runner[1254]: safe_run/runner ERROR
> Waiting 22.0 seconds before restarting
>
>
> I have tried reinstalling the pscheduler server and restarting all the
> processes with no luck.
>
> Any help would be appreciated.
>
> Sincerly
> Elicia Heera
> Network Engineer
>
> <SANReN Logo.PNG>
>




Archive powered by MHonArc 2.6.19.

Top of Page