perfsonar-user - [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: Kathy Benninger <>
- To: perfsonar-user <>
- Cc: Kathy Benninger <>
- Subject: [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade
- Date: Thu, 23 Aug 2018 16:44:53 -0400
- Dkim-filter: OpenDKIM Filter v2.11.0 mailer1.psc.edu w7NKiwQg002297
- Ironport-phdr: 9a23:DPprkRHwBI/90GulIEtq4J1GYnF86YWxBRYc798ds5kLTJ78oc+wAkXT6L1XgUPTWs2DsrQY07WQ6/iocFdDyK7JiGoFfp1IWk1NouQttCtkPvS4D1bmJuXhdS0wEZcKflZk+3amLRodQ56mNBXdrXKo8DEdBAj0OxZrKeTpAI7SiNm82/yv95HJbAhEmDiwbaluIBmqsA7cqtQYjYx+J6gr1xDHuGFIe+NYxWNpIVKcgRPx7dqu8ZBg7ipdpesv+9ZPXqvmcas4S6dYDCk9PGAu+MLrrxjDQhCR6XYaT24bjwBHAwnB7BH9Q5fxri73vfdz1SWGIcH7S60/VDK/5KlpVRDokj8KOSM5/m/JhMx+j6xVrxyuqBN934Hab5qYNOZ9c67HYd8WWWRMU8RXWidcAo28dYwPD+8ZMOhdtYb9u1sOrBujDgSxGuzvzDlIhn7w3a01zeshCxzN0Qs7EN0QtnTUqtT1NKEdUe+r1qnF1i3Db+hK1jvn9ofHbw0hrOiKULltcsTR0VEiGx7bgliTs4DpIjCY2voXv2Wb7OdsT/+jh3I7pw1pvzSj2MQhhpPUio8W1FzI7zh1zYcpKdGiVUJ3fNqpHZ1NvC+ALYR2WNktQ2RwtSY61LIGvZm7cTAPyJs9xh7fb+KIc5KV7h75TOaROit3iGh7d72imha+6UmgyuviWcmoyFtGszRJn9rWunwQ1BHf9NKLR/t+80u7xDqDyxzf5vlBIU8ulKrbL5AhwqQ3lpoWqUnMBTH5mEPwjK+XcEUk9fan6/78b7X+vZ+TK5V4hR/jPaQzgsC/G/g3MhASX2iH/uSxzKXj8lPjQLVElf02la/Zv4rAJcgCu665GBFa0pw45ha7Djemy8gYnWIZIF5feRKHiZTpNE/UIPD+E/i/n0qgnC11yP/bI72ySqnKe2DOirn6erB08QtB0wcphYRH6ohaEbYHKei2R1T8rvTZCAM0KQq537yhBdlggNAwQ2WKV4yQLKPD+XWF4OQuJeaBfsdBsjv7Lfkj6PP0pWcikhkQcbT/jshfU2yxAvkzexbRWnHrmNpUSWo=
I think I found the problem. A restore on another machine fail and give a more specific error:
Restoring pScheduler from backup:
Unpacking backup... Done.
Restoring configuration... Done.
Restoring database... Done.
Updating database... Done.
Resetting password... Done.
/usr/lib/perfsonar/scripts/ps-migrate-restore.sh: line 244: syntax error near unexpected token `fi'
/usr/lib/perfsonar/scripts/ps-migrate-restore.sh: line 244: `fi'
In /usr/lib/perfsonar/scripts/ps-migrate-restore.sh
I added "then" to lines 235 and 245, reran restore, and it seems to have
worked.
Kathy
On 8/23/2018 4:09 PM, Kathy Benninger wrote:
I'm seeking advice for a pscheduler that did not survive data restore following a Netinstall CentOS6->7 / pSTK4.1 Toolkit upgrade.
The host doesn't initiate tests or respond to pscheduled tests from other hosts.
Following are test output and log file snippets that will hopefully be sufficient to identify the problem.
Thanks!
Kathy Benninger
Pittsburgh Supercomputing Center
---------------------------------------------------------------------------
Some pscheduler checks:
[root@ps benninge]# pscheduler ping localhost
localhost: pScheduler is alive
[root@ps benninge]# pscheduler troubleshoot
Performing basic troubleshooting of localhost.
localhost:
Checking path MTU... 65520+
Checking for pScheduler... OK.
Checking clock... OK.
Idle test... Failed.
Unable to post task: Internal problem; see system logs.
Nothing is being scheduled.
---------------------------------------------------------------------------
pscheduler log messages:
Aug 23 14:32:14 ps journal: pscheduler-api ERROR Exception: ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:14 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"created-by": {"...
Aug 23 14:32:14 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:14 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:32:14 ps journal: pscheduler-api ERROR Traceback (most recent call last):
Aug 23 14:32:14 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
Aug 23 14:32:14 ps journal: pscheduler-api ERROR rv = self.dispatch_request()
Aug 23 14:32:14 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
Aug 23 14:32:14 ps journal: pscheduler-api ERROR return self.view_functions[rule.endpoint](**req.view_args)
Aug 23 14:32:14 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/tasks.py", line 477, in tasks
Aug 23 14:32:14 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/dbcursor.py", line 112, in dbcursor_query
Aug 23 14:32:14 ps journal: pscheduler-api ERROR ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:14 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"created-by": {"...
Aug 23 14:32:14 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:14 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:32:38 ps journal: pscheduler-api ERROR Exception: ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:38 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:32:38 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:38 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:32:38 ps journal: pscheduler-api ERROR Traceback (most recent call last):
Aug 23 14:32:38 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
Aug 23 14:32:38 ps journal: pscheduler-api ERROR rv = self.dispatch_request()
Aug 23 14:32:38 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
Aug 23 14:32:38 ps journal: pscheduler-api ERROR return self.view_functions[rule.endpoint](**req.view_args)
Aug 23 14:32:38 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/tasks.py", line 477, in tasks
Aug 23 14:32:38 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/dbcursor.py", line 112, in dbcursor_query
Aug 23 14:32:38 ps journal: pscheduler-api ERROR ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:38 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:32:38 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:38 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:32:53 ps journal: pscheduler-api ERROR Exception: ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:53 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:32:53 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:53 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:32:53 ps journal: pscheduler-api ERROR Traceback (most recent call last):
Aug 23 14:32:53 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
Aug 23 14:32:53 ps journal: pscheduler-api ERROR rv = self.dispatch_request()
Aug 23 14:32:53 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
Aug 23 14:32:53 ps journal: pscheduler-api ERROR return self.view_functions[rule.endpoint](**req.view_args)
Aug 23 14:32:53 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/tasks.py", line 678, in tasks_uuid
Aug 23 14:32:53 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/dbcursor.py", line 112, in dbcursor_query
Aug 23 14:32:53 ps journal: pscheduler-api ERROR ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:32:53 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:32:53 ps journal: pscheduler-api ERROR ^
Aug 23 14:32:53 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:33:04 ps journal: pscheduler-api ERROR Exception: ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:33:04 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:33:04 ps journal: pscheduler-api ERROR ^
Aug 23 14:33:04 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:33:04 ps journal: pscheduler-api ERROR Traceback (most recent call last):
Aug 23 14:33:04 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1475, in full_dispatch_request
Aug 23 14:33:04 ps journal: pscheduler-api ERROR rv = self.dispatch_request()
Aug 23 14:33:04 ps journal: pscheduler-api ERROR File "/usr/lib/python2.7/site-packages/flask/app.py", line 1461, in dispatch_request
Aug 23 14:33:04 ps journal: pscheduler-api ERROR return self.view_functions[rule.endpoint](**req.view_args)
Aug 23 14:33:04 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/tasks.py", line 477, in tasks
Aug 23 14:33:04 ps journal: pscheduler-api ERROR File "./pschedulerapiserver/dbcursor.py", line 112, in dbcursor_query
Aug 23 14:33:04 ps journal: pscheduler-api ERROR ProgrammingError: function api_task_post(unknown, text[], unknown, unknown, integer, unknown, boolean, unknown) does not exist
Aug 23 14:33:04 ps journal: pscheduler-api ERROR LINE 1: SELECT * FROM api_task_post('{"reference": {"psconfig": {"cr...
Aug 23 14:33:04 ps journal: pscheduler-api ERROR ^
Aug 23 14:33:04 ps journal: pscheduler-api ERROR HINT: No function matches the given name and argument types. You might need to add explicit type casts.
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR Restarting: ['/usr/libexec/pscheduler/daemons/archiver', '--daemon', '--pid-file', '/run/pscheduler-archiver.pid', '--dsn', '@/etc/pscheduler/database/database-dsn']
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR Program threw an exception after 0:00:00.012042
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR Exception: ProgrammingError: column "task_detail" does not exist
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR LINE 4: ... attempts, last_attempt, transform, task_detai...
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR ^
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR Traceback (most recent call last):
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR File "/usr/lib/python2.7/site-packages/pscheduler/saferun.py", line 72, in safe_run
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR function()
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR File "/usr/libexec/pscheduler/daemons/archiver", line 879, in <lambda>
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR pscheduler.safe_run(lambda: main_program())
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR File "/usr/libexec/pscheduler/daemons/archiver", line 835, in main_program
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR [options.max_parallel])
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR File "/usr/lib/python2.7/site-packages/pscheduler/db.py", line 193, in query
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR cursor.execute(query, args)
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR ProgrammingError: column "task_detail" does not exist
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR LINE 4: ... attempts, last_attempt, transform, task_detai...
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR ^
Aug 23 14:33:12 ps journal: safe_run/archiver ERROR Waiting 60.0 seconds before restarting
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR Restarting: ['/usr/libexec/pscheduler/daemons/archiver', '--daemon', '--pid-file', '/run/pscheduler-archiver.pid', '--dsn', '@/etc/pscheduler/database/database-dsn']
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR Program threw an exception after 0:00:00.012356
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR Exception: ProgrammingError: column "task_detail" does not exist
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR LINE 4: ... attempts, last_attempt, transform, task_detai...
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR ^
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR Traceback (most recent call last):
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR File "/usr/lib/python2.7/site-packages/pscheduler/saferun.py", line 72, in safe_run
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR function()
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR File "/usr/libexec/pscheduler/daemons/archiver", line 879, in <lambda>
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR pscheduler.safe_run(lambda: main_program())
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR File "/usr/libexec/pscheduler/daemons/archiver", line 835, in main_program
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR [options.max_parallel])
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR File "/usr/lib/python2.7/site-packages/pscheduler/db.py", line 193, in query
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR cursor.execute(query, args)
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR ProgrammingError: column "task_detail" does not exist
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR LINE 4: ... attempts, last_attempt, transform, task_detai...
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR ^
Aug 23 14:34:12 ps journal: safe_run/archiver ERROR Waiting 60.0 seconds before restarting
---------------------------------------------------------------------------
The first signs of trouble from when I tried to restore the data:
[root@ps BACKUPS]# /usr/lib/perfsonar/scripts/ps-migrate-restore.sh --data ps-backup_purdue_21Aug18.tgz
WARN: Looks like non-root user accounts were created prior to running this script. Skipping user account restoration to avoid conflicts
Restoring perfsonar configuration...[SUCCESS]
Restoring owamp-server configuration...[SUCCESS]
Restoring NTP configuration...[SUCCESS]
Restoring cassandra data for esmond...[SUCCESS]
Restoring postgresql data for esmond...[SUCCESS]
WARNING
You are about to obliterate the contents of the pScheduler
configuration and database on this host. Please confirm this by
typing today's date in YYYY-MM-DD format at the prompt below.
Confirm (YYYY-MM-DD): 2018-08-22
Restoring pScheduler from backup:
Unpacking backup... Done.
Restoring configuration... Done.
Restoring database... Done.
Updating database...ERROR: deadlock detected
DETAIL: Process 20475 waits for AccessExclusiveLock on relation 36307 of database 35991; blocked by process 20520.
Process 20520 waits for AccessShareLock on relation 36229 of database 35991; blocked by process 20475.
HINT: See server log for query details.
CONTEXT: SQL statement "DROP FUNCTION task_next_run(timestamptz, timestamptz, interval) CASCADE;"
PL/pgSQL function drop_function_all(text) line 34 at EXECUTE
SQL statement "SELECT drop_function_all('task_next_run')"
PL/pgSQL function inline_code_block line 1 at PERFORM
WARNING: Because of errors, pScheduler may be in an inoperable state.
Unable to restore pScheduler configuration
[root@ps BACKUPS]#
[root@ps BACKUPS]# /usr/lib/perfsonar/scripts/ps-migrate-restore.sh --data ps-backup_purdue_21Aug18.tgz
WARN: Looks like non-root user accounts were created prior to running this script. Skipping user account restoration to avoid conflicts
Restoring perfsonar configuration...[SUCCESS]
Restoring owamp-server configuration...[SUCCESS]
Restoring NTP configuration...[SUCCESS]
Restoring cassandra data for esmond...[SUCCESS]
Restoring postgresql data for esmond...[SUCCESS]
WARNING
You are about to obliterate the contents of the pScheduler
configuration and database on this host. Please confirm this by
typing today's date in YYYY-MM-DD format at the prompt below.
Confirm (YYYY-MM-DD): 2018-08-22
Restoring pScheduler from backup:
Unpacking backup... Done.
Restoring configuration... Done.
Restoring database... Done.
Updating database...ERROR: deadlock detected
DETAIL: Process 9315 waits for AccessExclusiveLock on relation 60308 of database 60105; blocked by process 9340.
Process 9340 waits for AccessShareLock on relation 60316 of database 60105; blocked by process 9315.
HINT: See server log for query details.
WARNING: Because of errors, pScheduler may be in an inoperable state.
Unable to restore pScheduler configuration
- [perfsonar-user] Error on data restore after CentOS6->7 upgrade, Kathy Benninger, 08/23/2018
- [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade, Kathy Benninger, 08/23/2018
- Re: [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade, Mark Feit, 08/23/2018
- Re: [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade, Kathy Benninger, 08/23/2018
- Re: [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade, Mark Feit, 08/23/2018
- [perfsonar-user] Resolved - Re: Error on data restore after CentOS6->7 upgrade, Kathy Benninger, 08/23/2018
Archive powered by MHonArc 2.6.19.