perfsonar-user - RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
Subject: perfSONAR User Q&A and Other Discussion
List archive
RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
Chronological Thread
- From: Zhi-Wei Lu <>
- To: Zhi-Wei Lu <>, Andrew Lake <>
- Cc: "" <>
- Subject: RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
- Date: Fri, 7 Sep 2018 16:33:38 +0000
- Accept-language: en-US
- Authentication-results: spf=none (sender IP is ) ;
- Ironport-phdr: 9a23:ybXuUhCSho6u+MX/s+UCUyQJP3N1i/DPJgcQr6AfoPdwSP34oM2wAkXT6L1XgUPTWs2DsrQY07WQ6/iocFdDyK7JiGoFfp1IWk1NouQttCtkPvS4D1bmJuXhdS0wEZcKflZk+3amLRodQ56mNBXdrXKo8DEdBAj0OxZrKeTpAI7SiNm82/yv95HJbAhEmDiwbaluIBmqsA7cqtQYjYx+J6gr1xDHuGFIe+NYxWNpIVKcgRPx7dqu8ZBg7ipdpesv+9ZPXqvmcas4S6dYDCk9PGAu+MLrrxjDQhCR6XYaT24bjwBHAwnB7BH9Q5fxri73vfdz1SWGIcH7S60/VDK/5KlpVRDokj8KOT4n/m/Klsx+gqFVoByjqBx+34Hbb5qYO+Bicq/BZ94WWXZNUthXWidcAo28dYwPD+8ZMOlbr4n9pkICohugCgmtGejhzCJIjWLx0Kw73eUhFRzG0Rc9H90SrXvbtsv1NKYJUeyv0qbH0CjDYupQ1Dzg5obIdRUhruuNXbJ2acfRzlQvFx/BjlWXrozlPCmZ2v4RvGic6upsTf+vhHI6pA5rvDivx9wshpPXiY0I11DI7SR5wIApJdKmUk57Z8CrEIdOuy2AKYR5X94iT3lvuCYn1r0GvZm7fC8JyJg92xHfbPmHfo6V6RzgTOacOSl0i2h5dL6ighu+7FWsxvD5W8m631tGsjZJn9zCtn8T2BHe682KR/5880u8wzmDzx3f5+BALEwui6bWJZwszqQumpYOv0nPBDH6lFnugK+YaEop+fSk5uHib7jkupOROIF5hwTwMqkulMyzHOE1PRYBUmWb9+Sx0bPu8lDkT7hMk/Y4iLPWsIrAKsQevqO5AxFa0oIk6xunFzmrzNMWkWUZIF5cZR+JiZblN0jJIP/jE/izmVOskCp3x//dOb3hH5PNIWXZnLf5Z7Z97FJcxxQvwtBD5pJUDbcBLOj0Wk/sqNzYChg5Mwu3w+r9FNp90YYeVXqOAq+fLqzSrUeF6vw1LOWQeYMYvSvxJ+U46/Prg380lkMRcbWs0JsZdn+1EfVrLkCcbHftn9sOC38Gvg8kQ+zrjF2CXyRTZ3G3X68k5zA6BoamDZrYS4CwnLOOwT20HoNXZmBdEFyDD2rnd5icV/cWdC2SOtNhkiADVbW5U48uywuutAHhy7pgKOrb4DcYuYv+1Nhu+eLTjwo/9Th1D8SGz2GNVH94knkJRz8wwKBwv1Z9ylGd3qhknfBUD8Jc6O5UUlRyCJjH0uYvC8zuQhmTOZCPQQ7/BM6vDzw1T9Qy2JkWeUFhFt6+jxfFmWytHqMRkLuIQpcz9+WLwmL1csZh127dmfIIwgMID5FhMWy9mrU59g7PCpTPnlnD0aumaOlG8jXc8X+KgDCgp0BXVgl+X+ifc2gSfEXNqtLw/AbfVLK0R7guLgIUkZSWcvJefoC50goeFaayaIiDPz67xWu8WEbWnuOAMtO2IWsQ0XqEHEFUy1xN8yjaa1Y1Xnuv+zqPVGAwHgLjak29q+dw9imxWhcglxXUNBM+juroq0VJ3qbCEahKg+NKoysotzwhFwmA04nrKfGHtTM+UZpRatgy0lwYyH3+vDFUbsGHBpxG3HIveRRam2f2zFgsJstanNMntm0nwF8qep+ey05LIjOEwYjraPqQLmjp4AvpaqjK10vY3crMvKoD9bMjulD7tUayF0Uk9HtplMFYyXWH4ZPDFk8PSp/rAXowoip6pvTlbyh1w56cgW9zNbScrzTO0swoH60oxgv2O592Nq+eGRC6N8QZCoD6IeominCkdVQCMfwEp4AuOMbzRfuHnZGmPa5EgXryn3tD/ahg2U6F6StnDOPEwsBWkLmjwgKbWmKk3x+aucftlNUcP2tAF3ejySXiGI9ab7FzeoBOE2q1Psmr3Ygv1YX1VSte81iuTxMd1cmldADaTmS13BYYlCF16WeihTP+yjV1lz8zqa/KzjPK3sz/fxYOJG9QAmRukASkLA==
- Spamdiagnosticmetadata: NSPM
- Spamdiagnosticoutput: 1:99
Andy helped me found the latest problem. My mesh configuration file has missing write_url for a number of pinger tests. The command psconfig translate
https://ucdhost/UCD-mesh.json has zero tests configured for the server. After I fixed the write_url on the mesh configuration file, my tests started to work again. I do need to drop all esmond tables in the postgresql database before running the restore script. Thanks again Andy! Zhi-Wei Lu IET-CR-Network Operations Center University of California, Davis (530) 752-0155 From: <>
On Behalf Of Zhi-Wei Lu I have dropped all tables in esmond (postgresql) and have run the migrate-restore script one more time. Now I see runs for traceroute but not for ping or throughput in the log file: Sep 6 12:27:08 mammoth-owamp-v4 journal: runner INFO 1575231: Running
https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/9951d807-57ed-47f1-b5dc-924e88256601/runs/91fec1cf-4642-48db-be78-0f46243fa2a7 Sep 6 12:27:08 mammoth-owamp-v4 journal: runner INFO 1575231: With traceroute: trace --dest mach1-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops
64 Sep 6 12:27:08 mammoth-owamp-v4 journal: runner INFO 1575231: Run succeeded. Sep 6 12:29:22 mammoth-owamp-v4 journal: runner INFO 1575387: Running
https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/30d677d6-fc6c-4dfd-8edc-8be365a846f0/runs/642479c3-2f9e-4eeb-b654-fdc49e074023 Sep 6 12:29:22 mammoth-owamp-v4 journal: runner INFO 1575387: With traceroute: trace --dest fumee-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops
64 Sep 6 12:29:22 mammoth-owamp-v4 journal: runner INFO 1575387: Run succeeded. The system /etc/perfsonar/psconfig/pscheduler-agent.json { "remotes" : [ { "url" : "https://ucdhost/UCD-mesh.json" } ] } The UCD-mesh defines runs: throughtput, ping, and traceroute. What daemons do I need to restart again?
Thank you. Zhi-Wei Lu IET-CR-Network Operations Center University of California, Davis (530) 752-0155 From:
<> on behalf of Zhi-Wei Lu <> Thanks, Andy! It turns out that the "authtoken_token" in the esmond database has the "NEW" value rather than the migrated value. I looked at the backup and restore scripts and examined the postgresql database. I re-"restore" the database, but it didn't replace the "new" value. I had to drop the authtoken_token table (one table only) and re-run the restore script one more time. It then restored the "old" authtoken values. After that I ran the scirpt again and restarted a few daemon last night, /usr/lib/perfsonar/scripts/system_environment/configure_esmond --force In the /var/log/pscheduler/pscheduler.log, we can see successful runs, ... Sep 6 09:18:39 mammoth-owamp-v4 journal: runner INFO 1575526: Running
https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/dbc2f80b-d785-4366-94fd-bc64d2a042ba/runs/5e0f34cc-1f84-4499-93ce-1774a0df7f1c Sep 6 09:18:39 mammoth-owamp-v4 journal: runner INFO 1575526: With traceroute: trace --dest melange-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64 Sep 6 09:18:39 mammoth-owamp-v4 journal: runner INFO 1575526: Run succeeded. ... However, I do not see any new data in the web-gui http://mammoth.noc.ucdavis.edu/toolkit/ Maybe, there are still inconsistency in the postgresql esmond database with cassandra data sources (mixing with new values which were not restore properly). I will try to drop all esmond tables and re-run restore script. Thanks again Andy! Zhi-Wei Lu IET-CR-Network Operations Center University of California, Davis (530) 752-0155 From: Andrew Lake <> Hi, I think maybe one of the steps you have done has gotten things working again, possible it took some time to propagate. If I look at your newest tasks they seem to be archiving successfully to localhost. For example see: Notice the first JSON blob labelled "archivings" and how it indicates it succeeded in the "diags" section. Andy On Wed, Sep 5, 2018 at 1:46 PM, Zhi-Wei Lu <> wrote:
|
- [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/05/2018
- Message not available
- RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/05/2018
- Message not available
- Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Andrew Lake, 09/05/2018
- RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/05/2018
- Message not available
- RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/05/2018
- Message not available
- RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/05/2018
- Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/06/2018
- Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/06/2018
- RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/07/2018
- Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration, Zhi-Wei Lu, 09/06/2018
Archive powered by MHonArc 2.6.19.