Skip to Content.
Sympa Menu

perfsonar-user - RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration

Subject: perfSONAR User Q&A and Other Discussion

List archive

RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration


Chronological Thread 
  • From: Zhi-Wei Lu <>
  • To: Zhi-Wei Lu <>, Andrew Lake <>
  • Cc: "" <>
  • Subject: RE: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
  • Date: Fri, 7 Sep 2018 16:33:38 +0000
  • Accept-language: en-US
  • Authentication-results: spf=none (sender IP is ) ;
  • Ironport-phdr: 9a23:ybXuUhCSho6u+MX/s+UCUyQJP3N1i/DPJgcQr6AfoPdwSP34oM2wAkXT6L1XgUPTWs2DsrQY07WQ6/iocFdDyK7JiGoFfp1IWk1NouQttCtkPvS4D1bmJuXhdS0wEZcKflZk+3amLRodQ56mNBXdrXKo8DEdBAj0OxZrKeTpAI7SiNm82/yv95HJbAhEmDiwbaluIBmqsA7cqtQYjYx+J6gr1xDHuGFIe+NYxWNpIVKcgRPx7dqu8ZBg7ipdpesv+9ZPXqvmcas4S6dYDCk9PGAu+MLrrxjDQhCR6XYaT24bjwBHAwnB7BH9Q5fxri73vfdz1SWGIcH7S60/VDK/5KlpVRDokj8KOT4n/m/Klsx+gqFVoByjqBx+34Hbb5qYO+Bicq/BZ94WWXZNUthXWidcAo28dYwPD+8ZMOlbr4n9pkICohugCgmtGejhzCJIjWLx0Kw73eUhFRzG0Rc9H90SrXvbtsv1NKYJUeyv0qbH0CjDYupQ1Dzg5obIdRUhruuNXbJ2acfRzlQvFx/BjlWXrozlPCmZ2v4RvGic6upsTf+vhHI6pA5rvDivx9wshpPXiY0I11DI7SR5wIApJdKmUk57Z8CrEIdOuy2AKYR5X94iT3lvuCYn1r0GvZm7fC8JyJg92xHfbPmHfo6V6RzgTOacOSl0i2h5dL6ighu+7FWsxvD5W8m631tGsjZJn9zCtn8T2BHe682KR/5880u8wzmDzx3f5+BALEwui6bWJZwszqQumpYOv0nPBDH6lFnugK+YaEop+fSk5uHib7jkupOROIF5hwTwMqkulMyzHOE1PRYBUmWb9+Sx0bPu8lDkT7hMk/Y4iLPWsIrAKsQevqO5AxFa0oIk6xunFzmrzNMWkWUZIF5cZR+JiZblN0jJIP/jE/izmVOskCp3x//dOb3hH5PNIWXZnLf5Z7Z97FJcxxQvwtBD5pJUDbcBLOj0Wk/sqNzYChg5Mwu3w+r9FNp90YYeVXqOAq+fLqzSrUeF6vw1LOWQeYMYvSvxJ+U46/Prg380lkMRcbWs0JsZdn+1EfVrLkCcbHftn9sOC38Gvg8kQ+zrjF2CXyRTZ3G3X68k5zA6BoamDZrYS4CwnLOOwT20HoNXZmBdEFyDD2rnd5icV/cWdC2SOtNhkiADVbW5U48uywuutAHhy7pgKOrb4DcYuYv+1Nhu+eLTjwo/9Th1D8SGz2GNVH94knkJRz8wwKBwv1Z9ylGd3qhknfBUD8Jc6O5UUlRyCJjH0uYvC8zuQhmTOZCPQQ7/BM6vDzw1T9Qy2JkWeUFhFt6+jxfFmWytHqMRkLuIQpcz9+WLwmL1csZh127dmfIIwgMID5FhMWy9mrU59g7PCpTPnlnD0aumaOlG8jXc8X+KgDCgp0BXVgl+X+ifc2gSfEXNqtLw/AbfVLK0R7guLgIUkZSWcvJefoC50goeFaayaIiDPz67xWu8WEbWnuOAMtO2IWsQ0XqEHEFUy1xN8yjaa1Y1Xnuv+zqPVGAwHgLjak29q+dw9imxWhcglxXUNBM+juroq0VJ3qbCEahKg+NKoysotzwhFwmA04nrKfGHtTM+UZpRatgy0lwYyH3+vDFUbsGHBpxG3HIveRRam2f2zFgsJstanNMntm0nwF8qep+ey05LIjOEwYjraPqQLmjp4AvpaqjK10vY3crMvKoD9bMjulD7tUayF0Uk9HtplMFYyXWH4ZPDFk8PSp/rAXowoip6pvTlbyh1w56cgW9zNbScrzTO0swoH60oxgv2O592Nq+eGRC6N8QZCoD6IeominCkdVQCMfwEp4AuOMbzRfuHnZGmPa5EgXryn3tD/ahg2U6F6StnDOPEwsBWkLmjwgKbWmKk3x+aucftlNUcP2tAF3ejySXiGI9ab7FzeoBOE2q1Psmr3Ygv1YX1VSte81iuTxMd1cmldADaTmS13BYYlCF16WeihTP+yjV1lz8zqa/KzjPK3sz/fxYOJG9QAmRukASkLA==
  • Spamdiagnosticmetadata: NSPM
  • Spamdiagnosticoutput: 1:99

Andy helped me found the latest problem.  My mesh configuration file has missing write_url for a number of pinger tests.

The command

psconfig translate https://ucdhost/UCD-mesh.json

has zero tests configured for the server.

After I fixed the write_url on the mesh configuration file, my tests started to work again.

 

I do need to drop all esmond tables in the postgresql database before running the restore script.

 

Thanks again Andy!

 

Zhi-Wei Lu

IET-CR-Network Operations Center

University of California, Davis

(530) 752-0155

 

From: <> On Behalf Of Zhi-Wei Lu
Sent: Thursday, September 06, 2018 12:35 PM
To: Andrew Lake <>
Cc:
Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration

 

I have dropped all tables in esmond (postgresql) and have run the migrate-restore script one more time.

 

Now I see runs for traceroute but not for ping or throughput in the log file:

 

Sep  6 12:27:08 mammoth-owamp-v4 journal: runner INFO     1575231: With traceroute: trace --dest mach1-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64

Sep  6 12:27:08 mammoth-owamp-v4 journal: runner INFO     1575231: Run succeeded.

Sep  6 12:29:22 mammoth-owamp-v4 journal: runner INFO     1575387: With traceroute: trace --dest fumee-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64

Sep  6 12:29:22 mammoth-owamp-v4 journal: runner INFO     1575387: Run succeeded.

 

The system /etc/perfsonar/psconfig/pscheduler-agent.json

 

{

   "remotes" : [

      {

         "url" : "https://ucdhost/UCD-mesh.json"

      }

   ]

}

 

The UCD-mesh defines runs: throughtput, ping, and traceroute.  What daemons do I need to restart again?

 

Thank you.

 

Zhi-Wei Lu

 

IET-CR-Network Operations Center

University of California, Davis

(530) 752-0155


From: <> on behalf of Zhi-Wei Lu <>
Sent: Thursday, September 6, 2018 9:26:36 AM
To: Andrew Lake
Cc:
Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration

 

Thanks, Andy!

 

It turns out that the "authtoken_token" in the esmond database has the "NEW" value rather than the migrated value.

I looked at the backup and restore scripts and examined the postgresql database.  I re-"restore" the database, but it

didn't replace the "new" value.  I had to drop the authtoken_token table (one table only) and re-run the restore script one more time.

It then restored the "old" authtoken values. After that I ran the scirpt again and restarted a few daemon last night,

 

/usr/lib/perfsonar/scripts/system_environment/configure_esmond --force

 

In the /var/log/pscheduler/pscheduler.log, we can see successful runs,

...

Sep  6 09:18:39 mammoth-owamp-v4 journal: runner INFO     1575526: With traceroute: trace --dest melange-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64

Sep  6 09:18:39 mammoth-owamp-v4 journal: runner INFO     1575526: Run succeeded.

...

 

However, I do not see any new data in the web-gui

 

http://mammoth.noc.ucdavis.edu/toolkit/

 

Maybe, there are still inconsistency in the postgresql esmond database with cassandra data sources (mixing with new values which were not restore properly).  I will try to drop all esmond tables and re-run restore script.

 

Thanks again Andy!

 

Zhi-Wei Lu

IET-CR-Network Operations Center

University of California, Davis

(530) 752-0155


From: Andrew Lake <>
Sent: Thursday, September 6, 2018 6:06:44 AM
To: Zhi-Wei Lu
Cc:
Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration

 

Hi,

 

I think maybe one of the steps you have done has gotten things working again, possible it took some time to propagate. If I look at your newest tasks they seem to be archiving successfully to localhost. For example see:

 

 

Notice the first JSON blob labelled "archivings" and how it indicates it succeeded in the "diags" section. 

 

Andy

 

 

 

 

On Wed, Sep 5, 2018 at 1:46 PM, Zhi-Wei Lu <> wrote:

Hi all,

 

I have just upgraded and migrated a few of my CentOS 6 perfsonar servers to CentOS 7.  I don’t see any new tests recorded.  I then noticed error messages in /var/log/ pscheduler/pscheduler.log

 

Sep  5 10:28:37 mammoth-owamp-v4 journal: runner INFO     1575513: With ping: rtt --count 10 --dest grx-v4.noc.ucdavis.edu --interval PT1S --source mammoth-owamp-v4.noc.ucdavis.edu --length 1000

Sep  5 10:28:46 mammoth-owamp-v4 journal: runner INFO     1575513: Run succeeded.

Sep  5 10:28:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: 401: Invalid token.

Sep  5 10:29:02 mammoth-owamp-v4 journal: archiver WARNING  1472536: Failed to archive https://localhost/pscheduler/tasks/c2a13eb2-b022-443d-95b9-42fa62c1859c/runs/44381ff1-71c4-4875-922f-c9cc9b253c29 to esmond: 401: Invalid token.

Sep  5 10:29:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: 401: Invalid token.

Sep  5 10:34:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: Archiver permanently abandoned registering test after 3 attempt(s): 401: Invalid token.

Sep  5 10:34:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Gave up archiving https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond

 

I guess that “localhost” probably should be the real host name, is that right? As I have a few hostnames for the server, which one I should use?  How do I configure perfsonar to use that name?

 

Thank you!

 

Zhi-Wei Lu

IET-CR-Network Operations Center

University of California, Davis

(530) 752-0155

 


--
To unsubscribe from this list: https://lists.internet2.edu/sympa/signoff/perfsonar-user

 




Archive powered by MHonArc 2.6.19.

Top of Page