Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration


Chronological Thread 
  • From: Zhi-Wei Lu <>
  • To: Andrew Lake <>
  • Cc: "" <>
  • Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
  • Date: Thu, 6 Sep 2018 19:35:24 +0000
  • Accept-language: en-US
  • Authentication-results: spf=none (sender IP is ) ;
  • Ironport-phdr: 9a23:LGcYqxHpS1SNDP1JPScQ6Z1GYnF86YWxBRYc798ds5kLTJ76pM2ybnLW6fgltlLVR4KTs6sC17KJ9fi4EUU7or+5+EgYd5JNUxJXwe43pCcHRPC/NEvgMfTxZDY7FskRHHVs/nW8LFQHUJ2mPw6arXK99yMdFQviPgRpOOv1BpTSj8Oq3Oyu5pHfeQpFiCa/bL9oMBm6sRjau9ULj4dlNqs/0AbCrGFSe+RRy2NoJFaTkAj568yt4pNt8Dletuw4+cJYXqr0Y6o3TbpDDDQ7KG81/9HktQPCTQSU+HQRVHgdnwdSDAjE6BH6WYrxsjf/u+Fg1iSWIdH6QLYpUjmk8qxlSgLniD0fOjA57m/Zl9BwgqxYrhKvpRN/wpLbb46OOfVkYq/RYckXSXZdUstRUSFKH4Oyb5EID+oEJetXrIn9p1oXoRS+GwasBf7kxDlVhnTr2qA70uogEQXb3AE7AtIBrGnbo8/rNKcTSuC51rfHwijeb/5P3zr29YbGchckof6WXLJwd9LcyVQzGAzYlFqQt43lPyiP2usTrmeb8vJsVeOpi247tQ5xuD6vydkwioXXnI4a1E3L9ThhzIYyON24VEp7YdmjEJtTrS2VK4x2QsYkTmp1uyg60qULtYChcyQW1ZgqwgPTZ+Gaf4WN7BLuW/qdLSt9iXJrZr2yiBm//E2lx+D9SMW51E5GoytHn9XWq3wA1h3e5tKaRvZy/0qtwyuD2x7O5u1cL0A7i6nWJ4Ikz7M1lJcfrUvOEyHzlUX4jKKWeFgo9+225OnoYrjmqJqROoFphQ7jNKklh9axDv4iMgcUWmiW4eS826Pn/U3+WLhEluE7nK7FvJzEPMgVu7a3DxJM3oYk8BmwES2q0NMFnXkbN11Ffw+Hj470NF3UOPD4F/C/g0iynzh32/DGP7rhApPXInjEjbfhYbJ960lbyAow19xQ+5VUCrQZLPLyXE/+qsDYAwcnPwOu3+rrFdBw2p4DVW6SB6KZPqzfvUOU6u8qLemDeoAYtTn4JvUg5vPik3s0lUcYfaaz3JsXbH64Hu5hI0WceXftg9kAEWMXvgclUezllECNXiRNZ3azRKIw/C80B5+7DYvbXICinKSB3DunHp1Rfm1GEk6DEW3md4WfXPcMbjidItV4njwZTriuVZUh2AqqtA/70LpnMvHU9jMCuZLi0th1+/PclQs09TNqE8SRzXuBQH9pkWMVFHcL2rtiqxl91kubyvo/xPpRTYUV/PpHXwM7M53HifRgDMz/UR7AedHPDlG7XtGhBjJ3RdU0hIgTeUIoG8m+lguOhAPnWZFYzZKPBYco6eTd2WT8PcBw1yyA2aU8wgMIWdpKLWTr3IBk9gfVAIDH3h+3j6uwe78b2ifXsXqYwHDAsEhEWVsoC7GdBGQLahOHoI70uE2bHuH3Vel6bVRPxJXaeqcUM4Gw0AQfTf33MYSGPzy/xTvuWx3Qmr3UNNuyJzsWjCnUBhlenQ5Np3ySbRxmEnn6/T2HXGA+TQi2Oxi0rbEu8T2gQkYoxlSDVFNu5r2l8U4op9OGQN8j4rYUijcIoAdyG1yEz871RoCu6lBPX/VAfNkU7GlJ5C+Cm0JnM4etNb5vigtGIDlxpF7kghVrFp1bw49tqHIx0BE0KKSE3UlHei/Cm538J/rMO2zq9Ve0aqHQ3V/Ql86b4KkU7/IxsRD+pwyzRXYlplxh38NYzDO475bHRF4bV57geksssR53u+eJTDM64tbm1XwpDqS1+hvfk4Y7G+844gurc95BMbjCGQPvRZ5JT/OyIfAnzgD6JikPO/pfofZuZZGvaueG1ai3Pe1pgDOhiyFd7Ztg1l6XqnsuUfbGipACxfzQnhCKUTvxlh+Aioj2gsgFAFNaBW+j0W7hDY9VaLd1eNMZFmOzC9C8ytxghoWrVnJFpxau
  • Spamdiagnosticmetadata: NSPM
  • Spamdiagnosticoutput: 1:99

I have dropped all tables in esmond (postgresql) and have run the migrate-restore script one more time.


Now I see runs for traceroute but not for ping or throughput in the log file:


Sep  6 12:27:08 mammoth-owamp-v4 journal: runner INFO     1575231: Running https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/9951d807-57ed-47f1-b5dc-924e88256601/runs/91fec1cf-4642-48db-be78-0f46243fa2a7
Sep  6 12:27:08 mammoth-owamp-v4 journal: runner INFO     1575231: With traceroute: trace --dest mach1-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64
Sep  6 12:27:08 mammoth-owamp-v4 journal: runner INFO     1575231: Run succeeded.
Sep  6 12:29:22 mammoth-owamp-v4 journal: runner INFO     1575387: Running https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/30d677d6-fc6c-4dfd-8edc-8be365a846f0/runs/642479c3-2f9e-4eeb-b654-fdc49e074023
Sep  6 12:29:22 mammoth-owamp-v4 journal: runner INFO     1575387: With traceroute: trace --dest fumee-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64
Sep  6 12:29:22 mammoth-owamp-v4 journal: runner INFO     1575387: Run succeeded.

The system /etc/perfsonar/psconfig/pscheduler-agent.json

{
   "remotes" : [
      {
         "url" : "https://ucdhost/UCD-mesh.json"
      }
   ]
}

The UCD-mesh defines runs: throughtput, ping, and traceroute.  What daemons do I need to restart again?


Thank you.


Zhi-Wei Lu

IET-CR-Network Operations Center
University of California, Davis
(530) 752-0155

From: <> on behalf of Zhi-Wei Lu <>
Sent: Thursday, September 6, 2018 9:26:36 AM
To: Andrew Lake
Cc:
Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
 

Thanks, Andy!


It turns out that the "authtoken_token" in the esmond database has the "NEW" value rather than the migrated value.

I looked at the backup and restore scripts and examined the postgresql database.  I re-"restore" the database, but it

didn't replace the "new" value.  I had to drop the authtoken_token table (one table only) and re-run the restore script one more time.

It then restored the "old" authtoken values. After that I ran the scirpt again and restarted a few daemon last night,


/usr/lib/perfsonar/scripts/system_environment/configure_esmond --force


In the /var/log/pscheduler/pscheduler.log, we can see successful runs,

...

Sep  6 09:18:39 mammoth-owamp-v4 journal: runner INFO     1575526: Running https://mammoth-v4.noc.ucdavis.edu/pscheduler/tasks/dbc2f80b-d785-4366-94fd-bc64d2a042ba/runs/5e0f34cc-1f84-4499-93ce-1774a0df7f1c
Sep  6 09:18:39 mammoth-owamp-v4 journal: runner INFO     1575526: With traceroute: trace --dest melange-v4.noc.ucdavis.edu --length 40 --source mammoth-v4.noc.ucdavis.edu --hops 64
Sep  6 09:18:39 mammoth-owamp-v4 journal: runner INFO     1575526: Run succeeded.
...

However, I do not see any new data in the web-gui


http://mammoth.noc.ucdavis.edu/toolkit/


Maybe, there are still inconsistency in the postgresql esmond database with cassandra data sources (mixing with new values which were not restore properly).  I will try to drop all esmond tables and re-run restore script.


Thanks again Andy!


Zhi-Wei Lu
IET-CR-Network Operations Center
University of California, Davis
(530) 752-0155

From: Andrew Lake <>
Sent: Thursday, September 6, 2018 6:06:44 AM
To: Zhi-Wei Lu
Cc:
Subject: Re: [perfsonar-user] Archiving to esmond failed after CentOS6 -> CentOS7 migration
 
Hi,

I think maybe one of the steps you have done has gotten things working again, possible it took some time to propagate. If I look at your newest tasks they seem to be archiving successfully to localhost. For example see:


Notice the first JSON blob labelled "archivings" and how it indicates it succeeded in the "diags" section. 

Andy




On Wed, Sep 5, 2018 at 1:46 PM, Zhi-Wei Lu <> wrote:

Hi all,

 

I have just upgraded and migrated a few of my CentOS 6 perfsonar servers to CentOS 7.  I don’t see any new tests recorded.  I then noticed error messages in /var/log/ pscheduler/pscheduler.log

 

Sep  5 10:28:37 mammoth-owamp-v4 journal: runner INFO     1575513: With ping: rtt --count 10 --dest grx-v4.noc.ucdavis.edu --interval PT1S --source mammoth-owamp-v4.noc.ucdavis.edu --length 1000

Sep  5 10:28:46 mammoth-owamp-v4 journal: runner INFO     1575513: Run succeeded.

Sep  5 10:28:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: 401: Invalid token.

Sep  5 10:29:02 mammoth-owamp-v4 journal: archiver WARNING  1472536: Failed to archive https://localhost/pscheduler/tasks/c2a13eb2-b022-443d-95b9-42fa62c1859c/runs/44381ff1-71c4-4875-922f-c9cc9b253c29 to esmond: 401: Invalid token.

Sep  5 10:29:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: 401: Invalid token.

Sep  5 10:34:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Failed to archive https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond: Archiver permanently abandoned registering test after 3 attempt(s): 401: Invalid token.

Sep  5 10:34:47 mammoth-owamp-v4 journal: archiver WARNING  1473429: Gave up archiving https://localhost/pscheduler/tasks/44d685eb-40c6-48eb-90c9-fc5dd427e3cc/runs/79c06354-dd20-48c1-afe4-e6bf4fdd0ae2 to esmond

 

I guess that “localhost” probably should be the real host name, is that right? As I have a few hostnames for the server, which one I should use?  How do I configure perfsonar to use that name?

 

Thank you!

 

Zhi-Wei Lu

IET-CR-Network Operations Center

University of California, Davis

(530) 752-0155

 


--
To unsubscribe from this list: https://lists.internet2.edu/sympa/signoff/perfsonar-user





Archive powered by MHonArc 2.6.19.

Top of Page