perfsonar-user - AW: [perfsonar-user] meshconfig problems
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: "Garnizov, Ivan (RRZE)" <>
- To: Pete Siemsen <>, "" <>
- Subject: AW: [perfsonar-user] meshconfig problems
- Date: Tue, 14 Aug 2018 08:57:23 +0000
- Accept-language: en-GB, de-DE, en-US
- Ironport-phdr: 9a23:WDOVKBdkGNwayVvt7LHeTJCWlGMj4u6mDksu8pMizoh2WeGdxc24YBKN2/xhgRfzUJnB7Loc0qyK6/6mATRIyK3CmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TW94jEIBxrwKxd+KPjrFY7OlcS30P2594HObwlSizexfbJ/IA+qoQnNq8IbnZZsJqEtxxXTv3BGYf5WxWRmJVKSmxbz+MK994N9/ipTpvws6ddOXb31cKokQ7NYCi8mM30u683wqRbDVwqP6WACXWgQjxFFHhLK7BD+Xpf2ryv6qu9w0zSUMMHqUbw5Xymp4rx1QxH0ligIKz858HnWisNuiqJbvAmhrAF7z4LNfY2ZKOZycqbbcNgHR2ROQ9xRWjRBDI2icoUPE+QPM+VWr4b/plsBsRSxCBK2C+/zzjJFnGP60bE43uknDArI3BYgH9ULsHnMotn7Nb0dUfqzzKnJzDTDaO1W1S/g44bLbBAuu/eMVq90fcTN1UcgFAzLglCepYf4OD6V1/oNvHKd7+Z6WuKgkXQnqwdwojmuwccslJLEhowPxVDZ7yl23Zg6KMS+RUVmb9CkF55QuDubN4twWs4jTWBouD0mxb0Ap5G7YDIGyJI5yB7Dc/CHdo+I4gj9W+qLJzd3mm5ldKqihxa09Uis0PPzWdSp0FpSqSpFlMfDtnYT2BzP8MSHTOVy/lu51TaRywDf8P9LLl0plabDM54hxLAxmoMcsUTCES/5gl32jKiOdkk+/Oin9fjnbq36qZCCMY90ix3xPr4omsOhHeQ0Kg4OUHaB+eS7zrHj+1P2QKlSgv01iKbZtoraKtodpqKjBQ9azJoj5w6xDze839QYhWMLLFRbdxKbl4TpNUvBIO3iAfiinVugiDJrx/HaMb3hGJnNKGbMkKvncL1g6EBc0BI+wc5D655IF70MJfP+V0HyudHdFBA1LxC4z/riBdlgyo8SR2KCD6qcPa/PsVKE/v8jLueRaIMPpTrwKPso6+TtgHMkg1MRY6ak0J8PZHymAvhqP1iVbH/tj9oPEmoHuxAxQPD2hFGeVDNfemq+U7w86zw+DY+rA4TOS4awi7Kbxii7BIdZZmVeB1CMD3joc4KEVu8JaCOdO8NhiD0FWaG/R4M7zxGurg77xKR9IurJ/C0YtIvv28Zv5+3Xjx4y8zN0ANqb022XUm57gHsERz4w3K9hvUxw0kmP3bRkj/BFENFe5v1EXwQ/OJHG0+B2F8z+VwfcctuVVlqrQtCrDDQsQt8x29MOZlxyG9SmjhDNxSqqBLoVmqSKBJwq6K3c2WL9J8lnxHbc0qkhjlgnQsxRNWC9iK5w7RXTC5TUnEqHj6qqb7gT3DbR9GefymqDpFlYXxBqUaXLRnAfYU3Wos7j5k/bUr+uEqooMhFaxM6GK6tKccHpjU5YSPv5OdTeZX6xlHmqBRaO2LyMcJTmd38D0yrAFUhX2zwUqE6LKQV2PSaspnnTCnQ6DlPiZljh98Fjo3qhCEI40lfOJwd52qC74RkTjObZVugexJoFvjsssTN5AAz70t7LQZLUvwd7cr5bZ9onpUpc2HjxtgphM4amIrw4wFMSblIkkVnp0kA9JYxal8k4q21uhCtsIKTQkGl7RR7Zl9ikO6PeLC/99QqparzRxHnQ2cvQ9qpZu6dwkEnqoAz8ThlqyH5gydQAliLEvpg=
Hello Pete, Just by looking at the dashboard you shared it is impossible to identify your system. The failures are all over the place. From the messages applied I can only
deduce that your host is: “perfsonar-1850.frgp.net”. Having so many failures
on the dashboard means, that there are problems not only on your system, but there are problems on the remote systems as well. Meaning you most likely won’t be able to deal with all the problems you manage to diagnose. The messages you applied signify problems on coordination of tasks from the remote side towards your host. From the mesh dashboard it appears you have lots of
different problems. Immediately from your logs it is apparent that a remote system does not allow your system to negotiate a test. Meaning your system is identified as a “non-friendly
host” and cannot even submit a request for a test with “ps-svl-10g.cenic.net”.
The resolution for this failure requires a discussion with the remote admin, so that his Limits config allows your system inquiries. “2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(ps-svl-10g.cenic.net->perfsonar-1850.frgp.net),
continuing with rest of config: 403 FORBIDDEN: Task forbidden by limits: Hints: requester: 129.19.165.2 server: 137.164.28.121 Identified as everybody Classified as default Application: Defaults applied to non-friendly hosts Group 1: Limit 'innocuous-tests' failed: Test type not in list Group 1: Want any, 0/1 passed, 1/1 failed: FAIL Group 1: Failed; stopping here. Application FAILS Proposal does not meet limits ” For the rest of the test failures I would suggest to verify initial communication with make a test with each remote system through pS (pScheduler) not the direct
measurement tool: ·
pscheduler troubleshoot ·
pscheduler troubleshoot --host <host> ·
pscheduler task --debug trace --dest perfsonar-1850.frgp.net --source <host> The steps above will not resolve the problems on the dashboard, but you have so many of issues, which require us to establish a solid base. On top of these review the /var/log/pscheduler/pscheduler.log
There you will find information on service failures or communication problems, which need to be addressed in order to restore operation of your system Regards, Ivan Garnizov GEANT SA1T2: pS deployments GN Operations GEANT SA2T3: pS development team GEANT SA3T5: eduPERT team Jubiläumsjahr 2018 - IT in Bewegung Das RRZE - der IT-Dienstleister der FAU Von: [mailto:]
Im Auftrag von Pete Siemsen Ever since I upgraded to 4.0.2, my (the FRGP's) participation in the Quilt mesh at
http://quiltmesh.onenet.net/maddash-webui/ has been problematic. My whole row is orange. Embarrassing. I've appended the last 25 lines from /var/log/perfsonar/meshconfig-agent.log file. These messages
mystify me because I can ping and/or traceroute to most of the hosts that appear in these error messages, like these perfsonar-1850$ tail --lines=25 meshconfig-agent.log 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(ps-svl-10g.cenic.net->perfsonar-1850.frgp.net),
continuing with rest of config: 403 FORBIDDEN: Task forbidden by limits: Hints: requester: 129.19.165.2 server: 137.164.28.121 Identified as everybody Classified as default Application: Defaults applied to non-friendly hosts Group 1: Limit 'innocuous-tests' failed: Test type not in list Group 1: Want any, 0/1 passed, 1/1 failed: FAIL Group 1: Failed; stopping here. Application FAILS Proposal does not meet limits 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(ps-grand-bw.perfsonar.kanren.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(ps-grand-lt.perfsonar.kanren.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(nmon-aa.mich.net->perfsonar-1850.frgp.net), continuing
with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(kc-core-psr.mo.more.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(latency.eugn-perfsonar.nero.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(bandwidth.eugn-perfsonar.nero.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(perfsonar-1850.frgp.net->web100.pnw-gigapop.net),
continuing with rest of config: 500 INTERNAL SERVER ERROR: Unable to determine participants: Process took too long to run. 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(web100.pnw-gigapop.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 INTERNAL SERVER ERROR: Unable to determine participants: Process took too long to run. 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(perfsonar.unl.edu->perfsonar-1850.frgp.net), continuing
with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(perfsonar-msn.wiscnet.net->perfsonar-1850.frgp.net),
continuing with rest of config: 500 timeout: timeout 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(perfsonar-1850.frgp.net->noctuidae.cns.vt.edu), continuing
with rest of config: 500 INTERNAL SERVER ERROR: Unable to determine participants: Process took too long to run. 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(noctuidae.cns.vt.edu->perfsonar-1850.frgp.net),
continuing with rest of config: 500 INTERNAL SERVER ERROR: Unable to determine participants: Process took too long to run. 2018/08/13 18:06:47 (26630) WARN> perfsonar_meshconfig_agent:430 main:: - Problem adding test trace(psonar.arc.vt.edu->perfsonar-1850.frgp.net), continuing
with rest of config: 500 timeout: timeout Any clue appreciated :-) -- Pete |
- [perfsonar-user] meshconfig problems, Pete Siemsen, 08/14/2018
- AW: [perfsonar-user] meshconfig problems, Garnizov, Ivan (RRZE), 08/14/2018
- Re: [perfsonar-user] meshconfig problems, Michael Johnson, 08/14/2018
- Re: [perfsonar-user] meshconfig problems, Casey Russell, 08/14/2018
- Re: [perfsonar-user] meshconfig problems, Michael Johnson, 08/14/2018
Archive powered by MHonArc 2.6.19.