perfsonar-user - [perfsonar-user] Various issues installing PS at Amazon
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: Jim Nauer CWRU <>
- To:
- Subject: [perfsonar-user] Various issues installing PS at Amazon
- Date: Wed, 23 Aug 2017 13:52:52 -0400
- Ironport-phdr: 9a23:xWIraRxEWjEbdKjXCy+O+j09IxM/srCxBDY+r6Qd1O4eIJqq85mqBkHD//Il1AaPBtSLraocw8Pt8InYEVQa5piAtH1QOLdtbDQizfssogo7HcSeAlf6JvO5JwYzHcBFSUM3tyrjaRsdF8nxfUDdrWOv5jAOBBr/KRB1JuPoEYLOksi7ze6/9pnQbglSmDaxfa55IQmrownWqsQYm5ZpJLwryhvOrHtIeuBWyn1tKFmOgRvy5dq+8YB6/ShItP0v68BPUaPhf6QlVrNYFygpM3o05MLwqxbOSxaE62YGXWUXlhpIBBXF7A3/U5zsvCb2qvZx1S+HNsDwULs6Wymt771zRR/yiCkJOT03/nzJhMNsl69Uug6tqgZlzoLIfI2YNvxzdb7dc9MAQmpBW95cWShAAoO5coABF/AOPftGoInnulADsxuxCxWiC+jyzTJIhGX53as03uk6DQ7JxhcgEMwUvHvKsdr1Kb4fXOaox6fL1TXOd+5a1Sn55YTSbBwtv+uAUa9sfcff00UjCgfIg1qIpYD7Iz+Y0v4Bvmub4uZ6SO6jlW0qpxtsrjWrxMohjJTCiJgPxVDe7yp5xZ44Jd2mR05/Zt6pCJ5QtzmEN4Z4W88uXX1ktDs6x7EcpZG0ZjMKx4o9yxHBdvOHbpCE4hPlVOmPPTd1nG9pdKy7ihqo70Ss1PfwWtSo3FtEtCZJj93Bu38V2xzW8MeHS/99/km72TaI0gDe8uNELloylafVMJMu3qQ/loENvkjZACD5hVj2gLeMdko44uio9/jnYrL+q5+EKYB0kA/+Mr8pmsy5G+s4KBECX3WA+euizr3u5kn5QLRRjv0qianVrorWJcUdpq6lHQBVyIAj5AijDzu4ytgXg2QILE8WMC6A2pDkIV/VJ/bxF7KimFm2uDZt2/3cOLD9WNPAImWQqrr5eadB7BtewQ8+1pZZ7o9ZEJkKKen+HEL9qY/2FBg8Zi61xq7BAchwzMs1ZliySvubOaWJmVSB/e9pLuWRMtxG8A3hIuQosqa9xUQynkUQKPGk
We have attempted to set up a simple PS node at Amazon in order to do some latency testing with history/trend graphing, but it is not going well.
--
First of all, the documentation does not appear to match reality when installing from the "bundle" rpm's instead of the ISO.
The installation consisted of deploying the official CentOS 7 Server AMI, installing EPEL & Perfsonar repos, then:
yum install perfsonar-toolkit
yum install perfsonar-toolkit-systemenv-testpoint
yum install perfsonar-toolkit-security
I then confirmed NTP config, ran the sysctl & firewall configs, and confirmed that SELinux is Disabled.
Finally, I ran nptoolkit-configure.py to configure the first user account. That script successfully created a web user account, but completely skipped the prompt to enable SSH login or sudo access.
Next, I tried the 'root' login thing - which can't be done on a default AWS / CentOS 7 image, so I just did 'sudo su - ' from the 'centos' account. That prompted me to set up a sudo user (skipped the "Website Administrator", since one had already been created) and disable root login (which was already done by the base CentOS image).
After all of that, I was able to log in to the web GUI, but it has never quite worked right. On most screens, a "Loading..." spinner will appear but never go away until I click on the page - then it just disappears, without actually loading e.g. the "services" list on the home page (both "public" and "logged in" versions).
I had been able to define a couple of tests - ping and latency to an on-campus location, but none of them appear to run correctly, and I can't get the 'tests' page to load fully anymore (that "loading..." spinner thing again).
The "latency" test does collect & show traceroute data and a correct latency value from the ping/traceroute data, but also reports "100% loss ... (owamp) [powstream]".
Digging into the log files, I found that 'owampd' is trying to bind to the _outside_ Amazon Elastic IP [1:1 NAT] for the system, not its own "inside" (RFC-1918) IP address. I found a note on this list from last year indicating that 'bwctl' had (has?) the same issue, and that it might be fixed "in the next version" - but that was a year ago. Any progress on this front, or are OWAMP/BWCTL just not usable in a NAT/AWS environment?
OR...did I configure the system wrong? After I found the bind-to-the-wrong-IP log message, I tried fixing things by setting up valid forward & reverse DNS for the "inside" IP (172.21.*), plus a DNS view that returns the outside IP (AWS Elastic IP) only to hosts querying from outside the AWS subnets. Then restarted PerfSONAR -- but that has made things worse. Now the web GUI won't show any of the "service" status information at all (where it did before). I also cannot find any config files (in /etc/{perfsonar,owamp-server,pscheduler}/*)with IP address or the local host name in them, so I'm not sure where to look to get things back to the less-broken state (service status is displayed and tests can be configured).
Any helpful hints? Or should I just nuke & re-install with the current DNS config, and live with a PS node that can record & graph 'ping' results but not much else?
James A. Nauer | "I shall not yield one whit of maturity,
Engineer III, Enterprise Systems | not grace, not respectability, to the
University Technology, [U]Tech | passing of time. I declare that I shall
Case Western Reserve University | forever be, if not a child, certainly
(216) 368-MACS (368-6227) | childish" --Kennet Shardik
USPA D-25604
Engineer III, Enterprise Systems | not grace, not respectability, to the
University Technology, [U]Tech | passing of time. I declare that I shall
Case Western Reserve University | forever be, if not a child, certainly
(216) 368-MACS (368-6227) | childish" --Kennet Shardik
USPA D-25604
- [perfsonar-user] Various issues installing PS at Amazon, Jim Nauer CWRU, 08/23/2017
- Re: [perfsonar-user] Various issues installing PS at Amazon, Andrew Lake, 08/24/2017
Archive powered by MHonArc 2.6.19.