perfsonar-user - [perfsonar-user] Re: perfS0NAR Services not running
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: "Yao,Rong" <>
- To: "Garnizov, Ivan (RRZE)" <>, "" <>
- Cc: "Moye,Roger V" <>, "Adams,Andrew M" <>, William J Allen <>
- Subject: [perfsonar-user] Re: perfS0NAR Services not running
- Date: Wed, 15 Jun 2016 15:03:32 +0000
- Accept-language: en-US
- Ironport-phdr: 9a23:SfWI8R/KOD51n/9uRHKM819IXTAuvvDOBiVQ1KB8 1+0cTK2v8tzYMVDF4r011RmSDdSdu6wP27OempujcFJDyK7JiGoFfp1IWk1Nou QttCtkPvS4D1bmJuXhdS0wEZcKflZk+3amLRodQ56mNBXsq3G/pQQfBg/4fVIs YL+lS8iC0Y/njqibwN76XUZhvHKFe7R8LRG7/036l/I9ps9cEJs30QbDuXBSeu 5blitCLFOXmAvgtI/rpMYwuwwZgf8q9tZBXKPmZOx4COUAVHV1e1wysYfEvAPF TBmI+D9Uc3sflFIIOTL3wVCwFsPwrCL88OV0wi+bL8roZb49RHKu4vEvAFXwhT 0JLDk/+XuSl9d9lopapg6svRpy39SSbY2IfrIqZq7Hc8gdQ2NbG9tKWjZpA4Wg YpEJAvZbe+tUst+uiUEJqE70NACrQNnozDACzivM7KQ9++EtGAfK0RY7GNRIt2 7b+oamfJwOWPy4mfGbhQ7IaOlbjHKksNDF
Greetings, Ivan,
Thank you for reply.
Please see attached screen shot of our toolkit GUI page. The services still have “Not Running” status.
Our ntp package is above 4.1
# yum list ntp Loaded plugins: fastestmirror, product-id, refresh-packagekit, rhnplugin, : search-disabled-repos, security, subscription-manager This system is receiving updates from RHN Classic or RHN Satellite. Loading mirror speeds from cached hostfile epel/metalink | 13 kB 00:00 * Internet2: mirror.ancl.hawaii.edu * epel: ftp.cse.buffalo.edu Internet2 | 2.9 kB 00:00 Internet2-web100_kernel | 2.9 kB 00:00 centos | 3.7 kB 00:00 epel | 4.3 kB 00:00 rhel-x86_64-server-6 | 1.5 kB 00:00 rhn-tools-rhel-x86_64-server-6 | 1.3 kB 00:00 Installed Packages ntp.x86_64 4.2.6p5-10.el6.1 @rhel-x86_64-server-6
# rpm -qa | grep ntp fontpackages-filesystem-1.41-1.1.el6.noarch perfsonar-toolkit-ntp-3.5.1.3-1.noarch nagios-plugins-ntp-2.0.3-3.el6.x86_64 ntpdate-4.2.6p5-10.el6.1.x86_64 nagios-plugins-ntp-perl-2.0.3-3.el6.x86_64 ntp-4.2.6p5-10.el6.1.x86_64
Since our servers synchronize within institutional time servers. We have:
# chkconfig --list | grep ntp ntpd 0:off 1:off 2:off 3:off 4:off 5:off 6:off ntpdate 0:off 1:off 2:on 3:on 4:on 5:on 6:off
Please advise.
Thanks! Rong
From: "Garnizov, Ivan (RRZE)" <>
Date: Wednesday, June 15, 2016 at 3:02 AM To: Rong Yao <>, "" <> Cc: "Moye,Roger V" <>, "Adams,Andrew M" <>, William J Allen <> Subject: RE: perfS0NAR Services not running Hi Rong, It appears to me that the picture on your toolkit GUI page must have changed ever since your previous report. Please update us on the status. Please verify and report the version of the ntp daemon:
yum list ntp. It should be > 4.1 Regards, Ivan From: Yao,Rong []
Greetings, Ivan, Thank you for your reply. I and my colleague had trouble shooting for a while. After I restarted the three services, I did restart several times, the following messages are associated the most recent restart: /var/log/messages Jun 14 11:45:02 r1prpps01 ntpdate[22023]: step time server 10.113.39.40 offset -0.014549 sec Jun 14 11:52:52 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136 Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting... Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited. Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good. Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: Status UNSYNC (clock offset issues likely) Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good. Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting... Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited. Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely) Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good. Jun 14 12:00:01 r1prpps01 root: 12:00:01 up 20:22, 1 user, load average: 0.02, 0.05, 0.01 Jun 14 12:00:02 r1prpps01 ntpdate[23154]: step time server 10.111.39.40 offset -0.014596 sec Jun 14 12:15:01 r1prpps01 root: 12:15:01 up 20:37, 1 user, load average: 0.00, 0.03, 0.00 Jun 14 12:15:02 r1prpps01 ntpdate[23731]: step time server 10.113.39.40 offset -0.014555 sec Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111) Jun 14 12:30:01 r1prpps01 root: 12:30:01 up 20:52, 1 user, load average: 0.12, 0.10, 0.03 Jun 14 12:30:02 r1prpps01 ntpdate[24300]: step time server 10.113.39.40 offset -0.014498 sec Jun 14 12:31:34 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136 /var/log/perfsonar/owamp_bwctl.log Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting... Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited. Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good. Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting... Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited. Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely) Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good. Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111) /var/log/esmond/esmond.log 2016-06-13 15:42:48,838 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done 2016-06-13 15:42:48,843 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160'] 2016-06-13 16:13:13,365 [INFO] /usr/lib/esmond/esmond/cassandra.py: Checking/creating column families 2016-06-13 16:13:13,366 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done 2016-06-13 16:13:13,372 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160'] /var/log/cassandra/cassandra.log INFO 11:57:08,273 Logging initialized INFO 11:57:08,297 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml INFO 11:57:08,456 Data files directories: [/var/lib/cassandra/data] INFO 11:57:08,457 Commit log directory: /var/lib/cassandra/commitlog INFO 11:57:08,457 DiskAccessMode 'auto' determined to be mmap, indexAccessMode is mmap INFO 11:57:08,458 disk_failure_policy is stop INFO 11:57:08,458 commit_failure_policy is stop INFO 11:57:08,461 Global memtable threshold is enabled at 1956MB INFO 11:57:08,526 Not using multi-threaded compaction INFO 11:57:08,655 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml INFO 11:57:08,665 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml INFO 11:57:08,672 JVM vendor/version: OpenJDK 64-Bit Server VM/1.8.0_91 WARN 11:57:08,672 OpenJDK is not recommended. Please upgrade to the newest Oracle Java release INFO 11:57:08,672 Heap size: 8207532032/8207532032 ( seems no error ) Regarding NTP, our server currently synchronize with institutional time servers, I do not use external time servers. As the current documentation : http://docs.perfsonar.net/install_centos.html Step3 command /usr/lib/perfsonar/scripts/system_environment/enable_ntpd appears to be outdate? No such
script? Regarding our server, it’s a RHEL 6 box. We have due network interfaces on it in order to reach by two network systems. Not sure if this makes perfSONAR confuse at some point. eth0
eth2
I appreciate very much for your input! Rong From:
"Garnizov, Ivan (RRZE)" <> Hi Rong, The web_admin.log is a very high level log for the web interface of the toolkit. In order to give us a better picture of the current case, could you please restart: -
sudo service bwctl-server restart -
sudo service owamp-server restart -
sudo service cassandra restart and provide us the messages from the commands above and excerpts of a little while before the restart from: -
/var/log/messages -
/var/log/perfsonar/owamp_bwctl.log -
/var/log/esmond/esmond.log -
/var/log/cassandra/cassandra.log Please tell how did you get to this state? Is that a new install or an upgrade? What in your understanding led to this state of the toolkit? Please note the above commands are not sufficient to restore the full operation of the toolkit. Regards, Ivan From:
[]
On Behalf Of Yao,Rong Greetings, Services bwctl, regular_test, owamp and esmond in the perfSONAR shown on web interface are “Not Running”. Please see the attached screen shot. I looked at web_admin.log and saw those error messages: 2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for peer_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf3_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for nuttcp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for thrulay_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for owamp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for test_ports
…
2016/06/10 15:39:57 (31269) ERROR> Host.pm:342 perfSONAR_PS::NPToolkit::DataService::Host::get_details - Unable to find host record in LS using hostname r1prpps01.mdanderson.edu
I do see those values are defined in the /etc/bwctl-server/bwctl-server.conf # bwctl control channel
peer_port 6001-6200
# bwctl measurement test ports
test_port 5001-5900
Can anyone please advise what’s wrong here?
Thanks,
Rong
------------------------ Rong Yao Research IS & Technology Services University of Texas MD Anderson Cancer Center Email:
| Tel: (713) 563-2687
The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected
health information (PHI); dissemination of PHI should comply with applicable federal and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution,
or copying of this message or any attachment (or the information contained therein) is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its
contents from your systems. The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected health information (PHI); dissemination of PHI should comply with applicable federal
and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution, or copying of this message or any attachment (or the information contained therein)
is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its contents from your systems. The information contained in this e-mail message may be
privileged, confidential, and/or protected from disclosure. This e-mail message
may contain protected health information (PHI); dissemination of PHI should
comply with applicable federal and state laws. If you are not the intended recipient,
or an authorized representative of the intended recipient, any further review,
disclosure, use, dissemination, distribution, or copying of this message or any
attachment (or the information contained therein) is strictly prohibited. If
you think that you have received this e-mail message in error, please notify
the sender by return e-mail and delete all references to it and its contents
from your systems. |
Attachment:
perfSONAR-GUI-06152016.png
Description: perfSONAR-GUI-06152016.png
- [perfsonar-user] perfS0NAR Services not running, Yao,Rong, 06/10/2016
- [perfsonar-user] RE: perfS0NAR Services not running, Garnizov, Ivan (RRZE), 06/13/2016
- [perfsonar-user] Re: perfS0NAR Services not running, Yao,Rong, 06/14/2016
- [perfsonar-user] RE: perfS0NAR Services not running, Garnizov, Ivan (RRZE), 06/15/2016
- [perfsonar-user] Re: perfS0NAR Services not running, Yao,Rong, 06/15/2016
- Re: [perfsonar-user] Re: perfS0NAR Services not running, Andrew Lake, 06/15/2016
- RE: [perfsonar-user] Re: perfS0NAR Services not running, Liu, Dengfeng (NIH/CIT) [C], 06/15/2016
- Re: [perfsonar-user] Re: perfS0NAR Services not running, Andrew Lake, 06/15/2016
- [perfsonar-user] Re: perfS0NAR Services not running, Yao,Rong, 06/15/2016
- [perfsonar-user] RE: perfS0NAR Services not running, Garnizov, Ivan (RRZE), 06/15/2016
- [perfsonar-user] Re: perfS0NAR Services not running, Yao,Rong, 06/14/2016
- [perfsonar-user] RE: perfS0NAR Services not running, Garnizov, Ivan (RRZE), 06/13/2016
Archive powered by MHonArc 2.6.16.