Hi,
Based on what you sent, I doubt that ntp is the source of your problem since that would at most affect BWCTL and not the other services. What is the output when you run the following:
/etc/init.d/bwctl-server restart /etc/init.d/perfsonar-regulartesting restart
also, what processes show-up in the process list when you do the following:
ps auxw | grep bwctl ps auxw | grep -i regular
Thanks, Andy On June 15, 2016 at 11:06:07 AM, Yao,Rong () wrote:
Greetings, Ivan,
Thank you for reply.
Please see attached screen shot of our toolkit GUI page. The services still have “Not Running” status.
Our ntp package is above 4.1
# yum list ntp
Loaded plugins: fastestmirror, product-id, refresh-packagekit, rhnplugin,
: search-disabled-repos, security, subscription-manager
This system is receiving updates from RHN Classic or RHN Satellite.
Loading mirror speeds from cached hostfile
epel/metalink | 13 kB 00:00
* Internet2: mirror.ancl.hawaii.edu
* epel: ftp.cse.buffalo.edu
Internet2 | 2.9 kB 00:00
Internet2-web100_kernel | 2.9 kB 00:00
centos | 3.7 kB 00:00
epel | 4.3 kB 00:00
rhel-x86_64-server-6 | 1.5 kB 00:00
rhn-tools-rhel-x86_64-server-6 | 1.3 kB 00:00
Installed Packages
ntp.x86_64 4.2.6p5-10.el6.1 @rhel-x86_64-server-6
# rpm -qa | grep ntp
fontpackages-filesystem-1.41-1.1.el6.noarch
perfsonar-toolkit-ntp-3.5.1.3-1.noarch
nagios-plugins-ntp-2.0.3-3.el6.x86_64
ntpdate-4.2.6p5-10.el6.1.x86_64
nagios-plugins-ntp-perl-2.0.3-3.el6.x86_64
ntp-4.2.6p5-10.el6.1.x86_64
Since our servers synchronize within institutional time servers. We have:
# chkconfig --list | grep ntp
ntpd
0:off 1:off
2:off 3:off
4:off 5:off
6:off
ntpdate
0:off 1:off
2:on 3:on
4:on 5:on
6:off
Please advise.
Thanks!
Rong
From: "Garnizov, Ivan (RRZE)" <>
Date: Wednesday, June 15, 2016 at 3:02 AM
To: Rong Yao <>, "" <>
Cc: "Moye,Roger V" <>, "Adams,Andrew M" <>, William J Allen <>
Subject: RE: perfS0NAR Services not running
Hi Rong,
It appears to me that the picture on your toolkit GUI page must have changed ever since your previous report.
Please update us on the status.
Please verify and report the version of the ntp daemon:
yum list ntp. It should be > 4.1
Regards,
Ivan
From: Yao,Rong []
Sent: Dienstag, 14. Juni 2016 20:11
To: Garnizov, Ivan (RRZE);
Cc: Moye,Roger V; Adams,Andrew M; William J Allen
Subject: Re: perfS0NAR Services not running
Thank you for your reply. I and my colleague had trouble shooting for a while.
After I restarted the three services, I did restart several times, the following messages are associated the most recent restart:
Jun 14 11:45:02 r1prpps01 ntpdate[22023]: step time server 10.113.39.40 offset -0.014549 sec
Jun 14 11:52:52 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136
Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting...
Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited.
Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good.
Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: Status UNSYNC (clock offset issues likely)
Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.
Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting...
Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited.
Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely)
Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good.
Jun 14 12:00:01 r1prpps01 root: 12:00:01 up 20:22, 1 user, load average: 0.02, 0.05, 0.01
Jun 14 12:00:02 r1prpps01 ntpdate[23154]: step time server 10.111.39.40 offset -0.014596 sec
Jun 14 12:15:01 r1prpps01 root: 12:15:01 up 20:37, 1 user, load average: 0.00, 0.03, 0.00
Jun 14 12:15:02 r1prpps01 ntpdate[23731]: step time server 10.113.39.40 offset -0.014555 sec
Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111)
Jun 14 12:30:01 r1prpps01 root: 12:30:01 up 20:52, 1 user, load average: 0.12, 0.10, 0.03
Jun 14 12:30:02 r1prpps01 ntpdate[24300]: step time server 10.113.39.40 offset -0.014498 sec
Jun 14 12:31:34 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136
/var/log/perfsonar/owamp_bwctl.log
Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting...
Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited.
Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good.
Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting...
Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited.
Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely)
Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration
is good.
Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111)
/var/log/esmond/esmond.log
2016-06-13 15:42:48,838 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done
2016-06-13 15:42:48,843 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160']
2016-06-13 16:13:13,365 [INFO] /usr/lib/esmond/esmond/cassandra.py: Checking/creating column families
2016-06-13 16:13:13,366 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done
2016-06-13 16:13:13,372 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160']
/var/log/cassandra/cassandra.log
INFO 11:57:08,273 Logging initialized
INFO 11:57:08,297 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml
INFO 11:57:08,456 Data files directories: [/var/lib/cassandra/data]
INFO 11:57:08,457 Commit log directory: /var/lib/cassandra/commitlog
INFO 11:57:08,457 DiskAccessMode 'auto' determined to be mmap, indexAccessMode is mmap
INFO 11:57:08,458 disk_failure_policy is stop
INFO 11:57:08,458 commit_failure_policy is stop
INFO 11:57:08,461 Global memtable threshold is enabled at 1956MB
INFO 11:57:08,526 Not using multi-threaded compaction
INFO 11:57:08,655 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml
INFO 11:57:08,665 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml
INFO 11:57:08,672 JVM vendor/version: OpenJDK 64-Bit Server VM/1.8.0_91
WARN 11:57:08,672 OpenJDK is not recommended. Please upgrade to the newest Oracle Java release
INFO 11:57:08,672 Heap size: 8207532032/8207532032
( seems no error )
Regarding NTP, our server currently synchronize with institutional time servers, I do not use external time servers.
command /usr/lib/perfsonar/scripts/system_environment/enable_ntpd appears to be outdate? No such
script?
Regarding our server, it’s a RHEL 6 box. We have due network interfaces on it in order to reach by two network systems.
Not sure if this makes perfSONAR confuse at some point.
eth0
eth2
I appreciate very much for your input!
From:
"Garnizov, Ivan (RRZE)" <>
Date: Monday, June 13, 2016 at 3:19 AM
To: Rong Yao <>, "" <>
Cc: "Moye,Roger V" <>, "Adams,Andrew M" <>, William J Allen <>
Subject: RE: perfS0NAR Services not running
Hi Rong,
The web_admin.log is a very high level log for the web interface of the toolkit.
In order to give us a better picture of the current case, could you please restart:
-
sudo service bwctl-server restart
-
sudo service owamp-server restart
-
sudo service cassandra restart
and provide us the messages from the commands above and excerpts of a little while before the restart from:
-
/var/log/messages
-
/var/log/perfsonar/owamp_bwctl.log
-
/var/log/esmond/esmond.log
-
/var/log/cassandra/cassandra.log
Please tell how did you get to this state? Is that a new install or an upgrade? What in your understanding led to this state of the toolkit?
Please note the above commands are not sufficient to restore the full operation of the toolkit.
Regards,
Ivan
From:
[]
On Behalf Of Yao,Rong
Sent: Freitag, 10. Juni 2016 23:23
To:
Cc: Moye,Roger V; Adams,Andrew M; William J Allen
Subject: [perfsonar-user] perfS0NAR Services not running
Services bwctl, regular_test, owamp and esmond in the perfSONAR shown on web interface are “Not Running”. Please see the attached screen shot.
I looked at web_admin.log and saw those error messages:
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for peer_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf3_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for nuttcp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for thrulay_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for owamp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for test_ports
…
2016/06/10 15:39:57 (31269) ERROR> Host.pm:342 perfSONAR_PS::NPToolkit::DataService::Host::get_details - Unable to find host record in LS using hostname r1prpps01.mdanderson.edu
I do see those values are defined in the /etc/bwctl-server/bwctl-server.conf
# bwctl control channel
peer_port 6001-6200
# bwctl measurement test ports
test_port 5001-5900
Can anyone please advise what’s wrong here?
Thanks,
Rong
------------------------
Rong Yao
Research IS & Technology Services
University of Texas MD Anderson Cancer Center
Email:
| Tel: (713) 563-2687
The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected
health information (PHI); dissemination of PHI should comply with applicable federal and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution,
or copying of this message or any attachment (or the information contained therein) is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its
contents from your systems.
The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected health information (PHI); dissemination of PHI should comply with applicable federal
and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution, or copying of this message or any attachment (or the information contained therein)
is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its contents from your systems.
The information contained in this e-mail message may be
privileged, confidential, and/or protected from disclosure. This e-mail message
may contain protected health information (PHI); dissemination of PHI should
comply with applicable federal and state laws. If you are not the intended recipient,
or an authorized representative of the intended recipient, any further review,
disclosure, use, dissemination, distribution, or copying of this message or any
attachment (or the information contained therein) is strictly prohibited. If
you think that you have received this e-mail message in error, please notify
the sender by return e-mail and delete all references to it and its contents
from your systems.
|