Skip to Content.
Sympa Menu

perfsonar-user - RE: [perfsonar-user] Re: perfS0NAR Services not running

Subject: perfSONAR User Q&A and Other Discussion

List archive

RE: [perfsonar-user] Re: perfS0NAR Services not running


Chronological Thread 
  • From: "Liu, Dengfeng (NIH/CIT) [C]" <>
  • To: Andrew Lake <>, "" <>, "Yao,Rong" <>, "Garnizov, Ivan (RRZE)" <>
  • Cc: "Adams,Andrew M" <>, "Moye,Roger V" <>, William J Allen <>
  • Subject: RE: [perfsonar-user] Re: perfS0NAR Services not running
  • Date: Wed, 15 Jun 2016 20:09:51 +0000
  • Accept-language: en-US
  • Ironport-phdr: 9a23:PlV/Cx15wNuagG7fsmDT+DRfVm0co7zxezQtwd8Z se4SLvad9pjvdHbS+e9qxAeQG96LurQU1aGP7/qocFdDyKjCmUhKSIZLWR4BhJ detC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TWM5DIfUi/yKRBy brysXNWC3oLniavrp8ebSj4LrQT+SIs6FA+xowTVu5teqqpZAYF19CH0pGBVcf 9d32JiKAHbtR/94sCt4MwrqHwI6LoJvvRNWqTifqk+UacQTHF/azh0t/DxsVH/ aSfHpj5FCiRF2iZPViHD4Av3Qd/Vuyj3/r503iWLFcDtC7Y5RWLmp+1zRQXmky ABPiR87XrakORxir5WuhSsu0Y5zoLJKsnBLPdkcLjae9oAAHdaU9x5VipdD5m6 YpdVSecNILAc58PMrlxKlxy4CkPkUPn+wzhgg3P/26E3yf8mF0fBxgN2W5pEn3 3OqtTnM7lWGcGrxa+AjQ/5Xd4Xk3+p6ZbBdFYuqOuKXKB3b+LfwFRpGw6T3XuK romwdReUy6BFn2Wd4ORtU+3ny0gGi0ZagwTujvQHv8OB0o4YzF/K+CN2hoI0OP W+SU49btm6RsgD/xqGPpd7F5txC1pjvzw3n/hf4Ma2

try "ntpdate -u <ntpserver>" and then try "hwclock -w"
Dengfeng

From: [] on behalf of Andrew Lake []
Sent: Wednesday, June 15, 2016 3:53 PM
To: ; Yao,Rong; Garnizov, Ivan (RRZE)
Cc: Adams,Andrew M; Moye,Roger V; William J Allen
Subject: Re: [perfsonar-user] Re: perfS0NAR Services not running

Hi,

Based on what you sent, I doubt that ntp is the source of your problem since that would at most affect BWCTL and not the other services. What is the output when you run the following:

/etc/init.d/bwctl-server restart
/etc/init.d/perfsonar-regulartesting restart

also, what processes show-up in the process list when you do the following:

ps auxw | grep bwctl
ps auxw | grep -i regular

Thanks,
Andy

On June 15, 2016 at 11:06:07 AM, Yao,Rong () wrote:

Greetings, Ivan,

Thank you  for reply.

Please see attached screen shot of our toolkit GUI page. The services still have “Not Running” status.

Our ntp package is above 4.1


# yum list ntp

Loaded plugins: fastestmirror, product-id, refresh-packagekit, rhnplugin,

              : search-disabled-repos, security, subscription-manager

This system is receiving updates from RHN Classic or RHN Satellite.

Loading mirror speeds from cached hostfile

epel/metalink                                            |  13 kB     00:00     

 * Internet2: mirror.ancl.hawaii.edu

 * epel: ftp.cse.buffalo.edu

Internet2                                                | 2.9 kB     00:00     

Internet2-web100_kernel                                  | 2.9 kB     00:00     

centos                                                   | 3.7 kB     00:00     

epel                                                     | 4.3 kB     00:00     

rhel-x86_64-server-6                                     | 1.5 kB     00:00     

rhn-tools-rhel-x86_64-server-6                           | 1.3 kB     00:00     

Installed Packages

ntp.x86_64                4.2.6p5-10.el6.1                 @rhel-x86_64-server-6


# rpm -qa | grep ntp

fontpackages-filesystem-1.41-1.1.el6.noarch

perfsonar-toolkit-ntp-3.5.1.3-1.noarch

nagios-plugins-ntp-2.0.3-3.el6.x86_64

ntpdate-4.2.6p5-10.el6.1.x86_64

nagios-plugins-ntp-perl-2.0.3-3.el6.x86_64

ntp-4.2.6p5-10.el6.1.x86_64



Since our servers synchronize within institutional time servers.  We have:

# chkconfig --list | grep ntp

ntpd           0:off 1:off 2:off 3:off 4:off 5:off 6:off

ntpdate        0:off 1:off 2:on 3:on 4:on 5:on 6:off


Please advise. 


Thanks!

Rong  




Hi Rong,

 

It appears to me that the picture on your toolkit GUI page must have changed ever since your previous report.

Please update us on the status.

 

Please verify and report the version of the ntp daemon: yum list ntp. It should be > 4.1

 

Regards,

Ivan

 

 

From: Yao,Rong [mailto:]
Sent: Dienstag, 14. Juni 2016 20:11
To: Garnizov, Ivan (RRZE);
Cc: Moye,Roger V; Adams,Andrew M; William J Allen
Subject: Re: perfS0NAR Services not running

 

Greetings, Ivan,

 

Thank you for your reply. I and my colleague had trouble shooting for a while. 

 

After I restarted the three services, I did restart several times, the following messages are associated the most recent restart:

 

/var/log/messages

Jun 14 11:45:02 r1prpps01 ntpdate[22023]: step time server 10.113.39.40 offset -0.014549 sec

Jun 14 11:52:52 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136

Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting...

Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited.

Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.

Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: Status UNSYNC (clock offset issues likely)

Jun 14 11:56:14 r1prpps01 owampd[22463]: NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.

Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting...

Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited.

Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely)

Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.

Jun 14 12:00:01 r1prpps01 root: 12:00:01 up 20:22, 1 user, load average: 0.02, 0.05, 0.01

Jun 14 12:00:02 r1prpps01 ntpdate[23154]: step time server 10.111.39.40 offset -0.014596 sec

Jun 14 12:15:01 r1prpps01 root: 12:15:01 up 20:37, 1 user, load average: 0.00, 0.03, 0.00

Jun 14 12:15:02 r1prpps01 ntpdate[23731]: step time server 10.113.39.40 offset -0.014555 sec

Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111)

Jun 14 12:30:01 r1prpps01 root: 12:30:01 up 20:52, 1 user, load average: 0.12, 0.10, 0.03

Jun 14 12:30:02 r1prpps01 ntpdate[24300]: step time server 10.113.39.40 offset -0.014498 sec

Jun 14 12:31:34 r1prpps01 fail2ban.filter[4260]: INFO [sshd] Found 172.18.38.136

 

/var/log/perfsonar/owamp_bwctl.log

Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2751, bwctld: exiting...

Jun 14 11:56:04 r1prpps01 bwctld[3571]: FILE=bwctld.c, LINE=2805, bwctld: exited.

Jun 14 11:56:14 r1prpps01 bwctld[22461]: FILE=time.c, LINE=148, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.

Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1848, owampd: exiting...

Jun 14 11:56:35 r1prpps01 owampd[12594]: FILE=owampd.c, LINE=1895, owampd: exited.

Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=112, NTP: Status UNSYNC (clock offset issues likely)

Jun 14 11:56:45 r1prpps01 owampd[22504]: FILE=time.c, LINE=118, NTP: STA_NANO should be set. Make sure ntpd is running, and your NTP configuration is good.

Jun 14 12:29:41 r1prpps01 owampd[24263]: FILE=sapi.c, LINE=303, Connection to ([mdaccps01.mdanderson.edu]:861) from ([mdaccps01.mdanderson.edu]:46111)

 

/var/log/esmond/esmond.log

2016-06-13 15:42:48,838 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done

2016-06-13 15:42:48,843 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160']

2016-06-13 16:13:13,365 [INFO] /usr/lib/esmond/esmond/cassandra.py: Checking/creating column families

2016-06-13 16:13:13,366 [INFO] /usr/lib/esmond/esmond/cassandra.py: Schema check done

2016-06-13 16:13:13,372 [INFO] /usr/lib/esmond/esmond/cassandra.py: Connected to ['localhost:9160']

 

/var/log/cassandra/cassandra.log

  INFO 11:57:08,273 Logging initialized

 INFO 11:57:08,297 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml

 INFO 11:57:08,456 Data files directories: [/var/lib/cassandra/data]

 INFO 11:57:08,457 Commit log directory: /var/lib/cassandra/commitlog

 INFO 11:57:08,457 DiskAccessMode 'auto' determined to be mmap, indexAccessMode is mmap

 INFO 11:57:08,458 disk_failure_policy is stop

 INFO 11:57:08,458 commit_failure_policy is stop

 INFO 11:57:08,461 Global memtable threshold is enabled at 1956MB

 INFO 11:57:08,526 Not using multi-threaded compaction

 INFO 11:57:08,655 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml

 INFO 11:57:08,665 Loading settings from file:/etc/cassandra/default.conf/cassandra.yaml

 INFO 11:57:08,672 JVM vendor/version: OpenJDK 64-Bit Server VM/1.8.0_91

 WARN 11:57:08,672 OpenJDK is not recommended. Please upgrade to the newest Oracle Java release

 INFO 11:57:08,672 Heap size: 8207532032/8207532032

( seems no error )

 

 

Regarding NTP,  our server currently synchronize with institutional time servers, I do not use external time servers.

As the current documentation : http://docs.perfsonar.net/install_centos.html Step3

command /usr/lib/perfsonar/scripts/system_environment/enable_ntpd appears to be outdate? No such script?

 

Regarding our server, it’s a RHEL 6 box.  We have due network interfaces on it in order to reach by two network systems.

Not sure if this makes perfSONAR confuse at some point.

eth0

  • Speed:1000M
  • ipv4 addresses:10.113.113.62
  • MAC address:3C:A8:2A:14:72:50
  • mtu:1500

eth2

  • Speed:1000M
  • ipv4 addresses:10.113.117.13   
  • MAC address:3C:A8:2A:14:72:52
  • mtu:9000

 

I appreciate very much for your input!

 

Rong

 

 

Hi Rong,

 

The web_admin.log is a very high level log for the web interface of the toolkit.

In order to give us a better picture of the current case, could you please restart:

-          sudo service bwctl-server restart

-          sudo service owamp-server restart

-          sudo service cassandra restart

 

and  provide us the messages from the commands above and excerpts of a little while before the restart from:

-          /var/log/messages

-          /var/log/perfsonar/owamp_bwctl.log

-          /var/log/esmond/esmond.log

-          /var/log/cassandra/cassandra.log

 

Please tell how did you get to this state? Is that a new install or an upgrade? What in your understanding led to this state of the toolkit?

 

Please note the above commands are not sufficient to restore the full operation of the toolkit.

 

Regards,

Ivan

 

 

Greetings, 

 

Services bwctl, regular_test, owamp and esmond in the perfSONAR shown on web interface are “Not Running”.  Please see the attached screen shot.

I looked at web_admin.log and saw those error messages:

 

2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for peer_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for iperf3_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for nuttcp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for thrulay_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for owamp_ports
2016/06/10 15:39:55 (31270) ERROR> BWCTL.pm:646 perfSONAR_PS::NPToolkit::Config::BWCTL::get_port_range - No port range for test_ports
2016/06/10 15:39:57 (31269) ERROR> Host.pm:342 perfSONAR_PS::NPToolkit::DataService::Host::get_details - Unable to find host record in LS using hostname r1prpps01.mdanderson.edu
I do see those values are defined in the /etc/bwctl-server/bwctl-server.conf
# bwctl control channel
peer_port       6001-6200
# bwctl measurement test ports
test_port       5001-5900
Can anyone please advise what’s wrong here? 
Thanks,
Rong

------------------------

Rong Yao

Research IS & Technology Services

University of Texas MD Anderson Cancer Center

Email: Tel: (713) 563-2687

 
 

The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected health information (PHI); dissemination of PHI should comply with applicable federal and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution, or copying of this message or any attachment (or the information contained therein) is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its contents from your systems.

The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected health information (PHI); dissemination of PHI should comply with applicable federal and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution, or copying of this message or any attachment (or the information contained therein) is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its contents from your systems.

The information contained in this e-mail message may be privileged, confidential, and/or protected from disclosure. This e-mail message may contain protected health information (PHI); dissemination of PHI should comply with applicable federal and state laws. If you are not the intended recipient, or an authorized representative of the intended recipient, any further review, disclosure, use, dissemination, distribution, or copying of this message or any attachment (or the information contained therein) is strictly prohibited. If you think that you have received this e-mail message in error, please notify the sender by return e-mail and delete all references to it and its contents from your systems.




Archive powered by MHonArc 2.6.16.

Top of Page