Skip to Content.
Sympa Menu

perfsonar-user - [perfsonar-user] Perfsonar (automatically) upgraded to 5.0 and now broken

Subject: perfSONAR User Q&A and Other Discussion

List archive

[perfsonar-user] Perfsonar (automatically) upgraded to 5.0 and now broken


Chronological Thread 
  • From: Onno Zweers <>
  • To: "" <>
  • Subject: [perfsonar-user] Perfsonar (automatically) upgraded to 5.0 and now broken
  • Date: Tue, 18 Apr 2023 14:36:11 +0000
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=surf.nl; dmarc=pass action=none header.from=surf.nl; dkim=pass header.d=surf.nl; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=OnmM5JfKBc4WrWp9eyAL1mhl4g9wvli+sWmZ1XH2/qE=; b=G+fIbU2X/FjVp0GNrztjl8Ezg7FulVE2zo1sQ1alpzmA/lqHQokUvSGPyLMEaRLT4lH69nlSzoherBn6+/MblgdNn57eH8Lq7gk0Ak7kVsr3yi8ctHJjD5C9d8RRQ9Y2fqptiFGn01qp49Db8l4DbKx9jduyo10SvVc8CLEpFA4ugTIwnzYiDtqvqo1hfuonLFute5YIMKaUJ8s3NAy2I7s6YW5Fn0gFnDzOiFNk7MnM6eEiG1m9l8ldfu6yOc1jFNQcozyss8Q+QqEU4mkIMi+cbKVLYjFvLh99+gTwU1XZaELdvDYOeNAZYPa9alxzR6LxQa+wu0FQ/UCEAYEZbQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=fi+MpcT8yOpvnGyEUAqakJpP8/brXlzdEVqvkI/GOdoVQn6Zr9TSad/iADjchR/TLJOrl8bsHycPHGur/db9ugP7Drqb3qzXXAar1LhNxX8ItMlKc6sVGz/kOAy9lzsAn/3Rfkw5eFmRC/7RatadoBT/Q7T7B2FEMT36cSDz4mfAnO98PxPdbxePvyJVT5pudxjyqLFk0gzFsr5IZTWSLSxzL63yjm8S7qcFI3/AalM8Ono1eQOOQdikAeTH86l3GcHrQDrcIDbhCvjvKGIjxdFiljRibBMtRpp6NJWxxa86wdgxouOVoAZsfv4fTLJYwyUtJfWCfW//bu79siFmEw==

Hi all,

 

On our perfsonar system, we have an automatic yum update running every Tuesday. This is to make sure that any security issues are patched very soon. We consider this good practice since the box is open to the world.

 

Today our monitoring warned me that /tmp on our perfsonar node was full. After investigation, I discovered the upgraded packages. I had missed the announcement, apologies for that (too busy to read emails haha).

 

So now we have a system that says “Error loading test listing; measurement archive unreachable:

https://perfsonar-bandwidth.grid.surfsara.nl/esmond/perfsonar/archive/

 

I tried some simple troubleshooting steps, like this:

 

[root@perfsonar /var/log/esmond]# pscheduler troubleshoot

Performing basic troubleshooting of perfsonar.grid.surfsara.nl.

 

perfsonar.grid.surfsara.nl:

 

  Measuring MTU... 65535 (Local)

  Looking for pScheduler... OK.

  Fetching API level... 5

  Checking clock... OK.

  Exercising API... Archivers... Contexts... Tests... Tools... OK.

  Fetching service status... OK.

  Checking services... Ticker... Scheduler... Runner... Archiver... OK.

  Checking limits... Failed.

 

Limit processor is not initialized: Limit 'idleex-default' has unsupported type 'test'

 

And I found a reference to perfsonar-pscheduler-limit-checker-for-5.0 which I tried:

 

[root@perfsonar /tmp/check/perfsonar-pscheduler-limit-checker-for-5.0-c8d6f4d]# /usr/bin/env python3 check-limits

NOTE: This limit configration could not be fully validated on a perfSONAR 5.0.0  system.  This program will still attempt to find problems related to the removal of

the 'test' limit.

 

ACTION REQUIRED.

 

This limit system configuration contains limits that are deprecated in

the 4.x family and will not be supported in release 5.0.

 

Limit 'idleex-default':

 

This limit has been removed in 5.0.  The idleex test is now disallowed for untrusted hosts by the 'allowed-tests' limit.

 

 

Limit 'throughput-default-time':

 

This limit has been rewritten to be compatible with perfSONAR 5.0.  If yours has not been customized, it can be replaced in its entirety with the following:

    

    {

        "name": "throughput-default-time",

        "description": "Throughput tests of reasonable duration",

        "type": "jq",

        "data": {

            "script": [

                "import \"pscheduler/iso8601\" as iso;",

                "if .test.type != \"throughput\"",

                "then true  # Don't care.",

                "else",

          "  if .test.spec.duration != null",

          "  and iso::duration_as_seconds(.test.spec.duration) > 60",

                "  then \"Duration for throughput must be 60 seconds or less.\"",

                "  else true end",

                "end"

            ]

        }

    }

    

Note that any of the modifications described above will be fully-compatible with 4.4.x  Getting a revised version tested and into production prior to upgrading to

5.0 is strongly-recommended.

 

It says “replace” but I can’t seem to find which file or config I need to replace.

 

I also noptice that opensearch was installed together with perfsonar 5.0. But it failes to start.

 

[root@perfsonar ~]# systemctl status opensearch.service

opensearch.service - OpenSearch

   Loaded: loaded (/usr/lib/systemd/system/opensearch.service; enabled; vendor preset: disabled)

   Active: failed (Result: exit-code) since Tue 2023-04-18 16:31:43 CEST; 4s ago

     Docs: https://opensearch.org/

  Process: 88115 ExecStart=/usr/share/opensearch/bin/systemd-entrypoint -p ${PID_DIR}/opensearch.pid --quiet (code=exited, status=1/FAILURE)

 Main PID: 88115 (code=exited, status=1/FAILURE)

 

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.bootstrap.OpenSearch.execute(OpenSearch.java:171)

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.cli.EnvironmentAwareCommand.execute(EnvironmentAwareCommand.java:104)

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.cli.Command.mainWithoutErrorHandling(Command.java:138)

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.cli.Command.main(Command.java:101)

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.bootstrap.OpenSearch.main(OpenSearch.java:137)

Apr 18 16:31:40 perfsonar.grid.surfsara.nl systemd-entrypoint[88115]: at org.opensearch.bootstrap.OpenSearch.main(OpenSearch.java:103)

Apr 18 16:31:43 perfsonar.grid.surfsara.nl systemd[1]: opensearch.service: main process exited, code=exited, status=1/FAILURE

Apr 18 16:31:43 perfsonar.grid.surfsara.nl systemd[1]: Failed to start OpenSearch.

Apr 18 16:31:43 perfsonar.grid.surfsara.nl systemd[1]: Unit opensearch.service entered failed state.

Apr 18 16:31:43 perfsonar.grid.surfsara.nl systemd[1]: opensearch.service failed.

 

 

I’m not sure what to do now. Any suggestions?

 

Cheers,

Onno




Archive powered by MHonArc 2.6.24.

Top of Page