Skip to Content.
Sympa Menu

perfsonar-user - [perfsonar-user] Test scheduling behavior post 4.4.0 upgrade

Subject: perfSONAR User Q&A and Other Discussion

List archive

[perfsonar-user] Test scheduling behavior post 4.4.0 upgrade


Chronological Thread 
  • From: "Uhl, George D. (GSFC-423.0)[Arctic Slope Technical Services, Inc.]" <>
  • To: "" <>
  • Cc: "Jackson, Wayne P. (GSFC-590.0)[Arctic Slope Technical Services, Inc.]" <>
  • Subject: [perfsonar-user] Test scheduling behavior post 4.4.0 upgrade
  • Date: Thu, 15 Jul 2021 19:43:46 +0000
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nasa.gov; dmarc=pass action=none header.from=nasa.gov; dkim=pass header.d=nasa.gov; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=37mGvp+0F0Iok77/HLymkXWfUVyhHbXs8re3wo1Zd0I=; b=TDISbJQHijaE8XJaDiP0UBfUhCwiwU9eFydJfky/z57oyIYWD2cGq3lfXNS6cbZuzI8rwBG247zMiZKvuPhS0n+XpLuSFsD2Rc/U6JNfJDNkvGpqGCNiI7DGyOFy6kaKFkHGbDUFb6s9VVkMsXKZs1HgSOdgthqnfKKen6MSIUXtuAeDMIDKS6w02rO/Eqd8+NzXyt7I5ml+8sTv+a8+9w0KV+tffHV5W1OWQiwlL2izTkmCMSdMkgjlHrsFX5pyCsHgryvep5XumDzdcLDVgxqJYghZi4aIwpmCqJ3A2Ost0svdw5vUGTTx+OejNxJonWXRTvJ/+JV1ZNOaCgnI3g==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=h2/s/eXfRTZuR1H7QDADZc4zKIfcfop7+/mLtdBRAMXHM9x8PwK1eb71hzeBfwQ/s5+5zKBsQHARTXXrHnKLPfBylrPp62Nq1Agr/VRJPinEzeYsYYKcoqj+vtb4dAWhmEoyn7S76BT7GPgJJgTZN1xwlD24yC/aNHuDItVPeZjx0KtdBMsmFeGlOyuS8fBbTPP8F1wt5grsu+7eU5kAD9Y5qQBFD65BMYhZKBiMaxUnKm2UKPz/EjAHqK3yZ1w9i9jrK3M6BVdPSLg7E3U0mlOlKU/w1ZkrHJy7EgAVwnZJu1ipLFuVL/7k2P1OmbWLbpDvB2dLihyvMNuWDv/ovA==
  • Dkim-filter: OpenDKIM Filter v2.11.0 ndjsvnpf104.ndc.nasa.gov 76EB8404E3CC

Hi,

 

A number of my tests have begun to fail after I upgraded my pS testnode software from 4.3.4 to 4.4.0 on the night of July 8/9.  It appears that something fairly dramatic has changed with test scheduling causing outbound tests to fail while inbound tests are running on a haphazard schedule sometimes only a few minutes apart.  The graphs below show test results that are scheduled to run on a 2 hour cycle.  These tests are generated though my psconfig test mesh.   There are several instances of this in my mesh and one commonality is that the remote perfsonar servers are designated as no-agent.

 

One week’s throughput test results prior to upgrade:

Chart, line chart Description automatically generated

 

One week’s throughput test results post upgrade:

Chart, scatter chart Description automatically generated

 

Latest one day’s worth of test results:

Chart, line chart Description automatically generated

 

No issues when running a troubleshoot

 

$ pscheduler troubleshoot 198.124.238.154

Performing basic troubleshooting of localhost and 198.124.238.154.

 

localhost:

 

  Measuring MTU... 65535 (Local)

  Looking for pScheduler... OK.

  Fetching API level... 5

  Checking clock... OK.

  Exercising API... Archivers... Clock... Contexts... Tests... Tools... OK.

  Fetching service status... OK.

  Checking services... Ticker... Scheduler... Runner... Archiver... OK.

  Checking limits... OK.

  Idle test.... 9 seconds.... Checking archiving... OK.

 

xxx.xxx.xxx.xxx:

 

  Measuring MTU... 1500+

  Looking for pScheduler... OK.

  Fetching API level... 5

  Checking clock... OK.

  Exercising API... Archivers... Clock... Contexts... Tests... Tools... OK.

  Fetching service status... OK.

  Checking services... Ticker... Scheduler... Runner... Archiver... OK.

  Checking limits... OK.

  Idle test.... 5 seconds.... Checking archiving... OK.

 

localhost and xxx.xxx.xxx.xxx:

 

  Checking IP addresses... IPv4

  Measuring MTU... 1500+

  Checking timekeeping... OK.

  Simple stream test.... 13 seconds.... OK.

 

pScheduler on both hosts appears to be functioning normally.

 

When running tests between the same two serves on the command line with pscheduler, they fail.  Every so often I get a successful test using the CLI, but it’s rare and inconsistent.  They usually fail with the following errors.

Gave up after too many scheduling conflicts.

Run not found; task may have been canceled.

 

Thanks,

George Uhl

NASA GSFC

 

From: <> on behalf of Szymon Trocha <>
Organization: PCSS
Reply-To: Szymon Trocha <>
Date: Thursday, July 15, 2021 at 10:44 AM
To: Raul Lopes <>
Cc: "" <>
Subject: [EXTERNAL] Re: [perfsonar-user] Node not reporting to Maddash

 

Hi Raul,

 

W dniu 15.07.2021 o 16:03, Raul Lopes pisze:

Hi Szymon,

 

Just to say thank you.

 

All hosts running stable for a week. This link was the solution for the two nasty ones:

 

Perfect! Glad I could help

 

regards,

--

Szymon Trocha
Poznań Supercomputing & Networking Center

General NOC phone +48 61-858-2015 | noc.pcss.pl
Personal desk phone +48 61-858-2022

Wysłaliśmy do Ciebie ten e-mail w odpowiedzi na Twoje zapytanie lub w związku z oferowaną usługą. Przesłanie korespondencji do Centrum Zarządzania PCSS lub zgłoszenie telefoniczne jest równoznaczne z wyrażeniem zgody na przetwarzanie danych osobowych przez Instytut Chemii Bioorganicznej Polskiej Akademii Nauk w Poznaniu adres: ul. Z. Noskowskiego 12/14, 61-704 Poznań. Szczegółowe informacje znajdują się w naszej Polityce prywatności. | This message has been sent as a part of communication with PSNC NOC or your service request sent to us. For more information read our Privacy Policy.

 

Attachment: smime.p7s
Description: S/MIME cryptographic signature



  • [perfsonar-user] Test scheduling behavior post 4.4.0 upgrade, Uhl, George D. (GSFC-423.0)[Arctic Slope Technical Services, Inc.], 07/15/2021

Archive powered by MHonArc 2.6.24.

Top of Page