Skip to Content.
Sympa Menu

perfsonar-user - Re: [perfsonar-user] pscheduler problems on Rocky 9 after reboot

Subject: perfSONAR User Q&A and Other Discussion

List archive

Re: [perfsonar-user] pscheduler problems on Rocky 9 after reboot


Chronological Thread 
  • From: Otto J Wittner <>
  • To: "Contardo, Gianni Carlo" <>, Andrew Gallo <>, Mark Feit <>, "" <>
  • Subject: Re: [perfsonar-user] pscheduler problems on Rocky 9 after reboot
  • Date: Thu, 25 Jan 2024 21:26:41 +0100
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=sikt.no; dmarc=pass action=none header.from=sikt.no; dkim=pass header.d=sikt.no; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=G5o/jtkjUSb5Vr/I3yj5vLL4TeJe3A/Yg603D17BneY=; b=KuyTLOgzjVzKmS3D5T4CKDqxRohy3XjRf9OVbiS3bLRrpaSliuxraR97GF5cuFkSsAEGPw5HyaG3c6itLHs9WOZcV1+5U1iuJjxSLEbc6WCg7Kb42TzKY9WPCabmT7U1Q4e4SWoOVxqcpbgk+7BaEFEBS2BD6HhC1iXPoWu55OLmDNTehlQul97B60ALVtQQInyrgDoPOEJMrRI8fpOIx5bJpvGO00sizjXasy9HzuEX1wrxZjtj+u2CGUyTWc89h1fswY5qjn/bIuttTyE+IrcdjDIq4UdIbcVFvvNMQ9K6ZmlW9UL3MqFz4uinI8YcU3sCILhAMbLoaEomvufWlQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=nnHmdk1CSXf+iyHN+7hXOdFIgNpJfCr0L0GmGUezr6lFBCjjJhDCbrC0EhcG76aHN/ZSGEvhfKlv+k+xRG+D6EmkVc9la2PT3hP08/4CyKOFlAxZZw+Yc+6+wN1Dhfzaw34ohS/bjyLnSNfJ9TrV4v+Dj95/5sd0sS4s7GNqFX+YwOLU0OEafipw2TY3ZRR9BTOdNAOcg2Iv3GlMaVvvSVqnbSkUPTBuUbxbKtD3Stl0o0RnXbomdI9cH+ySiyGXF3+jzMRiYGJcRmF+lOrauzknZulK8JB2/jNUSMfra1y6HQrfwHOEliK+OBMIp0wgGmiEQB3nbMin/dz0No3Pag==

Hi,

I have a similar issue in my container base test environment running almalinux 9 nodes. "start request repeated too quickly" is reported by systemctl status pscheduler*.service.

Adding StartLimitIntervalSec=30 and StartLimitBurst=1000 after the Restart=always statement in the pscheduler*.service descriptions seem to have solved the issue.

I assumed this was a container "thing", but perhaps it is more general...

O2

On 25.01.2024 18:59, "Contardo, Gianni Carlo" (via perfsonar-user Mailing List) wrote:
Since we upgraded to release 5.0.7 we've had issues. We are running on EL 7.9 and
things were working fine for months. The only recent changes have been the upgrade and
a reboot. Things were fine until we rebooted and then we noticed that pscheduler shows
up as "Not Running" in the toolkit web page.

Here's what shows up when I query for failed services:


$ systemctl --failed
UNIT LOAD ACTIVE SUB DESCRIPTION
● lm_sensors.service loaded failed failed Initialize hardware
monitoring sensors
● pscheduler-archiver.service loaded failed failed pScheduler server -
archiver
● pscheduler-runner.service loaded failed failed pScheduler server - runner
● pscheduler-ticker.service loaded failed failed pScheduler server - ticker

We manually started the services and then pscheduler showed up as "Running" in
the toolkit web page. Everything appears to be in the “Running” status on the toolkit web
page. Pscheduler monitor shows tasks in the Finished, Running, & Pending states, but
the MaDDash dashboards, which were previously working fine, show “Unable to find test data.”

Any guidance on how to get this working would be appreciated. Thank you.


Gianni

-----Original Message-----
From:
<> On Behalf Of Andrew Gallo
Sent: Monday, January 15, 2024 7:40 PM
To: Mark Feit <>;
Subject: Re: [perfsonar-user] pscheduler problems on Rocky 9 after reboot



On 1/11/2024 2:35 PM, Mark Feit wrote:
systemctl status postgresql



Here's the reported status pof postgresql

[agallo@acad-synclab ~]$ systemctl status postgresql ●
postgresql.service - PostgreSQL database server
Loaded: loaded (/usr/lib/systemd/system/postgresql.service; enabled;
preset: disabled)
Active: active (running) since Thu 2024-01-11 12:58:34 EST; 3
days ago



That's what I'm not understanding...everything looks OK.

After trying a reboot and restarting some services by hand, things seem to be
working.

I can try to reinstall and see how it goes.

Thanks for your help



--
________________________________
Andrew Gallo
The George Washington University


--
To unsubscribe from this list:
https://lists.internet2.edu/sympa/signoff/perfsonar-user

--
Otto J Wittner, PhD, Senior Engineer
https://sikt.no +47 99550566



Archive powered by MHonArc 2.6.24.

Top of Page