perfsonar-user - Re: [perfsonar-user] Node not reporting to Maddash
Subject: perfSONAR User Q&A and Other Discussion
List archive
- From: Raul Lopes <>
- To: Mark Feit <>, "" <>
- Subject: Re: [perfsonar-user] Node not reporting to Maddash
- Date: Tue, 6 Jul 2021 14:00:30 +0000
- Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=jisc.ac.uk; dmarc=pass action=none header.from=jisc.ac.uk; dkim=pass header.d=jisc.ac.uk; arc=none
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=kWf3Forj8Cuu8BuANkM58KsWJ1Gb/pB55zQfXjGGxeA=; b=DvxcRgqGFEe3wRBkELujpoCnX/X6HQor+wGDzWk1C7JqDhOHIinhQgEKH1NrL2sa2yxNoa+XKDIAPUE0acGymC2KCAKH/xWVatbLkeHAuPXXsRRF3ElFlcy6Cj1kw56y0WRD59leOyKgSGjIIyc2PvIa834K1aKV1rKvbtDZyAn3Tba583AiL56uqz45OrWyflaydQi+fZN9lkuJ2MUYxoWmJ1Rb+q6UoKWxRpTWa9TTdOkzdJjRvllv74gEE4zYYAQ0rM4ks/lMtT3w6cZK7/hBg5D6O7BDdoU6LhXQHGzq545G1O2jjyZjb5X1i4gg7flEWVY5c/g8GhOND4O3Fg==
- Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=kaz7Uz8M3TfnidmEENDf3ExzGrjCA2Nrzg+eYJod+zKdmhz/T1uNomWFcWA3Y3dQiam8wFOA7BFRl9eSiEYgHjRWJvaiInlIUA1mL0pOVtJG+ufo4cbIDz/2Fb5pHC4y0fJ3vfDHdKohtFMccLX9suN9deo6gmExW2PGWVpsUh5vEjEwDJzrIAbYt0JSZ/wWNhU0bANvlK7jUCdaIkVrKUksf1PuvTsQhndIZGej3jOZldg3+SZ7M80XtPWfjgLmbO/sMr7sCOiFrq5PToW+9oWwopoRnSSGLL3YiJ8/AK0K+xj5Nn1KjF3TJCNmh+AXM+vvllI1rSo4gPifXwOfFg==
Sent: 06 July 2021 14:25
To: Raul Lopes <>; <>
Subject: Re: Node not reporting to Maddash
Raul Lopes writes:
I have a couple of nodes in a 20 nodes mesh that run the tests, but do not report to Maddash.
The logs are full of lines like
Jul 5 19:19:25 ps03-em1 journal: runner INFO 489461: Posted result to
https://ps03-lat.rl.ac.uk/pscheduler/tasks/0203bb86-10af-4d77-a607-a51de117f9cf/runs/defa4e86-a17
7-41a5-aebc-e1256683bd07
I can use pscheduler on the command line and get latency and bandwidth run with success. However, they do not send anything to Maddash. There is nothing going from that host to the Maddash server.
The usual first step would be to grab a few of the run URLs for the posted results and ask pScheduler how it thinks the archvings went:
pscheduler result --archivings RUN-URL
At the bottom, you’ll see something that looks like this that should provide diagnostic information like this contrived example:
Archivings:
To failer, Unfinished
2021-07-06T13:05:51Z Possibly-permanent archiver error: Program failed to start after 3 tries: Program exited 1: Forced to fail badly.
However also shows
2021-07-06 09:58:14,944 [INFO] /usr/lib/python3.6/site-packages/pycassa/logging/pool_logger.py: Connection 139755773060824 (localhost:9160) in pool 139755772886656 failed:
TSocket read 0 bytes
esmond/django.log has an error
2021-07-06 10:50:25,567 [ERROR] /usr/lib/esmond/lib/python3.6/site-packages/django/core/handlers/exception.py: Internal Server Error: /esmond/perfsonar/archive/4dbdcd5297fa41eaa01b5474ff5b186f/
Traceback (most recent call last):
…
File "/usr/lib64/python3.6/site-packages/thrift/transport/TSocket.py", line 143, in read
message='TSocket read 0 bytes')
thrift.transport.TTransport.TTransportException: TSocket read 0 bytes
Has anyone seen a situation like that?
I haven’t seen that specifically, but zero-byte reads are often a sign that the server closed the connection while the client was expecting it to send something. Andy might have something more specific since he’s our Esmond maven. He’s just back from vacation and should get to this while digging out from under a week’s worth of mail.
--Mark
Jisc is a registered charity (number 1149740) and a company limited by guarantee which is registered in England under company number. 05747339, VAT number GB 197 0632 86. Jisc’s registered office is: 4 Portwall Lane, Bristol, BS1 6NB. T 0203 697 5800.
Jisc Services Limited is a wholly owned Jisc subsidiary and a company limited by guarantee which is registered in England under company number 02881024, VAT number GB 197 0632 86. The registered office is: 4 Portwall Lane, Bristol, BS1 6NB. T 0203 697 5800.
Jisc Commercial Limited is a wholly owned Jisc subsidiary and a company limited by shares which is registered in England under company number 09316933, VAT number GB 197 0632 86. The registered office is: 4 Portwall Lane, Bristol, BS1 6NB. T 0203 697 5800.
For more details on how Jisc handles your data see our privacy notice here: https://www.jisc.ac.uk/website/privacy-notice
- [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Mark Feit, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Mark Feit, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/07/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/07/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Szymon Trocha, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/07/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/07/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Mark Feit, 07/06/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/06/2021
- <Possible follow-up(s)>
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Szymon Trocha, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Szymon Trocha, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Raul Lopes, 07/08/2021
- Re: [perfsonar-user] Node not reporting to Maddash, Mark Feit, 07/06/2021
Archive powered by MHonArc 2.6.24.