Skip to Content.
Sympa Menu

perfsonar-user - RE: [perfsonar-user] Encountered end of file

Subject: perfSONAR User Q&A and Other Discussion

List archive

RE: [perfsonar-user] Encountered end of file


Chronological Thread 
  • From: Louis-Berthier Soullière <>
  • To: Mark Feit <>, "" <>
  • Subject: RE: [perfsonar-user] Encountered end of file
  • Date: Tue, 16 Jul 2019 12:42:13 +0000
  • Arc-authentication-results: i=1; mx.microsoft.com 1;spf=pass smtp.mailfrom=ssss.gouv.qc.ca;dmarc=pass action=none header.from=ssss.gouv.qc.ca;dkim=pass header.d=ssss.gouv.qc.ca;arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=pxcNGzScHZMV8GoGu1Wq5hKFQRjx+31xN1I3DsakiLQ=; b=MA/EF6Gm+OwlX+Mjav0RWiC4g5v4+Zea8MZNiQ5Gebwuh/tU4vFlXWCy9Tm/soesBtwJxqY3LnzxakVsXm0qKm2LnBCk01MDS/vJ29vZXfJfOcgBrlasOWOfVITKEkTZ4TAvV+mWFfc4BQaXhl4UUXiYqLX5GyWueSRCTw2aFNMSjRHbdc0UEiarPK/WFWrs0WvPkNehHMqRSmTh5o0oEI1CYVS3BWVe56Z2sR/R2IYKZ12iotp/B4xnG1qleH3swNN0dIVcWDlb8/qwTeENt9qjVFhZwk3GBuvBmiR4mRZC+Rjo3iKSaHNSPJj71mFxIhRzQLCclrzsNW6+DwImKQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=QAA77lgwlqRiN8Hh42WQBOx7zTvG7oNroivrz58g8p6vIyX9s1P2YnGhTKIMDKKpzOG/SoaRqDY9ZwqZ5PmGmPeBfwl8sPmm3PIhVAaXvRBrW8SrrkwndmZX29g6K2eeSByW6UiG9ksqjQrnGZKp9bbuFRmSYsOFhiMtZVAG3JzJHZqYd7JP/PCqa7wugXqMsht89DRSGUWObmFwCtBipSO8wfiTJMwTJ3jjQs38yZ9+eMFt84zIuRCz7uePPvOGNXxvUn2Y5va55xCJ2H6aikCzXU0TAAmOw24DrodurO0B99eoTRU5AmK20pDcRwggy2wChevKd259Dx645aMP7w==

Hi Mark,

 

I restarted the httpd service and it is now working.

 

But I got another error when doing the troubleshoot (btw, i changed the MTU on the interfaces of both server to 1460)

 

localhost:

 

  Measuring MTU... 65535 (Local)

  Looking for pScheduler... OK.

  Checking clock... OK.

  Exercising API... Status... Tests... Tools... OK.

  Idle test.... 4 seconds.... Checking archiving... OK.

 

10.x.x.1 :

 

  Measuring MTU... 1460+

  Looking for pScheduler... OK.

  Checking clock... OK.

  Exercising API... Status... Tests... Tools... OK.

  Idle test.... 1 seconds.... Checking archiving... OK.

 

localhost and 10.x.x.2 :

 

  Checking IP addresses... IPv4

  Measuring MTU... 1460+

  Checking timekeeping... OK.

  Simple stream test.... 9 seconds.... Failed.

 

Task failed to run properly.

 

2019-07-16T12:06:32Z on 127.0.0.1 and 10.x.x.2 with simplestreamer:

 

simplestream --dest 10.x.x.2

 

Run did not complete: Failed

 

This run likely failed because the clocks on participants differed by 0:00:01.941681.

 

Limit system diagnostics for this run:

 

  No limits were applied

 

Diagnostics from 127.0.0.1:

  Try 1 failed: Failed to connect: [Errno 111] Connection refused

  Try 2 failed: Failed to connect: [Errno 111] Connection refused

  Try 3 failed: Failed to connect: [Errno 111] Connection refused

  Try 4 failed: Failed to connect: [Errno 111] Connection refused

  Try 5 failed: Failed to connect: [Errno 111] Connection refused

  Ran out of time trying to connect.

 

Error from 127.0.0.1:

  Failed to connect: [Errno 111] Connection refused

 

Diagnostics from 10.x.x.1 :

  No result was produced

 

Error from 10.x.x.2:

  No result was produced

 

============

 

I synchronized both clock manually before the operation, and both use the same server.

 

# service ntpd stop
# ntpdate ntp.rtss.qc.ca
# service ntpd start

 

I even tried to switch to chrony because it is suppose to handle the network latency better (satellite), no luck.

 

On both host :

 

[]# psc ping localhost

localhost: pScheduler is alive

 

If I try to fetch manually the api version from the host 10.x.x.2, it work :

 

# curl -ks https://10.x.x.2/pscheduler/api

4

 

Log of 10.x.x.1 :

 

Jul 16 08:09:54 perfsonar journal: runner INFO     205654: With simplestreamer: simplestream --dest 10.x.x.1

Jul 16 08:09:55 perfsonar journal: runner INFO     23875: Posted result to https://10.x.x.1/pscheduler/tasks/de96c854-9933-472d-b8f7-c52612c8b2b9/runs/805a53b3-d80d-4b53-bb35-9f3da25b667d

Jul 16 08:09:55 perfsonar journal: runner INFO     23877: Posted result to https://10.x.x.1/pscheduler/tasks/b3c3941e-fac9-4d03-9f5a-e359244e2db8/runs/056dcc00-a9e0-4903-95fc-47b50d9cda98

Jul 16 08:09:59 perfsonar journal: runner INFO     205654: Run succeeded.

Jul 16 08:10:05 perfsonar journal: runner INFO     22182: Posted result to https://10.x.x.1/pscheduler/tasks/02f151e9-18ae-4370-b7fb-bc29eb263a25/runs/49c23c92-e7da-4d62-8012-2f816a72c467

Jul 16 08:10:05 perfsonar journal: runner INFO     22180: Posted result to https://10.x.x.1/pscheduler/tasks/936ac75d-0020-49be-93f2-2a37743ff900/runs/e1a798c8-5a90-4cc6-b399-661fc6347edf

Jul 16 08:10:14 perfsonar journal: runner WARNING  205654: Unable to retrieve run https://10.x.x.2/pscheduler/tasks/bf27fdfe-7c89-4db3-ab34-a5180152b35c/runs/fb8a42c9-f614-4029-a474-d2fea305c0bf

 

Log of 10.x.x.2 :

 

Nothing

 

Louis

 

De : Mark Feit [mailto:]
Envoyé : 15 juillet 2019 18:57
À : Louis-Berthier Soullière <>;
Objet : Re: [perfsonar-user] Encountered end of file

 

Louis-Berthier Soullière writes:

 

I'm trying to run this command with no success :

 

# pscheduler ping 10.x.x.1

10.x.x.1 : Encountered end of file

- Running pS-Toolkit-4.2.0b1-CentOS7-FullInstall-x86_64-2019Jul09.iso on both side

 

One key difference between the beta and production versions is the introduction of different library to do the HTTP(S) operations.  It’s cURL, which has an established track record, and we’ve had it in the test bed for months.  The error you’re seeing comes from cURL.

 

“pscheduler ping 10.x.x.1” does the equivalent of “curl -sk https://10.x.x.1/pscheduler/”.  If you can do that on the command line and get the same result, the problem isn’t likely a problem with pScheduler.

 

- High latency between the two server : 530ms

- Tried to simulate latency between two local server and cannot replicate the issue (not a latency problem)

- The test work one way but not the other. (Server A to B = OK, B to A = issue)

 

Is this the same pair of systems you were working with in ticket #884?  Your last comment indicates that there’s a MTU mismatch between the two systems, which would cause problems in the larger-to-smaller direction.  If you’re on satellite, some providers have been known to do things that make path MTU discovery not work.

 

Is it possible to disable all limits for testing purpose?

 

This isn’t a limits-related problem, but you can disable the limit system by deleting or renaming /etc/pscheduler/limits.conf.  The change will take effect within 30 seconds.

 

--Mark

 




Archive powered by MHonArc 2.6.19.

Top of Page