Skip to Content.
Sympa Menu

grouper-users - Re: [grouper-users] Error on job: CHANGE_LOG_consumer_psp

Subject: Grouper Users - Open Discussion List

List archive

Re: [grouper-users] Error on job: CHANGE_LOG_consumer_psp


Chronological Thread 
  • From: "Bee-Lindgren, Bert" <>
  • To: Scott Koranda <>, grouper-users <>
  • Subject: Re: [grouper-users] Error on job: CHANGE_LOG_consumer_psp
  • Date: Thu, 12 Jan 2017 17:07:30 +0000
  • Accept-language: en-US
  • Authentication-results: spf=none (sender IP is ) ;
  • Ironport-phdr: 9a23:X4+pghebgx+rA/0hcIWUurAllGMj4u6mDksu8pMizoh2WeGdxc27ZBWN2/xhgRfzUJnB7Loc0qyN4vymAjRLsMjJmUtBWaQEbwUCh8QSkl5oK+++Imq/EsTXaTcnFt9JTl5v8iLzG0FUHMHjew+a+SXqvnYdFRrlKAV6OPn+FJLMgMSrzeCy/IDYbxlViDanb75/KBq7oR/fu8UKjodvLqc8wQbVr3VVfOhb2XlmLk+JkRbm4cew8p9j8yBOtP8k6sVNT6b0cbkmQLJBFDgpPHw768PttRnYUAuA/WAcXXkMkhpJGAfK8hf3VYrsvyTgt+p93C6aPdDqTb0xRD+v4btnRAPuhSwaLDMy7n3ZhdJsg6JauBKhpgJww4jIYIGOKfFyerrRcc4GSWZdW8pcUTFKDIGhYIsVF+cPPfhWoZThp1UArhW+CwujBOLzxTBHnXL5x7E23/g7HA3awAAsAtQDu2nUotXvM6cSVPi4wKfVwDrdcfxawDHz6InOch88rvGMR6x/cczJxUIyCg7KkE+QqZDrMziI0ekCqHaU7/B8Ve61kGEotR9+riGrxsctjInGm40Vykra+iV/24s1Ity4SFJhbd6rCpRQqz+VOJd2Qs84XW5npTs6x6QAtJWmciYKz5EnyATea/yBa4WI7QjjVOGMITdgmn1lYqiziAyq8Ui40uH8VM60301QripYlNnMrXAN2ALX6siDUPd95Fqh2TCJ2gvO6e9EOVg5mbTBJJI937I9k4cfvVneEiLzlkX7g7Oaelkh9+im9evqbbHrq5qZOoJ2lA3xLrwiltK/DOgiLwQDXWaW9f6h2LDj/ED1WqhGg/MonqTfrZvUP94UprSjDA9Qyosj6wiwDzOh0NkAhXcKMFVLdA6JgoTwNV/AJ/71Ae64g1u3jjhn3ffGPqD9AprWKXjDjbHhcqtn505E0gozysxf6IxIBbEdIfLzXUnxuMbfDh8kLwy0x+HnCNJ+1o8ERW2PBaqZPLvTsV+O+O0vP/GBaJEPtzrhNvQo4uPigWIklVIYc6mk0ocbZGy9E/RoPUqUbn7hj9IEHGoIoAYxUPTmiFiYXj5SY3ayUbg85jY+CI++FojMWJqtgKCb0CigAJJZeHlKClSNEXfuaoqEVO0MZzyUIsJ6lTwEVKShR5Uu1R60qA/11aZoLurO9iIGqJ3jycB55/fPmhEq6Tx0E8Od3nmCT2FumWMIWic2075loUBk11eDyrZ3g+ZDFdFI4/NJUxw6NYLHz+BkEdzyWwTBfsuXR1a8RNWpHy0xQswrz9AQfkl9BobqsheW5CGtH6NdtLuRDZo4++qIxHv2Pc97x3/u26wojl1gScxKYynuibR46hDeHcvUiEiDjO7+er4bwTbA7iKe1meUp2lZVhJ9S6PIQSpZa0fL+4fX/ETHGvWEBLliFgpHxcGYJ60OIvfkl0kMDKPpNc7CJWi8liK0BBCEyamBaqLtenlb0yzBBUME1Q0f4CDVZkAFGi69rjeGX3RVHlX1bhapqLEmpQ==
  • Spamdiagnosticmetadata: NSPM
  • Spamdiagnosticoutput: 1:99

Hello,


> ... Did not get all the way through the batch! 88720 != 88721 ...


Changelog entry 88721 failed to be processed. It might help to know what that change was, and it might help try to duplicate the problem. Can you query the changelog for some details about it?


Of course, PSP should have logged something about its failure to process that change; I would have expected an error message to be there before the log lines you sent. Also, it's a long shot, but might there be some information in the grouper_event log?  Or search for 88720 and 88721 in the log directory, obviously ignoring the "Did not get all the way through the batch!" messages.


The "Did not get all the way through the batch! 88720 != 88721" repeated over and over, right? If not, then retryOnError might not have been working.


Finally, if there aren't any other error messages, then we'll probably have to try to duplicate the problem with more verbose logging.



--Bert



From: <> on behalf of Scott Koranda <>
Sent: Thursday, January 12, 2017 11:09 AM
To: grouper-users
Subject: [grouper-users] Error on job: CHANGE_LOG_consumer_psp
 
Hello,

We have a Grouper 2.2.2 running in production. The deployment
uses the PSP to provision grouper memberships to OpenLDAP,
including the 'isMemberOf' attribute on a person record.

The deployment includes this configuration in
grouper-loader.properties:

changeLog.changeLogTempToChangeLog.quartz.cron = 0/10 * * * * ?
changeLog.consumer.psp.class = edu.internet2.middleware.psp.grouper.PspChangeLogConsumer
changeLog.consumer.psp.quartzCron = 5/10 * * * * ?
changeLog.consumer.psp.retryOnError = true
changeLog.psp.fullSync.class = edu.internet2.middleware.psp.grouper.PspChangeLogConsumer
changeLog.psp.fullSync.quartzCron = 0 0 0 * * ?

So temporary change logs are moved every 10 seconds.
The PSP also fires every 10 seconds, with 5 seconds of offset
from the temporary change log job. This gives us on average an
upper limit on latency of 15 seconds.

Last night we saw these errors in grouper_error.log:

2017-01-12 01:01:15,059: [DefaultQuartzScheduler_Worker-7] ERROR ChangeLogHelper.processRecords(281) -  - Did not get all the way through the batch! 88720 != 88721
2017-01-12 01:01:15,059: [DefaultQuartzScheduler_Worker-7] ERROR GrouperLoaderJob.runJob(433) -  - Error on job: CHANGE_LOG_consumer_psp
java.lang.RuntimeException: Error in loader job: null, check logs: Did not get all the way through the batch! 88720 != 88721
    at edu.internet2.middleware.grouper.app.loader.GrouperLoaderJob.runJob(GrouperLoaderJob.java:422)
    at edu.internet2.middleware.grouper.app.loader.GrouperLoaderJob.execute(GrouperLoaderJob.java:315)
    at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
    at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:529)
2017-01-12 01:01:15,081: [DefaultQuartzScheduler_Worker-7] ERROR GrouperLoaderJob.execute(318) -  - Error running up job
java.lang.RuntimeException: Error in loader job: null, check logs: Did not get all the way through the batch! 88720 != 88721
    at edu.internet2.middleware.grouper.app.loader.GrouperLoaderJob.runJob(GrouperLoaderJob.java:422)
    at edu.internet2.middleware.grouper.app.loader.GrouperLoaderJob.execute(GrouperLoaderJob.java:315)
    at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
    at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:529)
2017-01-12 01:01:20,014: [DefaultQuartzScheduler_Worker-10] INFO  EventLog.info(156) -  - [81bd39d7cdb946a8b44306cb0e5de3d3,'GrouperSystem','application'] session: start (2ms)
2017-01-12 01:01:25,014: [DefaultQuartzScheduler_Worker-2] INFO  EventLog.info(156) -  - [58af231ef1ec47d98e1dbaa9f16c5e66,'GrouperSystem','application'] session: start (3ms)

A restart of the Grouper loader process made these errors go
away and PSP provisioning proceeded as we expect.

What would have caused those errors?

Why did a restart of the Grouper loader clear those errors?

What can we do in the future to prevent similar errors?

Thanks,

Scott K



Archive powered by MHonArc 2.6.19.

Top of Page