grouper-users - RE: [grouper-users] Maintaining Grouper database size
Subject: Grouper Users - Open Discussion List
List archive
- From: "Black, Carey M." <>
- To: "Hyzer, Chris" <>
- Cc: Shilen Patel <>, David Langenberg <>, Gail H Lift <>, "" <>, Rory Larson <>
- Subject: RE: [grouper-users] Maintaining Grouper database size
- Date: Wed, 31 Jan 2018 18:32:25 +0000
- Accept-language: en-US
- Authentication-results: spf=pass (sender IP is 128.146.138.11) smtp.mailfrom=osu.edu; duke.edu; dkim=none (message not signed) header.d=none;duke.edu; dmarc=pass action=none header.from=osu.edu;
- Ironport-phdr: 9a23:vJ/G9hPRETfJl2qHUm8l6mtUPXoX/o7sNwtQ0KIMzox0K/z4rsbcNUDSrc9gkEXOFd2Cra4c0qyO6+jJYi8p2d65qncMcZhBBVcuqP49uEgeOvODElDxN/XwbiY3T4xoXV5h+GynYwAOQJ6tL1LdrWev4jEMBx7xKRR6JvjvGo7Vks+7y/2+94fcbglUmTaxe69+IAmrpgjNq8cahpdvJLwswRXTuHtIfOpWxWJsJV2Nmhv3+9m98p1+/SlOovwt78FPX7n0cKQ+VrxYES8pM3sp683xtBnMVhWA630BWWgLiBVIAgzF7BbnXpfttybxq+Rw1DWGMcDwULs5Qiqp4bt1RxD0iScHLz85/3/Risxsl6JQvRatqwViz4LIfI2ZMfxzdb7fc9wHX2pMRsZfWTJcDIOgYYUBDOQBMuRZr4bhqFQDtgGxCRWoCe711jNEmn370Ksn2OohCwHG2wkgEsoAvHnJqNX6LrsdUeOtwKLV0zjMdelW1in96YPVdR4tu/+AVq93fMXKzUkgDR/KjlaKpYD4IT+Y2P8As2+A7+p9T+6glXMoqxxorzWp28wihI7JhocPxVDF8yV02Ic1JdukSEFle96kFoVftz2EO4dsXMwtXnxotD4iyr0cvp67eyYKyIwgxx7ebfyHb5KE7Q7kVOaUOTt4i2hleKqmixqs7Ees1+zxW8u73VtMsyFLkcHMu2gT2xPJ9sSLV/5w8lq81TqS0g3T5O5JLVwomafeJZ4u3LE9m5sWvEnGHCL6hlv5gLeUe0gr4OSk9uvqbqn8qpKSKYN4kAXzPro2lsyxH+s1MRQCUm2e9Om8yLLs4UP5TbBUgvA5k6TWrZXXKMsbq6O7DQJY15os5A2kADi81tkTgGMJI0hfeB2diojkI1HOL+78Dfe4m1miiCtmyffaMrH/G5nAM37NnKn4cbZ68EFT1hA/zddC55JIEb4BJ+/zWknsu9DCFh85KQu0w/r5B9phyoMeWGWPAqmDPKPVrF+I+uYvI+6LZI8WojryN/8l5/v2gX84n18SY7Wp0IMJZH+iAvhqPkCUbWfxjtscFGoKsAUzQPD2hFCHXz5ceXm/UqM56zw+Co+qE5nMSpiogLOb3Se7GpNWZnpBClCJCXrnbIGEVO0QZyKROMNtjyIIWLa6Ro8mzhGuqRX2xKR6IerJ4iEYr47s1MBp5+3PkhE/7TN0D96a02GQV2F7gHkISyYr3KBkv0N90EyD3LN8g/xZDtxT++hJXhkgOZ7dyex6F879WhjHftiXVFaqXM+qDi8sQdIskJcyZBM3O8SwgwqHlwGqGb4O3fTfAZc047DRxVDwPM071m7L0q9nglU7FJhhL2qj0+RV8wHYBMqBuEyDmrfiUOJWlHrH8G6IzizX5hpwVxVtF6jJQCZMNQPtsd3l6xaaHPeVArM9P14EkJbaJw==
- Spamdiagnosticmetadata: NSPM
- Spamdiagnosticoutput: 1:99
In general, I think there is a clear need here for operational tools/processes to manage the DB data growth. However, I also hate losing data. ( Delete is a form of “loss”. Hopefully a willful choice, but still a loss.)
Mostly because we lose the ability to ask a whole range of questions about “what really happened”? ( While looking back instead of planning ahead.
J ) Maybe it would be better to have a model where this kind of audit data is moved from “Active” to “Archived” then off to “delete”? Maybe a shadow table(s) where the “Archived data” can be held just out of sight of the operation of the UI/WS, but still around for other reporting? Your schedule of a configuration to define the duration of “Active” (Days/weeks/months, move from “Active” to “Archive” on that
schedule.) and “Achieved” (Days/weeks/months/years) data sounds good. Then add a later schedule to more from Archived to delete. I also think there is the possibility for some to want to treat any membership change ( regardless of source [UI/WS/Loader/etc…]) as equally valuable, and others
might see “non-human” process as less necessary to have in their active audit trail. So maybe the definition of that should be a separate config item? (AKA: “has a subject id”
vs “no subject id” for the change) Maybe even special groups that need more monitoring/carve outs for extra ( or reduced) retention too. Also, I also wonder if there are some reports/summary/monitoring that should be done before the delete that would preserve some details/trends while still letting
go of the volume of data? Maybe there are some groups that it would be nice to monitor the count of members once a day, month, etc.. across the cycles of the academic/finical
calendar? Maybe seeing spikes/dips in Loader loaded data by group/job? Maybe seeing growth/shrinking basis, ref, access control policy groups in the system over time? Etc… So I think it may be harder than just “archive/delete every N days”. Might even be a opportunity to tag with attributes to signal what to do for each group? (
maybe with a system config default if not tagged? ) .. Thinking like Attestation, but for the definition of things like: “ArchiveAfter”, ‘DeleteAfter”, “CollectStatsEvery”…. --
Carey Matthew From: [mailto:]
On Behalf Of Rory Larson Agreed. That would be a very nice feature. Would time-based deletes be based on create-date or last-mod-date? There seems to be a difference between these in the grouper_audit_entry table, though I'm
not sure why a log record or point-in-time record would ever be modified. Thanks, Rory From: Gail H Lift []
Sounds good here too. The configurable time intervals will make it easy to adjust to local needs. On Wed, Jan 31, 2018 at 11:55 AM, David Langenberg <> wrote:
--
|
- RE: [grouper-users] Maintaining Grouper database size, (continued)
- RE: [grouper-users] Maintaining Grouper database size, Hyzer, Chris, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Rory Larson, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Hyzer, Chris, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Rory Larson, 01/29/2018
- Re: [grouper-users] Maintaining Grouper database size, Gettes, Michael, 01/29/2018
- Re: [grouper-users] Maintaining Grouper database size, David Langenberg, 01/30/2018
- RE: [grouper-users] Maintaining Grouper database size, Hyzer, Chris, 01/31/2018
- Re: [grouper-users] Maintaining Grouper database size, David Langenberg, 01/31/2018
- Re: [grouper-users] Maintaining Grouper database size, Gail H Lift, 01/31/2018
- RE: [grouper-users] Maintaining Grouper database size, Rory Larson, 01/31/2018
- RE: [grouper-users] Maintaining Grouper database size, Black, Carey M., 01/31/2018
- RE: [grouper-users] Maintaining Grouper database size, Rory Larson, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Jim Fox, 01/31/2018
- RE: [grouper-users] Maintaining Grouper database size, Hyzer, Chris, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Rory Larson, 01/29/2018
- RE: [grouper-users] Maintaining Grouper database size, Hyzer, Chris, 01/29/2018
Archive powered by MHonArc 2.6.19.