17:01:33 #startmeeting keystone-office-hours 17:01:34 Meeting started Tue Mar 27 17:01:33 2018 UTC and is due to finish in 60 minutes. The chair is lbragstad. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:01:35 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:01:37 The meeting name has been set to 'keystone_office_hours' 17:01:45 bridge is open btw: https://redhat.bluejeans.com/8559013623/ 17:02:18 * lbragstad grabs water quick 17:02:29 i'll be on in about 3 minutes 17:02:34 * hrybacki visits a water closet 17:03:58 :/ guys i love your faces but i can't have a multi-hour video conference every week, that starts to feel a lot like work 17:05:18 when you have them everyday, why not have another one? 17:05:20 :( 17:05:40 video calls are the only way I can stay on topic these days -_- my current role is wrecking me 17:08:43 i'm starting an etherpad for the help wanted list https://etherpad.openstack.org/p/keystone-help-wanted-list 17:08:56 if you need my input on the call just ping me 17:10:27 adding that to the OO etherpad cmurphy 17:10:51 cmurphy: can i entice you with doggo cam again? (soon to be puppy cam too) 17:11:02 (2 more weeks and puppy arrives) 17:11:26 kmalloc: hmmmmmmmm maybe 17:11:44 cmurphy: ooh sec. 17:12:56 puppies aside a video call isn't easily reconsumeable even if it's recorded, so it's harder for say wxy to come back to it and figure out what happened 17:17:45 true 17:18:27 https://usercontent.irccloud-cdn.com/file/ss1edaf7/OMG%20PUPPY 17:18:39 For your puppy consumption needs. 17:19:18 The video call was so that someone can drive a specific conversation e.g. look at this review with me. Like, someone outside of the core team for example could ask for more immediate input and get feedback if they are having troubles with just comments on reviews 17:19:55 they can do that on irc 17:20:10 and we can jump on a call if irc isn't cutting it for a given discussion 17:20:32 I guess I was shooting for some consistency 17:21:31 but it is what the team wants :) 17:21:48 yeah I like the video call for focused conversation (ie reviews,specs) 17:22:09 but otherwise irc can probably cover most of what we need 17:22:17 please go ahead with it, it's just evening here for me so i'm going to relax and stuff and am available if needed specifically 17:31:37 jgrassler: updated https://review.openstack.org/#/c/396331/20 to summarize the meeting 17:34:14 lbragstad: Thanks! 17:34:23 jgrassler: no problem 17:34:39 lbragstad: I'll update the spec tomorrow morning (gotta dash now) 17:34:57 jgrassler: yeah - no worries 17:45:23 hrybacki: reviewed https://review.openstack.org/#/c/523973/ 17:45:35 lbragstad: ack, thank you! looking now 17:45:47 looks good, just a few suggestions, but I think we can probably take this to some of the other projects and get feedback 17:46:29 lbragstad: ack. Need folks to ask a few hard questions so we can flesh it out in a meaningful way rather than just speculating imo 18:00:44 lbragstad: the mysql and pgsql errors are different 18:00:54 one is an issue, loosk like with the dataset 18:00:57 the other is a deadlock 18:01:03 huh 18:01:28 for mysql, are you specifically referencing the issue brought up this morning by tmcm? 18:01:37 yep 18:01:54 that looks to be an issue with the FK constraint cannot be made, there is bad data 18:02:05 like project.id reference in a column that doesn't exist in the project table 18:02:07 (example) 18:02:17 kmalloc: yeah - we found out that the domain being referenced didn't actually exist 18:02:23 yep 18:02:43 the hard part here is... we don't really test pgsql 18:02:51 this might be a pgsql issue 18:03:04 there was some discussion about postgres support, but i don't know where that ended up 18:03:25 well, let me check the gate, but... i think we aren't testing pgsql meaningfully 18:03:30 Lance Bragstad proposed openstack/keystone master: Log warning when using token_flush https://review.openstack.org/556889 18:03:48 i thought i remember various TC members being involved there 18:04:03 iirc - it was a long discussion and i never kept up with it 18:04:32 lbragstad: https://bugs.launchpad.net/keystone/+bug/1758460 18:04:34 Launchpad bug 1758460 in OpenStack Identity (keystone) "UUID (or any persistent) token providers unable to validate federation token" [Undecided,New] 18:04:53 tell me with a straight face, how much do we care about UUID provider at this point, even in stable/pike :-) 18:05:08 gyee: not 18:05:14 gyee: it was deleted in Rocky 18:05:18 heh 18:05:32 gotta ask 18:05:33 if it is a major bug, we can fix as long as P isn't EOL 18:05:44 and if the bug is in Q, we can address it 18:05:46 P is near EOL 18:05:49 gyee: i already pulled that plug 18:06:06 lbragstad: it got a resolution https://governance.openstack.org/tc/resolutions/20170613-postgresql-status.html but it was not really a real decision 18:06:07 and started rewriting all the interfaces in that part of keystone 18:06:11 but frankly, i wouldn't care about the P bug unless it's critical, and if the bug is in Q too 18:06:19 since technically we support uuid in both P and Q 18:06:33 that bug's been there since P 18:06:43 only for UUID provider though 18:06:51 as a stable core, i'd merge a fix for Q and backport to P 18:07:16 but i wouldn't write the code myself. 18:07:19 if that helps you out 18:08:04 that's enough info for me to convey back to the decision makers :-) 18:08:06 thanks guys 18:08:20 gyee: and i say that as the only keystone-stable-core member :P 18:08:44 gyee: my recommendation to the decision makers is "fernet" 18:09:14 kmalloc, agree 18:09:23 gyee: if you are writing the fix, propose straight to Q but comment that it cannot be merged to master because uuid has been removed. 18:09:53 kmalloc, nah, I'll push for fernet, no point of touching UUID 18:09:54 and once it's good (inc. a test) port to P 18:09:58 i figured 18:10:02 but just in case you have to ;) 18:10:13 and then ping me directly so we can push it through (if you end up needing it) 18:10:32 gyee: with my internal hat on, our other product already does fernet so you could copy that implementation :) 18:11:17 cmurphy, most of the stuff are already there, just the rotation bit needs work 18:12:15 lbragstad: i'm also going to reference the TC resolution regarding PGSQL there. 18:12:30 lbragstad: it's going to take a bit more work to know what and why PG is having the issues. 18:12:43 lbragstad: i'm guessing it is an issue with load on the DB and a table lock. 18:12:52 lbragstad: possibly a delete operation 18:19:12 gyee: in case you're interested https://review.openstack.org/#/q/status:open+project:openstack/keystone+branch:master+topic:token-provider-refactor 18:19:28 i'm not sure if you are maintaining an out-of-tree token provider anywhere 18:20:24 kmalloc: you mean reference it in the bug? 18:20:30 lbragstad, no out-of-tree, I got pulled in to reverify the federation stuff recently 18:20:38 gyee: oh 18:20:39 but I haven't touch that stuff in awhile 18:20:44 lbragstad: yeah 18:20:51 kmalloc: ok 18:21:25 lbragstad: just commented on the code 18:21:47 lbragstad: i think we might need zzzeek to help us here. 18:22:07 lbragstad: where was that community tag stuff doc'd again? 18:22:29 lbragstad: disregard 18:22:34 here? https://governance.openstack.org/tc/reference/tags/index.html 18:22:48 kmalloc: thanks for digging 18:22:55 aye 18:24:28 lbragstad: update pushed 18:24:33 * hrybacki fetches lunch 18:26:41 sweet 18:27:54 lbragstad: i downgraded the bug to medium 18:28:00 it's PGonly and it's contract 18:28:27 kmalloc: ack - so we're waiting on feedback then? 18:28:30 zzzeek: if you could help out some, trying to chase down potential deadlocks in a contract phase https://bugs.launchpad.net/keystone/+bug/1755906 18:28:30 Launchpad bug 1755906 in OpenStack Identity (keystone) "Occasional deadlock during db_sync --contract during Newton to Pike live upgrade" [Medium,Confirmed] 18:28:58 zzzeek: i just don't see it, and it's happening in PG but not MySQL AFAICT 18:35:58 kmalloc: we're spuporting postgresql again? 18:36:26 zzzeek: no, just a best effort, feel free to say "not my problem/can't help" 18:36:36 kmalloc: postgresql is very locky at the DDL level 18:36:49 just said I would ask -- mostly to be sure we aren't doing something dumb that could bite us in MySQL as well 18:36:59 zzzeek: yeah PG is very locky for integrity reasons(tm) 18:37:54 kmalloc: so, what did you have in mind here? 18:38:17 if you could look at the migration and give a "yeah no clear issues that would impact, this is an edge case" 18:38:19 that is good for me 18:38:37 kmalloc: are the deadlocks against the normal app server running SELECT statements? 18:38:55 the deadlock is happening in a contract phase (new FK constraint) while selects are happening 18:39:05 Gage Hugo proposed openstack/keystone master: Make tags filter match subset rather than exact https://review.openstack.org/553108 18:39:12 lbragstad added a releasenote 18:39:32 this is the no-downtime(limited downtime) upgrade thing 18:40:21 zzzeek: this is the migrtation in question https://github.com/openstack/keystone/blob/master/keystone/common/sql/contract_repo/versions/014_contract_add_domain_id_to_user_table.py 18:41:41 kmalloc: this is adding a foreign key in the contract, huh 18:42:06 because we can't add the forign key in expand... lets just say the no-downtime thing has been headaches 18:42:20 and pivoting the whole table is also... very detrimental 18:42:49 kmalloc: i dont see a quick win on this thered' ahve to be some hey make sure the app server isn't running while the migration happens thing, e.g. more locks 18:43:00 zzzeek: and for all i know the triggers is causing issues 18:43:03 kmalloc: online schema migrations for PG seems like a non-starter given the status of PG 18:43:08 good to know 18:43:15 i'll respond with that and reference this convo 18:43:15 kmalloc: yo're doing the triggers w/ PG as well? 18:43:20 sigh 18:43:32 against all my protests 18:43:47 kmalloc: oh I was arguing in *favor* of the triggers :) 18:43:56 i greatly dislike them. 18:44:12 they're so very hard to debug. 18:44:19 esp. when the app can do all the logic. 18:44:22 kmalloc: im not saying this bug cant be fixed but it would require thinking, looking, coding, and testing and i dont know we have resources for that for PG 18:44:28 yeah thats fine 18:44:44 i'll reference this as well confirming my statement, it's just not something we have resources for 18:44:48 yes 18:44:53 but we will happily accept external help 18:45:00 and if they have a fix, we'll evaluate it 18:45:06 and include it if we can. 18:49:28 lbragstad: ^, marked the bug as incomplete 18:49:34 kmalloc: zzzeek thanks 18:49:38 lbragstad: if they can supply help, we'll accept it 18:49:41 gagehugo: works for me locally 18:50:19 lbragstad: otherwise, we should update our documentation, live-schema changes only supported/tested under MySQL, PGSQL is recommended that live-schema changes not be performed (downtime-only) 18:50:28 lbragstad: acutally,w e sould update docs for that anyway 18:50:33 right 18:52:56 kmalloc: when should we remove keystone-manage token_flush? 18:52:58 Solar? 18:53:08 lbragstad: sure. 19:04:59 Lance Bragstad proposed openstack/keystone master: Log warning when using token_flush https://review.openstack.org/556889 19:14:57 lbragstad: cmurphy perhaps we could conduct an audit of our API vs scope levels 19:16:26 for example, I assumed user related actions would be domain-scoped as opposed to system-scoped. BUT if user/group actions are system-scoped that would resolve our earlier issue lbragstad 19:20:25 Lance Bragstad proposed openstack/keystone master: Removal of deprecated direct driver loading https://review.openstack.org/350815 19:23:40 hrybacki: i think some of that would audit would have been taken care of when we implement scope_types 19:24:18 lbragstad: ack. I'm just thinking doing part of it now would make sure we have a sane, consistent messgae in the spec 19:24:27 e.g. are user operations domain or system level discussion 19:24:52 sure 19:26:12 we could put a disclaimer for that specific case in the spec 20:31:11 is Solar the official name? 20:32:36 not yet 20:34:29 oh - shoot, i should probably wip my review then 20:37:38 I liked stein 21:18:48 knikolla: are you happy with response here https://review.openstack.org/#/c/556022/ ? 21:23:45 lbragstad: want me to update the auth receipt spec and move that down? 21:23:57 or would you prefer doing it as another patch? 21:24:22 adriant: oh - we can do that in another patch set... we talked about that in today's meeting and i said i was going to propose a follow on 21:24:30 but i haven't gotten to it yet 21:24:35 cool :) 21:24:59 sorry I wasn't able to attend 21:25:29 i think kmalloc was also hoping for a final look from ayoung 21:26:19 adriant: no worries - it's not the best time for APAC 21:26:55 As I start making progress on the implementation WIP I'll come by for office hours and ask silly questions! 21:28:51 Lance Bragstad proposed openstack/keystone-specs master: Log queens specifications with previous releases https://review.openstack.org/557060 21:35:44 Merged openstack/keystone master: Make tags filter match subset rather than exact https://review.openstack.org/553108 21:45:56 cmurphy: only because ayound was commenting on it 21:46:00 cmurphy: but i'm fine with it +A now 21:46:06 lbragstad, adriant: ^ 21:46:35 Merged openstack/keystone master: Fix integer -> method conversion for python3 https://review.openstack.org/555339 21:51:09 lbragstad: pushed it :) 21:52:28 Merged openstack/keystone-specs master: Add spec for MFA auth receipts https://review.openstack.org/553670 21:53:56 woo! 21:54:11 now I have to actually implement it. Which shouldn't be so bad