14:00:25 #startmeeting glance 14:00:26 Meeting started Thu Mar 14 14:00:25 2019 UTC and is due to finish in 60 minutes. The chair is jokke_. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:27 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:29 The meeting name has been set to 'glance' 14:00:31 #topic roll-call 14:00:34 o/ 14:00:56 #link https://etherpad.openstack.org/p/glance-team-meeting-agenda 14:01:19 o/ 14:01:43 o/ 14:02:20 o/ 14:02:40 hey all 14:03:02 #topic updates 14:03:34 Soo we're on FF after the extended merge them features now we agreed last week. 14:04:04 #link https://etherpad.openstack.org/p/glance-stein-milestone-3 14:04:12 current status ^^ 14:04:30 I think lpetrut's work on the Windows compatibility is the last thing that is not purely bugfix we will consider 14:04:51 yes 14:05:00 I don't think we have any FFE requests on anything else. We have some bugfixes pending but in general we're looking pretty good 14:05:08 also do we need FF for visibility? 14:05:17 was just going to ask 14:05:25 that may need to be postponed to train 14:05:30 We need to drop Tempest gating for that 14:05:32 agree 14:05:38 thanks a lot for your help on this. some updates on this matter: we're almost done with the CI 14:06:06 cool 14:06:33 Which is yet again topic on my PTG list. 14:06:49 i will help with that, we need a strategy 14:07:14 But lets move on and we can discuss the more pressing current matters under their slots 14:07:23 #topic release updates 14:07:47 We need to release glance milestone 3 today as agreed in last week 14:07:59 Client got release last week on time and we did not release the milestone 3 14:08:17 python-glanceclient --> Version 2.16.0 released last week 14:08:56 \o/ 14:09:14 for milestone 3, we need releasenotes patch, then need to regenerate sample config, and release patch 14:09:16 I think the big question is do we have something we need the milestone 3 tag for? 14:09:21 RC1 is next week 14:10:13 tags are cheap, I'm not saying we definitely should not do that, but do we have any benefit for tagging it? 14:10:25 is it possible that we can release RC1 direct? 14:10:42 that's a question for sean, but i believe we can 14:10:49 abhishekk: yes, as of this cycle forward there is no need to tag milestones 14:11:06 only reason we did need the milestone tagging was for the db migrations to work 14:11:35 cool, then I guess apart from cache-manage utility we don't have any major feature in this release 14:11:58 As of release model, the Integrated release with milestones changed to kind of Integrated release with possible milestones 14:12:11 yeah, then we need to submit the patch to open the migration for Train release 14:12:31 yes, that will be around Train-1 14:12:39 ack 14:13:03 or when that test breaks the gate 14:13:12 which i forget what exactly it looks for 14:13:15 or that ^^ ;D 14:13:28 :D 14:13:43 I think we did merge the instructions into the release liaison document 14:14:05 jokke_, it was already there 14:14:28 but it definitely should not blow up this week so lets move on unless there is something else about releases? 14:14:50 Ohh, how are the periodical gates looking? Still just sporadical timeouts? 14:15:05 yes, 6 failures in last week 14:15:15 all are timeouts 14:15:29 I have added this topic in PTG discussion etherpad 14:15:37 hmm-m that's like double what we had week before iirc 14:15:55 sounds like the gate load plays a role in there as well 14:15:58 we might opt help from infra guys? 14:16:27 that's it from release updates topic 14:16:41 are we going to tag m3 today or RC1 directly 14:16:50 I think it's at least worth of discussing. We definitely have the data to show now when we have had stable situation for a while 14:17:33 right 14:17:43 Like said I don't mind either way. Does anyone have any compelling arguments for or against tagging m-3? 14:18:56 I guess not. Lets skip it and focus to get good RC1 out next week then! 14:18:58 sorry to say, suddenly there are 8-10 failures for periodic jobs 14:18:59 functional-py35 create: /home/zuul/src/git.openstack.org/openstack/glance/.tox/functional-py35 14:18:59 ERROR: InterpreterNotFound: python3.5 14:18:59 ___________________________________ summary ____________________________________ 14:18:59 ERROR: functional-py35: InterpreterNotFound: python3.5 14:19:19 ^^ looks definitely like Infra issue 14:19:30 they changed some images or something recently 14:19:40 #link http://zuul.openstack.org/builds?pipeline=periodic&project=openstack%2Fglance 14:19:47 yes 14:19:48 thanks abhishekk 14:19:53 thierry said something about it on the ML, though not that exact issue 14:20:24 forget that, it was a gpg key issue 14:20:29 ok, moving on 14:20:39 #topic feature Freeze 14:21:07 So we have those couple of patches lpetrut had to rebase there still hanging 14:21:16 How is it looking ot get those reviewed? 14:21:44 those have entered the gate 14:22:05 one patch needs to be reviewed where he has added time.sleep in tests 14:22:19 #link https://review.openstack.org/#/c/643336/1 14:22:27 yep, I've marked that as WIP as there's still one case that I'll need to fix 14:22:43 so this got me wondering, did you have such problems before? I mean, due to the timestamps 14:23:04 AFAIK, never 14:23:12 lpetrut: I can't recall at least. The good thing about that patch is that it's touching tests only 14:23:15 no, usually the functional tests are slow enough 14:23:18 interesting, good to know 14:23:56 what i mean is, we haven't had to worry about instantaneous updates in the functional tests 14:24:07 ^^ 14:24:43 yeah, I'm not sure if it's just very fast or the clock is inaccurate :) 14:25:13 At this point we need to put full focus on nailing the bugs down. Stuff like the test hardenings are great for this time of the cycle 14:25:24 and sorry to ask, but do gate tests run on bare metal? 14:25:40 Shall we have bug smash around the weekend, Monday maybe? 14:25:55 lpetrut: i don't think the tests run on bare metal 14:26:01 lpetrut: no they mostly don't 14:26:01 though, maybe occasionally 14:26:15 i think in general they are running on "donated" VMs 14:26:34 yes 14:26:35 there is some baremetal testing but like rosmaita said mainly zuul respins VMs 14:27:15 got it, thanks 14:27:32 abhishekk: was your yes for lpetrut or bug smash? Sorry I mixed up the discussion here a bit 14:27:48 bug smash 14:28:04 cool. And I hope you're feeling better 14:28:05 quick question about lpetrut: how do we feel about the sleeps? just let them through? 14:28:20 yes, better and better 14:28:23 they are very small sleeps 14:28:44 I thought that a few ms should be negligeable. mocking complicate the code and cause other issues 14:29:02 I can make them optional (e.g. have a separate method for that, something like .add_delay) 14:29:07 i agree, don't want to mock too much stuff in functional tests 14:29:37 maybe let's just leave them in so we can get the rest merged and people can start testing it 14:29:46 do the optimizations as a followup patch 14:29:55 As long as the delays are not in a places where we would expect catching real issues like race conditions I'm fine having them in tests 14:30:31 well, they are db updates, so race condition city 14:31:24 ok, lets review them carefully and see if we should perhaps make sure the test reflects real life scenario enough. 14:31:38 ok 14:32:02 if it's testing issue I'm fine with delaying them, if it's something we should fix to make sure it's atomic change, lets fix it :D 14:32:10 Do lpetrut need to send FFE mail as per standard? 14:32:47 abhishekk: like said earlier these are testing hardening/bug fixing. Perfect for this time. I don't see new features introduced here 14:33:06 cool 14:33:18 yeah, but we need these test fixes for the other stuff to merge, right? 14:33:55 rosmaita: I think they are not prerequisite for the stuff gating atm. but for the windows ci to run 14:34:02 lpetrut: correct me if I'm wrong 14:35:25 ok, moving on 14:35:32 in that case, it would be cool if lpetrut did the optimization of only sleeping if running on windows 14:35:37 #topic py3 glance_store issue 14:35:50 rosmaita: this was yours 14:36:23 #link https://review.openstack.org/#/c/620234/ 14:36:26 rosmaita, I will try to have a look at test patch today (or may be tomorrow) 14:36:41 abhishekk: don't i have another idea on that 14:36:54 great 14:36:58 i will be changing the patch a bit, had an idea while at the dentist this morning 14:37:14 :D 14:37:15 anyway, i just want to push the above 14:37:25 rosmaita: ok what you had in mind? 14:37:41 well, mainly i need two +2s 14:37:44 :) 14:37:57 or need to find out that this is unacceptable 14:38:04 i would like to close this out ASAP 14:38:33 Just bit of a background. We had discussion with rosmaita about this patch and what Tim said about monitoring the result returned from that read 14:39:32 yeah, the basic idea is that i want to keep this patch as small as possible, that is, don't want it to mask any other problems 14:39:45 So Tim's proposal was instead of faking the zero read to check every response we get from the reads and if it's anything but actual data coming in, replacing it 14:39:55 such a bad time to run out of battery :) without 634007, we cannot run the tests on Widndows (currently in the gate), while 643336 fixes unit tests. sorry for going off topic 14:40:36 and that's exctly why we came to the conclusion that faking the zero read instead of replacing something that executed code returns to us would be the better way to go 14:41:00 That way we will catch possible future issues while we try to find out what is the actual root cause 14:41:08 yes, so we just need sean or abhishek to agree with jokke_ 14:41:28 yes makes sense to me 14:41:48 in the meantime, i am trying to get functional tests that catch this 14:42:13 and we left the bug open, so the commit message is pointing to related-bug not closes-bug 14:42:21 and the note added by rosmaita helps better to understand it 14:43:05 so we have reminder that we actually need to figure out what's crapping us in the first place. This change will just fix this specific issue so we can release and indeed work with py3 14:43:25 i'll add some comments to the bug 14:43:43 great, I think this is beaten, lets move on 14:43:43 great 14:44:02 #topic data remains in staging 14:44:13 abhishekk: this is something you've been working on 14:44:22 jokke_, yes 14:44:24 i put it on the agenda for abhishekk 14:44:39 i want to make sure we are all ok with the way the problem is handled 14:44:43 And I think rosmaita pointed out very well that the staging should not be relying deployment store enablements 14:44:44 i think it's fine 14:44:45 yesterday Me and rosmaita had a discussion about how to fix this 14:44:59 and we will be refactoring this for Train 14:45:03 ok 14:45:06 ++ 14:45:13 but this fix i think needs to go back to rocky 14:45:28 so we go as it is for now and keep it in the agenda of PTG discussions how we do this finally? 14:45:32 I have tested this for single store and multiple store and it is working 14:45:38 so i do have a question for abhishek, though 14:45:53 please shoot 14:46:11 you found in testing that using _build_store() to create the store to delete from modified teh "real" store 14:46:23 but why does it work in the api in the /stage call? 14:47:07 just give me a minute 14:47:14 sure, or we can discuss later 14:47:30 just wanted to determine whether we need to change the code in /stage also for now 14:47:41 i don't think so 14:47:48 but it is kind of weird, i am missing something 14:48:20 The staging code at least used to work in a way that it created new store object, owerwrote the config on that, so the config change stayed local on that object 14:48:52 right 14:48:54 in staging code we are creating file store instance and calling add method of file store directly 14:49:16 it is worst kind of black magic wizardry you can do in OOP 14:50:07 abhishekk: and I think that was rosmaita's point on the comment, we should do the same on the cleanup. Just create the object with overwritten config and call it's delete directly 14:50:38 to call delete method of filestore we need to pass location object to delete method 14:50:49 instead of relying the store library to figure out how to get to the path 14:50:56 abhishekk: ohhh 14:51:08 and it is so weired to create a location object on the fly 14:51:12 this is related, i think this will be a problem: https://github.com/openstack/glance/blob/master/glance/api/v2/image_data.py#L76 14:51:16 so we would need to do even more hackstery to create that bloody object 14:51:45 ok, then I agree. Lets keep this simple for now and do it properly in Train 14:51:46 i think get_location_from_uri requires the scheme to be "registered" 14:51:55 yes, so instead of that I have kept it simple 14:52:42 i think we need to do unlink that that _unstage function 14:52:42 abhishekk: would you mind to write me an mail about this whole scenario and the reasons why it sucks? ;) I'd like to have this explained in known issues section of the release notes 14:52:49 rosmaita, right, and if I register it and if there is filestore in the 'stores' conf option then all images will be stored in staging area :D as it will override 14:52:58 jokke_, sure 14:53:10 great 14:53:48 ok, so lets stick with the current and explain in documentation. 14:53:51 I will draft a mail within a hour after the meeting 14:54:23 this should get easier once we get to utilize those reserved stores on multistore for it 14:54:35 yes, way easy 14:54:45 abhishekk: no rush, I need it like before the final release :D 14:54:50 ok, so i get it ... the problem is having to use the location for delete 14:55:01 have i mentioned that i really hate image locations recently? 14:55:13 #MeToo 14:55:14 :D 14:55:23 meaning "recently mentioned" not "recently hated" 14:55:25 rosmaita: and not only location, which we know, but the store delete expecting properly formed location object 14:55:37 i have hated them for a very long time 14:55:45 yeah same 14:55:52 ok, so to summarize: 14:56:00 creating the store in /stage is fine 14:56:27 probably a problem wiht the _unstage method in the image_data controller for the delete, though 14:56:47 in the task, abhishek's code is doing the right thing to just manually delete the stuff from teh staging area 14:56:55 the end 14:57:05 yeah 3 min for open discussion 14:57:11 #topic open discussion 14:57:57 rosmaita: so just to clarify, the glance_store needs the store type to be enabled to be able to form the location object from the uri to pass to the delete ... in nutshell that's the current problem :D 14:58:24 correct 14:58:26 right 14:58:38 anything else? 14:58:44 nope 14:58:46 oh yeah 14:59:04 You need to send mail for PTL candidacy 14:59:16 #agreed Bug scrub day at Monday! Lets go through our open bugs and make sure we haven't missed anything critical 14:59:32 jokke_: what abhishekk said!!! 14:59:37 just to update, I have submitted specs for nova and cinder to use multiple backned of glance 14:59:40 abhishekk: I already submitted my candidacy to the elections repo ;) 14:59:48 abhishekk: Amazing! 14:59:59 jokke_, great 15:00:36 ok, time ... we can continue on #os-glance 15:00:39 thanks all 15:00:40 thank you all 15:00:43 #endmeeting