09:01:09 <flwang1> #startmeeting magnum 09:01:10 <openstack> Meeting started Wed Oct 23 09:01:09 2019 UTC and is due to finish in 60 minutes. The chair is flwang1. Information about MeetBot at http://wiki.debian.org/MeetBot. 09:01:11 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 09:01:13 <openstack> The meeting name has been set to 'magnum' 09:01:21 <flwang1> #topic roll call 09:01:59 <flwang1> o/ 09:02:13 <strigazi> o/ 09:02:43 <brtknr> o/ 09:03:56 <flwang1> let's stop typing on etherpad for now 09:04:02 <flwang1> and focus here? 09:04:16 <flwang1> #topic updates 09:04:28 <flwang1> as i mentioned in the ehterpad 09:04:43 <flwang1> the ignition patch has been merged and cherrypicked to train 09:05:11 <flwang1> we may need a tiny patch to drop the copy local-data to cfn-init-data 09:06:01 <flwang1> strigazi: brtknr: ^ 09:06:26 <strigazi> I don't know if we need it 09:06:42 <flwang1> strigazi: brtknr: i think we should focus on irc 09:06:46 <flwang1> strigazi: why? 09:06:55 <strigazi> flwang1: why we need it 09:06:56 <strigazi> ? 09:07:08 <brtknr> if its copying, it will behave the same as before? 09:07:26 <brtknr> does the heat patch prioritise local-data? 09:07:49 <flwang1> strigazi: see https://github.com/openstack/magnum/blob/master/magnum/drivers/k8s_fedora_coreos_v1/templates/user_data.json#L80 09:08:09 <flwang1> you added ExecStartPre=-mv /var/lib/os-collect-config/local-data /var/lib/cloud/data/cfn-init-data 09:08:23 <flwang1> not copy, it's a mv 09:08:36 <brtknr> doh! its mv /var/lib/os-collect-config/local-data /var/lib/cloud/data/cfn-init-data, not cp! 09:08:50 <brtknr> either we should change it to cp.... 09:08:59 <flwang1> we should drop it 09:09:00 <brtknr> because heat will have already cut a release 09:09:17 <brtknr> when magnum cuts a release, it will break compatibility 09:09:22 <flwang1> ah, i see your point 09:09:30 <flwang1> that's shit 09:09:48 <brtknr> yeah :( 09:10:21 <flwang1> anyway, we can keep it maybe 09:10:34 <brtknr> thats assuming heat has cut a release 09:10:35 <strigazi> ExecStartPre=- means fail safely 09:10:39 <brtknr> we should check with them 09:10:51 <flwang1> strigazi: that's good 09:10:56 <strigazi> why we need it? 09:11:12 <flwang1> strigazi: you're confusing me again, why we need what? 09:11:18 <strigazi> I did it this way because I was expecting this 09:11:32 <strigazi> We don't need to drop the mv 09:11:51 <flwang1> because it's like a patching 09:11:58 <flwang1> not a solution 09:12:34 <strigazi> ;et 09:12:51 <flwang1> but as brtknr said, heat have already released a version, so we can keep it 09:13:28 <flwang1> don't you think it's ugly to have a mv like this? 09:13:28 <strigazi> we can drop it, the agent will work unless someone has a broken heat. we can keep it, the agent will work with a fixed or broken heat 09:13:48 <flwang1> yep, that's why i think now we can live with that 09:14:09 <flwang1> after I realized that heat have already released 09:14:19 <flwang1> clear now? 09:15:02 <strigazi> yes. That's why I said, "why we need it?" 09:15:27 <strigazi> move on, we keep it. 09:15:27 <flwang1> ok, fail enough 09:15:30 <brtknr> lol... 09:16:29 <strigazi> flwang1: let's move on? 09:16:44 <flwang1> as for fa29 support, personally i would say support it until we think fa30 is stable 09:17:15 <brtknr> #topic FA29 support 09:17:26 <flwang1> #topic fa29 support 09:17:32 <brtknr> :( I dont have the wizardry 09:17:53 <flwang1> brtknr: i can sell it to you if you can buy me a beer 09:18:11 <brtknr> seems like a good trade 09:18:27 <flwang1> strigazi: how do you think? 09:18:35 <brtknr> flwang1: I am happy with that proposal 09:19:03 <brtknr> or when FA31 enters development? 09:20:06 <flwang1> brtknr: either works i think, and i think it won't take too long, because the coreos driver is sharing most of the atomic driver 09:20:44 <brtknr> works for me 09:21:52 <flwang1> next one? 09:21:54 <brtknr> I would prefer is use_podman=true 09:22:18 <strigazi> brtknr: no, default behaviour is better to not change. 09:22:19 <brtknr> is that doable? or would it cause issues for upgrade? 09:22:30 <brtknr> okay fine 09:22:55 <strigazi> user creates a cluster in stein with a CT. In train they should get the same. makes sense? 09:22:56 <flwang1> strigazi: i would like to understand why didn't you introduce this label at the first time? 09:23:12 <strigazi> flwang1: because I got complains later :) 09:23:26 <flwang1> personally, like this 09:23:58 <flwang1> as i mentioned in the comments, it gives us more time to get the confidence to migrate to fedora coreos 09:24:42 <brtknr> sure, I'm happy with it 09:24:47 <flwang1> the only thing i'd like to propose is, just skip this label for fedora coreos 09:24:49 <brtknr> but needed some convincing 09:25:20 <brtknr> maybe we should drop podman support for atomic altogether? 09:25:30 <brtknr> so its a clean separation 09:25:43 <flwang1> it's a bit late at this moment, i would say 09:26:07 <brtknr> its a lot of extra code for a feature with a short shelf life 09:26:20 <flwang1> we can have a separate, fully isolated code, as long as we deprecate the fedora atomic driver 09:26:57 <flwang1> are you suggesting only for master? 09:26:58 <brtknr> we just need to drop the label... default to using podman for coreos and atomic for fa29 09:27:09 <brtknr> with the same driver 09:27:21 <strigazi> give me a moment 09:28:17 <strigazi> I didn't mention this in the commit message unfortunately, but i mentioned it in a meeting. 09:28:22 <brtknr> so USE_PODMAN is an internal label rather than something that the user specifies 09:28:29 <strigazi> brtknr: please waot 09:28:31 <strigazi> brtknr: please wait 09:28:58 <strigazi> kubelet in an atomic container, doesn't work 09:29:10 <strigazi> that is why we have podman in the atomic driver. 09:29:29 <flwang1> iirc, that's for v1.16.x, right? 09:29:54 <strigazi> If you don't want to have fedora atomic and and >= 1.16 we can drop it 09:30:32 <strigazi> brtknr: flwang1 what do you prefer? 09:30:49 <flwang1> strigazi: i'm ok for have it for fedora atomic driver 09:30:54 <strigazi> brtknr: ? 09:31:07 <strigazi> you have some concerns 09:31:19 <flwang1> but i prefer to use true for fedora coreos 09:31:28 <flwang1> instead of letting user to set it for fedora coreos 09:31:42 <strigazi> flwang1: https://review.opendev.org/#/c/690053/2/magnum/drivers/k8s_fedora_coreos_v1/templates/kubecluster.yaml@775 09:32:21 <strigazi> flwang1: https://review.opendev.org/#/c/690053/2/doc/source/user/index.rst@1385 09:32:43 <brtknr> hmm let me thing 09:34:03 <brtknr> i think by the time we start deploying train release to customer sites, which is at least 3 months away, fcos30 should be stable 09:34:28 <flwang1> strigazi: what happened if i set use_podman=false for driver=fedora_coreos? 09:34:33 <brtknr> we dont need the use_podman label, if flwang1 needs it, lets feep it 09:34:40 <strigazi> we also have users, so don't forget us too :) 09:34:54 <strigazi> flwang1: brtknr creation will fail 09:34:56 <strigazi> with: 09:35:24 <brtknr> strigazi: yes you too! if your users need it! 09:35:35 <flwang1> yep, but i don't want to see that kind of "surprise" 09:35:55 <flwang1> wait 09:36:05 <strigazi> flwang1: | status_reason | ERROR: Parameter 'use_podman' is invalid: "false" is not an allowed value [true] | 09:36:27 <strigazi> it is pretty clear and prertty fast 09:36:49 <strigazi> http://paste.openstack.org/show/785502/ 09:36:51 <flwang1> unless i set use_podman=false for a fedora_coreos template 09:36:58 <flwang1> ok, i'm good then 09:37:55 <strigazi> brtknr: thoughts? 09:38:21 <brtknr> is it better to call it podman_enabled to keep with the convention 09:38:48 <flwang1> brtknr: good point? 09:38:53 <flwang1> good point 09:38:54 <strigazi> brtknr: anything else than naming? 09:39:01 <brtknr> nope 09:39:08 <strigazi> are you sure? 09:39:13 <flwang1> hah 09:39:23 <flwang1> strigazi: don't be pushy for our lovely brtknr 09:39:27 <brtknr> i am outnumberd 09:40:02 <strigazi> I mean, I might miss something 09:40:17 <brtknr> we need to document it 09:40:31 <strigazi> https://review.opendev.org/#/c/690053/2/doc/source/user/index.rst 09:40:51 <brtknr> explain in the doc that we might need use_podman in atomic to support 1.16.x+ 09:40:59 <brtknr> explain in the doc that we need use_podman in atomic to support 1.16.x+ 09:41:24 <brtknr> i will add this to review 09:42:04 <brtknr> i need to look at the code again 09:42:15 <brtknr> i was off yesterday so havent had a chance to review 09:43:13 <flwang1> let's move? 09:43:19 <flwang1> we have 17 mins left 09:43:42 <flwang1> heat-agent: Check if scripts exists https://review.opendev.org/#/c/689704/ 09:43:47 <flwang1> i have +2ed 09:44:48 <brtknr> I will review today 09:45:08 <brtknr> But happy with it in principle 09:45:48 <brtknr> Should address the weird behaviour during reboot in heat-contaner-agent log 09:46:01 <flwang1> brtknr: yep 09:46:07 <flwang1> i notice that recently as well 09:46:29 <strigazi> weird behaviour during reboot in heat-contaner-agent log 09:46:35 <strigazi> please elaborate 09:47:28 <flwang1> my case is not reboot, i saw heat-container-agent stuck several times 09:47:30 <brtknr> strigazi: once the heat-container-agent finishes its thing, it starts writing: Oct 22 20:48:26 k8s-dlgmc7wmwxlo-master-0.novalocal podman[1492]: /var/lib/os-collect-config/local-data not found. Skipping to journald 09:47:33 <flwang1> not useful log 09:47:35 <flwang1> no 09:47:47 <brtknr> but after reboot, it would complain about a missing file 09:47:50 <strigazi> brtknr: I mean about reboot 09:47:57 <strigazi> ok 09:48:14 <brtknr> just rebooting the node in the dev environment now 09:48:15 <strigazi> which file? I think my patch fixes it 09:48:42 <brtknr> strigazi: one sec, just waiting for the reboot to complete 09:50:01 <strigazi> https://github.com/openstack/magnum/commit/3674b3617a770bd71d09e23137ff96f90eb1241a My mistake, it wasn't verbose enough. Using the atomic cli to install kubelet breaks mount 09:50:06 <strigazi> propagation of secrets, configmaps and so on. Using podman 09:50:10 <strigazi> in a systemd unit works. 09:50:15 <brtknr> strigazi: why do you strongly dislike PODMAN_ENABLED btw? 09:50:31 <brtknr> i am not that fussed 09:50:59 <strigazi> it doesn't enable anything. But maybe from the user/ops pov it is better 09:52:29 <strigazi> logs? 09:52:30 <flwang1> strigazi: that's a valid point 09:52:42 <flwang1> generally, we use XXX_enabled only for addons 09:52:45 <brtknr> look at the end: https://seashells.io/p/WseYPMUE 09:52:47 <flwang1> not for this kind of case 09:53:23 <brtknr> lets leave it as USE_PODMAN then 09:53:44 <strigazi> brtknr: after the stop? 09:54:07 <strigazi> brtknr: which line? 09:54:16 <brtknr> Oct 23 09:50:29 09:54:41 <strigazi> brtknr: my patch fixes this 09:54:44 <flwang1> strigazi: brtknr: can we discuss details offline? 09:54:56 <brtknr> strigazi: yes, i already ack this 09:55:07 <strigazi> flwang1: 09:55:09 <strigazi> flwang1: ok 09:55:17 <flwang1> sorry, guys 09:55:20 <strigazi> anything last comments? 09:55:28 <flwang1> it's late here 09:55:45 <brtknr> okay, there's the issue with octavia and master lb fip 09:55:47 <strigazi> oh, have a look at the three patches from Theodoros I added in the etherpad 09:55:50 <brtknr> its working for flwang 09:55:54 <brtknr> not for me 09:56:10 <brtknr> could you please test this strigazi 09:56:15 <flwang1> strigazi: i will leave that for brtknr 09:57:41 <strigazi> one final thing from me 09:58:30 <strigazi> We need the patches I proposed, for podman and heat-agent, plus the three fixes from Theodoros: 09:58:35 <strigazi> https://review.opendev.org/#/c/687879/ 09:58:35 <strigazi> https://review.opendev.org/#/c/688400/ 09:58:35 <strigazi> https://review.opendev.org/#/c/688346/ 09:58:47 <strigazi> in the train branch 09:58:53 <flwang1> CERN is trying to upgrade to train ? 09:58:55 <strigazi> and release 9.1.0 09:59:00 <strigazi> yes 09:59:06 <flwang1> ok, i see then 09:59:28 <flwang1> to use the v1.16.x and later, right? 09:59:36 <strigazi> yes 09:59:44 <strigazi> 17 in two months 09:59:48 <flwang1> same here 10:00:04 <flwang1> catalyst is on stable/stein 10:00:11 <flwang1> with several private patches 10:00:11 <strigazi> we as well 10:00:18 <flwang1> cherrypicked from master 10:00:21 <strigazi> flwang1: how many? 10:00:29 <flwang1> more than 10 10:00:37 <strigazi> 48 here 10:00:39 <flwang1> from master(train) 10:00:43 <strigazi> who wins? 10:00:48 <flwang1> you win 10:00:54 <strigazi> xD 10:00:56 <flwang1> my friend, so you're keen than me 10:01:13 <strigazi> CherryPickaaS 10:01:18 <flwang1> ok, let's wrap up this meeting 10:01:21 <brtknr> :D 10:01:41 <flwang1> anything else? 10:01:42 <brtknr> we have only started upgrading to stein 10:02:02 <strigazi> I'm good, before closing 10:02:14 <strigazi> +1 or -1 to: We need the patches I proposed, for podman and heat-agent, plus the three fixes from Theodoros: 10:02:21 <strigazi> brtknr: flwang1 ^^ 10:02:28 <strigazi> and then 9.1.0 10:02:30 <strigazi> brtknr: flwang1 ^^ 10:02:31 <flwang1> strigazi: buy me a beer 10:02:47 <strigazi> sure thing? IPA? 10:02:51 <strigazi> sure thing, IPA? 10:03:16 <flwang1> hopefully i can join the next openstack summit 10:03:22 <flwang1> do you know where is it? 10:03:24 <flwang1> after shanghai 10:03:37 <strigazi> this is a good local one https://www.magictomato.ch/fr/geneve/item_details/531-biere-pale-ale-tangente-brasserie-du-virage 10:03:56 <strigazi> flwang1: I think Vancouver, but not sure 10:04:07 <flwang1> the logo looks cool 10:04:18 <flwang1> if it's Vancouver, i want to go 10:04:27 <strigazi> brtknr: flwang1 agree with the plan for 9.1.0? 10:04:28 <flwang1> if it's EU, no go 10:04:34 <brtknr> strigazi: sounds good 10:04:36 <flwang1> strigazi: works for me 10:04:40 <flwang1> i have to go 10:04:41 <strigazi> excellent 10:04:55 <flwang1> have fun, guys 10:04:59 <strigazi> good night, get some rest 10:05:04 <flwang1> o/ 10:05:18 <brtknr> good nite! 10:05:53 <strigazi> #endmeeting