09:01:09 #startmeeting magnum 09:01:10 Meeting started Wed Oct 23 09:01:09 2019 UTC and is due to finish in 60 minutes. The chair is flwang1. Information about MeetBot at http://wiki.debian.org/MeetBot. 09:01:11 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 09:01:13 The meeting name has been set to 'magnum' 09:01:21 #topic roll call 09:01:59 o/ 09:02:13 o/ 09:02:43 o/ 09:03:56 let's stop typing on etherpad for now 09:04:02 and focus here? 09:04:16 #topic updates 09:04:28 as i mentioned in the ehterpad 09:04:43 the ignition patch has been merged and cherrypicked to train 09:05:11 we may need a tiny patch to drop the copy local-data to cfn-init-data 09:06:01 strigazi: brtknr: ^ 09:06:26 I don't know if we need it 09:06:42 strigazi: brtknr: i think we should focus on irc 09:06:46 strigazi: why? 09:06:55 flwang1: why we need it 09:06:56 ? 09:07:08 if its copying, it will behave the same as before? 09:07:26 does the heat patch prioritise local-data? 09:07:49 strigazi: see https://github.com/openstack/magnum/blob/master/magnum/drivers/k8s_fedora_coreos_v1/templates/user_data.json#L80 09:08:09 you added ExecStartPre=-mv /var/lib/os-collect-config/local-data /var/lib/cloud/data/cfn-init-data 09:08:23 not copy, it's a mv 09:08:36 doh! its mv /var/lib/os-collect-config/local-data /var/lib/cloud/data/cfn-init-data, not cp! 09:08:50 either we should change it to cp.... 09:08:59 we should drop it 09:09:00 because heat will have already cut a release 09:09:17 when magnum cuts a release, it will break compatibility 09:09:22 ah, i see your point 09:09:30 that's shit 09:09:48 yeah :( 09:10:21 anyway, we can keep it maybe 09:10:34 thats assuming heat has cut a release 09:10:35 ExecStartPre=- means fail safely 09:10:39 we should check with them 09:10:51 strigazi: that's good 09:10:56 why we need it? 09:11:12 strigazi: you're confusing me again, why we need what? 09:11:18 I did it this way because I was expecting this 09:11:32 We don't need to drop the mv 09:11:51 because it's like a patching 09:11:58 not a solution 09:12:34 ;et 09:12:51 but as brtknr said, heat have already released a version, so we can keep it 09:13:28 don't you think it's ugly to have a mv like this? 09:13:28 we can drop it, the agent will work unless someone has a broken heat. we can keep it, the agent will work with a fixed or broken heat 09:13:48 yep, that's why i think now we can live with that 09:14:09 after I realized that heat have already released 09:14:19 clear now? 09:15:02 yes. That's why I said, "why we need it?" 09:15:27 move on, we keep it. 09:15:27 ok, fail enough 09:15:30 lol... 09:16:29 flwang1: let's move on? 09:16:44 as for fa29 support, personally i would say support it until we think fa30 is stable 09:17:15 #topic FA29 support 09:17:26 #topic fa29 support 09:17:32 :( I dont have the wizardry 09:17:53 brtknr: i can sell it to you if you can buy me a beer 09:18:11 seems like a good trade 09:18:27 strigazi: how do you think? 09:18:35 flwang1: I am happy with that proposal 09:19:03 or when FA31 enters development? 09:20:06 brtknr: either works i think, and i think it won't take too long, because the coreos driver is sharing most of the atomic driver 09:20:44 works for me 09:21:52 next one? 09:21:54 I would prefer is use_podman=true 09:22:18 brtknr: no, default behaviour is better to not change. 09:22:19 is that doable? or would it cause issues for upgrade? 09:22:30 okay fine 09:22:55 user creates a cluster in stein with a CT. In train they should get the same. makes sense? 09:22:56 strigazi: i would like to understand why didn't you introduce this label at the first time? 09:23:12 flwang1: because I got complains later :) 09:23:26 personally, like this 09:23:58 as i mentioned in the comments, it gives us more time to get the confidence to migrate to fedora coreos 09:24:42 sure, I'm happy with it 09:24:47 the only thing i'd like to propose is, just skip this label for fedora coreos 09:24:49 but needed some convincing 09:25:20 maybe we should drop podman support for atomic altogether? 09:25:30 so its a clean separation 09:25:43 it's a bit late at this moment, i would say 09:26:07 its a lot of extra code for a feature with a short shelf life 09:26:20 we can have a separate, fully isolated code, as long as we deprecate the fedora atomic driver 09:26:57 are you suggesting only for master? 09:26:58 we just need to drop the label... default to using podman for coreos and atomic for fa29 09:27:09 with the same driver 09:27:21 give me a moment 09:28:17 I didn't mention this in the commit message unfortunately, but i mentioned it in a meeting. 09:28:22 so USE_PODMAN is an internal label rather than something that the user specifies 09:28:29 brtknr: please waot 09:28:31 brtknr: please wait 09:28:58 kubelet in an atomic container, doesn't work 09:29:10 that is why we have podman in the atomic driver. 09:29:29 iirc, that's for v1.16.x, right? 09:29:54 If you don't want to have fedora atomic and and >= 1.16 we can drop it 09:30:32 brtknr: flwang1 what do you prefer? 09:30:49 strigazi: i'm ok for have it for fedora atomic driver 09:30:54 brtknr: ? 09:31:07 you have some concerns 09:31:19 but i prefer to use true for fedora coreos 09:31:28 instead of letting user to set it for fedora coreos 09:31:42 flwang1: https://review.opendev.org/#/c/690053/2/magnum/drivers/k8s_fedora_coreos_v1/templates/kubecluster.yaml@775 09:32:21 flwang1: https://review.opendev.org/#/c/690053/2/doc/source/user/index.rst@1385 09:32:43 hmm let me thing 09:34:03 i think by the time we start deploying train release to customer sites, which is at least 3 months away, fcos30 should be stable 09:34:28 strigazi: what happened if i set use_podman=false for driver=fedora_coreos? 09:34:33 we dont need the use_podman label, if flwang1 needs it, lets feep it 09:34:40 we also have users, so don't forget us too :) 09:34:54 flwang1: brtknr creation will fail 09:34:56 with: 09:35:24 strigazi: yes you too! if your users need it! 09:35:35 yep, but i don't want to see that kind of "surprise" 09:35:55 wait 09:36:05 flwang1: | status_reason | ERROR: Parameter 'use_podman' is invalid: "false" is not an allowed value [true] | 09:36:27 it is pretty clear and prertty fast 09:36:49 http://paste.openstack.org/show/785502/ 09:36:51 unless i set use_podman=false for a fedora_coreos template 09:36:58 ok, i'm good then 09:37:55 brtknr: thoughts? 09:38:21 is it better to call it podman_enabled to keep with the convention 09:38:48 brtknr: good point? 09:38:53 good point 09:38:54 brtknr: anything else than naming? 09:39:01 nope 09:39:08 are you sure? 09:39:13 hah 09:39:23 strigazi: don't be pushy for our lovely brtknr 09:39:27 i am outnumberd 09:40:02 I mean, I might miss something 09:40:17 we need to document it 09:40:31 https://review.opendev.org/#/c/690053/2/doc/source/user/index.rst 09:40:51 explain in the doc that we might need use_podman in atomic to support 1.16.x+ 09:40:59 explain in the doc that we need use_podman in atomic to support 1.16.x+ 09:41:24 i will add this to review 09:42:04 i need to look at the code again 09:42:15 i was off yesterday so havent had a chance to review 09:43:13 let's move? 09:43:19 we have 17 mins left 09:43:42 heat-agent: Check if scripts exists https://review.opendev.org/#/c/689704/ 09:43:47 i have +2ed 09:44:48 I will review today 09:45:08 But happy with it in principle 09:45:48 Should address the weird behaviour during reboot in heat-contaner-agent log 09:46:01 brtknr: yep 09:46:07 i notice that recently as well 09:46:29 weird behaviour during reboot in heat-contaner-agent log 09:46:35 please elaborate 09:47:28 my case is not reboot, i saw heat-container-agent stuck several times 09:47:30 strigazi: once the heat-container-agent finishes its thing, it starts writing: Oct 22 20:48:26 k8s-dlgmc7wmwxlo-master-0.novalocal podman[1492]: /var/lib/os-collect-config/local-data not found. Skipping to journald 09:47:33 not useful log 09:47:35 no 09:47:47 but after reboot, it would complain about a missing file 09:47:50 brtknr: I mean about reboot 09:47:57 ok 09:48:14 just rebooting the node in the dev environment now 09:48:15 which file? I think my patch fixes it 09:48:42 strigazi: one sec, just waiting for the reboot to complete 09:50:01 https://github.com/openstack/magnum/commit/3674b3617a770bd71d09e23137ff96f90eb1241a My mistake, it wasn't verbose enough. Using the atomic cli to install kubelet breaks mount 09:50:06 propagation of secrets, configmaps and so on. Using podman 09:50:10 in a systemd unit works. 09:50:15 strigazi: why do you strongly dislike PODMAN_ENABLED btw? 09:50:31 i am not that fussed 09:50:59 it doesn't enable anything. But maybe from the user/ops pov it is better 09:52:29 logs? 09:52:30 strigazi: that's a valid point 09:52:42 generally, we use XXX_enabled only for addons 09:52:45 look at the end: https://seashells.io/p/WseYPMUE 09:52:47 not for this kind of case 09:53:23 lets leave it as USE_PODMAN then 09:53:44 brtknr: after the stop? 09:54:07 brtknr: which line? 09:54:16 Oct 23 09:50:29 09:54:41 brtknr: my patch fixes this 09:54:44 strigazi: brtknr: can we discuss details offline? 09:54:56 strigazi: yes, i already ack this 09:55:07 flwang1: 09:55:09 flwang1: ok 09:55:17 sorry, guys 09:55:20 anything last comments? 09:55:28 it's late here 09:55:45 okay, there's the issue with octavia and master lb fip 09:55:47 oh, have a look at the three patches from Theodoros I added in the etherpad 09:55:50 its working for flwang 09:55:54 not for me 09:56:10 could you please test this strigazi 09:56:15 strigazi: i will leave that for brtknr 09:57:41 one final thing from me 09:58:30 We need the patches I proposed, for podman and heat-agent, plus the three fixes from Theodoros: 09:58:35 https://review.opendev.org/#/c/687879/ 09:58:35 https://review.opendev.org/#/c/688400/ 09:58:35 https://review.opendev.org/#/c/688346/ 09:58:47 in the train branch 09:58:53 CERN is trying to upgrade to train ? 09:58:55 and release 9.1.0 09:59:00 yes 09:59:06 ok, i see then 09:59:28 to use the v1.16.x and later, right? 09:59:36 yes 09:59:44 17 in two months 09:59:48 same here 10:00:04 catalyst is on stable/stein 10:00:11 with several private patches 10:00:11 we as well 10:00:18 cherrypicked from master 10:00:21 flwang1: how many? 10:00:29 more than 10 10:00:37 48 here 10:00:39 from master(train) 10:00:43 who wins? 10:00:48 you win 10:00:54 xD 10:00:56 my friend, so you're keen than me 10:01:13 CherryPickaaS 10:01:18 ok, let's wrap up this meeting 10:01:21 :D 10:01:41 anything else? 10:01:42 we have only started upgrading to stein 10:02:02 I'm good, before closing 10:02:14 +1 or -1 to: We need the patches I proposed, for podman and heat-agent, plus the three fixes from Theodoros: 10:02:21 brtknr: flwang1 ^^ 10:02:28 and then 9.1.0 10:02:30 brtknr: flwang1 ^^ 10:02:31 strigazi: buy me a beer 10:02:47 sure thing? IPA? 10:02:51 sure thing, IPA? 10:03:16 hopefully i can join the next openstack summit 10:03:22 do you know where is it? 10:03:24 after shanghai 10:03:37 this is a good local one https://www.magictomato.ch/fr/geneve/item_details/531-biere-pale-ale-tangente-brasserie-du-virage 10:03:56 flwang1: I think Vancouver, but not sure 10:04:07 the logo looks cool 10:04:18 if it's Vancouver, i want to go 10:04:27 brtknr: flwang1 agree with the plan for 9.1.0? 10:04:28 if it's EU, no go 10:04:34 strigazi: sounds good 10:04:36 strigazi: works for me 10:04:40 i have to go 10:04:41 excellent 10:04:55 have fun, guys 10:04:59 good night, get some rest 10:05:04 o/ 10:05:18 good nite! 10:05:53 #endmeeting