15:01:09 <mgoddard> #startmeeting kolla
15:01:19 <mgoddard> mgoddard mnasiadka hrw egonzalez yoctozepto rafaelweingartne cosmicsound osmanlicilegi bbezak parallax Fl1nt
15:01:21 <mgoddard> ^ meeting
15:01:25 <mgoddard> #topic rollcall
15:05:38 <mgoddard> #topic agenda
15:06:14 <mgoddard> #topic announcements
15:06:16 <mgoddard> #info TC & PTL election season looms
15:06:18 <mgoddard> #link http://lists.openstack.org/pipermail/openstack-discuss/2021-August/024093.html
15:06:37 <mgoddard> Nominations start on 17th August
15:06:53 <mgoddard> I will not run again for PTL
15:07:15 <priteau> You will always stay our BDFL mgoddard
15:07:33 <mgoddard> is there no escape? :)
15:07:35 <priteau> (in our hearts)
15:09:18 <yoctozepto> <3
15:09:34 <mgoddard> small plug: I will be participating in tomorrow's opendev.live session on ironic
15:09:45 <yoctozepto> yeah, I will be watching you
15:10:38 <yoctozepto> psst: let's increase the tempo a bit; we are already 10 mins into the meeting and no meat still :-)
15:11:25 <yoctozepto> (I also wonder where mnasiadka is :-) )
15:11:27 <mgoddard> #link https://www.youtube.com/channel/UCQ74G2gKXdpwZkXEsclzcrA
15:11:44 <mgoddard> #topic Review action items from the last meeting
15:11:55 <mgoddard> none
15:12:10 <mgoddard> #topic CI status
15:12:52 <yoctozepto> I updated it today for k and k-a
15:12:59 <yoctozepto> not sure about kayobe
15:13:05 <mgoddard> anything new?
15:13:28 <yoctozepto> not really, I deleted some old stuff; zun seems to be failing on ubuntu
15:13:31 <yoctozepto> but that's it
15:13:47 <yoctozepto> I wanted to discuss general issues we have listed at the top in the whiteboard
15:13:56 <yoctozepto> (this is the extra subtopic for CI status)
15:14:01 <priteau> I think Kayobe is green at the moment
15:14:25 <mgoddard> +1
15:14:30 <priteau> Are ussuri and train still red?
15:14:44 <mgoddard> unsure, probably not
15:15:10 <priteau> There's no open patches against them
15:15:25 <mgoddard> last ussuri runs look ok
15:15:25 <yoctozepto> no periodic jobs to check?
15:16:08 <priteau> No periodic
15:16:10 <mgoddard> no
15:16:20 <mgoddard> go ahead yoctozepto
15:16:48 <yoctozepto> ok
15:17:11 <yoctozepto> I have collected 5 general issues
15:17:24 <yoctozepto> i.e. mostly job-type-independent
15:17:39 <yoctozepto> the first one is SIGPIPE rc=-13 error
15:17:48 <yoctozepto> I have never got this outside of CI
15:17:53 <yoctozepto> have you?
15:18:10 <mgoddard> not that I remember
15:18:12 <yoctozepto> (for the notes please see the whiteboard; I will not be repasting them in the chat)
15:18:24 <yoctozepto> priteau and you?
15:18:35 <yoctozepto> (nobody else to ask today)
15:18:46 <mgoddard> osmanlicilegi is here
15:19:24 <yoctozepto> sorry! missed due to the same colour as priteau in my client :-)
15:19:26 <priteau> yoctozepto: I don't think I have ever seen this error
15:19:38 <mgoddard> yoctozepto: do you have some proposal for it?
15:19:49 <yoctozepto> yeah, I'm pretty sure it's something really weird going in the CI
15:20:00 <yoctozepto> mgoddard: no, for this issue I'm just collecting others' feedback
15:20:07 <yoctozepto> not much to work on
15:20:15 <yoctozepto> ok, let's move onto the next issue
15:20:25 <yoctozepto> failing pulls
15:20:31 <yoctozepto> this has a fix
15:20:34 <yoctozepto> it has been discussed
15:20:39 <yoctozepto> so just go and merge :-)
15:20:59 <osmanlicilegi> yoctozepto: I'm trying to catch up what I've missed for a while :/
15:20:59 <yoctozepto> I happens outside of CI so it's legit and fixing is nice for end-users as well
15:21:20 <yoctozepto> osmanlicilegi: sure; the question was whether you have ever seen rc=-13 error from kolla-ansible when running it locally
15:21:35 <yoctozepto> osmanlicilegi: so it's independent of the upstream knowledge :-)
15:21:47 <yoctozepto> ^^^ it* happens
15:22:01 <osmanlicilegi> never seen this but I'll recheck
15:22:04 <yoctozepto> any thoughts on issue #2? if not, let's move on onto 3.
15:22:17 <yoctozepto> osmanlicilegi: thanks, no sweat :-)
15:22:54 <yoctozepto> so for issue #2 just merge the proposal
15:23:04 <yoctozepto> and issue #3 is about weird attach behaviour
15:23:20 <yoctozepto> this has not happened to me in prod but seems more legit than issue #1
15:23:28 <priteau> yoctozepto: retry is good. Never seen it with local registry but improved CI stability will be good
15:23:28 <yoctozepto> though it's probably an upstream bug
15:24:06 <mgoddard> did you manage to reproduce the attachment issue?
15:24:52 <yoctozepto> mgoddard: locally not; but it seems to be luck-based so it could be that one-more-try could trigger it
15:24:57 <yoctozepto> but one has to stop somewhere lol
15:25:06 <yoctozepto> it's repeatable in CI though
15:25:15 <yoctozepto> I have a proposal that can be rechecked to trigger it
15:25:21 <yoctozepto> as it only runs the affected jobs
15:25:58 <yoctozepto> well, at least I *had*
15:26:01 <mgoddard> waiting longer didn't help?
15:26:06 <yoctozepto> nope, it did not
15:26:12 <yoctozepto> it seems like some process just does not complete
15:26:16 <yoctozepto> no errors, no nothing
15:26:20 <yoctozepto> the volume gets stuck
15:26:24 <yoctozepto> only ever seen this on ubuntu
15:26:32 <yoctozepto> and the cinder backend is irrelevant
15:26:43 <yoctozepto> (always nice to be able to blame ceph but not this time, fellas)
15:27:10 <mgoddard> raise with cinder?
15:27:28 <yoctozepto> yeah, that is my proposal as well
15:27:32 <yoctozepto> just enquiring first
15:27:36 <yoctozepto> ok, thanks
15:28:02 <yoctozepto> oh, I see I switched the order a bit
15:28:16 <yoctozepto> so that was actually issue #4
15:28:21 <yoctozepto> the #3 is about logging mess
15:28:45 <yoctozepto> I guess it's obvious we need a volunteer for better logs
15:29:00 <yoctozepto> can we link to any related proposals? do any come to your minds?
15:30:19 <mgoddard> there are a few patches proposed for logging
15:30:52 <mgoddard> it depends which part of the mess you want to fix
15:31:34 <yoctozepto> well, at least the one making it hard to find issues in the log files themselves
15:31:49 <yoctozepto> please just add any links you deem valuable
15:32:24 <yoctozepto> nothing else to discuss for this particular issue
15:32:31 <yoctozepto> as for issue #5:
15:32:42 <yoctozepto> do we agree to ignore this in CI?
15:32:57 <yoctozepto> because otherwise we would have to nicely stop all the services in proper order
15:33:07 <yoctozepto> and only then start them again
15:33:14 <mgoddard> how many retries to we use?
15:33:47 <yoctozepto> that's a good question; more precisely: how many retries does placement use in this config
15:34:04 <yoctozepto> the issue is because keystone goes down during upgrade
15:34:16 <yoctozepto> and haproxy also needs to pick it up back alive
15:34:29 <yoctozepto> so a few retries should help
15:34:33 <yoctozepto> perhaps it's not doing enough
15:34:35 <yoctozepto> let's check
15:34:40 <mgoddard> https://docs.openstack.org/placement/wallaby/configuration/config.html#keystone_authtoken.http_request_max_retries
15:35:20 <yoctozepto> raise to 5 in CI?
15:35:24 <yoctozepto> or 6
15:35:42 <mgoddard> could do
15:35:52 <mgoddard> or maybe we just ignore it :)
15:36:22 <yoctozepto> mgoddard: I'll try with that bump
15:36:35 <yoctozepto> I have only ever seen placement and (rarer) neutron to hit this
15:36:42 <yoctozepto> they are most talkative it seems
15:36:48 <yoctozepto> for whatever reason
15:36:55 <yoctozepto> could use bumping for all core services
15:37:01 <yoctozepto> all right
15:37:03 <yoctozepto> the plan is there
15:37:10 <yoctozepto> thank you for the fruitful discussion
15:37:16 <yoctozepto> precisely what I wanted :-)
15:38:57 <mgoddard> thanks yoctozepto for bringing it up
15:39:04 <yoctozepto> my pleasure!
15:39:07 <mgoddard> #topic Release tasks
15:39:17 <mgoddard> Finally, it is R-8
15:39:26 <yoctozepto> :O
15:39:27 <mgoddard> so according to https://docs.openstack.org/kolla/latest/contributor/release-management.html
15:39:41 <mgoddard> we must Switch binary images to current release
15:39:57 <mgoddard> would anyone like to do it?
15:40:25 <yoctozepto> "like" is a strong word :-)
15:41:09 <priteau> I can give it a try if no one else is already on it
15:41:21 <yoctozepto> I have other stuff to do as well but I guess it won't hurt me to bump; though I'm looking forward to broader participation :-)
15:41:33 <yoctozepto> oh, priteau already volunteered, good!
15:41:51 <mgoddard> #action priteau to Switch binary images to current release
15:41:53 <mgoddard> thanks priteau
15:41:57 <yoctozepto> thanks ++
15:42:10 <mgoddard> we need to think about cycle highlights soon
15:42:13 <mgoddard> but it can wait
15:42:19 <yoctozepto> yeah, next meeting
15:42:24 <yoctozepto> let's add to the agenda though
15:42:30 <yoctozepto> I will add then
15:42:31 <mgoddard> #topic Clean up old priorities from the whiteboard to get fresher look at it
15:42:38 <yoctozepto> mine again
15:42:48 <yoctozepto> I suggest we simply clean all completed one
15:42:51 <yoctozepto> ones*
15:42:58 <yoctozepto> simple topic :-)
15:43:03 <yoctozepto> just gathering your approval
15:44:24 <mgoddard> makes sense
15:44:35 <mgoddard> the list wasn't really updated for xena
15:44:43 <yoctozepto> yup
15:44:50 <yoctozepto> ok, so I can clean this up
15:44:54 <yoctozepto> no problem
15:45:01 <yoctozepto> you can action me on it
15:45:11 <mgoddard> #action clean up whiteboard priorities
15:45:33 <mgoddard> CentOS Stream 9...
15:45:51 <mgoddard> we should probably check in with RDO on that one
15:46:38 <mgoddard> it's cutting it quite fine for a major upgrade
15:46:49 <yoctozepto> argh, noez
15:46:58 <yoctozepto> I forgot we are awaiting a landslide
15:47:08 <mgoddard> #action mgoddard check in with RDO re CS9
15:47:37 <mgoddard> #topic Kolla operator pain points https://etherpad.opendev.org/p/pain-point-elimination
15:48:08 <mgoddard> I don't see any new ones since last time
15:48:27 <yoctozepto> me neither
15:48:30 <mgoddard> #topic Kolla Ansible Framework and its QA https://etherpad.opendev.org/p/kolla-ansible-framework
15:48:47 <yoctozepto> so, I have done a larger writeup
15:49:12 <yoctozepto> that we should do some core-involving exercises to maintain better posture :-)
15:49:32 <yoctozepto> my goal for today is to share this with you
15:49:44 <yoctozepto> and ask you for collaboration
15:50:11 <yoctozepto> who would want to drive/discuss this with me? (even indepently of our general meetings not to eat up their time)
15:50:56 <yoctozepto> I see only mgoddard is lurking in the etherpad so not many ppl to ask :-)
15:51:13 <priteau> Sorry, was working on my action :P
15:51:38 <yoctozepto> priteau: no problem
15:52:29 <mgoddard> it makes sense to me to expand this model
15:53:10 <mgoddard> it could easily be used for DB setup, check-containers.yml, etc.
15:53:31 <mgoddard> config would be more work
15:53:42 <mgoddard> but could start simple with config.json
15:54:19 <yoctozepto> yeah, we have config-check in workings (on me) but config itself, especially config.json, is a nice candidate
15:54:43 <yoctozepto> so, as you figured, the work is threefold: document, refactor, test
15:55:00 <yoctozepto> but first obviously decide on the scope etc.
15:55:17 <yoctozepto> put any relevant idea somewhere in that etherpad
15:55:18 <yoctozepto> any time
15:55:35 <priteau> So it's about moving more K-A code into a framework, so there is less duplication of code between roles? With ultimately services could just be defined as a dict of their config?
15:56:36 <mnasiadka> yoctozepto: missing me?
15:56:44 <yoctozepto> priteau: +/- yeah
15:56:56 <yoctozepto> but for starters to clean up what we already have and decide on the next steps
15:57:02 <yoctozepto> which ultimately could work as you described
15:57:12 <mgoddard> I suppose my question would be, how do we prioritise this against other work?
15:57:54 <yoctozepto> tough nut; I can drive most of this because it feels important to me
15:58:11 <yoctozepto> but need you for discussion and review of course
15:58:20 <yoctozepto> bring your ideas, thoughts, comments
15:58:33 <yoctozepto> mnasiadka: I'm always missing fellow cores :-)
15:58:52 <yoctozepto> *you = you all
16:00:50 <yoctozepto> all right, we are past time unfortunately :-(
16:00:57 <mgoddard> indeed
16:01:05 <mgoddard> thanks for driving discussions today yoctozepto
16:01:15 <yoctozepto> mgoddard: you are welcome; thanks for chairing
16:01:41 <mgoddard> #endmeeting