Wednesday, 2021-09-08

ianwi'll think on it and see if i can come up with something; if it's not working or the change is looking ridiculous we can move to phase 200:01
clarkbphase 2 == meetpad? that works for me. I didn't want to forget about it then it be the weekend again if others wanted to dig into it sooner. I'm happy to think on it over the weekend too00:01
ianwyep or longer threads or whatever works.  i feel like it might slot ontop of 807672 fairly logically.  but i may be wrong00:02
clarkbok sounds like a plan then. Will wait and see how it shakes out00:03
ianwalso matches my philosophy of nothing getting people more interested than an obviously incorrect change :)00:03
ianwspeaking of https://review.opendev.org/c/opendev/system-config/+/807672, i guess zuul has decided it has nothing to do there?00:10
Clark[m]Most of those jobs run post merge so ya maybe syntax checking is all we'll get?00:29
opendevreviewIan Wienand proposed opendev/base-jobs master: infra-prod: split out setup steps  https://review.opendev.org/c/opendev/base-jobs/+/80780701:37
opendevreviewIan Wienand proposed opendev/system-config master: Refactor infra-prod jobs for parallel running  https://review.opendev.org/c/opendev/system-config/+/80767201:42
opendevreviewIan Wienand proposed opendev/system-config master: infra-prod: clone source once  https://review.opendev.org/c/opendev/system-config/+/80780801:42
opendevreviewIan Wienand proposed opendev/base-jobs master: infra-prod: split out setup steps  https://review.opendev.org/c/opendev/base-jobs/+/80780701:43
opendevreviewIan Wienand proposed opendev/base-jobs master: infra-prod: split out setup steps  https://review.opendev.org/c/opendev/base-jobs/+/80780702:03
opendevreviewIan Wienand proposed opendev/base-jobs master: infra-prod: split out setup steps  https://review.opendev.org/c/opendev/base-jobs/+/80780702:37
*** ysandeep|out is now known as ysandeep05:54
*** slaweq_ is now known as slaweq07:06
*** pjakuszew7 is now known as pjakuszew07:09
*** jpena|off is now known as jpena07:38
opendevreviewSorin Sbârnea proposed zuul/zuul-jobs master: Make default tox run more strict about interpreter version  https://review.opendev.org/c/zuul/zuul-jobs/+/80770208:19
*** elodilles_pto is now known as elodilles08:44
opendevreviewgnuoy proposed openstack/project-config master: Add openstack-loadbalancer charm and interfaces  https://review.opendev.org/c/openstack/project-config/+/80783808:48
opendevreviewMerged openstack/project-config master: Update Neutron's Grafana dashboard  https://review.opendev.org/c/openstack/project-config/+/80580909:09
opendevreviewSlawek Kaplonski proposed opendev/irc-meetings master: Remove Neutron L3 subteam meeting  https://review.opendev.org/c/opendev/irc-meetings/+/80784309:30
opendevreviewgnuoy proposed openstack/project-config master: Add openstack-loadbalancer charm and interfaces  https://review.opendev.org/c/openstack/project-config/+/80783809:33
opendevreviewMerged opendev/irc-meetings master: Move Keystone meeting to 1500 UTC  https://review.opendev.org/c/opendev/irc-meetings/+/80772909:51
opendevreviewMerged opendev/irc-meetings master: Remove Neutron L3 subteam meeting  https://review.opendev.org/c/opendev/irc-meetings/+/80784309:54
opendevreviewMerged opendev/irc-meetings master: Move openstack-helm meeting to #openstack-helm  https://review.opendev.org/c/opendev/irc-meetings/+/80509409:55
*** jpena is now known as jpena|lunch11:30
*** jpena|lunch is now known as jpena12:34
opendevreviewSorin Sbârnea proposed zuul/zuul-jobs master: ensure-tox: avoid using a reliable version of tox  https://review.opendev.org/c/zuul/zuul-jobs/+/80786712:52
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Introduce a BackendInterface  https://review.opendev.org/c/opendev/statusbot/+/80787113:14
opendevreviewSorin Sbârnea proposed zuul/zuul-jobs master: ensure-tox: avoid using a reliable version of tox  https://review.opendev.org/c/zuul/zuul-jobs/+/80786713:25
opendevreviewSorin Sbârnea proposed zuul/zuul-jobs master: Fix historical ansible-lint errors  https://review.opendev.org/c/zuul/zuul-jobs/+/80787713:43
opendevreviewArtem Goncharov proposed zuul/zuul-jobs master: [DNM] Test upload_logs_s3 role  https://review.opendev.org/c/zuul/zuul-jobs/+/80713213:54
*** diablo_rojo is now known as Guest669514:25
*** Guest6695 is now known as diablo_rojo14:46
opendevreviewMichal Nasiadka proposed opendev/irc-meetings master: kolla: update chair  https://review.opendev.org/c/opendev/irc-meetings/+/80789214:57
*** tosky is now known as Guest670215:22
*** tosky_ is now known as tosky15:22
opendevreviewArtem Goncharov proposed zuul/zuul-jobs master: Fix upload_logs_s3 role  https://review.opendev.org/c/zuul/zuul-jobs/+/80713215:35
opendevreviewArtem Goncharov proposed zuul/zuul-jobs master: Fix zuul-jobs-test-upload-logs-s3 job  https://review.opendev.org/c/zuul/zuul-jobs/+/80713215:36
clarkbI forwarded the most recent certcheck warning about the linaro cert to kevinz15:44
opendevreviewMerged opendev/irc-meetings master: kolla: update chair  https://review.opendev.org/c/opendev/irc-meetings/+/80789215:52
*** marios is now known as marios|out15:58
opendevreviewArtem Goncharov proposed zuul/zuul-jobs master: Revert "Revert "Update upload-logs roles to support endpoint override""  https://review.opendev.org/c/zuul/zuul-jobs/+/77667716:38
opendevreviewArtem Goncharov proposed zuul/zuul-jobs master: Revert "Revert "Update upload-logs roles to support endpoint override""  https://review.opendev.org/c/zuul/zuul-jobs/+/77667716:38
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Introduce a BackendInterface  https://review.opendev.org/c/opendev/statusbot/+/80787116:42
*** jpena is now known as jpena|off17:09
*** ysandeep is now known as ysandeep|out19:24
clarkbI've discovered that you can explicitly subscribe to github releases on repos so I have done that for etherpad and github19:26
clarkber etherpad and gitea on github19:27
fungioh neat19:29
fungii didn't realize that was possible19:29
clarkbits an option in the watch drop down menu19:29
clarkbyou select custom and then in custom can select releases19:29
funginoted19:30
clarkbI'm hoping that helps us stay on top of those tools and get releases pushed out more frequently19:32
corvusthe zuul bugfix change has merged; i'd like to restart all of zuul soon.  did the gerrit stuff happen already?19:34
fungithe gerrit server migration has not happened yet19:34
fungimnaser: ^ can we coordinate a gerrit server migration and zuul upgrade/restart soonish?19:35
fungithat way we minimize lost events from gerrit19:35
fungii guess the order would be shut down gerrit and start server migration, then restart all of zuul onto new code, then start gerrit again?19:36
clarkbfungi: ++ I was just going to say similar19:36
clarkbI can help though lunch should be here shortly19:37
funginow that config is persisted in zk, we don't have to worry about cat jobs being unable to clone from gerrit, right?19:37
corvusfungi: i think we still get the branch list from gerrit, so best to start zuul after gerrit is up19:37
corvusso: save zuul queue, down gerrit, down zuul, up gerrit, up zuul, restore queue?19:38
clarkbcorvus: sounds good to me19:41
clarkbnow we just need mnaser to do the migration after down gerrit and before up gerrit19:41
corvusi await a ping :)19:41
clarkbI do have a conflict for about 45 minutes at 22:20 UTC. Hopefully we can be done by then19:44
fungiyep, that sounds like a perfect plan19:44
fungii'm basically around, just doing yardwork (with frequent breaks because it's hot, muggy and there's no breeze)19:44
corvusbeing next to the sea with no seabreeze seems like a weather malfunction19:45
fungiit only does this when i want to knock out the yardwork. nature is conspiring to return this to a natural habitat19:56
fungi(which i'm trying to encourage, within the limit allowed by the town's codes, at least until i find time to lobby to change them)19:57
fungiweather like this almost puts me in the mood for civil disobedience though19:58
*** timburke__ is now known as timburke20:04
*** elodilles is now known as elodilles_pto20:12
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Introduce a BackendInterface  https://review.opendev.org/c/opendev/statusbot/+/80787120:38
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Add Etherpad backend  https://review.opendev.org/c/opendev/statusbot/+/80794620:38
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Add use_ssl option  https://review.opendev.org/c/opendev/statusbot/+/80794720:38
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Handle exception for unprivileged commands  https://review.opendev.org/c/opendev/statusbot/+/80794820:38
mnaserclarkb, fungi, corvus: i am doing the 'initial' online migrate now, i will ping when im ready to 'punt' the system over20:43
mnaserfrom then it should be a brief reboot20:43
clarkbsounds good20:43
clarkbI'll go warn the openstack release team20:44
fungithanks! i'm on hand to help20:44
fungijust heating up some leftovers20:45
clarkber I math'd poorly my conflict is at 21:20 for about 45 minutes (have to go get the kids from school)20:49
fungidon't sweat it, i'm here to stop/restart things20:50
corvusi'm also around20:54
clarkbI need to convince the school to give us the schedule in UTC20:55
fungihah20:55
fungialso convince them to teach utc20:55
fungithe abbreviation alone is amusing, since it was chosen to be neither the english (cut) nor french (tuc) abbreviation, so it's actually nonsense for the sake of fairness20:56
corvuswait is the english cut or uct?20:58
clarkbrecently I watched a tom scott video on how officially denmark's official time is up to like .11 seconds off from utc due to an old law they haven't changed that based the time on a the 15 degree meridian rather than utc. But the country happily syncs to utc clocks and ignores the problem :)20:58
clarkbcorvus: "Coordinated Univeral Time" iirc20:58
corvushuh.  well.  hrm.  i'd like to propose a third option!  :)20:59
fungicorvus: utc is "coordinated universal time" in english and "temps universel coordonné" in french20:59
fungiyou clearly never paid attention to the time announcements on the bbc world service shortwave broadcasts after they switched off gmt ;)21:00
corvusno; wcpe did carry the world news which had them back in the gmt days tho :)21:01
corvusi have run the zull_pull playbook after confirming the lastest image build jobs succeeded21:02
corvuswow we really should call it the zull_pull playbook.  so much sense.21:03
mnaserclarkb, fungi, corvus: i'm ready if/when you are21:03
fungithanks, i'll do a quick status notice and then stop gerrit21:03
clarkbcorvus: ^ I'll let you drive since I have to pop out soon if you want to grab queues and do the zuul side?21:03
corvusi have saved zuul queues.  so fungi has the next call.21:03
corvusclarkb: ack21:04
clarkbsoudns good. I'm around if I can help but will let you two drive21:04
fungi#status notice The Gerrit service on review.opendev.org is going offline momentarily for a host migration and zuul upgrade, downtime should be only a few minutes.21:04
opendevstatusfungi: sending notice21:04
-opendevstatus- NOTICE: The Gerrit service on review.opendev.org is going offline momentarily for a host migration and zuul upgrade, downtime should be only a few minutes.21:04
fungii downed the containers on review.o.o and then powered off the server instance gracefully from the operating system for good measure21:05
fungimnaser: you should be clear to trigger final migration21:05
mnaserok, starting21:05
fungicorvus: stop zuul containers at will21:05
corvusstopping zuul21:05
mnaserfungi: i see it pingable again21:07
fungithanks, checking21:08
fungigerrit container is starting now21:08
fungilooks like the gerrit service is online again21:09
corvusi'll start zuul21:09
fungicorvus: you should be clear to start zuul services again if they've finished stopping21:09
fungiperfect21:09
mnaserwee, that was smooth (at least for me :p)21:09
fungithanks for working with us on it!21:10
corvus++21:10
clarkbyup thank you for being available to do it when it was good for us21:12
corvusmnaser: just happened to notice this in the logs in case you're unaware: https://zuul.opendev.org/t/vexxhost/config-errors21:13
corvustenants loaded; re-enqueueing21:19
fungiawesome21:21
corvusre-enqueue complete21:25
fungithanks corvus!21:28
corvusi don't see any new exceptions in the log21:28
fungii'll consider that great news21:29
corvus(i do see one harmless one about unlocking nodesets for canceled jobs, but it's a pre-existing bug, and i suspect the next set of patches will address it)21:29
corvusso i think next step is just keep an eye out for unusual behavior.  i'll check back in a bit and look for stuck jobs, etc.21:29
fungiwill do21:35
corvus#status log restarted all of zuul on commit 04678e25e666c5e97b76e68838a0ce1cf076114421:52
opendevstatuscorvus: finished logging21:52
Clark[m]I'm taking this as an opportunity for system updates locally. But will follow-up with the openstack release team if that hasn't happened already22:04
clarkbI've notified the openstack release team that the work is odne and things are happy22:25
fungithanks22:30
clarkbianw: if you'd like me to switch diskimage-builder-core group ownership in gerrit from infra-ptl to diskimage-builder-core to make it self owned I can do that now22:30
clarkband I can do it without escalating privs as I'm the current owner22:30
clarkbI'm happy for that group to continue to self organize as it has largely done so for the last couple of years at least22:31
ianwclarkb: i think so, thank you.  we'll add stevebaker and TheJulia and possibly contact old members to see if they want to be removed22:31
fungisounds great to me. the more reviewers there the better22:32
clarkbthat is done now22:32
ianwthank you22:32
clarkblooking at the membership I suspect many of those would prefer to be removed since they havne't done openstacky things or even cloudy things in a long time22:33
ianwindeed, it's a trip down memory lane, that list22:34
stevebakerclarkb, ianw: awesome, thanks. dib is a dependency on an unusual collection of projects, this will help a lot22:35
ianwclarkb: i think the idea of having a single job checkout system-config works and slots in fairly easily; https://review.opendev.org/q/topic:%22s-c-prod-parallel%22+(status:open%20OR%20status:merged) is the collection22:35
clarkbianw: thanks I've been all over the place today. Let me take a look22:36
ianwno rush, it is one you want to read after a coffee :)22:36
clarkbianw: the first thing I notice is the order of the changes seems backwards to me22:36
clarkbor maybe the end result here gets squashed together?22:37
clarkband the third change is largely independnet and can land today? I'll start with that one I guess22:39
ianwif it matches what's in my head; we add new jobs to opendev/base-jobs, but keep the existing base job as is by including both playbooks22:39
ianwthen system-config modifies itself to run the code-checkout job first, and to parent it's base job to the one that only setups the executor login22:40
ianwand yeah, the other one that does the initial split up can be separate; although note it still keeps the semaphore at this point to keep things running in order just while we confirm it doesn't go crazy22:41
clarkb++ I think changing the semaphores is the very last thing we want to do22:41
clarkbianw: left a thought on 807080722:47
clarkber 80780722:47
ianwhrm, yes, it probably should do that "is this periodic i'll use master bit"22:49
ianwit might be clearer to do that in a follow-on?22:49
clarkbya thats fine too as long as we land that before we try to use the new jobs22:52
clarkbbecause that will cause updates in all the periodic jobs that we don't want if we don't clean up the existing stuff for that22:52
ianwclarkb: urgh, now i think about it 807672 needs up update periodic-hourly and periodic, doesn't it ...23:00
clarkbianw: yup we have to do everything all at once23:02
clarkbthis is part of what makes it complicated23:02
ianwi think i'll have to move the dependencies into the job definitions in infra-prod.yaml23:03
ianwthat is indeed the more logical place for them.  it just makes it quite difficult to see what the relationships are at a glance23:04
clarkbbut also I think keeping the semaphores doing one job at a time is a good sanity checking when we do a big cutover to the new job dep listings and all that23:08
clarkbthen once we're happy with that state we can update just the semaphores so that it enforces one buildset at a time and some upper cap of jobs in a buildset (to not overwhelm bridge). That will require two semaphores23:08
clarkbone that a parent paused job holds (I called this the anchor job in previous discussion) and then a semaphore with say a limt of 5 to start that each other job holds23:09
fungiyeah, i like the idea of ratcheting up the semaphore allowance gradually in an effort not to swamp the bastion23:10
fungiwe've certainly seen it get overwhelmed with ansible processes before23:10

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!