Tuesday, 2020-04-21

mordredianw: cool!00:02
ianwexcept it failed to build, but we'll get there :)00:02
*** DSpider has quit IRC00:03
mordredone step at a time ;)00:03
openstackgerritMohammed Naser proposed zuul/zuul-jobs master: helm-template: enable using values file  https://review.opendev.org/72136500:11
openstackgerritMohammed Naser proposed zuul/zuul-jobs master: helm-template: allow users to disable wait-for-pods  https://review.opendev.org/72136900:11
ianwit built against focal00:15
*** factor has joined #opendev00:20
ianwyay, it seems to be working in a manual test00:34
fungiyay!00:34
openstackgerritMohammed Naser proposed zuul/zuul-jobs master: helm-template: enable using values file  https://review.opendev.org/72136500:35
ianwnope, spoke too soon ... it gets further though :/00:35
*** dmsimard1 has joined #opendev00:50
*** dmsimard has quit IRC00:51
*** dmsimard1 is now known as dmsimard00:52
ianwok it works with a combination of pull request 26 + 27 ... sigh i don't want to think about it too much00:57
*** mlavalle has quit IRC01:03
*** factor has quit IRC02:49
*** factor has joined #opendev03:15
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151103:18
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151103:57
*** ysandeep|afk is now known as ysandeep04:20
*** olaph has quit IRC04:22
*** ykarel|away is now known as ykarel04:45
*** ykarel is now known as ykarel|afk05:16
*** sgw has quit IRC05:22
AJaegersgw, config-core, FYI, the docs jobs now fail if no HTML is generated, see https://review.opendev.org/721221 - and that's what sgw run into. That's on purpose, we had a few broken repos that failed in publishing05:32
*** ykarel|afk is now known as ykarel05:33
*** sgw has joined #opendev05:40
fricklercorvus: the deploy on meetpad failed https://zuul.opendev.org/t/openstack/build/6b06e2b3356a4422a8ed2b88aa9b9074 , the log on bridge points to some ensure-pip issue05:50
openstackgerritOpenStack Proposal Bot proposed openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/72152706:06
*** dpawlik has joined #opendev06:08
openstackgerritMerged openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/72152706:37
*** rpittau|afk is now known as rpittau07:03
*** DSpider has joined #opendev07:09
*** olaph has joined #opendev07:20
*** olaph has quit IRC07:29
*** tosky has joined #opendev07:44
*** ralonsoh has joined #opendev07:57
*** ysandeep is now known as ysandeep|lunch08:17
openstackgerritPaul Albertella proposed zuul/zuul-jobs master: Add Bazel build and install roles  https://review.opendev.org/69351309:08
*** ykarel is now known as ykarel|lunch09:14
openstackgerritPaul Albertella proposed zuul/zuul-jobs master: Add Bazel build and install roles  https://review.opendev.org/69351309:26
*** ysandeep|lunch is now known as ysandeep09:39
*** sshnaidm|afk is now known as sshnaidm09:54
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Use cached 'tox_executable' in fetch-tox-output  https://review.opendev.org/72119210:01
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Use cached 'tox_executable' in fetch-tox-output  https://review.opendev.org/72119210:06
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Use cached 'tox_executable' in fetch-tox-output  https://review.opendev.org/72119210:10
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Use cached 'tox_executable' in fetch-tox-output  https://review.opendev.org/72119210:11
openstackgerritMaciej Józefczyk proposed openstack/project-config master: Retire openstack/networking-ovn repository  https://review.opendev.org/72156510:18
*** ykarel|lunch is now known as ykarel10:20
*** rpittau is now known as rpittau|bbl10:31
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: fetch-sphinx-tarball: install bzip2  https://review.opendev.org/72157110:42
hrwheh. forgot to try to catch ianw during his afternoon.10:44
hrwianw: can you look what is missing to get centos8/aarch64 on zuul?10:44
openstackgerritMaciej Józefczyk proposed openstack/project-config master: Retire openstack/networking-ovn repository  https://review.opendev.org/72156510:55
*** ysandeep is now known as ysandeep|afk11:01
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry git clone/fetch on timeout  https://review.opendev.org/72158111:29
*** ysandeep|afk is now known as ysandeep11:32
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: WIP: Add testing of fetch-sphinx-tarball role  https://review.opendev.org/72158411:36
*** hashar has joined #opendev11:52
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry zypper when refresh failed  https://review.opendev.org/72158711:53
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry zypper when refresh failed  https://review.opendev.org/72158712:01
openstackgerritPaul Albertella proposed zuul/zuul-jobs master: Add Bazel build and ensure roles  https://review.opendev.org/69351312:11
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: WIP: Add testing of fetch-sphinx-tarball role  https://review.opendev.org/72158412:12
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: WIP: Add testing of fetch-sphinx-tarball role  https://review.opendev.org/72158412:13
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry git clone/fetch on timeout  https://review.opendev.org/72158112:18
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: Make linting use of find portable  https://review.opendev.org/72159512:28
*** rpittau|bbl is now known as rpittau12:31
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: WIP: Add testing of fetch-sphinx-tarball role  https://review.opendev.org/72158412:33
ttxcorvus, fungi: more on my refs/changes cleaning adventures. Apparently it is the enormous number of refs in refs/changes that prevents me from cleaning them up. It's very likely that the per-repo mirroring would just fail for anything more active than release-test. So cleaning up changes before we switch to per-repo replication is a must-do rather than a nice-to-have. I found a way to do it in smaller12:49
ttxchunks, which hopefully should work even on larger repos. It's pretty slow but should work. Testing now.12:49
ttx(Basically github fails when you do a push that results in deleting thousands of references)12:51
corvusttx: ack12:51
corvusfrickler: that's really weird, i wonder what's different about that in prod vs test?12:52
ttxIt's going to be very long, but the only alternative would be to continue to explicitly push refs/changes in the git replication zuul job.12:53
corvusfrickler: the 'pip' ansible module is failing with 'ImportError: No module named pkg_resources' it's running under /usr/bin/python (python2)12:54
corvusttx: zuul doesn't have all the refs changes, so that would make the replication job take a long time12:54
ttxok so it's probably the only way to do it. Running a larger tset now12:55
ttxtest*12:55
corvusttx: ++12:55
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry zypper when refresh failed  https://review.opendev.org/72158712:57
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry git clone/fetch on timeout  https://review.opendev.org/72158112:59
*** ykarel is now known as ykarel|afk13:08
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: Add testing of fetch-sphinx-tarball role  https://review.opendev.org/72158413:10
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry zypper when refresh failed  https://review.opendev.org/72158713:15
openstackgerritJan Zerebecki proposed openstack/diskimage-builder master: Retry git clone/fetch on timeout  https://review.opendev.org/72158113:17
*** mlavalle has joined #opendev13:58
openstackgerritMonty Taylor proposed opendev/system-config master: Move cloud-init removal to its own playbook  https://review.opendev.org/72110614:03
openstackgerritMonty Taylor proposed opendev/system-config master: Just move cloud-init removal into base-server  https://review.opendev.org/72110714:03
openstackgerritMonty Taylor proposed opendev/system-config master: Remove some extra bits from site.pp  https://review.opendev.org/72110114:03
openstackgerritMonty Taylor proposed opendev/system-config master: Split codesearch into its own playbook  https://review.opendev.org/72110214:03
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109814:03
mordredclarkb, corvus: ^^ that last one goes ahead and docker/ansiblizes accessbot14:08
mordredit also splits running of accessbot from installing it - so we can just not run the run playbook in the gate14:08
*** roman_g has joined #opendev14:10
openstackgerritMonty Taylor proposed openstack/project-config master: Run accessbot script on accessbot channels update  https://review.opendev.org/72162714:11
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109814:13
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109814:14
*** ykarel|afk is now known as ykarel14:22
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162914:26
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162914:37
openstackgerritMerged zuul/zuul-jobs master: Use main.yaml, not .yml  https://review.opendev.org/72124514:42
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163314:44
openstackgerritSorin Sbarnea proposed openstack/project-config master: Enable promote to unarchive gz archives in addition to bz2  https://review.opendev.org/72165215:10
*** dzho has joined #opendev15:18
*** ysandeep is now known as ysandeep|away15:40
*** ykarel is now known as ykarel|away15:40
openstackgerritMonty Taylor proposed opendev/system-config master: Split codesearch into its own playbook  https://review.opendev.org/72110215:44
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109815:44
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162915:44
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163315:44
openstackgerritMonty Taylor proposed opendev/system-config master: Stop cloning a bunch of puppet modules we don't use  https://review.opendev.org/72089215:44
openstackgerritMonty Taylor proposed opendev/system-config master: Run set-hostnames in run-base  https://review.opendev.org/72167615:44
*** markmcclain has joined #opendev15:56
mordredclarkb: so - that stack is green much further up now15:57
clarkbmordred: k I've been distracted by like 10 things already this morning. Going to try and reset than catch up on that work15:58
mordredcorvus, fungi: if you're bored - https://review.opendev.org/#/c/721107/ and parents are green and landable15:58
mordredclarkb: it's eversomuch fun15:58
mordredclarkb: also: https://review.opendev.org/#/c/717620/ and https://review.opendev.org/#/c/720527/ and https://review.opendev.org/#/c/720709/ are ready ... although obviously do not land them - we'll need to do a coordinated rollout15:59
mordredactually - maybe if I take out the docker compose up we can land them and then just to the restarts16:00
clarkbmordred: I don't think we want the ups anyway beacuse we land a lot of zuul changes16:01
mordredclarkb: yeah ... well, we don't on scheduler for sure... but we do need them in the gate. maybe I should do the gerrit trick and add a start variable16:02
mordredset it in the gate, don't set it in prod16:02
openstackgerritMonty Taylor proposed opendev/system-config master: Run nodepool launchers with ansible and containers  https://review.opendev.org/72052716:07
mordredclarkb, corvus : ^^ like that - if you like that approach, I'll apply it to the zuul patch too16:08
corvusmordred: it looks like you did the hostname/cloud-init moves in 3 changes when you probably could have done them in one.  i've +2d them, but i dunno if maybe you want to think about squashing them?16:09
corvusmordred: 721676, 721107, 72110616:09
corvusmordred: i'm fine with them as-is, but considering how test intensive they're likely to be, could take a while to land16:10
mordredcorvus: yeah - actually - if y'all like the final version there I can totally squash that16:10
mordredwhy don't I just go ahead and squash them - I think the end result is solid yeah?16:11
corvusmordred: i do like the final version, i almost suggested that on the first change16:11
openstackgerritMonty Taylor proposed opendev/system-config master: Rearrange set-hostnames and cloud-init removal  https://review.opendev.org/72110616:14
openstackgerritMonty Taylor proposed opendev/system-config master: Remove some extra bits from site.pp  https://review.opendev.org/72110116:14
openstackgerritMonty Taylor proposed opendev/system-config master: Split codesearch into its own playbook  https://review.opendev.org/72110216:14
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109816:14
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162916:14
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163316:14
openstackgerritMonty Taylor proposed opendev/system-config master: Stop cloning a bunch of puppet modules we don't use  https://review.opendev.org/72089216:14
mordredcorvus: thereyago16:14
clarkbmordred: ensuring that cloud-init isn't present should noop on our images. Was that causing problems (because it shouldn't)16:21
corvusclarkb: regardless, we still run it in 721106.  i think the commit message just has extra text now.16:23
clarkbhrm good point I think I'm reading too much into the rearrangement16:24
mordredclarkb: no - it wasn't causing problems - it just seemed inappropriate in a playbook called "set-hostnames"16:24
clarkbmordred: left a note on a different thing on why I Think this wasn't part of base16:24
corvusclarkb: expensive in test or prod?16:25
corvusalso, i'm not sure i've ever seen autoremove clean up old kernels16:25
clarkbcorvus: prod16:25
mordredclarkb: I can take out that autoremove if you want16:25
corvusclarkb: well, that would just happen once, right? if it happens at all?16:25
clarkbcorvus: it would happen everytime we upgrade kernels16:26
corvusclarkb: right, but i mean, it'll take a couple of seconds?16:26
corvusand keep our disks from filling up?16:26
clarkbcorvus: no its usually several minutes per kernel  because it rebuilds initramfs things16:26
clarkb(I think its initramfs)16:26
corvus(if it happens at all, which i'd love for it to, but i'm still skeptical)16:26
clarkbcorvus: what are we skeptical about?16:27
corvusi'm still skeptical autoremove will actually remove old kernels16:27
corvusi have lots of machines where i have to run a special script16:27
corvusbut let's ignore that for now16:27
clarkbit does for me on my xenial machine. Let me update and autoremove to dobuel check16:27
corvusbecause i want to be wrong on that :)16:27
corvuslet's assume it does happen: i think it's worth it and we should do it16:27
mordredI just ran apt-get autoremove on review-dev16:27
mordredit was a noop16:28
mordredthere are more than one kernel there16:28
corvusmordred: more than 2?16:28
mordredyes16:28
corvus(current and latest are generally what we should end up with)16:28
mordred173, 176 and 17716:28
clarkbmordred: are we booted on 173?16:29
mordredyes16:29
corvuslooks like something may be cleaning them already16:29
clarkbya so it won't remove 17316:29
corvus(unattended-upgrades?)16:29
clarkbcorvus: and yes I agree that implies something is already cleaning them16:29
mordredyeah - so - I think it's safe - and probably a good hygiene thing16:29
mordredbtw - I'm excited about how the puppet manifest split is turning out16:31
fungimordred: revisiting one of the backup cron errors from review-dev, it looks like mysql-client isn't installed on it (but is installed on the production server). do we need to add that package, or are we supposed to not be backing up that server?16:31
mordredfungi: hrm. you know - maybe we should update the backup cron to run mysqldump in the mariadb container16:32
mordredwith an exec16:32
mordredthat said - we DO have mysql-client on review16:32
fungiyeah, right now its running outside16:32
fungiand yes, it's working on production16:32
clarkbmordred: there is no mariadb container on review(-dev)16:32
clarkbI think that is the issue16:33
fungiahh16:33
mordredclarkb: oh - we need to redeploy review-dev don't we?16:33
mordredwait - no - i'm stupid16:33
clarkbmordred: well its still using trove is the reason16:33
mordredyeah16:33
mordredlet's add the mysql-client16:34
clarkbcorvus: fwiw just confirmed on my local xenial machine that autoremove does remove kernels16:34
clarkbcorvus: note that ubuntu default partitioning is notorious for not allocating enough space in /boot for 3 kernels (or even 2 sometimes)16:34
clarkbpossible that your need for special script is driven by ^16:34
clarkb(I manually partition /boot on my ubuntu machines now because the installer never gives it enough room)16:34
fungisame, i split /boot and /boot/efi even16:35
openstackgerritMonty Taylor proposed opendev/system-config master: Install mysql-client on gerrit servers  https://review.opendev.org/72168916:35
clarkbfungi: ya the installer does too, but it gives /boot/efi like 10x the space when it needs 1/100th the room16:36
fungiso that /boot can be formatted differently from the limited filesystems efi supports16:36
mordredfungi, clarkb : ^^16:36
fungiclarkb: yep, my /boot/efi is ~1% utilized according to df16:36
mordredoh good! the logstash filters job just started failing because apparently installing cryptography now isn't going to use wheels!16:38
fungimordred: i think i also accidentally deleted some related cronspam about bup failing on review-dev, but will try to hold onto the next one16:38
mordredhttps://zuul.opendev.org/t/openstack/build/81779d15dc634e3ea756c306851c560b16:38
mordredor something16:38
clarkbmordred: usually they fix that within a fwe minutes I think16:38
fungioh, or i bet i can check syslog ;)16:38
mordredok. so I just got unlucky it sounds like16:38
clarkb(they do upload wheel after sdists occasionally and that causes fallout)16:38
mordredsigh16:38
clarkbit would be nice if it was a thing to always upload a wheel first16:38
clarkbbut since wheels are optional you cna't really enforce that in pypi16:38
*** rpittau is now known as rpittau|afk16:39
clarkbmordred: https://pypi.org/project/cryptography/2.9.1/#files note the wheels vs tar.gz timestamp16:39
clarkbmordred: you got luck!16:39
fungihrm, syslog seems to indicate bup ran successfully, so maybe it's just noisy16:39
mordredis zuul having a sad?16:40
clarkbmordred: it won't let me ssh16:41
*** jrosser has quit IRC16:41
fungiclarkb: lots of projects build many wheels for different environments and interpreters, some of which may come way after the release depending on their build infrastructure16:41
clarkb(we saw this with bridge the other day too)16:41
clarkblive migrations maybe?16:41
clarkbfungi: ya they should all upload wheels first is my point16:41
clarkbbut its not something that is considered by people doing releases16:41
mordrednm16:41
mordredit's back16:41
*** mnaser has quit IRC16:41
*** vblando has quit IRC16:42
fungiright, though that effectively means avoiding uploading your source code until possibly days later when your build infrastructure catches up16:42
*** Open10K8S has quit IRC16:42
*** diablo_rojo_phon has quit IRC16:42
clarkbfungi: yup, but then your users don't break16:42
*** Open10K8S has joined #opendev16:42
clarkbmordred: oh I may not have my ssh key loaded now that I double check16:42
clarkbpebkac16:43
fungiload average and memory utilization look reasonable on the scheduler16:43
funginothing at all in dmesg since yesterday, so doesn't look like the vm paused16:44
*** jrosser has joined #opendev16:44
*** vblando has joined #opendev16:44
mordredI think maybe I just caught it doing a reconfig or something16:44
*** mnaser has joined #opendev16:44
fungiinfra-root: new pip beta is up if anyone wants to take it for a spin: https://www.mail-archive.com/distutils-sig@python.org/msg28968.html16:45
*** mnaser has quit IRC16:46
*** mnaser has joined #opendev16:47
*** mnaser has quit IRC16:49
*** mnaser has joined #opendev16:50
mordredclarkb: jeez. the carnage of dead jobs from that cryptography release :)16:51
corvusmordred: looks like some test failures coming in on that stack16:51
mordredcorvus: yeah - I believe they're all from the cryptography release16:52
mordred(they released tarball before wheels)16:53
mordredcorvus: left a follow up comment on https://review.opendev.org/#/c/721098/ to your comment16:53
*** hashar has quit IRC17:01
openstackgerritMerged opendev/system-config master: Remove old etherpad.openstack.org  https://review.opendev.org/71749217:05
openstackgerritMerged opendev/system-config master: Remove unused gerrit puppet things  https://review.opendev.org/71400117:05
openstackgerritMerged opendev/system-config master: Remove global variables from manifest/site.pp  https://review.opendev.org/72080017:05
mordredwoot!17:07
openstackgerritMonty Taylor proposed opendev/zone-opendev.org master: Remove etherpad-dev CNAME  https://review.opendev.org/72169317:08
mordredinfra-root: I think we're good to delete the old etherpad.openstack.org and etherpad-dev.openstack.org servers ... do we have any concerns with that?17:09
funginope, i haven't left anything there17:09
AJaegerclarkb, mordred, regarding etherpad-dev: Are we fine to take everything done for them? Then I can merge https://review.opendev.org/720900 to remove the puppet modules.17:10
corvusmordred: wfm17:10
AJaegermordred: will you empty the repos or shall I help you with those?17:10
mordredAJaeger: I can do it - unless you have a script already or something17:11
AJaegermordred: I have no script... Just seeing that you're busy with other fires and therefore offered to help...17:11
clarkbmordred: I think thats ok. Dont' forget there are trove instances assoicated with them, maybe clean those up a little later?17:12
mordredclarkb: yeah - I think let's leave them for now just in case?17:13
fungiyou can snapshot the trove instances to back them up if you want to delete17:13
mordredI think we're far enough down the road that falling back to them is unlikely17:13
AJaegermordred, clarkb , approved the first retirement change...17:13
mordredAJaeger: \o/17:13
AJaegermordred: ok, you go ahead with emptying repo - let me fix your second project-config change17:13
fungialso holding onto an archival snapshot is probably nicer to the provider than keeping an unused trove instance running17:13
mordred++17:14
AJaegermordred: I'll fix 720901 now...17:14
mordredclarkb: actually - for openstackci and gerrit and zuul - should we maybe just leave them as-is - I think people may still be using them to run 3pci?17:14
mordredor should we maybe tag their current tip with an eol tag, then land the retirement patch so that people are, they can pin to the eol patch?17:15
clarkbmordred: ya we can alawys tag after the fact too if necessary17:16
mordredgood point17:17
mordredok. I'm going to delete the old servers17:17
mordredinfra-root: also - we have kube-master and 3 kube minion servers in rax - I think those can be deleted too - we're not doing anything with them17:19
corvusmordred: i don't remember us setting up a k8s in rax17:20
mordredwe didn't17:20
mordredI think we booted servers in anticipation of doing so and then never used them17:20
openstackgerritAndreas Jaeger proposed openstack/project-config master: Retire unused puppet modules  https://review.opendev.org/72090117:21
AJaegerclarkb: could you put https://review.opendev.org/#/c/721312/ on your review queue to remove an unused job from system-config, please?17:22
fungimordred: my recollection is that while there are probably still rather a lot of ci systems out there deployed from our puppet modules, few if any were continuously consuming the updates from those modules' git repositories and rarely if ever upgraded (usually just when something in the software broke which we had a corresponding behavior change in the modules for)17:23
mordredfungi: ++17:24
openstackgerritMerged openstack/project-config master: Stop running jobs on unused puppet repos  https://review.opendev.org/72090017:26
corvusmordred: i'm still confused on https://review.opendev.org/721098 fwiw17:26
AJaegermordred: before we can fully retire the repos, you need to update https://review.opendev.org/#/c/720892/ - and get it merged17:29
mordredcorvus: so - the existing state of the world in the puppet of eavesdrop is that we create a jenkins user on eavesdrop using the ::jenkins::jenkinsuser puppet module. the private key for that user is then used as a zuul secret in the yaml2ical publication job17:30
mordredcorvus: I didnt' want to port "create a jenkins" user to the ansible17:30
openstackgerritAndreas Jaeger proposed openstack/project-config master: Retire unused puppet modules  https://review.opendev.org/72090117:30
mordredso figured this would be a good time to switch to using zuul deployment keys instead of putting an ssh private key into zuul as a secret17:30
mordredso the change here is to create a zuul user for the yaml2ical job to use instead of the jenkins user it was using, and then to update the job in project-config to use add_host along with per-project ssh key to connect to that zuul user17:31
corvusmordred: oooh, it's the jenkins user.  got it.17:32
mordred\o/17:33
corvusmordred: will this require a change to the yaml2ical job, or is the 'jenkins' user key already the project-config key?17:33
mordredcorvus: https://review.opendev.org/#/c/721099/17:34
corvusmordred: got it :)17:34
mordredI shoudl maybe fix that patch :)17:34
corvusyeah, we'll want to land that not too long after the other one :)17:34
mordredooh - that job might be being run on irc-meetings changes17:35
mordredwhich I don't think we want as much17:35
mordredmaybe we remove it from irc-meetings and run publication hourly from project-config17:36
openstackgerritMonty Taylor proposed openstack/project-config master: Use zuul deployment keys for yaml2ical  https://review.opendev.org/72109917:41
mordredcorvus: maybe something like that would be safer ^^17:41
openstackgerritMonty Taylor proposed opendev/system-config master: Split codesearch into its own playbook  https://review.opendev.org/72110217:47
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109817:48
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162917:48
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163317:48
openstackgerritMonty Taylor proposed opendev/system-config master: Stop cloning a bunch of puppet modules we don't use  https://review.opendev.org/72089217:48
openstackgerritMonty Taylor proposed opendev/system-config master: Run nodepool launchers with ansible and containers  https://review.opendev.org/72052717:49
openstackgerritMonty Taylor proposed opendev/system-config master: Run zookeeper cluster in nodepool jobs  https://review.opendev.org/72070917:50
openstackgerritMonty Taylor proposed opendev/system-config master: Run nodepool launchers with ansible and containers  https://review.opendev.org/72052717:50
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: POC: download-artifacts: provide a dictionary with tests  https://review.opendev.org/72170317:52
openstackgerritSorin Sbarnea proposed zuul/zuul-jobs master: POC: download-artifacts: provide a dictionary with tests  https://review.opendev.org/72170317:54
mordred#status log deleted old etherpad.openstack.org and etherpad-dev.openstack.org servers17:54
openstackstatusmordred: finished logging17:54
fungithanks mordred!!!17:55
mnasermordred: Open10K8S is trying to use the image at https://review.opendev.org/#/c/713953/ -- is there any reason why we can't rebase and merge? (should we use it out-of-tree so we can land code and if it seems stable enough, we can psuh it back after?)17:59
mordredmnaser: oh - yeah - we should rebase that18:00
mordredmnaser: lemme do that real quick18:00
openstackgerritMerged opendev/system-config master: Remove system-config-puppet-beaker-rspec-puppet-4-centos-7-infra  https://review.opendev.org/72131218:09
openstackgerritMonty Taylor proposed opendev/system-config master: Add a uwsgi-base container image  https://review.opendev.org/71395318:11
mordredmnaser: ^^ thereyougo18:11
mnasermordred: ok awesome, i'll try to provide a "it works" signal..18:12
mnaserthinking out loud tho that might be hard for the images we build, bleh!18:13
mnaserbecause opendev/system-config is not in the vexxhost tenant18:13
*** ralonsoh has quit IRC18:16
openstackgerritMonty Taylor proposed opendev/system-config master: Rearrange set-hostnames and cloud-init removal  https://review.opendev.org/72110618:17
mordredclarkb, corvus : sorry ^^ I left out a change when I extracted that from the etherpad patch18:17
openstackgerritSorin Sbarnea proposed opendev/base-jobs master: Enable promote to unarchive gz archives in addition to bz2  https://review.opendev.org/72170618:19
mordredcorvus, clarkb: also - https://review.opendev.org/#/c/720527/ is green now - and I think it should be safe to land18:25
openstackgerritMonty Taylor proposed opendev/system-config master: Rearrange set-hostnames and cloud-init removal  https://review.opendev.org/72110618:26
openstackgerritMonty Taylor proposed opendev/system-config master: Remove some extra bits from site.pp  https://review.opendev.org/72110118:26
openstackgerritMonty Taylor proposed opendev/system-config master: Split codesearch into its own playbook  https://review.opendev.org/72110218:26
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109818:26
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162918:26
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163318:26
openstackgerritMonty Taylor proposed opendev/system-config master: Stop cloning a bunch of puppet modules we don't use  https://review.opendev.org/72089218:26
clarkbmordred: there are like 3 stacks now? do we have a preference in priority for one over the other? I've been distracted by testing my nodepool change (which we should land btw to address inap issue, I can restart launcher if it lands) as well as devstack thing in #openstack-qa18:27
mordredclarkb: the nodepool one is likely the simplest - and shouldn't cause things to restart ...18:29
mordredclarkb: so - if we land it, then we could use the need to restart launchers anyway to pick up your change as the opportunity to restart them18:29
clarkbk I'll try to take a quick look at it (but also need to prep for meeting)18:30
mordredclarkb: (a quick re-review on https://review.opendev.org/721106 would be nice - I just missed a single line)18:30
clarkbmordred: done on the quick rereview18:30
mordredclarkb: thanks!18:31
fricklercorvus: mordred: it was passing on the previous patch that got merged and I cannot find what changed over the weekend /var/log/ansible/service-meetpad.yaml.log.2020-04-18T18\:54\:06Z vs. /var/log/ansible/service-meetpad.yaml.log.2020-04-20T18\:33\:15Z18:32
mordredfrickler: looking18:33
mordredfrickler: Failed to import the required Python library (setuptools) on meetpad01's Python /usr/bin/python. Please read module documentation and install in the appropriate location. If the required library is installed, but Ansible is using the wrong Python interpreter, please consult the documentation on ansible_python_interpreter18:34
mordredfrickler: we rolled out the clarkb change to install docker-compose from pip18:35
clarkbmordred: frickler do we need to set python3 as the ansible_python_interpreter?18:35
mordredperhaps we shoudl be setting python_interpreter there18:35
mordredyeah18:35
mordredclarkb: I thnik so18:35
clarkbI want to say bionic has python3 but not python218:35
mordredpython3 is solid on that host18:35
clarkb(I don't know why it cares all of a sudden)18:35
mordredpython2 is there18:35
mordredbut yeah - I'm confused - maybe something accidentlaly installed python2 where it wasn't installed before?18:36
mordredclarkb, frickler: I'll push up a patch real quic18:37
openstackgerritMonty Taylor proposed opendev/system-config master: Set ansible_python_interpreter to python3 on meetpad  https://review.opendev.org/72170718:39
mordredclarkb, frickler :^^18:39
fricklermordred: oh, unattended upgrades http://paste.openstack.org/show/792499/18:39
mordredfrickler: ah yes - because before we had docker-compose installed which pulled in extra things18:40
mordredbut since we don't install that from distro anymore, u-a cleaned up for us (yay)18:41
mordredfrickler: glad that explains why it worked then stopped working18:41
fricklerthat also explains why one run passed even after the removal of docker-compose, it failed after the nightly cron18:41
prometheanfiremordred: ianw mind tagging a new glean release (if https://review.opendev.org/367487 can't get merged beforehand)18:41
openstackgerritMonty Taylor proposed opendev/ansible-role-puppet master: Stop logging puppet to syslog  https://review.opendev.org/72171018:44
mordredsigh18:44
clarkbmordred: what is wrong with syslog?18:45
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171118:45
mordredclarkb: it makes looking at job output hard18:45
clarkbmordred: well the reason we syslogged it was to have the logs on the host iirc18:46
mordredyeah- but I think these days it's more of a pita now that we have more and more logging on bridge because of ansible18:46
mordredit makes the ansible easy to see and the puppet hard18:46
clarkbmordred: ya I don't really care either way just wanted to call out that on host logs was an explicit design choice there before18:49
clarkbmordred: my only other concern is that the stdout/stderr logs may be less verbose than syslog by default?18:49
clarkbI think we get the most verbose set of logs in syslog18:49
mordredclarkb: do we? the docs on logdest don't call that out - but they're also super sparse18:50
clarkbmordred: I think default for stdout is just errors?18:51
clarkbmaybe warnings but I know we get all the info stuff in syslog too18:51
mordredclarkb: I could stick that in the tree after the eavesdrop patch and we could see the difference18:51
clarkbmordred: ++18:51
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171118:53
mordredclarkb: there - that'll be sure to trigger it and we can see what it's like18:53
mordredit's almost like this whole testing system works ;)18:53
mordredcorvus: if you have a sec, I'm getting a containers issue in https://zuul.opendev.org/t/openstack/build/4e475259a12b4e9383cd1afdb18760cb18:55
mnasermordred: hmm, ran into "fatal error: stdio.h: No such file or directory" before?  i'm assuming because of some of the clean ups done, its missing that18:55
mnasergcc is installed and i dont feel like adding build-essential in there18:56
mordredcorvus: and it makes me think I've got the job set up wrong - but I did parent it on system-config-run-containers18:56
mordredmnaser: you need somethign else ... one sec18:56
mnaserlibc6-dev maybeh?18:56
mordredyup18:57
mordredlibc6-dev18:57
mordredmnaser: nodepool's bindep is pretty minimal and a good one to look at for the basics18:57
mordred(it has a few more than you need for other thigns - but it's been minimized pretty strongly)18:58
mnaserok cool i'll add-as-i-go18:58
clarkbif you split build from consumption you should be able to safely add build essential and everything else you need without worrying about size too much18:58
clarkbI think the bindep env is called "compile"18:59
mordredthat's right18:59
mordredclarkb: but you actually don't need build-essential even in compile - because it pulls in all the stuff you need to build debian packages18:59
openstackgerritMohammed Naser proposed opendev/system-config master: Add a uwsgi-base container image  https://review.opendev.org/71395318:59
mordredmost of the time gcc and libc6-dev is all you need18:59
mnaseryeah that's true i guess18:59
mnaserit was just a matter of keeping it quick'n'efficent18:59
mordredI like avoiding it - even though ti's just build stage, because it is a waste18:59
mordredyup19:00
clarkbmordred: ya definitely just use what you need just wanted to call out we've addressed the underlying concern pretty well19:00
mordred++19:00
mnaserbtw, i wanted to propose at some point to add git to python-builder19:00
mnaserit seems like 99% of the times you end up having to isntall it19:00
mnaserand if you dont then assemble will19:00
mordredit should already be in builder19:00
mnaserhm19:00
corvusmeeting time in #opendev-meeting19:01
mordredit's added after the bindep stuff is installed so that if you have git in your bindep file you end up with it in your final image19:01
mnaserat the time i had to install it if i tired to do a git inside of it19:01
mnaser(trying to build openstack images using python-builder/python-base by cloning repos into it)19:01
ianwhrw: hrm, i think the centos-8 arm nodes should be ready?19:01
mordredmnaser: assemble will install it - but it's not there before assemble19:01
mnasermordred: yeah i kinda needed it to be there before assemble in my case, but its a weird unicorn case19:02
mordredmnaser: I highly recommend cloning the repos externally and pointing builder at them19:02
mnasermordred: yes i was going to follow up with ideally id like to do COPY from teh zuul checked out repos19:02
mordredyup19:02
mnasermordred: but i wasn't sure _yet_ how to bake that into the dockerfile while still making it buildable locally19:02
mordredbecause we explicitly don't have git in the builder image - becaues if we do it winds up excluded from the final image and that's bad :)19:02
ianwhrw: did you just try a centos-8-arm64 node type?19:03
mordredmnaser: I think for these you might just have to have the instructions be "cd src/opendev.org/openstack/nova ; docker build . -f ~/src/openev.org/vexxhost/openstack-images/docker/nova/Dockerfile"19:03
mnasermordred: ah yes that seems a bit reasonable19:04
mnaseror maybe you could do docker build inside nova but.. one day :)19:04
mordredone day :)19:04
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109819:07
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171119:07
openstackgerritMonty Taylor proposed opendev/system-config master: Move in-tree hiera settings to ansible vars  https://review.opendev.org/72162919:07
openstackgerritMonty Taylor proposed opendev/system-config master: Add new etherpad to cacti  https://review.opendev.org/72163319:07
openstackgerritMonty Taylor proposed opendev/system-config master: Stop cloning a bunch of puppet modules we don't use  https://review.opendev.org/72089219:07
hrwianw: o, indeed. started working.19:15
hrwianw: sorry for bothering19:16
prometheanfiremordred: ianw should I make a release request to releases for glean?19:18
fungiprometheanfire: no, we can just push a tag19:20
corvusfungi: when you said "this is true" in your comment, what does "this" refer to?19:28
corvusi totally get that the two comments form a conversation :)19:28
corvusi just don't understand the antecedent19:28
fungicorvus: "this" == "Its possible this is no longer a concern..."19:32
fungisorry19:32
openstackgerritMerged zuul/zuul-jobs master: Use cached 'tox_executable' in fetch-tox-output  https://review.opendev.org/72119219:32
prometheanfirefungi: ah, cool, wasn't totally sure how it was managed19:36
fungiit's no longer in openstack at all19:36
fungiso wouldn't be handled by the openstack release team regardless19:36
clarkbmordred: root screen has run flock -n /var/run/reprepro/ubuntu.lock bash in window 019:40
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109819:45
ianwhrw: excellent :)  lmn ... it seemed too easy!19:45
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171119:47
ianwprometheanfire: i can do a tag soon.  will this fix the builds in the dib gate?19:48
prometheanfireianw: It along with the DIB patch will help I think19:51
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109819:55
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171119:55
corvusmordred: re https://review.opendev.org/721707 -- if we just remove python2 on the host, then testing == prod again, right?19:55
mordredyeah - I think that's the better idea19:56
mordreddoing that on meetpad now19:56
corvusokay, i -2d the change because our streams crossed and frickler +3d it19:57
mordred++19:57
mordredcorvus: when you have a sec, https://review.opendev.org/#/c/721106/ needs a re-review - I missed a line when I split it out earlier19:59
clarkbinfra-root I have manually held all the affected lockfiles in https://review.opendev.org/#/c/720718/2/modules/openstack_project/manifests/mirror_update.pp via a root shell on mirror-update.openstack.org. I am going to approve this change now20:00
corvusmordred: +320:00
openstackgerritMerged zuul/zuul-jobs master: Make linting use of find portable  https://review.opendev.org/72159520:00
mordredcorvus: thanks20:00
mordredclarkb: ++20:00
clarkbalso I've bumped the quotas for ubuntu-ports and ubuntu mirrors as they both add focal20:00
clarkbonce that change lands and applies we can then manaully run all of those sync commands, then manually vos release -localauth on the afs servers20:01
clarkbI'll work on all that over the course of today20:01
corvusclarkb: thanks!20:01
corvusmordred: did you remove python2?  i'll manually run the meetpad playbook if so20:01
mordredcorvus: I did20:01
ianwclarkb: ++ can give a hand if it gets late for you20:01
clarkbianw: thanks!20:01
corvus<2001:4800:7819:104:be76:4eff:fe04:892f> Failed to connect to the host via ssh: /bin/sh: 1: /usr/bin/python: not found20:02
corvusmordred: i thought it was supposed to try python3 in that case?20:02
mordredcorvus: neat. I thought it was too20:02
corvusmordred: oh, maybe i need to refresh the fact cache?20:02
fungidepends on the version of ansible, right?20:02
mordredoh - maybe so20:02
mordredcorvus: yeah - /usr/bin/python is definitely in the fact cache for meetpad20:03
ianwi think that container based functional tests for platforms of interest are failing due to test configuration at this point, particularly volume maps, rather than issues with dib/containers as such.  investigating20:04
mordredas is discovered_interpreter_python20:04
corvusmordred: i tried "-m setup" and "-m gather_facts" and no joy20:04
clarkbianw: I've stuck a copy of reprepro-mirror-update in my homedir on mirror-update.openstack.org and commented out the vos release20:04
corvusmordred: got any hints?20:04
clarkbianw: I'll be using that script once reprepro configs update20:04
openstackgerritAndreas Jaeger proposed openstack/project-config master: Retire i18n-specs repo  https://review.opendev.org/72171920:05
mordredcorvus: why don't we try moving the fact cache file out of the way20:05
corvusmordred: https://docs.openstack.org/openstack-ansible/12.2.6/install-guide/ops-troubleshooting-ansiblecachedfacts.html#forcing-regeneration-of-cached-facts20:05
corvusmordred: our osa friends appear to recommend that20:05
mordred+20:05
mordredwant me to do that?20:05
corvusmordred: i will20:05
mordredk20:05
clarkband with that I'm going to get a bike ride in as everything should be stable at the moment20:06
clarkbback in an hour or so20:06
corvusmordred: that made everything better20:07
mordredcorvus: woot!20:07
clarkbcorvus: mordred ok you know what my explicit trigger of replication for openstack/requirements may have fixed the review/p/ thing20:08
clarkbbecause I agree ti seems up to date nwo20:08
clarkbmordred: so maybe we just need to fix those new repos and then retrigger replication for all of /opt/git?20:09
clarkband really popping out now. back soon20:09
openstackgerritAndreas Jaeger proposed openstack/project-config master: Retire i18n-specs repo  https://review.opendev.org/72171920:10
openstackgerritAndreas Jaeger proposed openstack/project-config master: Finish retiring i18n-specs  https://review.opendev.org/72172220:10
openstackgerritAndreas Jaeger proposed openstack/project-config master: Finish retiring i18n-specs  https://review.opendev.org/72172220:13
ianwprometheanfire / mordred: i just pushed glean 1.18.0 ... only functional change really is the gentoo resolv.conf thing20:14
corvusfrickler: okay meetpad seems to be working again20:16
corvuser, restarting again real quick20:16
openstackgerritMerged opendev/system-config master: Rearrange set-hostnames and cloud-init removal  https://review.opendev.org/72110620:18
openstackgerritMerged opendev/system-config master: Remove some extra bits from site.pp  https://review.opendev.org/72110120:18
openstackgerritMerged opendev/system-config master: Start mirroring focal, stop mirroring trusty  https://review.opendev.org/72071820:19
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151120:27
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] strip env vars in yum-minimal chroot calls  https://review.opendev.org/72172620:27
mordredclarkb: yeah - they're definitely not owned right - but I don't know why _why_ they aren't owned right, becaues jeepyb sure does do a chown20:35
mordredI'm just going to fix them for now - let's keep an eye out next time we make a new project - the system was known to be in an inconsistent state, so I don't know that debugging why the three repos created during that time are weird is worth it20:38
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] strip env vars in yum-minimal chroot calls  https://review.opendev.org/72172620:46
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151120:46
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109820:54
*** DSpider has quit IRC20:57
openstackgerritMonty Taylor proposed opendev/puppet-hound master: Update git url to hound-search  https://review.opendev.org/72172920:59
mordredclarkb, fungi, corvus : ^^ sigh20:59
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] strip env vars in yum-minimal chroot calls  https://review.opendev.org/72172620:59
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151120:59
mordredclarkb: incidentally: https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_31b/721711/5/check/system-config-run-codesearch/31b107e/bridge.openstack.org/ara-report/result/a673a179-9ab6-41db-8ade-de5952604587/21:01
openstackgerritMonty Taylor proposed opendev/ansible-role-puppet master: Add color parameter  https://review.opendev.org/72173221:05
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171121:07
mordredclarkb: that's the results of the "log puppet to stdout" change. there's sadly some ansi color - so there's the abilitu to disable so that it's readable21:07
mordredinfra-root: logstash-worker13 seems down21:13
mordred(noticed this from a failed infra-prod-base)21:14
corvusit appears to have gone down on april 1821:14
mordredneat.21:15
mordredcorvus: I'm gonna look at its console on rax - and if there's no smoking gun issue, I figure just restarting it yeah?21:17
corvusmordred: sgtm; i confirm no ssh from here21:17
mordredkk21:17
fungiat best you'll probably see console messages about cpu cycle skips21:18
fungiprobably another live migration gone wrong21:18
mordredit OOMed21:19
funginice21:19
fungioom'ed hard enough to take ssh offline?21:19
fungior just an outright panic?21:19
mordredrebooting21:19
corvusand snmp21:19
mordredoomed and then there were jdb stuck tasks21:19
corvusah21:20
fungiwere the timestamps close together? otherwise the oom could have been from some other time21:20
mordredthe rax showed last updated april 18 - so I'm also guessing live migrate - that then caused something to get stuck - and the oom is just. symptom21:20
fungiregardless, yeah, needs reboot21:20
mordredsshing again21:20
mordredworking21:20
fungithanks!21:20
mordredfungi, corvus: do y'all have opinions on the puppet-log-to-stdout patch?21:20
mordredah - I see you do!21:21
* mordred goes to read21:21
corvusmordred: i just wrote a followup comment too21:21
fungii had opinions? i've already forgotten if so21:21
mordredcorvus: I agree with your statement21:22
mordredI would prefer both too - but I also agree, I don't see that as an option21:22
corvusbasically, there have been times recently where the only way i could find out what happened on a host was on the hosts syslog, so i'd have more warm fuzzies if that could remain while we got used to how it's sposed to work :)21:22
mordred++21:22
mordredI mean - it's a variable - we could set it differently in the gate21:23
corvusnah, let's try the thing that makes the most sense, and change it if it's bonkers21:23
mordredwhich is where my main concern is - it's harder to find the logs amongst the syslog in the gate jobs21:23
mordredkk21:23
corvusso frowny face +221:23
mordredfungi: https://review.opendev.org/#/c/721732/ and https://review.opendev.org/#/c/721711/ if you feel you might have an opinion21:23
mordredhttps://zuul.opendev.org/t/openstack/build/31b107e4fef348ba9c4d002d2d689a9e is a job with the logging to stdout - albeit still with ansi codes21:24
mordredbut you can click through to the ara report and then see the output in the failed task21:24
fungimordred: my opinion is yes21:25
fungiat least on the first one21:25
mordred\o/21:25
* mordred is writing so many puppet related patches21:26
fungi721711 is somewhat of a lack of visibility for folks who have ssh access to some servers but not bridge21:26
fungialso, like corvus, i've relied on this in the past but agree it won't prevent me from being able to investigate at least21:27
corvusfungi: but if we can verify the log is clean, we could make that public21:27
fungiyep, true21:27
corvus(by including it in the job)21:27
fungiso potentially improved visibility for people with no ssh access21:27
fungiwhich on balance is nicer21:27
fungialso, what's the story for launch-node?21:27
fungiwill the puppet output come back to stdout of the calling terminal now?21:27
fungipreviously we had to --keep and then ssh into a broken launched node and look in syslog to find out what puppet did (or didn't)21:28
mordredfungi: I'm gonna say "yes"21:28
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add first haskell job  https://review.opendev.org/72173521:28
fungiif that's how launch-node starts behaving, might prevent needing to rerun with --keep21:29
fungiso actually nice21:29
*** dpawlik has quit IRC21:34
*** sgw has quit IRC21:34
clarkbcorvus: ya restarting those workers is generally the best thing if its sad21:39
clarkbinfra-root it seems that the focal update is on mirror-update now21:39
clarkbI'm going to start with some of the smaller repo as we should be able to clear them out quickly21:40
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add first haskell job  https://review.opendev.org/72173521:41
clarkbinfra-root that results in http://paste.openstack.org/show/792506/ so I'm trying to figure out clearvanished now21:42
clarkbthis was on ceph hammer21:42
openstackgerritMerged opendev/ansible-role-puppet master: Add color parameter  https://review.opendev.org/72173221:43
clarkbinfra-root: `k5start -t -f /etc/reprepro.keytab service/reprepro -- reprepro --confdir /etc/reprepro/debian-ceph-hammer clearvanished` appears to be what I want goign for htat if i don't hear objection21:46
ianwclarkb: that seems about right, although i have to re-learn all the reprepro magic whenever i have to touch it21:47
clarkbnwo it says There are still packages in 'trusty|main|amd64', not removing (give --delete to do so)!21:48
clarkbso I'm going to give it the --delete21:48
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] strip env vars in yum-minimal chroot calls  https://review.opendev.org/72172621:49
openstackgerritIan Wienand proposed openstack/diskimage-builder master: [wip] switch func tests to containers  https://review.opendev.org/72151121:49
clarkband now it says (dumpunreferenced lists such files, use deleteunreferenced to delete them.)21:49
clarkbso I think the sequence is reprepro --delete clearvanished && reprepro deleteunreferenced21:49
clarkbI'm doing that now21:49
clarkb(still only operating on the small ceph repo so that if I get it wrong rebuilding is easy21:50
clarkbyou have to do reprepro --nokeepunreferencedfiles deleteunreferenced otherwise it doesn't actually delete unreferenced files21:52
clarkbso now I'm gonna rerun reprepro normally and it should be happy since trusty has been removed21:52
*** sgw has joined #opendev21:53
prometheanfireianw: looks like the gentoo dib build is passing, so, good sign21:53
clarkbalright that looks happy so now I'm gonna vos release it and we can double check the trusty cleanup results21:54
openstackgerritMerged zuul/zuul-jobs master: helm-template: allow users to disable wait-for-pods  https://review.opendev.org/72136921:55
fungiwow, reprepro is such an example of arcane incantations21:55
clarkbinfra-root https://mirror.dfw.rax.opendev.org/ceph-deb-hammer/ was my first victim. Can you check if that looks like trusty cleaned up properly?21:56
clarkblooks like we may need to manually remove https://mirror.dfw.rax.opendev.org/ceph-deb-hammer/dists/trusty/main/binary-amd64/ ?21:56
clarkb(I think the man page actually warns of this)21:56
clarkb"Do not forget to remove all exported package indices manually."21:56
clarkbbut also those are much smaller can probably be done whenever we like without much worry21:57
clarkb(rather than needing to incorporate it as part of the manual flow but I can as well)21:57
ianwhrm, i thought pretty much what was described in https://docs.openstack.org/infra/system-config/reprepro.html would get rid of it21:57
ianwthe bit at "In some situations where things are very out of sync, it may be easier to remove and replace an entire section of the repository." anwyay ... but ... yeah, reprepro is all black magic ...21:57
fungiwell, don't forget reprepro keeps a stateful database in, i think, /var/lib about what it's synced21:58
fungiso just deleting the files which were mirrored by hand can get it all nice and confused21:59
clarkbya I think what ianw links to is how we keep the reprepro db in sync21:59
clarkbto deltee packages I had to run the deleteunreferenced command21:59
clarkbhttps://mirror.dfw.rax.opendev.org/ceph-deb-hammer/pool/main/c/ceph/ shows no more trusty packages so I think that was working21:59
fungiahh, yeah, our doc talks about the db files too21:59
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add first haskell job  https://review.opendev.org/72173522:00
fungianyway, if you're going for clean slate, you have to delete those too22:00
clarkbshould I manually delete the indexes as suggested by the manpage then rerun reprepro and a vos release?22:00
clarkbfungi: those?22:00
openstackgerritNate Johnston proposed openstack/project-config master: Retire the Congress project  https://review.opendev.org/72174222:00
fungiclarkb: no, i suspect the manpage is talking about empty package indices22:00
clarkbfungi: any idea how I get the trusty indices to go away?22:01
fungibut i'm not in a good state to try to digest it at the moment (i've only ever had to troubleshoot it once, when we were trying to add debian/buster backports i think, and got really turned around)22:01
clarkbhttps://mirror.dfw.rax.opendev.org/ceph-deb-hammer/dists/trusty/ basically all that22:01
clarkbmy reading of the manpage is that I have to manually delete that22:02
fungioh, hrm, yeah those aren't empty22:02
clarkband I'm happy to rm that tree and then rerun reprepro to make sure its still happy22:02
fungiyou could move the trusty dir out of the way and rerun reprepro to see if ti complains or recreates it22:02
clarkbok let me try that22:03
clarkbI renamed trusty to trusty-old, reran reprepro. It seems to have completed successfully and not error'd. There is no new trusty/ dir under dists22:04
clarkbnow I will rm -rf trusty-old/ ?22:04
clarkbI'll rerun reprepro after that just to be double sure22:04
openstackgerritNate Johnston proposed opendev/irc-meetings master: Remove the congress team meeting  https://review.opendev.org/72174422:06
fungiclarkb: sounds fine, yep22:06
clarkbhttps://mirror.dfw.rax.opendev.org/ceph-deb-hammer/dists/ all done that looks good to me22:07
fungii guess it just doesn't clean up package metadata files or remove any entries from them if you tell it to delete the packages22:07
clarkbalright I'll proceed with the other smaller repos and get a complete process written down22:07
openstackgerritMonty Taylor proposed opendev/puppet-hound master: Update git url to hound-search  https://review.opendev.org/72172922:09
mordredcorvus, fungi : ^^ sorry - puppet lint - re-review plz?22:09
mordredclarkb: reading scrollback22:09
corvusmordred: i will happily approve changes both with and without the :: :)22:10
corvusbecause i have no idea22:10
fungii will even more happily approve changes removing puppet though ;)22:11
mordredfwiw: https://github.com/voxpupuli/puppet-lint-absolute_classname-check#relative-class-name-inclusion22:11
clarkbinfra-root https://etherpad.opendev.org/p/tMfB9VRcqe7NhS9a4-ZX thats the process I think22:11
mordredclarkb: looks like things are going decently for you?22:11
clarkbmordred: yup I think I have a process that works now22:12
clarkbI just need to work through the list. I'll leave ubuntu and ubuntu-ports for last as they are scariest :)22:12
clarkbI want things to go wrong in the smaller repos22:12
mordredclarkb: lgtm22:12
fungiclarkb: what's with the custom /home/clarkb/reprepro-mirror-update ?22:13
clarkbfungi: it doesn't vos release because we want to do that on the afs server with localauth22:14
fungioh, yep!22:14
mordredcorvus: if you have a sec: at the end of the failed task here: https://zuul.opendev.org/t/openstack/build/8b290fa9faf44657b5367a1e22e49e2f/console22:14
fungigood call22:14
clarkbalso note that the deb ceph jewel repo also has an old jessie repo in it22:15
clarkbgonna clean that up with trusty because may as well22:15
mordredcorvus: the system-config-build-image-accessbot job builds opendevorg/accessbot - but that docker pull is not finding it22:16
fungigood call22:16
mordredclarkb: ++22:16
mordredcorvus: this is for https://review.opendev.org/#/c/72109822:16
clarkbjewel hasn't actually updated in a while because of the jessie thing so this is a good exercise of all the stuff22:18
mordredcorvus: do I need to explicitly put a latest tag in there?22:19
mordredcorvus: OH22:21
mordredwait - no22:21
mordredcorvus: yeah - pull-from-intermediate-registry is running on localhost - not on the jobs' eavesdrop.o.o host22:22
mordredcorvus: do we need to do things to make sure the docker we install with system-config on the job's eavesdrop.o.o is configured to pull from the buildset registry? I thought we'd already worked on this before22:24
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add first haskell job  https://review.opendev.org/72173522:26
corvusmordred: erm, let me page stuff in :)22:30
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109822:32
mordredcorvus: I'm taking a stab at something there - but definitely page things in because I think there's still maybe a fundamental thing here22:33
clarkbceph jewel and ubuntu cloud archive are vos releasing now22:33
clarkbthen I'll do ports22:33
clarkbthen I'll do ubuntu22:33
clarkbI expect these to take signficiantly more time22:33
mordredcorvus: what I *think* I've determined is that we have an issue because we don't run use-buildset-registry on our nodes22:33
mordredso nodes we have that are using images aren't getting speculative images becuase the docker we install as part of our production ansible is not configured to talk to the buildset registry22:34
clarkbalright those two are done done now22:34
corvusmordred: i think we do -- https://zuul.opendev.org/t/openstack/build/8b290fa9faf44657b5367a1e22e49e2f/console#1/0/41/eavesdrop01.openstack.org22:34
corvusmordred: we run it in run-base-pre22:35
mordredcorvus: oh - gotcha22:35
clarkbnote: ubuntu-ports does not remove any dists only adds focal22:35
clarkbso I'm skipping the first two steps and going straight ot running our script22:36
mordredcorvus: so my latest patchset is poo22:36
clarkbWarning parsing /etc/reprepro/ubuntu-ports/updates, line 13: unknown architecture 'amd64' will be ignored!22:37
clarkbI think that is because ports is everything but amd6422:37
clarkb(so thats all fine and can be ignored)22:37
clarkbalso I did bump quota on ubuntu-ports by a fair bit so that should be fine. Can reduce it down if I gave it too mcuh room after22:39
mordredclarkb: I think so - it's just a quota, not an actual FS allocation22:39
corvusmordred: i think we reinstall docker22:39
mordredcorvus: yeah - because we install docker as part of the playbook22:40
corvusmordred: it might overwrite the config22:40
mordredsigh22:40
corvusor actually, that might not be a reinstall22:40
mordredit's the first install22:40
corvusyeah; so we should verify the daemon config22:40
mordredthere is no docker when we run use-buildset-registry22:40
mordredis it safe to pull back to the log server?22:40
corvusyep22:41
mordredk. adding it to the mix22:41
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109822:42
corvusmordred: add /etc/hosts too22:43
corvushttps://zuul.opendev.org/t/openstack/build/8b290fa9faf44657b5367a1e22e49e2f/console#1/0/39/eavesdrop01.openstack.org22:43
corvusthat's what the docker config should be; i just noticed it relies on /etc/hosts22:43
mordredcorvus: oh - you know what - I think we write out /etc/hosts22:44
mordredlemme check if it does lineinfile22:44
mordredcorvus: nope! that would be it22:44
corvusah in set-hostname22:44
mordredyeah22:45
mordredmaybe we should move set-hostname to the pre-playbook22:45
corvusmordred: and we weren't using that role earlier right (before your change?)22:45
mordredand run it before use-buildset registry22:45
mordredyup, that's right22:45
corvuscool, then we only just broke it :)22:45
corvusmordred: yeah, i kinda like that, since it's something we usually run in launch node22:46
mordredcorvus: oh - except wanna hear somethign funny?22:46
corvusmordred: and maybe add a note that use-buildset-registry wants to run after it22:46
mordredcorvus: it's in playbooks/roles, not roles - so it's not accessible by zuul22:46
mordredand the pre playbook is run by zuul22:46
mordredlemme try symlink22:46
clarkbI'm realizing this might take all day. So I'm going to start ubuntu concurrent with ubuntu-ports22:46
corvusmordred: the pre playbook is in playbooks/zuul -- is that not good enough?22:47
openstackgerritMerged opendev/puppet-hound master: Update git url to hound-search  https://review.opendev.org/72172922:47
corvusmordred: (would that only allow access to playbooks/zuul/roles, not playbooks/roles ?)22:47
mordredyup22:47
clarkbianw: ^ fyi, I'll try pop in over the evening and check on things but chances are you may end up needing to do vos release later? or maybe I'll even do that in the morning22:47
mordredwe could move it to playbooks22:47
mordredor - let's start with symlink - verify it fixes the issue - then we can take a step back and think about what the right way to organize is22:48
corvusk22:48
corvusmordred: could also probably do lineinfine :)22:48
mordredcorvus: yeah22:49
mordredcorvus: but I think we overwrite on purpose because we don't know the incoming state of hostnames from our cloud providers22:49
ianwclarkb: so it's syncing ubuntu?  let me pull up a window22:49
clarkbianw: not yet its doing the cleanup bits now22:50
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109822:50
clarkbianw: root screen window 0 is ubuntu and window 1 is ubuntu-ports22:50
clarkbalso I see that we sync source packages22:50
mordredcorvus: trying symlink. I moved the role to roles and then symlinked to it from playbooks/roles22:50
clarkbwe maybe don't want to do that?22:50
corvusmordred: good point, then that sounds best22:50
mordredclarkb: I think we did that on purpose for a reason?22:50
clarkbmordred: oh maybe if it is on purpose I'll just accept it will take longer :)22:51
mordredclarkb: well - I mean - I think - but I don't remember what the reason was22:51
clarkb106.9k trusty packages are about to be deleted22:51
clarkbI'm largely trusting reprepro here fwiw (and the fact it seemed happy on the 3 previous repos)22:52
ianwclarkb: so i see ubuntu-ports running, is there another window for ubuntu?22:53
ianwoh sorry, i see your prior comment now22:53
clarkbianw: windows 2,3,4 can all be closed which will release their locks and allow those volumes to be updated normally22:53
clarkbianw: I'll do that once I've got ubuntu running the normal update script22:53
ianwok22:54
clarkbianw: I'm hoping I can reduce the problem set to "run vos release command on afs01 once this script finishes)22:54
ianwok, i won't go touching anything yet :)22:56
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add initial haskell job  https://review.opendev.org/72173522:56
ianwthe first thing we need to move this to ansible is gpg import stuff ... i started looking around to see if that was a solved problem a while ago, ansible didn't have anything built in22:57
clarkbfwiw the kernel versions it is deleting look good22:57
clarkb(they are all reaelly old trusty era versions)22:57
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171122:58
mordredianw: I feel like we've got some gpg import code somewhere that can be cargo culted22:58
ianwthe repropro roles are pretty much just templating22:58
mordredianw: ah - no - all we have is apt-key imports, which are different22:59
mordredianw: roles/add-gpgkey in zuul-jobs23:00
mordredianw: I knew I'd seen something somewhere23:00
mordredianw: don't know if that'll be useful - but maybe it's helpful23:01
ianwcool, yeah i'll have to context switch it back in but it was something like that23:03
*** tosky has quit IRC23:03
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: cabal-test: add initial haskell job  https://review.opendev.org/72173523:05
ianwok, i think i've fixed the problem with the functional-containers build for yum-minimal (https://review.opendev.org/721726) -- using wrong TMPDIR when inside initial chroot23:05
ianwwhy this doesn't fail in a non-container environment ... i don't know23:05
ianwi've noticed some other stuff that can be cleaned up in the job now though, so will beat it all into order today23:06
mordredianw: sweet23:08
mordredianw: so - do you think after that's in shape that just installing deboostrap from the ppa you made is the next step in the debootstrap thing?23:08
* mordred has only half-paged in the stuff you did - is curious23:09
ianwmordred: i think so ... for getting debuntu working with the status quo minimal elements23:11
clarkbianw: https://etherpad.opendev.org/p/tMfB9VRcqe7NhS9a4-ZX I'm taking notes there on current states of things23:12
ianwmordred: no issues looking deeper into the container based build elements -- but personally i don't think we should try bringing in that, and container builders all at the same time23:12
mordredyeah - I think fixing one thing at a time is a good idea23:13
ianwit's always better if we have something to clearly roll-back to23:13
mordredand in general - I'd like to be able to say "the zuul/nodepool-builder images generally work" - which they generally do other than this23:13
ianwi.e. what did *not* happen with the suse stuff last week or so23:14
mordredyeah23:14
ianwi think so -- soon we'll have centos,fedora,xenial,bionic,suse in dib gate and in experimental buster,gentoo using the builder images23:15
clarkbianw: I've closed the other windows and removed their assocaited locks so the screen is very focused now on the remaining repos23:15
ianwthat should be enough distributions for anyone :)23:15
ianwok, i'm watching a tail of the ports update log23:17
mordredianw: ++23:18
mordredianw: who would ever need more that 640k of RAM???23:18
mordredzomg!23:19
mordredclarkb, ianw : https://review.opendev.org/#/c/721102 is green! it's our first "split a puppeted service into its own playbook"23:19
clarkbianw: I think I'm going to start dinner early. That way I can come back to the mirrors when they've gotten further along23:32
clarkbianw: both seem to be happily chugging through the packages right now and quota headroom looks fine23:32
clarkbI'll check in periodically through my evening and will be sure to let you know when I'm permanently afk23:33
clarkbthe etherpad should be up to date onw where we are right now23:33
ianwclarkb: ok, thanks ... i have a window up with the logs so will keep watching too23:34
openstackgerritMonty Taylor proposed opendev/system-config master: Split eavesdrop into its own playbook  https://review.opendev.org/72109823:36
openstackgerritMonty Taylor proposed opendev/system-config master: Stop logging puppet to syslog  https://review.opendev.org/72171123:37
*** mlavalle has quit IRC23:37
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: Add remove-zuul-sshkey  https://review.opendev.org/68071223:45

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!