Monday, 2021-02-22

ianwfungi: yeah, i don't have any particular insight over your testing.  i'm happy to babysit a restart00:01
fungihappy to do a restart myself too, just wanted to make sure i wasn't overlooking anything obvious00:01
fungiand clearly it's not crippling us as it's been broken since tuesday and nobody reported it until saturday00:02
fungior wednesday/sunday au time00:02
fungii can restart it now if you're cool with that00:03
ianwyep i am00:04
fungiianw: okay, doing it now00:05
*** cloudnull has quit IRC00:05
fungiit's starting back up now00:05
*** cloudnull has joined #opendev00:06
fungiianw: also i saw there were some backup errors e-mailed to our shared root mail address00:06
fungigerrit webui seems to be back again00:07
fungiand `docker-compose exec gerrit lslocks -u` includes /var/gerrit/index/accounts_0011/write.lock again00:07
fungiif brinzhang happens back by, they should be able to try generating an http password again00:08
fungii'll mention it over in #openstack-nova since that's where it seems to have been initially mentioned00:09
fungi#status log Restarted the Gerrit container on review.o.o to address a recurrence of https://bugs.chromium.org/p/gerrit/issues/detail?id=1372600:10
openstackstatusfungi: finished logging00:10
*** brinzhang has joined #opendev00:43
*** tosky has quit IRC00:53
*** LowKey has quit IRC00:56
*** LowKey has joined #opendev00:56
openstackgerritIan Wienand proposed opendev/system-config master: [wip] gerrit : add mariadb_container option  https://review.opendev.org/c/opendev/system-config/+/77596101:26
openstackgerritIan Wienand proposed opendev/system-config master: gerrit: download latest mysql connector  https://review.opendev.org/c/opendev/system-config/+/77685701:26
*** LowKey has quit IRC02:14
*** fdegir has quit IRC02:58
openstackgerritBrin Zhang proposed openstack/project-config master: Create inspur/venus-dashboard repository  https://review.opendev.org/c/openstack/project-config/+/77686403:01
*** ykarel has joined #opendev03:14
*** lbragstad_ has joined #opendev03:41
*** lbragstad has quit IRC03:43
openstackgerritIan Wienand proposed opendev/system-config master: [wip] gerrit : add mariadb_container option  https://review.opendev.org/c/opendev/system-config/+/77596104:54
*** ysandeep|away is now known as ysandeep|ruck05:25
*** zoharm has joined #opendev05:27
brinzhangfrickler, infra-root: generate password for HTTP Credentials works fine now, thanks^05:47
*** dtantsur|afk is now known as dtantsur06:14
*** whoami-rajat has joined #opendev06:23
*** dtantsur is now known as dtantsur|brb07:05
*** hashar has joined #opendev07:09
*** lpetrut has joined #opendev07:17
*** slaweq has joined #opendev07:17
*** guillaumec has quit IRC07:34
*** guillaumec has joined #opendev07:34
*** eolivare has joined #opendev07:38
*** ralonsoh has joined #opendev07:38
*** zoharm has quit IRC07:40
*** zoharm has joined #opendev07:40
*** hashar has quit IRC07:52
*** hashar has joined #opendev07:54
*** ysandeep|ruck is now known as ysandeep|lunch08:08
*** andrewbonney has joined #opendev08:10
*** ykarel_ has joined #opendev08:30
*** ykarel has quit IRC08:33
*** ykarel_ is now known as ykarel08:33
*** rpittau|afk is now known as rpittau08:34
*** tosky has joined #opendev08:43
*** jpena|off is now known as jpena08:58
*** fressi has joined #opendev08:59
*** ysandeep|lunch is now known as ysandeep|ruck09:03
*** ykarel has quit IRC09:23
*** ykarel has joined #opendev09:38
*** DSpider has joined #opendev09:59
*** slaweq has quit IRC10:11
*** slaweq has joined #opendev10:15
*** sshnaidm__ is now known as sshnaidm10:22
*** ykarel has quit IRC10:36
*** jhesketh has quit IRC10:37
*** jhesketh has joined #opendev10:37
*** ykarel has joined #opendev10:42
*** tosky_ has joined #opendev10:53
*** tosky has quit IRC10:54
*** tosky_ is now known as tosky10:54
*** slaweq_ has joined #opendev11:27
*** dviroel has joined #opendev11:30
*** slaweq has quit IRC11:31
openstackgerritSorin Sbârnea proposed zuul/zuul-jobs master: Upgrade ansible-lint to 5.0  https://review.opendev.org/c/zuul/zuul-jobs/+/77324511:40
*** dtantsur|brb is now known as dtantsur11:50
*** hashar is now known as hasharLunch11:59
openstackgerritGuillaume Chauvel proposed opendev/system-config master: Increase autogenerated comment width to avoid line wrap  https://review.opendev.org/c/opendev/system-config/+/77144512:01
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179812:01
*** jpena is now known as jpena|lunch12:31
fungibrinzhang: thanks for confirming!12:42
*** iurygregory_ has joined #opendev13:14
*** iurygregory has quit IRC13:15
*** lbragstad_ is now known as lbragstad13:16
*** tosky has quit IRC13:18
*** iurygregory_ is now known as iurygregory13:24
*** tosky has joined #opendev13:24
*** jpena|lunch is now known as jpena13:25
*** lbragstad has quit IRC13:35
*** lbragstad has joined #opendev13:37
*** ykarel has quit IRC13:56
*** hasharLunch is now known as hashar14:06
openstackgerritGuillaume Chauvel proposed opendev/system-config master: Increase autogenerated comment width to avoid line wrap  https://review.opendev.org/c/opendev/system-config/+/77144514:14
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179814:14
*** tosky has quit IRC14:29
*** tosky has joined #opendev14:33
openstackgerritJeremy Stanley proposed opendev/git-review master: Test/assert Python 3.9 support  https://review.opendev.org/c/opendev/git-review/+/77258914:48
*** brinzhang_ has joined #opendev14:52
*** brinzhang_ has quit IRC14:53
*** brinzhang_ has joined #opendev14:53
*** brinzhang has quit IRC14:54
*** tosky has quit IRC15:15
*** tosky has joined #opendev15:15
*** tosky_ has joined #opendev15:18
*** tosky is now known as Guest4362915:19
*** tosky_ is now known as tosky15:19
*** Guest43629 has quit IRC15:21
*** fressi has left #opendev15:22
*** lpetrut has quit IRC15:29
*** redrobot has quit IRC15:30
*** redrobot has joined #opendev15:30
*** brinzhang_ has quit IRC15:32
*** brinzhang_ has joined #opendev15:33
*** ysandeep|ruck is now known as ysandeep|dinner15:42
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588316:05
*** whoami-rajat has quit IRC16:19
*** ysandeep|dinner is now known as ysandeep|away16:21
*** LowKey has joined #opendev16:26
openstackgerritGuillaume Chauvel proposed opendev/system-config master: Increase autogenerated comment width to avoid line wrap  https://review.opendev.org/c/opendev/system-config/+/77144516:36
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179816:36
*** dtantsur is now known as dtantsur|afk16:38
*** mlavalle has joined #opendev17:10
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588317:19
*** fbo is now known as fbo|off17:23
clarkbguillaumec: note if you are writing that plugin the copyright should be for you (or possibly your employer) but not the OSF17:26
clarkbI had assumed this was something android was already doing hence the request for attribution given the previous copyright line17:26
*** sgw has left #opendev17:26
*** zoharm has quit IRC17:34
*** rpittau is now known as rpittau|afk17:49
*** sgw has joined #opendev17:54
*** jpena is now known as jpena|off17:55
*** hashar has quit IRC18:07
clarkbnow that the zuul release is largely done, any reason to not try and build new ze/zm instances on focal? All of the currenty instances are z(e|m)XY.openstack.org. I think I can make z(e|m)XY.opendev.org and once they are up and running remove teh corresponding .openstack.org host18:09
clarkbthe nodepool launchers and and zk servers are also on my list but a bit more complicated as they are more stateful18:09
clarkbfor the launchers I think we may need to put the old on in emergency and disable it by hand, then land .opendev.org addition change and when ahppy remove the old .openstack.org launcher18:10
*** hamalq has joined #opendev18:10
clarkbfor zk I think we can rotate servers out and add new ones in without downtime. The gotcha there is going to be the configs that tell zuul and nodepool and zk itself where the zk servers are18:10
clarkbnot sure what that will look like yet18:11
guillaumecclarkb, "ze/zm instances on focal", perhaps this: https://review.opendev.org/c/opendev/gear/+/74128818:11
clarkbguillaumec: I think we'll be ok since gear runs in a container on the scheduler (whcih means upadting the scheduler too should be fine), but that is a good thing to be aware of18:14
*** mlavalle has quit IRC18:21
*** _mlavalle_1 has joined #opendev18:21
clarkbI'll start with a zm01.opendev.org since that should have the smallest impact if something like gear is unhappy18:21
openstackgerritOleksandr Kozachenko proposed zuul/zuul-jobs master: Revert "Revert "Update upload-logs roles to support endpoint override""  https://review.opendev.org/c/zuul/zuul-jobs/+/77667718:22
fungiyeah, seems like we should be all set to start doing system upgrades18:26
*** sboyron has joined #opendev18:33
*** ralonsoh has quit IRC18:35
*** sboyron has quit IRC18:37
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179818:43
openstackgerritClark Boylan proposed openstack/project-config master: Add an nl01.opendev.org config  https://review.opendev.org/c/openstack/project-config/+/77697918:51
openstackgerritClark Boylan proposed opendev/system-config master: Update test platform for zm and ze servers  https://review.opendev.org/c/opendev/system-config/+/77698018:54
openstackgerritClark Boylan proposed opendev/system-config master: Update nodepool launchers to focal in testing  https://review.opendev.org/c/opendev/system-config/+/77698118:54
openstackgerritClark Boylan proposed opendev/system-config master: Update zookeeper servers in testing to focal  https://review.opendev.org/c/opendev/system-config/+/77698218:54
openstackgerritClark Boylan proposed opendev/system-config master: Update zuul01 to focal in testing  https://review.opendev.org/c/opendev/system-config/+/77698318:54
clarkbthere are enough moving parts in ^ that I'm sure I will get something wrong.18:54
clarkbI'll also WIP https://review.opendev.org/c/opendev/system-config/+/776980 to prevent it from merging until we are ready18:54
openstackgerritClark Boylan proposed opendev/system-config master: Add new zm01.opendev.org server  https://review.opendev.org/c/opendev/system-config/+/77698418:59
openstackgerritClark Boylan proposed opendev/zone-opendev.org master: Add zm01.opendev.org to DNS  https://review.opendev.org/c/opendev/zone-opendev.org/+/77698519:05
clarkbI think these two changes are ready for review unless we want to wait for test results from https://review.opendev.org/c/opendev/system-config/+/776980 furst19:05
clarkbone thing I'm noticing is that there is a bit of config intertwining particularly for the gearman server specification that will be fun to untangle when we get around to updating zuul0119:06
clarkbanyway one step at a time, we'll start with the mergers and executors19:07
clarkband with zm01.opendev.org above :)19:08
fungii'm really stumped on https://review.opendev.org/77258919:15
fungii think something about the gerrit setup in the git-review functional jobs must be breaking the dev mode default creds19:16
fungii've even tried outright replacing the test_user/test_pass with admin/secret and not trying to create a new user, but when i try to post to a/accounts/self/sshkeys to write the ssh public key as admin/secret (like zuul's quickstart job does), i still get a 401 unauthorized19:17
fungiand if i comment out all the test cleanup so that the gerrit instance is left running after a test, i can't authenticate to its rest api as admin/secret either, so i don't think it's anything as simple as i've called requests wrong19:18
openstackgerritOleksandr Kozachenko proposed zuul/zuul-jobs master: Revert "Revert "Update upload-logs roles to support endpoint override""  https://review.opendev.org/c/zuul/zuul-jobs/+/77667719:21
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179819:50
clarkbhttps://review.opendev.org/c/opendev/system-config/+/776980/1 got a +1 from zuul so I think we should be good to proceed with https://review.opendev.org/c/opendev/system-config/+/776984 and https://review.opendev.org/c/opendev/zone-opendev.org/+/77698520:02
*** slaweq_ has quit IRC20:07
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588320:09
clarkbguillaumec: for the gear tls thing I wonder if ssl.create_default_context would be better?20:23
clarkbor just construct a mask for TLS 1, 1.1 and 1.2?20:29
clarkbhrm it doesn't appear to be a mask20:31
clarkb1 is value 3, 1.1 is value 4 and 1.2 is value 520:32
openstackgerritGuillaume Chauvel proposed opendev/gear master: Create SSL context using PROTOCOL_TLS: highest supported version  https://review.opendev.org/c/opendev/gear/+/74128820:34
guillaumecjsut rebased to recheck :) ,  no idea atm20:34
*** andrewbonney has quit IRC20:39
clarkbpoking around at it I think create_default_context may be the closest thing to what we're looking for without just requiring tlsv2. However, I suppose that may also be an option20:41
clarkbmaybe with a fallback eg if ssl.PROTOCOL_TLS is defined use that else use ssl.PROTOCOL_TLSv1_220:41
clarkbfungi: corvus: ianw: have time for https://review.opendev.org/c/opendev/system-config/+/776984 and https://review.opendev.org/c/opendev/zone-opendev.org/+/776985 ? I can keep an eye on that and then also cleanup zm01.openstack.org assuming zm01.opendev.org joins the cluster happily20:43
fungiyeah, i can take a peek20:48
*** LowKey has quit IRC20:51
*** LowKey has joined #opendev20:51
clarkbthen I expect I'll try to get through all of the mergers then look at the executors next. The executors are a bit more complciated as their disk layout is not quitedefault, but other than that I expect it to be very similar to the mergers20:54
fungiwent ahead and approved those, they should be entirely safe on their own20:55
clarkbthanks20:55
ianwo/20:59
ianwi can help out with some upgrades if time is getting away20:59
clarkbhasn't gotten away yet :) I'll let you know if that chagnes later today20:59
clarkbat this point the server is up and just need to land those changes, have them run config management against the server then see what it looks like after the fact21:00
*** eolivare has quit IRC21:01
ianwfungi: unless you've started with DEVELOPMENT_BECOME_ANY_ACCOUNT i don't think you'll have the admin user21:03
ianwi guess https://review.opendev.org/c/opendev/git-review/+/772589/17/git_review/tests/__init__.py used to "fake" this by directly inserting a user into the db21:03
openstackgerritMerged opendev/zone-opendev.org master: Add zm01.opendev.org to DNS  https://review.opendev.org/c/opendev/zone-opendev.org/+/77698521:03
fungiianw: yeah, it does set auth.type=DEVELOPMENT_BECOME_ANY_ACCOUNT in the config for each "site" but i've also tried to go farther and add --dev to the gerrit init command line for when it sets up the "golden site" as well as using sed to inline replace the OPENID with DEVELOPMENT_BECOME_ANY_ACCOUNT in its default config too21:07
fungii'm assuming none of that is sufficient though21:07
fungibecause it still seemed to make no difference in the end21:07
*** slaweq_ has joined #opendev21:09
ianwfungi: are we talking about line 162 there?21:09
fungiyeah, i've tried adding --dev to that21:11
fungiwhich in theory should be happening when the database deson't exist yet21:12
openstackgerritMerged opendev/system-config master: Add new zm01.opendev.org server  https://review.opendev.org/c/opendev/system-config/+/77698421:12
ianwfungi: you'll definitely need that i think -- that is what i found with https://opendev.org/opendev/system-config/commit/d1694d4c98eabc5fda7ea4a38334fdffa6be2cc921:12
fungiso the check in gerrit-pgm/src/main/java/com/google/gerrit/pgm/init/InitAdminUser.java for existing users list being empty should still succeed21:12
ianwi suspect either the --dev or the DEVELOPMENT_BECOME_ANY_ACCOUNT somehow isn't being applied21:12
fungigit_review/tests/utils.py is what adds DEVELOPMENT_BECOME_ANY_ACCOUNT to the config and has been there since ages, but apparently wasn't doing anything21:13
ianwon line 165, it seems to have an extra "download-commands" on the end of the init?21:15
openstackgerritJeremy Stanley proposed opendev/git-review master: Test/assert Python 3.9 support  https://review.opendev.org/c/opendev/git-review/+/77258921:15
fungiianw: that's ^ what i've been trying locally21:16
fungilooks like download-commands was added by https://review.openstack.org/93951 in 201421:17
fungii can remove it, the commit message seems to imply it was needed for plugin stuff21:17
ianwyeah, i can't see that it takes any arguments, i wonder if it's making your init silently fail?21:18
fungii'm trying locally with it stripped out. if that was it i'm gonna scream21:19
fungis/scream/start drinking early/21:19
fungiStdout: fatal: Option "--install-plugin" takes an operand21:19
fungii guess that's saying install the download-commands plugin?21:19
ianwoh, ok, the argument before it21:20
ianwstill, i don't think you need it21:20
fungii'll drop --install-plugin too in that case21:20
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588321:20
clarkbeventually I'll figure out the java ssh stuff21:21
fungimmm... i still get 401/Unauthorized21:21
ianwfungi: umm, is it setting DEVELOPMENT_... *after* init?21:23
fungiianw: that was a belt and suspenders hail mary21:24
fungithe init includes --dev which should have the same effect21:24
fungiat least based on my reading of the docs21:25
ianwumm, i seem to have a strong memory that it does not21:25
ianwyou need both21:25
ianwi think line 166 needs to move above the init21:25
fungiwill that file exist before the init is run?21:27
fungiwe make the etc dir a few lines earlier so we can precreate the hostkey in it21:28
fungithe gerrit init seems to unpack the tree which includes the default config, then i stream edit that unpacked config just for added security since gerrit is started via the unpacked gerrit.sh script later which doesn't have any command-line option to tell it to start in --dev mode21:31
ianwhrm, i guess it's sufficiently different to the setup of our container that ... i don't know :)21:32
mordredianw: the upstream gerrit containers are quite a bit different than ours21:37
mordredthey do a bunch of things in a container start script that my OCD found annoying and wanted to do at build time :)21:38
ianwmordred: apropos nothing to do with that ... i hit an issue connecting gerrit to mariadb with key lengths fixed with  https://gerrit-review.googlesource.com/c/gerrit/+/297522  ... anyway given your skills in db optimisation might be of interest21:40
ianwas far as i can tell, all the upstream gerrit CI has failed for reasons not related to the change ...21:41
mordredianw: the patch looks good. out of curiosity, how did that come up?21:42
ianwmordred: i'm looking at connecting us up to a container ... for several of the reasons presented in https://review.opendev.org/c/opendev/system-config/+/775961 :)21:42
ianwi've run some prelim memory stats on that too, importing the production tables it sits @21:43
ianwdf82e25bbb8c   gerrit-compose_mariadb_1   0.09%     462.6MiB / 7.61GiB21:43
ianwsimilar after dumping the whole db.  i'm not sure if maybe it sets its limits % wise based on total memory though, so putting it in a 64gb machine *may* have other affects21:44
mordredianw: ++ cool. well - you mention it in the commit message, but this is likely one of the few cases where just using h2 would be fine - the data is "neat" but if it all DIAF nobody would care21:44
clarkbmordred: we were explicitly told to not use h221:44
mordredbut also - just spinning up a local mariadb container seems great!21:44
clarkbbecause the performance is terrible I think it impacts user web browsing as it wants to update that for each file you look at21:44
clarkbI'm still very skeptical a local mariadb will work for us given the memory situation, but this sort of investigating can help us evaluate that21:45
mordrednod. I mean - gerrit also uses h2 for several of the caches21:45
mordredso I've been considering it a reasonable choice for stuff that has cache-level data importance in my brainhole21:46
mordredbut I don't have strong opinions - mostly that that table is super annoying :P)21:46
clarkbya its definitely not the most important data21:46
clarkbbut luca specifically warned against it for performance reasons21:46
mordrednod21:47
mordredpaying attention to luca is important21:47
mordredso - the table as it stands now is 460M ?21:47
ianwno, but that's the container memory stats after importing it and dumping it21:48
clarkbI think the raw data is closer to 5GB21:48
clarkbbut I haven't looked at it recently21:48
mordredoh - gross21:48
ianwwhich i figure has read/written the whole thing, so probably reflects about worst case21:48
ianw(sorry, gotta run kids to school, back later)21:48
clarkbI think it tends to have very hot data and very cold data too21:49
ianwdf82e25bbb8c   gerrit-compose_mariadb_1   0.09%     67.32MiB / 7.61GiB21:49
clarkbbecause its going to be what people are actively reviewing at the moment21:49
ianwthat's the quiescent container just started21:49
ianwsmaller than i thought21:49
clarkbnot the sort of db where you'd expect the whole thing to need to be in memory at all times21:50
mordredyeah. and in innodb, it'll really just be the hot pages loaded21:53
mordredmostly poking there to think about innodb buffer pool size - in this case it could be pretty small I think21:55
ianwmordred: in the usual yak shaving way, this all actually started from https://bugs.launchpad.net/ubuntu/+source/mysql-5.7/+bug/1914695 where an update stopped making mysqldump --all-databases stop working against our very ancient instance ... you might know somebody who might care about that?22:04
openstackLaunchpad bug 1914695 in mysql-5.7 (Ubuntu) "mysqldump --all-databases not dumping any databases with 5.7.33" [Undecided,Fix released]22:04
ianwclarkb: changing the base job exposes us very widely to docker limit issues @ https://review.opendev.org/c/opendev/system-config/+/775051 ...22:07
ianwwe're almost at the point there where the unreliability of it makes it impractical to merge reliably22:09
clarkbianw:ya22:12
clarkbianw: in the latest run though it failed to stop the container for some reason22:14
clarkband the nested ara doesn't seem to have any std(out22:14
clarkber std(out|err) to explain why22:14
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588322:29
*** slaweq_ has quit IRC22:40
ianwclarkb: it looks like for graphite it couldn't load a certificate https://zuul.opendev.org/t/openstack/build/3094763cc5294a17beb6b03464120f24/log/graphite02.opendev.org/docker/graphite-docker_graphite_1.txt22:52
clarkbianw: fyi waiting zuul to update zm01.opendev.org and service-borg-backup failed. It is unhappy about ansible_ssh_host_key_ed25519_public being undefined22:53
ianwhrm, is that a bad match?  we shouldn't be running backup roles on zm hosts?22:54
ianwbad group match i mean22:54
clarkbianw: it wasn't trying to run on zm01, but the zm01 change triggered all the jobs because the inventory updated22:54
clarkbit was running against hte proper hosts I think22:55
ianwoh right, ok.  yeah something up, will look22:55
clarkbI wonder if that is a fact loading problem from one host to another22:55
clarkblike maybe we need a play in the playbook that just looks up facts fo the backup hosts first22:56
clarkbbefore doing anything on the other hsots22:56
ianwto the graphite job issue; it doesn't appear to collect acme.sh logs so i can't quite tell what went wrong.  i really need to work on running a container with the fake acme thing for testing purposes22:57
guillaumecianw, is it possible to pull an image from intermediate registry ?    I have different behavior for system-config-run-review-3.2 and system-config-run-review-3.3. I tried to get the console log but it doesn't seem possible. So I would like to debut run the docker image myself22:59
clarkbguillaumec: it is the images are listed in the job artifacts list23:00
clarkbfor the build I think23:00
ianwguillaumec: yes it should be an artifact, e.g. https://zuul.opendev.org/t/openstack/build/dd8a7627904045bdae8296a01cd05945/artifacts23:00
guillaumecah nice, never clicked on that tab yet :)  good to know23:01
ianwi was thinking it would be cool to make the zuul summary tab, for each job be a "clickdown" arrow (i don't know what the real term for that is) that then queries zuul and shows the job artifacts23:02
ianwif it was async and only queried when clicked, it wouldn't be too much load on the zuul web side23:02
ianwit's a little above my current polymer/javascript/gerrit-plugin skillset though :)23:03
ianwguillaumec: although it's some work to get it setup to the point that it has users and projects and changes.  if you make a failing job with your change, i can arrange to put it on hold for you.  that way zuul does all that setup23:05
fungiianw: not sure if you saw me mention yesterday, but gitea01 and translate01 have been sending backup failure messages to us23:12
fungii haven't had time to dig into logs, the e-mails don't really provide detail23:13
clarkbzm01.opendev.org has finished ansibling. Things look about right, but we don't auto start the service. I will start it now. Then if it looks happy stop the merger on zm01.openstack.org23:13
clarkbit appears to have started happily and is awaiting jobs23:14
clarkbI guess we have 7 other dedicated mergers so I should be fine to stop zm01.openstack.org now23:15
clarkbI'll do that23:15
clarkbthats done. Assuming everything continues to look happy I'll plan to try and do the other 7 tomorrow23:16
clarkbprobably just do them all in one batch? they are small servers so doing them like that shouldn't be a huge deal23:16
fungido we need to open gearman access for it in the scheduler's iptables rules?23:17
clarkbwe shouldn't, that is all handled by the iptables ansibel role23:17
clarkb2021-02-22 23:14:19,982 DEBUG zuul.BaseMergeServer: Registering 4 jobs <- that implies to connected to gearman just fine23:17
clarkb*it connected to23:17
fungiahh, great23:18
clarkbupdating the zuul scheduler itself becomes problematic for this reason though23:18
clarkbas everything needs to connect to it23:18
clarkbhowever, I think we may be able to get away with it since we'll give it a dns alias for the old openstack.org name.23:18
clarkbwhcih means we can redeploy it, switch things over in dns, restart other services, they connect with the old name, then update services to use the new name23:19
clarkbI've been puzzling through some of that today and definitely don't have all the answers yet, but it should be doable23:19
clarkbnow I just need to see zm01.opendev.org handle some jobs23:19
ianwfungi: ok, will look into23:28
ianwi'm just pulling up the backup logs for the other issue now23:28
clarkbI am having the hardest time convincing jsch to ssh to gerrit properly23:38
clarkbit says a bunch of "closing immediately" on the client side23:39
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179823:40
clarkbnow it is almost like 29418 isn't listening23:41
openstackgerritGuillaume Chauvel proposed opendev/system-config master: [DNM] test comment width: review without autogenerated tag  https://review.opendev.org/c/opendev/system-config/+/77179823:41
clarkbya got a bunch of connection refused23:42
* clarkb waits for the 3.2 run to finish to check its logs23:42
*** DSpider has quit IRC23:43
fungilocal container-to-container networking?23:43
fungior host-to-container?23:43
clarkbfungi: possibly, though I've split the job up to run the http pass first then do ssh (since it was hard to read the logs when they were all mixed together. And http was fine23:43
clarkbalso I'm using host networking to avoid these problems23:43
clarkband it should all be connections to localhost so the firewall shouldn't be getting in the way23:44
fungiahh, okay, so probably not23:44
clarkbzm01.opendev.org has found some jobs \o/ it looks fine too23:46
clarkbwe dno'23:47
clarkber23:47
clarkbwe don't need to preclone repos like nova do we?23:47
clarkbwe fixed that timeout problem right?23:47
fungiwe doubled the timeout to, like, 60023:49
guillaumecianw, understood why, gerrit has merged https://gerrit-review.googlesource.com/c/gerrit/+/296851 (80ch to 120ch) which is stable-3.2 and it's not in 3.3 atm,  and sample reviews were not longer than 120 characters :)23:49
clarkbin the 3.2 case most pushes failed, but most clones and pulls succeeded23:51
clarkbthat at least is progress23:51
clarkbI wonder if the http pass is polluting the repo and that is causing: Ref master already exists - null23:52
clarkbI guess I can try it with another psuh specific repo23:52
* clarkb writes that up23:52
openstackgerritClark Boylan proposed opendev/system-config master: Try to make gatling-git work with our test gerrit  https://review.opendev.org/c/opendev/system-config/+/77588323:54
ianwservice-borg-backup.yaml.log.2021-02-22T22:47:02Z failed with 'ansible.vars.hostvars.HostVarsVars object' has no attribute 'ansible_ssh_host_key_ed25519_public', but service-borg-backup.yaml.log.2021-02-22T07:10:24Z did not23:58
ianwi.e. sometimes it seems ansible is not populating the hostvars of the backup servers with the ssh_host_key info23:58
clarkbianw: ya my previous comments were assuming it was something along those lines. I think that this has to do with how ansible loads facts for other hosts23:59

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!