Thursday, 2019-04-25

openstackgerritClark Boylan proposed zuul/zuul master: Make paused status bar blue  https://review.opendev.org/65558800:02
openstackgerritClark Boylan proposed zuul/zuul master: Tiny cleanup in change panel js  https://review.opendev.org/65558900:02
clarkbI think that should make the paused bar blue and I think I found a valid cleanup too00:02
*** Weifan has joined #openstack-infra00:03
fungiooh, neat, we get e-mail from the docker registry gc cron00:03
fungii hadn't noticed these before just now00:03
corvushopefully it didn't just delete an orphan layer from one of our jobs00:04
*** jamesmcarthur has quit IRC00:04
*** dave-mccowan has quit IRC00:05
fungiblobs /docker/registry/v2/blobs/sha256/78/78b43f9b8784fa68dd97dcf055ac03735d10f8a78ca52b5e0d174f1aacf984d5 and /docker/registry/v2/blobs/sha256/d9/d9b93e201cee5b09d7af25ba9ba5e420f54b0581e32cc91e967c1db5e2562b15 it says00:07
fungiif those mean anything to anybody00:07
corvuswell the good news is that zuul-upload-image succeeded00:09
corvusthe bad news is that zuul-upload-image succeeded so we don't get to find out what's up with the EOF thing.  also tox-py36 failed so it didn't merge.00:10
corvusthe good news is since it didn't merge, we have another convenient test change00:10
corvusi'll re-enqueue both again00:10
*** slaweq has joined #openstack-infra00:11
*** diablo_rojo has quit IRC00:21
*** dikonoor has joined #openstack-infra00:21
*** slaweq has quit IRC00:24
*** gyee has quit IRC00:24
*** Goneri has quit IRC00:30
*** jamesmcarthur has joined #openstack-infra00:33
mriedemi just wanted to say, we'll probably have a working multi-cell ci job for nova by tomorrow https://review.opendev.org/#/c/655222/ and it was surprisingly easy due to zuulv3 and the zuulv3-ification of the devstack/tempest jobs by the QA team (andreaf/gmann/others)00:35
mriedemsdague said a couple of years ago when talking about enabling a multi-cell job with devstack-gate something along the lines of "i think we've reached our limits for this kind of thing with d-g and will be needing to ansible this all up to be sane"00:36
*** igordc has quit IRC00:40
*** mattw4 has quit IRC00:42
*** yamamoto has joined #openstack-infra00:46
*** yamamoto has quit IRC00:47
*** yamamoto has joined #openstack-infra00:48
*** rlandy|bbl is now known as rlandy00:48
*** michael-beaver has quit IRC00:49
*** jamesmcarthur has quit IRC00:50
*** Weifan has quit IRC00:51
*** jamesmcarthur has joined #openstack-infra00:52
*** jamesmcarthur has quit IRC00:53
*** jamesmcarthur has joined #openstack-infra00:54
*** jamesmcarthur has quit IRC00:56
*** jamesmcarthur has joined #openstack-infra00:56
*** whoami-rajat has joined #openstack-infra01:02
*** ricolin has joined #openstack-infra01:05
*** diablo_rojo has joined #openstack-infra01:07
*** armax has quit IRC01:10
*** mriedem has quit IRC01:11
*** slaweq has joined #openstack-infra01:13
*** slaweq has quit IRC01:24
openstackgerritmelanie witt proposed openstack/project-config master: Fix ceph failure rate grafana dashboard  https://review.opendev.org/65559101:32
*** markvoelker has quit IRC01:34
*** ykarel|away has joined #openstack-infra01:37
*** smarcet has joined #openstack-infra01:37
*** diablo_rojo has quit IRC01:49
openstackgerritmelanie witt proposed openstack/project-config master: Fix ceph failure rate grafana dashboard  https://review.opendev.org/65559101:50
*** apetrich has quit IRC01:57
*** jamesmcarthur has quit IRC02:00
*** jamesmcarthur has joined #openstack-infra02:01
*** jamesmcarthur has quit IRC02:02
*** jamesmcarthur has joined #openstack-infra02:02
*** ykarel|away has quit IRC02:06
*** slaweq has joined #openstack-infra02:11
*** smarcet has quit IRC02:14
*** smarcet has joined #openstack-infra02:16
*** Emine has quit IRC02:16
*** slaweq has quit IRC02:24
*** dikonoor has quit IRC02:25
*** Weifan has joined #openstack-infra02:32
*** ykarel|away has joined #openstack-infra02:34
*** Weifan has quit IRC02:35
*** smarcet has quit IRC02:36
openstackgerritTrinh Nguyen proposed openstack/project-config master: Migrate telemetry to storyboard  https://review.opendev.org/65560002:38
*** armax has joined #openstack-infra02:41
*** ricolin has quit IRC02:46
rm_workhey, anyone know what the senlin channel is?02:51
rm_workoh wait i can look it up on the wiki prolly02:51
rm_workahh, just #senlin02:51
*** hongbin has joined #openstack-infra02:51
*** jamesmcarthur has quit IRC02:52
*** jamesmcarthur_ has joined #openstack-infra02:53
*** armax has quit IRC02:53
*** bgmccollum has quit IRC02:59
*** bgmccollum has joined #openstack-infra03:04
*** bhavikdbavishi has joined #openstack-infra03:05
*** smcginnis has quit IRC03:06
*** bgmccollum has quit IRC03:07
*** bhavikdbavishi1 has joined #openstack-infra03:08
*** bhavikdbavishi has quit IRC03:09
*** bhavikdbavishi1 is now known as bhavikdbavishi03:09
*** auristor has quit IRC03:11
*** dikonoor has joined #openstack-infra03:11
*** hongbin has quit IRC03:12
*** jamesmcarthur_ has quit IRC03:14
*** slaweq has joined #openstack-infra03:16
*** jamesmcarthur has joined #openstack-infra03:16
*** roman_g has quit IRC03:18
openstackgerritJason Lee proposed opendev/storyboard master: WIP: Migrates Users to Blueprints, Writer Script WIP #2  https://review.opendev.org/65481203:21
*** bhavikdbavishi has quit IRC03:21
*** auristor has joined #openstack-infra03:24
*** slaweq has quit IRC03:24
*** ekultails has joined #openstack-infra03:28
*** ykarel|away is now known as ykarel03:30
*** ekultails has quit IRC03:35
*** markvoelker has joined #openstack-infra03:35
*** Nisha_Agarwal has joined #openstack-infra03:45
*** michael-beaver has joined #openstack-infra03:46
*** bhavikdbavishi has joined #openstack-infra03:46
*** bhavikdbavishi has quit IRC03:49
*** psachin has joined #openstack-infra03:52
*** udesale has joined #openstack-infra03:55
*** jamesmcarthur has quit IRC03:58
*** ricolin has joined #openstack-infra04:10
*** pcaruana has joined #openstack-infra04:11
*** slaweq has joined #openstack-infra04:13
*** ykarel is now known as ykarel|afk04:14
*** ykarel|afk has quit IRC04:19
*** slaweq has quit IRC04:24
*** ykarel|afk has joined #openstack-infra04:35
*** ykarel|afk is now known as ykarel04:36
*** sthussey has quit IRC04:42
*** pcaruana has quit IRC04:43
*** uberjay has quit IRC05:03
*** e0ne has joined #openstack-infra05:05
*** kukacz has quit IRC05:06
*** kukacz has joined #openstack-infra05:08
*** e0ne has quit IRC05:08
*** uberjay has joined #openstack-infra05:08
*** quiquell|off is now known as quiquell05:09
*** slaweq has joined #openstack-infra05:11
*** slaweq has quit IRC05:21
*** gagehugo has quit IRC05:23
*** slaweq has joined #openstack-infra05:27
*** gagehugo has joined #openstack-infra05:31
*** Nisha_Agarwal has quit IRC05:36
*** adriant has quit IRC05:38
*** adriant has joined #openstack-infra05:38
*** slaweq has quit IRC05:41
*** Nisha_ has joined #openstack-infra05:51
*** michael-beaver has quit IRC05:55
*** slaweq has joined #openstack-infra05:56
*** slaweq has quit IRC06:02
openstackgerritOpenStack Proposal Bot proposed openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/65561506:04
*** igordc has joined #openstack-infra06:08
*** electrofelix has joined #openstack-infra06:10
*** slaweq has joined #openstack-infra06:11
*** ramishra has joined #openstack-infra06:14
*** slaweq has quit IRC06:16
*** pcaruana has joined #openstack-infra06:20
*** iurygregory has joined #openstack-infra06:34
*** yamamoto has quit IRC06:39
*** slaweq has joined #openstack-infra06:44
*** AJaeger has quit IRC06:47
*** quiquell is now known as quiquell|brb06:48
*** apetrich has joined #openstack-infra06:52
*** igordc has quit IRC06:53
*** ricolin has quit IRC06:53
*** evrardjp has quit IRC06:55
*** evrardjp has joined #openstack-infra06:55
*** AJaeger has joined #openstack-infra06:56
*** evrardjp has quit IRC06:58
*** evrardjp has joined #openstack-infra06:58
*** aaronsheffield has quit IRC07:02
*** amoralej|off is now known as amoralej07:04
*** ccamacho has joined #openstack-infra07:08
*** ccamacho has quit IRC07:09
*** ccamacho has joined #openstack-infra07:10
*** dpawlik has quit IRC07:14
*** yamamoto has joined #openstack-infra07:16
*** happyhemant has joined #openstack-infra07:18
openstackgerritMerged openstack/project-config master: Normalize projects.yaml  https://review.opendev.org/65561507:19
amorinhey all07:21
AJaegermorning amorin! We have some dns issues with ovh-bhs1 ;(07:23
amorinAJaeger: ok, can you tell me more? do you have any instance running affected, so I can checl?07:23
amorincheck*07:23
AJaegeramorin: could you come back during US time and ask, please? WE disabled it for now...07:24
*** zbr has joined #openstack-infra07:24
*** Nisha_ has quit IRC07:24
AJaegerThe issue we noticed is that git.openstack.org could not be resolved07:24
amorinAJaeger: no problem, I'll be there full day07:24
amorinok in the meantime, I will do some checks07:24
amorinalso, Shrews asked me to delete some instances from GRA107:24
amorinwhich were stuck in deleting or something like that07:25
AJaegerWE disabled last week Thursday, reenabled yesterday - and disabled again after this log: http://logs.openstack.org/69/655369/1/gate/kolla-build-ubuntu-source/509c989/job-output.txt.gz#_2019-04-24_18_43_31_07478707:25
amorinnice, thnaks07:25
AJaegerinfra-root, anybody around to work with amorin on OVH DNS problems?07:25
amorindo you have any idea of the /etc/resolv.conf file there?07:26
AJaegerno, sorry. fungi, mnaser and clarkb might be able to help better - but are in US/Canada timezones07:27
*** ricolin has joined #openstack-infra07:28
amorinok no problem07:28
*** yamamoto has quit IRC07:28
AJaegeramorin: http://eavesdrop.openstack.org/irclogs/%23openstack-infra/%23openstack-infra.2019-04-18.log.html#t2019-04-18T22:18:02 was from Thursday07:29
AJaegerclarkb said "dig git.openstack.org @1.1.1.1 works most of the time but I managed to get it to hang"07:30
AJaegeramorin: thanks for looking into that - hope you had a good long weekend!07:30
*** yamamoto has joined #openstack-infra07:31
*** slaweq has quit IRC07:31
*** slaweq has joined #openstack-infra07:32
amorinAJaeger: ok07:32
*** kjackal has joined #openstack-infra07:32
*** slaweq has quit IRC07:34
*** slaweq has joined #openstack-infra07:34
*** slaweq has quit IRC07:35
*** slaweq has joined #openstack-infra07:36
*** slaweq has quit IRC07:36
*** dpawlik has joined #openstack-infra07:36
*** slaweq has joined #openstack-infra07:37
*** slaweq has quit IRC07:38
*** bhavikdbavishi has joined #openstack-infra07:38
*** slaweq has joined #openstack-infra07:38
*** jpena|off is now known as jpena07:43
*** kopecmartin|off is now known as kopecmartin07:46
*** ykarel is now known as ykarel|lunch07:49
*** jpich has joined #openstack-infra07:55
*** rpittau|afk is now known as rpittau07:58
*** ramishra_ has joined #openstack-infra07:59
*** ramishra has quit IRC08:02
*** ramishra_ is now known as ramishra08:04
*** ralonsoh has joined #openstack-infra08:05
amorinfunny,I spwaned 20 instances, doing some dig git.openstack.org against 1.1.1.1 and some of them are failing like clarkb said, doing the same with 8.8.8.8, and all of them are working08:05
*** quiquell|brb is now known as quiquell08:07
*** e0ne has joined #openstack-infra08:08
*** e0ne has quit IRC08:08
*** e0ne has joined #openstack-infra08:08
amorin1.0.0.1 is working better08:10
amorincc AJaeger08:10
*** zhangfei has joined #openstack-infra08:12
*** derekh has joined #openstack-infra08:14
*** roman_g has joined #openstack-infra08:14
*** zbr is now known as zbr|rover08:18
AJaegeramorin: thanks.08:18
AJaegerNow I wonder whether we should change DNS server08:19
*** hrw has joined #openstack-infra08:24
hrwmorning08:24
hrwCan http://mirror.london.linaro-london.openstack.org/debian repos get signed? Debian Buster refuses to use them08:27
hrwhttp://logs.openstack.org/59/557659/37/experimental/kolla-build-debian-source-arm64/15cf546/logs/build/000_FAILED_base.txt.gz08:27
*** zigo has quit IRC08:27
*** gmann has quit IRC08:28
*** yamamoto has quit IRC08:29
*** yamamoto has joined #openstack-infra08:30
*** Lucas_Gray has joined #openstack-infra08:32
*** lucasagomes has joined #openstack-infra08:33
*** ykarel|lunch is now known as ykarel08:39
*** tkajinam has quit IRC08:54
*** rcernin has quit IRC08:58
*** tobias-urdin has quit IRC09:01
*** tobias-urdin has joined #openstack-infra09:17
*** dims has quit IRC09:20
AJaegerhrw: See https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/configure-mirrors/templates/etc/apt/apt.conf.d/99unauthenticated.j2 - we disable the signing.09:21
AJaegerWhy is that not working for Debian?09:22
AJaegerhrw: we don't want to sign our mirror that is a non-trivial task and not really needed09:22
*** dims has joined #openstack-infra09:26
*** tobias-urdin has quit IRC09:27
*** Lucas_Gray has quit IRC09:29
*** yamamoto has quit IRC09:31
*** dims has quit IRC09:33
*** dtantsur|afk is now known as dtantsur09:33
*** dims has joined #openstack-infra09:34
*** tobias-urdin has joined #openstack-infra09:40
*** zbr|rover has quit IRC09:42
*** zbr has joined #openstack-infra09:43
*** dklyle has quit IRC09:50
*** dklyle has joined #openstack-infra09:50
*** kota_ has quit IRC09:51
*** kota_ has joined #openstack-infra09:52
*** yamamoto has joined #openstack-infra10:07
*** ccamacho has quit IRC10:11
*** yamamoto has quit IRC10:11
*** bhavikdbavishi has quit IRC10:20
*** yamamoto has joined #openstack-infra10:22
openstackgerritWill Szumski proposed openstack/pbr master: Fix white space handling in file names  https://review.opendev.org/62916110:22
*** yamamoto has quit IRC10:35
*** yamamoto has joined #openstack-infra10:48
*** dpawlik has quit IRC10:52
*** jpena is now known as jpena|lunch10:55
hrwAJaeger: ok11:05
hrwAJaeger: will check then what is going on there. thanks11:05
*** aaronsheffield has joined #openstack-infra11:08
*** yamamoto has quit IRC11:09
*** Lucas_Gray has joined #openstack-infra11:10
hrwAJaeger: Buster is not fine with that ;(11:12
*** rfolco|ruck is now known as rfolco|ruck|doct11:12
hrwAJaeger: 'Acquire::AllowInsecureRepositories' is new flag11:12
*** yamamoto has joined #openstack-infra11:21
*** Lucas_Gray has quit IRC11:24
*** dpawlik has joined #openstack-infra11:25
*** zhangfei has quit IRC11:29
*** zbr has quit IRC11:31
*** zbr has joined #openstack-infra11:32
*** yamamoto has quit IRC11:32
*** udesale has quit IRC11:33
*** yamamoto has joined #openstack-infra11:36
*** bhavikdbavishi has joined #openstack-infra11:44
*** yamamoto has quit IRC11:46
*** yamamoto has joined #openstack-infra11:49
*** dpawlik has quit IRC11:52
*** zigo has joined #openstack-infra11:52
*** panda is now known as panda|lunch11:59
*** yamamoto has quit IRC12:00
*** rh-jelabarre has joined #openstack-infra12:03
*** fresta has quit IRC12:07
*** jcoufal has joined #openstack-infra12:12
*** markvoelker has quit IRC12:14
*** markvoelker has joined #openstack-infra12:15
*** dpawlik has joined #openstack-infra12:15
*** rlandy has joined #openstack-infra12:20
*** derekh has quit IRC12:23
*** tosky has joined #openstack-infra12:25
*** vabada has joined #openstack-infra12:28
*** kranthikirang has joined #openstack-infra12:29
*** altlogbot_1 has quit IRC12:32
*** yamamoto has joined #openstack-infra12:36
*** lseki has joined #openstack-infra12:37
*** yamamoto has quit IRC12:38
*** nicolasbock has joined #openstack-infra12:38
*** yamamoto has joined #openstack-infra12:38
*** altlogbot_3 has joined #openstack-infra12:38
fungialso, not signing the indices on our mirrors helps dissuade folks from relying on them outside our ci system12:43
fungiwe used to have problems with people pointing their *production* machines at our package mirrors and then freaking out when we would take them down for maintenance or rename/move them12:44
*** derekh has joined #openstack-infra12:44
fungiamorin: that sounds like basically what we were experiencing too, yes. i'm guessing some routing issue between bhs1 and those netblocks but really have no insight into that12:46
fungialso possible 8.8.8.8 and 1.1.1.1 have blocked access from there due to abuse, or are aggressively throttling because of the request volume they see from those address ranges12:47
*** altlogbot_3 has quit IRC12:47
*** altlogbot_0 has joined #openstack-infra12:48
*** kgiusti has joined #openstack-infra12:54
*** udesale has joined #openstack-infra12:56
*** eharney has quit IRC13:00
amorinfungi: 8.8.8.8 and 1.0.0.1 were working perfectly this morning while I was testing13:01
amorinonly 1.1.1.1 was affected by those timeouts13:01
amorinI am also suspecting some abuse blocking system13:01
fungiamorin: oh, i misread when you said "doing the same with 8.8.8.8" and thought you meant it was failing the same as 1.1.1.1, but now i see you said they were all working13:01
amorinyup13:02
fungiso 1.1.1.1 is problematic from bhs1 but 8.8.8.8 is not, sounds like?13:02
amorinyes13:02
amorinis the DNS server a parameter on your side ?13:02
amorinbased on region?13:02
amorinI mean, can it be updated easily, only for BHS1?13:03
fungiit's set consistently across all places we upload, modulo selecting ipv6 over ipv4 when usable13:03
fungiwe could perhaps look into swapping 1.1.1.1 for 1.0.0.113:03
fungiclarkb: ^ if you end up being around this morning (i know you have travel to prepare for)13:04
*** dikonoor has quit IRC13:06
clarkbwe switched off opendns for similar reloability issuesiirc. Moving to 1.0.0.1 should be fine13:06
clarkbneed to update our images then the base job that cleans up the config13:06
*** ykarel is now known as ykarel|mtg13:08
openstackgerritDirk Mueller proposed opendev/system-config master: Add mirroring for Stein packages  https://review.opendev.org/65568613:09
amorinfungi: clarkb ok13:09
*** smcginnis has joined #openstack-infra13:10
openstackgerritJeremy Stanley proposed openstack/project-config master: Switch from 1.1.1.1 to 1.0.0.1  https://review.opendev.org/65568713:16
fungiclarkb: amorin: AJaeger: ^13:16
fungipart 113:17
*** rfolco|ruck|doct is now known as rfolco|ruck13:18
openstackgerritJeremy Stanley proposed opendev/base-jobs master: Switch from 1.1.1.1 to 1.0.0.1  https://review.opendev.org/65568913:20
fungiand part 2 ^ which i'll wip awaiting image rebuilds/uploads13:20
*** ricolin has quit IRC13:21
*** mriedem has joined #openstack-infra13:22
*** altlogbot_0 has quit IRC13:23
*** michael-beaver has joined #openstack-infra13:24
*** altlogbot_2 has joined #openstack-infra13:27
*** altlogbot_2 has quit IRC13:28
AJaegerthanks, fungi13:30
mordredfungi: +2 on both13:30
*** altlogbot_0 has joined #openstack-infra13:32
*** eharney has joined #openstack-infra13:41
*** altlogbot_0 has quit IRC13:42
openstackgerritMerged openstack/project-config master: Switch from 1.1.1.1 to 1.0.0.1  https://review.opendev.org/65568713:47
fungizuul scheduler memory utilization looks way better since yesterday's restart13:47
fungihttp://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=64792&rra_id=all13:48
tinwoodhello.  So after the recent changes, the renamed x/nova-lxd project is no longer pushing to github.com/openstack/nova-lxd ... is it/should it be going somewhere else or do I need to config something? thanks13:52
fungitinwood: it's updating on https://opendev.org/x/nova-lxd but we have a script which we can use to transfer https://github.com/openstack/nova-lxd to another github organization of your choosing and there's a zuul job you can add to push new commits (and branches and tags) to it as soon as they merge13:54
fungithe org transfer is only actually necessary if you want github to set up transparent redirects so that old urls and git remotes will get forwarded to the new location there13:55
tinwoodfungi, is it "allowed" to still be mirrored at github.com/openstack/nova-lxd or does it need to be mirrored elsewhere? (i.e. not in the openstack org)?13:56
*** altlogbot_2 has joined #openstack-infra13:56
fungitinwood: the openstack tc has decided that going forward the openstack namespace will only be used for official openstack deliverable repositories, so any unofficial repos there will need to be transferred, deleted or archived13:57
fungitinwood: also, if you would like a custom namespace for it on opendev.org we can rename that independently of anything github-related, though we'll need to coordniate to do that after the ptg since i don't think any of us have time to do it in the next week-ish13:57
tinwoodfungi, right, that does make sense why it stopped then.  Thanks for that; I'll have a discussion in our charms group to work out where to put it.  it might go to openstack-charmers instead, but I'm not sure yet.  thanks very much for your help.  Is there an example of the job I will (eventually) need to add?13:58
clarkbara is using it, should be able to find it there14:01
mordredhttps://opendev.org/recordsansible/ara-web/src/branch/master/.zuul.yaml#L40-L5014:01
mordredtinwood: ^^14:01
fungiyeah, that's where i was hunting14:01
clarkbalso dmsimard sent email to the openstack-discuss list about how to set it up14:01
mordredalong with https://opendev.org/recordsansible/ara-web/src/branch/master/.zuul.yaml#L60-L6214:01
fungiaha, ara itself didn't have it yet but ara-web does14:01
mordredyeah14:01
tinwoodmordred, fungi that's brilliant; thanks.14:01
*** panda|lunch is now known as panda14:02
tinwood(also clarkb)!14:02
fungitinwood: to be clear, official openstack projects will be using something like that job to replicate to github in the near future too, but for ease of our migration maintenance we left their gerrit replication in place in the meantime14:02
tinwoodfungi, ok14:02
fungilevel playing field and all14:02
mordredyou're a level playing field14:03
tinwood:)14:03
mordredfungi, clarkb: how is that ara-web replication job working14:03
mordred?14:04
mordredit references ara_git_mirror_credentials which I don't see defined in the ara-web repo14:04
clarkbfungi: re zuul memory I reverted two suspect changes we could live without (leaving two other suspect changes we probably need) If memory holds up we will need to dig into those two changes that were reverted to figure out what is goig on14:04
mordredclarkb: ++14:04
clarkbmordred: I'm not sure. dmsimard should know14:05
mordredkk14:05
*** dpawlik has quit IRC14:06
fungipossible he hasn't finished adding them yet14:06
mordredgood point14:06
rpittauhi mordred, sorry to bug you, just wondering why https://review.opendev.org/655445 was not approved as the other backports. Asking because we're currently blocked by x/pyghmi and that's the last fix :/14:07
mordredrpittau: I don't know - I'm guessing it's just that corvus and dtroyer both hit +2 at the same time and so neither saw that they were second14:08
mordredclarkb: ^^ you have +A powers there14:08
rpittauoh.....14:08
clarkbI can take a look in a few14:09
rpittauthanks!14:09
clarkbbut then Im doi g yardwork and running errands and packing and stuff14:09
dtroyerrpittau, clarkb: got it14:10
mordredyeah. it's not going to be the MOST productive couple of days14:10
clarkbthanks14:10
rpittauthanks all, much appreciated :)14:10
*** Goneri has joined #openstack-infra14:14
*** e0ne has quit IRC14:16
*** yamamoto has quit IRC14:18
corvushas anyone looked into the regitry failures?14:26
*** gmann has joined #openstack-infra14:26
*** armax has joined #openstack-infra14:26
corvusi'll start there14:28
AJaegercorvus: they fail randomly - I did a couple of rechecks, sometimes they passed and py35/py36 failed instead ;(14:28
corvusAJaeger: yeah, we put autoholds in place yesterday and they caught some nodes14:29
corvusso the next step is to identify if any of those are useful14:29
AJaegercorvus: cool!14:30
fungithe unit test failures looked like individual test timeouts, though i haven't yet had time to correlate them14:30
AJaegerand py36 is failing again on 654238 ;(14:31
corvusfungi: yeah, that's a known problem14:31
fungiargh14:31
AJaegerfungi, corvus, want to requeue again once everything is finished? Or how to move forward with these ?14:31
corvusthings got a lot worse with the multi-ansible work14:32
corvushere's one node we caught: http://logs.openstack.org/38/654238/3/gate/zuul-upload-image/221aaa2/ara-report/result/9bc3c6c6-3df7-4bd5-a895-a32b36247448/14:32
corvusthat's a new error14:32
fungiAJaeger: i promoted 654238,3 so it'll just restart testing14:32
*** jpena|lunch is now known as jpena14:33
fungiugly, but faster than alternatives (other than bypassing testing entirely)14:33
AJaegerfungi: thanks!14:33
dmsimardreading backlog14:36
dmsimardmordred, fungi, clarkb: according to zuul builds page, upload-git-mirror hasn't had the opportunity to run for ara-infra or ara-web yet14:38
dmsimardIt is entirely in the realm of possibility that I have not set them up properly14:38
dmsimardara works for sure, though14:38
fungidmsimard: where is that job added to the ara repo? i couldn't find it14:39
dmsimardthough I did notice something14:39
dmsimardwhen the opendev .gitreview patch was pushed, that didn't trigger the post pipeline14:39
fungiit wouldn't14:40
fungiwe didn't push those through gerrit14:40
fungithat would have taken faaaaar too long14:40
dmsimardfungi: it's in the feature/1.0 branch: https://opendev.org/recordsansible/ara/src/branch/feature/1.0/.zuul.d/zuul.yaml#L115-L17314:40
fungioh! got it. i was looking in master14:41
dmsimardmaster has been mostly frozen but I should probably make sure the job is there as well to prevent issues in the future14:41
fungitinwood: ^ see dmsimard's link for a better working example14:41
tinwoodfungi, thanks14:42
corvusfungi, mordred, AJaeger, clarkb: i'm going to start an etherpad to keep notes on the registry issue debugging14:43
corvushttps://etherpad.openstack.org/p/akSFrd8Oh714:43
fungithe ssh_key secret would be encrypted to your zuul project per https://docs.openstack.org/infra/manual/zuulv3.html#secret-variables14:44
fungitinwood: ^14:44
fungithanks corvus!14:44
tinwoodfungi, ah, I see. to be able to push to a nominated repo.14:46
fungiyep14:46
*** electrofelix has quit IRC14:48
*** electrofelix has joined #openstack-infra14:48
*** electrofelix has quit IRC14:49
openstackgerritBen Nemec proposed openstack/pbr master: Stop using pbr sphinx integration  https://review.opendev.org/65556514:51
corvusmordred: we might have logs from the registry that you could correlated with that error14:52
mordredcorvus: nod14:52
dtroyerI am trying to figure out what is causing a POST_FAILURE on a devstack job where devstack runs to completion: http://logs.openstack.org/42/655542/1/gate/flock-devstack-config/b5401da/  (from review https://review.opendev.org/#/c/655542/).  This has happened a couple of times this week and I'm not sure I know where to look.14:54
corvusmordred: the second error is the one we kept seeing yesterday14:55
corvusmordred: the third is the same as the first -- the bad blob14:59
*** iurygregory has quit IRC15:00
*** _erlon_ has joined #openstack-infra15:00
fungibtw, zuul-upload-image hit another post_failure on 654238,3: http://logs.openstack.org/38/654238/3/gate/zuul-upload-image/bbbabdf/15:01
fungire-promoting now15:01
*** sthussey has joined #openstack-infra15:03
*** Goneri has quit IRC15:05
dmsimardfwiw, just confirmed that upload-git-mirror still works for ara, will double check for ara-web and ara-infra15:06
*** ykarel|mtg is now known as ykarel15:06
dmsimardmordred: so the secret for ara_git_mirror_credentials should be redefined in both ara-web and ara-infra then ?15:07
fungidmsimard: yes15:07
fungidmsimard: if you want, jroll just sent a message to the openstack-discuss ml which mentions that job if you're keen to follow up with an example and indication that it's already working: http://lists.openstack.org/pipermail/openstack-discuss/2019-April/005629.html15:07
fungidtroyer: i think it's this: http://logs.openstack.org/42/655542/1/gate/flock-devstack-config/b5401da/ara-report/result/aeee74d0-e49f-4f36-9861-99711650c2cf/ "Timeout (32s) waiting for privilege escalation prompt:"15:11
dtroyerfungi: ok, thanks.  not that there is anything I can do about that (is there??) but I'm getting frustrated devs who just want to make jobs non-voting because of that…15:12
fungihere's where it appears in the console stream: http://logs.openstack.org/42/655542/1/gate/flock-devstack-config/b5401da/job-output.txt.gz#_2019-04-24_21_07_42_50853115:13
*** quiquell is now known as quiquell|off15:14
fungihttps://github.com/ansible/ansible/issues/14426 might be relevant15:15
fungiseeing some suggestions that it could be related to systemd deciding that services started on-demand by the login process are possibly broken and throttling them15:18
*** Goneri has joined #openstack-infra15:18
fungidmsimard: maybe you've seen this behavior before?15:19
*** igordc has joined #openstack-infra15:19
fungiunfortunately it's resulting in a failure to collect other logs from the problem node, so can't go digging in syslog to find out15:20
*** jamesmcarthur has joined #openstack-infra15:20
fungicould be that iterating over a massive list of files with that particular action is a bad idea and we should find a more efficient solution to that15:21
*** lpetrut has joined #openstack-infra15:25
*** jamesmcarthur has quit IRC15:32
*** jamesmcarthur has joined #openstack-infra15:34
fungilooks like if zuul-quick-start succeeds for 654238,3 we'll need to reenqueue 55491,2 (it's currently hit another unit test timeout)15:35
*** yamamoto has joined #openstack-infra15:35
*** liuyulong has quit IRC15:37
fungier, reenqueue 655491,2 i mean15:40
fungithough if it hasn't finished testing by the time the one ahead of it merges, i'll just promote it so we don't have to wait for it to report first15:40
*** fresta has joined #openstack-infra15:42
*** yamamoto has quit IRC15:42
*** ramishra has quit IRC15:42
*** efried has quit IRC15:43
*** efried has joined #openstack-infra15:43
pabelangerfungi: it looks like it might have passed15:45
fungiyeah, i'm watching the console stream for the registry upload15:45
fungiit's just wrapping up now15:45
openstackgerritMerged zuul/zuul master: Update references for opendev  https://review.opendev.org/65423815:45
fungithere we go15:45
pabelangerneed to look at zuul_console to see why we don't output LOOP tasks15:45
fungirestarting builds for 655491,2 now15:45
fungioh, looks like it ended up kicked out on a post_failure so need to reenqueue it15:47
fungiand done15:47
AJaeger\o/15:48
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65572415:53
openstackgerritMerged osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65572415:54
*** jcoufal has quit IRC15:56
*** ykarel is now known as ykarel|away15:57
*** weshay|rover is now known as weshay15:59
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65572516:00
*** amoralej is now known as amoralej|off16:00
openstackgerritMerged osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65572516:01
*** rpittau is now known as rpittau|afk16:05
*** ykarel|away has quit IRC16:08
AJaegerconfig-core, could you review  https://review.opendev.org/655591 and https://review.opendev.org/#/c/651668/ , please?16:08
imacdonnhi infra ... I need to ask a stupid question. Apologies in advance, but I've timed out searching for the answer....16:11
*** kjackal has quit IRC16:11
imacdonnso I have this cinder third party CI system .. it uses devstack master, but sets CINDER_BRANCH (in local.conf) to ${GERRIT_REFSPEC} for the change to be tested16:11
imacdonnthis no longer works, since the opendev migration (I guess?):16:12
imacdonn+ functions-common:git_timed:626           :   timeout -s SIGINT 0 git fetch https://git.openstack.org/openstack/cinder.git refs/changes/41/627941/3816:12
imacdonnfatal: Couldn't find remote ref refs/changes/41/627941/3816:12
fungiimacdonn: yes, for the moment refs/changes isn't being redirected to opendev.org (where git.openstack.org is redirected to)16:13
fungiyou would need to fetch refs/changes from https://review.opendev.org/p/openstack/cinder.git for now16:13
imacdonnfungi: OK .... but, I'm not doing that; devstack is ....16:14
fungiimacdonn: oh, there are recent fixes to devstack for that16:14
fungisome for stable branches may have just merged this morning16:14
fungii think the fixes were for that, at least16:14
imacdonnI'm using devstack master, though16:15
*** dklyle has quit IRC16:15
*** dklyle has joined #openstack-infra16:15
*** adriancz has quit IRC16:15
mordredwe have a WIP patch to fix the refs/changes/ thing in the gitea layer16:17
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65573316:17
clarkbalso CI systems are expected to set error on clone iirc16:17
clarkbbut maybe that is more a thing we do and less uni ersal16:18
*** jamesmcarthur has quit IRC16:18
*** sshnaidm|off has quit IRC16:18
*** pcaruana has quit IRC16:19
mordredcorvus: I added some notes to the etherpad - they may be largely useless16:19
*** quiquell|off has quit IRC16:20
imacdonnmordred: is it completely broken until then? i.e. no way to get refs/changes in devstack?16:20
fungiahh, got it, so devstack assumes CINDER_BRANCH will be a ref available on the origin remote, but some third-party ci systems pass the change ref as CINDER_BRANCH (or similar)?16:20
corvusgitea update:  the git clone redirect patch has merged upstream and should be included in 1.9.0.  i've opened a PR for refs/changes: https://github.com/go-gitea/gitea/pull/675816:20
mordredcorvus: but - tl;dr - error 1 seems to be "bad size" - and then 2 is an EOF from the buildset registry - this makes me think perhaps both problems are actually somehow linked to a flaky buildset registry and the intermediate registry is actually fine now16:20
*** Lucas_Gray has joined #openstack-infra16:20
mordredcorvus: should we go ahead and put yoru refs/changes patch into jeblair/1.8.0-opendev ?16:20
openstackgerritMerged osf/openstackid-resources master: Updated validation for slides  https://review.opendev.org/65573316:20
mordredcorvus: if that very-unsubstantiated-hypothesis holds, perhaps a retry: on the skopeo task would be worthwhile?16:21
corvusmordred: it takes 2 hours to replicate nova into gitea, even with my change.  so i think that will take some planning.16:21
mordredcorvus: nod16:21
*** igordc has quit IRC16:21
corvusmordred: yeah, maybe our socat thing isn't working?16:21
corvusthat seems surprising to me... it's so simple16:21
*** lucasagomes has quit IRC16:22
mordredcorvus: oh - right - because in this case localhost is actually socat16:22
fungirestarted gate testing for 655491,2 again due to yet more timeoutexceptions in unit tests16:22
mordredcorvus: but yeah - it could be read-issues over socat rather leading to EOF/data-validation16:22
imacdonnfungi: I wonder if I can force devstack by setting GIT_BASE <insert=chin-scratching-emoji>16:23
corvusmordred: these are held nodes -- we could hop on an executor, run socat, and do a bunch of skopeo copy to local disk operations16:23
mordredcorvus: on the other, out of curiosity - is that replicating all of nova AND refs/changes/* - or replicating refs/changes into gitea with nova-minus-refs/changes already there?16:23
corvusdid that make sense or should i use more words?16:23
mordredcorvus: I think that made sense16:23
corvusmordred: 'git push --mirror' -- everything16:23
fungiimacdonn: it may be worth bringing up with the devstack maintainers in #openstack-qa16:23
*** igordc has joined #openstack-infra16:23
corvusmordred: i believe the time is related to the number of refs, so i don't expect substantial improvement16:24
mordredcorvus: nod. I mean, I doubt it'll be significantly less - yeah16:24
imacdonnfungi: OK, thanks16:24
fungiimacdonn: i would personally expect CINDER_BRANCH to be separate from the change you're testing, and instead have your ci system attempt to merge the change ref into that branch before starting tests16:24
hrwfungi: I am fine with whatever policies infra has as it is not my job to change them but to obey them. (signed repos thing). Pointing out just a fact that current way used in zuul jobs applies for Debian:stretch (stable) and not good for Debian:buster (testing, soon stable)16:24
corvusmordred: because git-receive-pack still processes each ref and calls the gitea post-receive hook, even though with my change that hook now noops.16:24
corvusthat still takes (a very small amount of) time16:24
fungihrw: yep, understood that part, and thanks for bringing it to our attention16:25
hrwfungi: thanks16:25
imacdonnfungi: it's basically following https://wiki.openstack.org/wiki/Cinder/tested-3rdParty-drivers - at least as far as the CINDER_BRANCH part16:26
fungia very small amount of time multiplied by a very large amount of git objects16:26
imacdonnfungi: CINDER_REPO might be interesting ... I'm not currently setting that16:27
corvusmordred: you want to pick one of those nodes, grab it's ip and which executor originally was talking to it (we may as well make the test as close as possible) and set up a screen there?16:27
fungiimacdonn: yep, it's possible that document relied on convenience of assuming the origin also had a complete copy of unreviewed changes in gerrit16:27
mordredcorvus: yes - but I need to fix my coffee situation first - I will do that in about 5 minutes16:28
*** jpich has quit IRC16:28
corvusk16:28
fungii need to go find lunch, but if 655491,2 finally manages to merge i should be available again soonish to do executor restarts16:29
* fungi disappears for a bit16:29
hrwfungi, AJaeger: can I get pinged once that Debian apt thing gets implemented? will then resume work on adding arm64/debian ci job to kolla project.16:30
*** jamesmcarthur has joined #openstack-infra16:31
corvusmordred: i have a screen session running as root on ze0116:31
*** ykarel|away has joined #openstack-infra16:32
corvusmordred: i've chosen the last build/host in the list16:32
mordredcorvus: I am on it now - but still not 100% back16:33
corvusmordred: ack.  i've got 2 windows.  the first is now running socat16:34
corvusi copied this command: http://logs.openstack.org/91/655491/1/gate/zuul-upload-image/fbff568/ara-report/result/679c7f1d-676b-4610-befd-38553d43d192/16:34
corvusgetting ready to run skopeo in the second16:34
mordredk. I'm back - what's the switch-window command again?16:35
corvusmordred: c-a n16:35
mordredc-a something yeah?16:35
mordredawesome16:35
openstackgerritMerged openstack/project-config master: Fix ceph failure rate grafana dashboard  https://review.opendev.org/65559116:35
corvussweet!  instafail16:35
openstackgerritFatih Degirmenci proposed opendev/glean master: Sync when writing the file  https://review.opendev.org/65223816:35
mordredyay!16:36
corvus2019/04/25 16:35:50 socat[13207] E connect(5, AF=10 [2607:ff68:0100:0054:f816:3eff:fea9:51df]:5000, 28): Permission denied16:36
mordred2019/04/25 16:35:50 socat[13208] E connect(5, AF=10 [2607:ff68:0100:0054:f816:3eff:fea9:51df]:5000,|16:37
mordred 28): Permission denied16:37
mordred:)16:37
mordredyour paste was more fully formed than mine16:37
*** zbr is now known as zbr|rover16:37
corvusis it due to the firewall?  if so, how did this ever work?16:37
mordredmaybe? I've got this:16:38
mordredhttps://superuser.com/questions/282976/ssh-over-ipv6-got-permission-denied-error16:39
*** jcoufal has joined #openstack-infra16:39
mordredwhich isn't the same thing - but is similarly an ipv6 permission error with a socat reference in the answer16:39
mordredhttps://unix.stackexchange.com/questions/413278/socat-gives-error-read6-0xf97acc0-8192-permission-denied has no answers16:39
mordredwell - that telnet certainly didn't work16:40
*** pcaruana has joined #openstack-infra16:40
corvushow about i try opening up the port in iptables16:40
mordred++16:40
corvusis the buildset registry using host networking?16:41
corvuslooks like no:16:41
corvus    ports:16:41
corvus     - "5001:5000"16:41
mordredis there a reason we're not using host networking16:42
mordred?16:42
corvusso....16:42
mordredotherwise, do we need to tell docker to "expose" that port?16:42
* mordred goes tso read16:42
corvusmordred: yeah sorry that's what i meant by that paste -- we are telling docker to expose that port16:42
mordredah - gotcha16:43
corvusmordred: https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/run-buildset-registry/tasks/main.yaml#L8616:43
corvusi see iptables rules for docker regarding port 5000; i do not see ip6tables rules for docker and port 500016:43
corvusthis is a limestone host so only has ip616:43
mordredwell - that would certainly explain that16:44
mordredI mean - lack of ip6 rules would certainly explain connection refused16:44
corvusperhaps this only works on ipv4 because of that?16:44
*** gyee has joined #openstack-infra16:44
corvusand we didn't notice this because it only worked on ipv4 because of the *other* docker problem16:44
mordredyeah16:44
mordredso we maybe need to also open 5000 for ip6 outside of docker - assuming it's not going to do *anything* with ip6 rules16:45
corvusyeah, let's start by opening this port, but i think there's a good chance that if docker isn't doing ipv6 port forwarding rules, it still won't work.16:46
corvusbut lets get to that point, and then see if the solution is something like "tell docker to ipv6 too" or "use host networking"16:46
mordredif we switch to host networking - does docker get out of the business of trying to do iptables rules?16:46
mordredyeah16:46
mordred++16:46
* corvus works on iptables incantations16:47
mordredcorvus: honestly, since it's port 5000 - host networking seems like it might be better anyway - less pieces in the middle of the chain16:47
mordredbut - I agree, one step at a time16:48
corvusmordred: hrm, that's progress on the iptables front16:49
mordredyes, I agree16:50
*** udesale has quit IRC16:50
corvusi added rules to v4 and v6; let me double check it was the v6 one that made the difference16:50
*** jpena is now known as jpena|off16:50
rpiosoAfter a change has been merged, can comments be sent to the owner and reviewers? Gerrit is permitting me to create draft comments.16:51
*** igordc has quit IRC16:52
corvusmordred: okay, i think we confirmed that the EOF issue is that ip6tables rule16:52
*** Goneri has quit IRC16:52
mordred\o/16:52
mordredcorvus: --src-tls-verify=false will disable tls verification16:52
imacdonnfungi: looks like CINDER_REPO=https://review.opendev.org/p/openstack/cinder.git made it all better ... thanks for helping me find my way ;)16:52
*** kopecmartin is now known as kopecmartin|off16:54
mordredcorvus: yes, that16:54
corvushrm, i wonder what the unauthorized is about16:55
*** mattw4 has joined #openstack-infra16:55
mordredyeah16:55
corvusthe buildset registry is reporting an auth error16:55
corvusha16:56
corvusthe password serious had a period in it16:56
mordredhahah16:56
mordredok - so that seemed to work much betterer16:57
corvusyeah16:57
corvusmordred: i'm really curious about the bad blob error16:57
mordredunfortunately it's not widely applicable enough to support my earlier hypothesis that the other error is from the same cause16:57
mordredyeah16:57
corvusmordred: that happened on limestone too, which should be in the same situation16:57
mordredbest I can tell it's length validation related16:57
corvusbut it also happened on gra1 which should be ipv4 i think?16:57
mordredcorvus: yeah - but also on ovh-gra116:57
mordredyeah16:57
corvusit's probably worth repeating this on one or both of those; which should we start with?16:58
mordredcorvus: I think ovh-gra1? wait ... how did we get a bad blob error on limestone at all?16:58
mordredwe shouldnm't have been able to connect16:58
corvusmordred: yeah, that's why i think that one is interesting too16:58
corvusi think maybe i want to start there16:59
corvussince it's more like changing one variable than two16:59
corvus(there == limestone)16:59
mordredyeah16:59
corvusthat job ran on ze06; i'm going to start a new screen session there17:00
corvusoops, wrong job17:01
corvusmake that ze0317:01
corvussorry, i may have exited a screen, but i've got one up now on ze0317:02
mordredI'm in17:02
corvusmordred: oh i think there's an error in the etherpad17:03
mordredyeah?17:03
corvusmordred: i think that first job ran in rax-org17:03
corvusord17:03
*** sshnaidm has joined #openstack-infra17:03
corvusmordred: double check me on that17:03
*** sshnaidm has quit IRC17:03
mordredcorvus: yes. I agree weith you17:04
mordredI'm not sure what I checked originally17:04
corvusmordred: that's great news!  it means that ipv6 errors with EOF and ipv4 errors with validation17:04
mordred++17:04
corvuslet's just continue with this host then17:04
corvusi'm in the next window now17:05
mordredyup17:05
corvusokay, i'm weirded out that iptables doesn't matter here but i'll just file that away17:05
mordred:)17:06
corvusi'm logging into the host to look at the registry logs17:06
mordredwell - doesn't docker properly set iptables for ipv4?17:06
corvusmordred: yeah, but i'm surprised it does so in a way that bypasses our ingress filters...17:06
corvusvery helpful of it17:06
corvusokay, so the registry saw my telnet as a failed tls handshake, that looks good17:07
corvusi'll try skopeo now17:07
mordred++17:07
corvusthat's annoying17:08
mordredyeah17:08
*** derekh has quit IRC17:08
mordredcorvus: so - the failed job was copying to the intermediate registry17:09
corvusoh17:09
mordredskopeo --insecure-policy copy docker://127.0.0.1:38143/zuul/zuul-executor:latest docker://insecure-ci-registry.opendev.org:5000/zuul/zuul-executor:221aaa239c584e3885f3a2e3371a7ee2_latest17:09
corvusderp17:09
mordredcorvus: also - pull from zuul-executor17:10
corvusmordred: oh thx17:11
mordredye17:11
mordredyes17:11
*** Lucas_Gray has quit IRC17:12
corvusgrumble17:12
mordredyeah17:12
corvusthat's probably going to keep working17:13
mordredwell - the copy is an idempotent operation ... maybe there's just occasional network derps and a retries would help it?17:13
corvusyeah.  i agree, retries:3 or however that's spelled might be called for here17:13
mordredsince it DID skip the ones that worked before and then we watched it transfer the one that broke17:13
mordredcorvus: yeah17:13
*** dtantsur is now known as dtantsur|afk17:14
corvusi'm looking at the buildset registry logs from the build17:14
mordredcorvus: I'll make a retries: patch so we've got it17:15
corvusmordred: maybe do that for both skopeos -- push and pull17:15
mordredyeah17:16
*** happyhemant has quit IRC17:17
corvusnothing's jumping out at me in the buildset registry log (though there are a bunch of expected errors (blob not found) and something could easily be hiding among them)17:18
*** ricolin has joined #openstack-infra17:20
*** ralonsoh has quit IRC17:20
openstackgerritMonty Taylor proposed zuul/zuul-jobs master: Add retries to skopeo copy operations  https://review.opendev.org/65573917:20
mordredcorvus: I *think* that's the right way to do that in a loop context17:21
mordreddmsimard: ^^ you're good with the ansibles, right?17:21
mordredoh - I guess I can do is success17:21
openstackgerritMonty Taylor proposed zuul/zuul-jobs master: Add retries to skopeo copy operations  https://review.opendev.org/65573917:22
*** Weifan has joined #openstack-infra17:24
openstackgerritJames E. Blair proposed openstack/project-config master: Allow incoming access to port 5000 on test nodes  https://review.opendev.org/65574117:24
corvusmordred: for the ipv6 issue... one way to address that would be to open the port on all our images/build nodes ^17:24
corvusmordred: or we could do it in the opendev buildset registry job17:25
corvussince that's opendev specific, i think we can encode information like that in it  (we can't put it in the role though)17:25
corvuslet me try that change to see how it looks17:26
*** psachin has quit IRC17:28
mordredcorvus: yeah. I think in the opendev buildset registry job17:28
*** Lucas_Gray has joined #openstack-infra17:28
openstackgerritJames E. Blair proposed opendev/base-jobs master: Open the firewall port for the buildset registry  https://review.opendev.org/65574417:32
corvusmordred: i think ^ should do it17:33
mordredcorvus: ++17:33
*** lpetrut has quit IRC17:34
corvusAJaeger, fungi: are you around to review https://review.opendev.org/655739 ?17:34
*** igordc has joined #openstack-infra17:35
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Fixed multipart form request for slide updates (PUT)  https://review.opendev.org/65574517:35
*** jcoufal has quit IRC17:35
openstackgerritMerged osf/openstackid-resources master: Fixed multipart form request for slide updates (PUT)  https://review.opendev.org/65574517:37
AJaegercorvus: on it...17:42
AJaegercorvus: too slow ;(17:43
*** igordc has quit IRC17:43
*** ijw has joined #openstack-infra17:43
*** Lucas_Gray has quit IRC17:43
corvusAJaeger: how about https://review.opendev.org/655744 ? :)17:44
mordredinfra-root: I need to step away for a few - biab17:44
*** igordc has joined #openstack-infra17:45
*** Weifan has quit IRC17:46
*** ijw has quit IRC17:48
AJaegerpabelanger: why did you remove your +A from 655744? I can add mine - just wondering...17:49
*** ijw has joined #openstack-infra17:49
pabelangerAJaeger: I didn't see 2nd patch17:49
pabelangerwas just about to say I prefer 655744 over other17:49
pabelangerif other infra-roots wanted to look17:49
AJaegercorvus: do we need https://review.opendev.org/#/c/655741/ as well as 744?17:50
AJaegersorry, now caught up on reviews17:51
corvusAJaeger, no it's an alternate to 74417:51
corvusseems that so far everyone who has a preference like 744, so i think approving that now and wiping or abandoning the other is good.  if we decide we want the other, we can move to it, but it will take a few days to do.17:51
corvusthey are compatible (we merge the jobs change, and later the image change, and then remove the jobs change) if we wanted17:52
*** Weifan has joined #openstack-infra17:53
pabelanger+117:53
*** Weifan has quit IRC17:53
*** nicolasbock has quit IRC17:53
openstackgerritMerged zuul/zuul-jobs master: Add retries to skopeo copy operations  https://review.opendev.org/65573917:54
*** Weifan has joined #openstack-infra17:54
AJaegercorvus: great17:54
*** Weifan has quit IRC17:59
*** igordc has quit IRC18:00
*** hwoarang has quit IRC18:00
*** bhavikdbavishi has quit IRC18:00
openstackgerritMerged opendev/base-jobs master: Open the firewall port for the buildset registry  https://review.opendev.org/65574418:01
*** hwoarang has joined #openstack-infra18:01
*** nicolasbock has joined #openstack-infra18:02
corvuswe have 2 zuul changes in check that could exercise that; i think 655474 is too old to get that, but i'm optimistic that 655491 will18:04
*** igordc has joined #openstack-infra18:06
*** ijw has quit IRC18:08
*** ijw has joined #openstack-infra18:09
*** ijw has quit IRC18:18
*** ijw has joined #openstack-infra18:18
*** lpetrut has joined #openstack-infra18:18
*** ijw has joined #openstack-infra18:19
rpiosocorvus: After a change has been merged, can comments be sent to the owner and reviewers? The Gerrit web UI is permitting me to create draft comments, but I don't see a Reply button.18:19
*** ijw has joined #openstack-infra18:19
*** tosky has quit IRC18:20
corvusrpioso: comments may be left on changes after they have merged18:20
rpiosocorvus: How are they sent to the owner and other reviewers?18:20
rpiosoPresently, they're marked as draft comments.18:20
corvusrpioso: draft comments are not. but if you publish them, they will appear as normal comments.18:20
openstackgerritJames E. Blair proposed opendev/base-jobs master: sudo add iptables rules  https://review.opendev.org/65575918:21
corvusAJaeger, mordred: ^18:21
rpiosocorvus: How are they published? There's no Reply... button displayed.18:21
corvusrpioso: what url are you at?18:22
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Fix error on multipart parsing  https://review.opendev.org/65576018:22
openstackgerritMerged osf/openstackid-resources master: Fix error on multipart parsing  https://review.opendev.org/65576018:22
AJaegerJeffrey4l: +2A18:23
AJaegerJeffrey4l: sorry, complete wrong autocompltion18:23
AJaegertrying again...18:23
AJaegercorvus: +2A18:23
*** corvus is now known as jeblair18:23
jeblairAJaeger: thanks!18:23
*** jeblair is now known as corvus18:23
AJaeger;)18:23
*** lpetrut has quit IRC18:23
AJaegermordred: do we still need https://review.opendev.org/580871 or time to abandon?18:24
openstackgerritJames E. Blair proposed openstack/project-config master: Don't announce osf/ projects in openstack-infra/opendev  https://review.opendev.org/65576118:25
rpiosocorvus: Never mind. Chrome hides the button when  it's not full screen.18:26
rpiosoIt was half screen on a 23" display :-(18:27
corvushelpful!18:27
fungiback and checking in before i try to wrestle my yard into submission... looks like 655491,2 failed out on more unit test timeouts?18:27
AJaegercorvus: time to abandon https://review.opendev.org/587178 and https://review.opendev.org/556885 ?18:28
corvusfungi: fixes for registry errors are in progress and currently broken; i'll recheck that soon.18:28
corvusAJaeger: i'm not in a position to deal with that right now18:29
corvusAJaeger: we can WIP those if that would help18:29
AJaegercorvus: I can ping in a few weeks again ;) no worries...18:29
corvuscurrently, my stack overfloweth18:30
fungitaking a look at 655739 now18:31
fungiwow, there's more scrollback than i thought, that's already merged. continuing18:31
fungi655744 too18:31
corvusfungi: post-merge reviews still welcome :)18:32
openstackgerritMerged opendev/base-jobs master: sudo add iptables rules  https://review.opendev.org/65575918:32
corvusokay, i'm going to re-enqueue some changes now18:32
fungiwell, my post-merge activities also have to compete with yardwork, so...18:33
fungii figure if it's broken i'll just find out when problems surface18:33
*** ijw has quit IRC18:35
*** ijw has joined #openstack-infra18:35
*** ijw has quit IRC18:37
*** ijw has joined #openstack-infra18:38
*** ijw has quit IRC18:41
*** ijw has joined #openstack-infra18:42
openstackgerritTobias Henkel proposed zuul/zuul master: WIP add repl  https://review.opendev.org/57996218:45
*** ijw has quit IRC18:47
*** ijw has joined #openstack-infra18:48
openstackgerritPaul Belanger proposed zuul/zuul master: Support Ansible 2.8  https://review.opendev.org/63193318:48
*** ijw has quit IRC18:54
*** ijw has joined #openstack-infra18:54
*** jamesmcarthur has quit IRC19:02
*** e0ne has joined #openstack-infra19:06
*** jamesmcarthur has joined #openstack-infra19:14
*** lpetrut has joined #openstack-infra19:14
openstackgerritJason Lee proposed opendev/storyboard master: WIP: Adds Blueprints to DB, Errors in Task Creation  https://review.opendev.org/65481219:16
*** ijw has quit IRC19:17
*** jamesmcarthur has quit IRC19:17
*** e0ne has quit IRC19:17
*** igordc has quit IRC19:17
*** ijw has joined #openstack-infra19:17
*** jcoufal has joined #openstack-infra19:17
*** jamesmcarthur has joined #openstack-infra19:18
*** ijw has quit IRC19:20
*** ijw has joined #openstack-infra19:20
*** ijw has quit IRC19:21
*** ijw has joined #openstack-infra19:21
*** ykarel|away has quit IRC19:22
corvusAJaeger, mordred: progress!  the most recent run of 655491 ran the new iptables roles successfully19:24
corvuss/roles/tasks/19:24
*** eharney has quit IRC19:25
corvusthat was on ipv4, so we haven't demonstrated conclusively that solves the v6 problem (EOF), but we've at least unbroken ourselves19:25
mordredcorvus: \o/19:25
corvusmordred: your retries fixed the blob issue!19:25
corvusmordred: http://logs.openstack.org/91/655491/2/gate/zuul-upload-image/8b4d7af/ara-report/result/8ccff3b6-849c-446b-8a47-91803af172b2/19:26
mordredcorvus: double-\o/19:26
AJaegercorvus, \o/19:26
mordredcorvus: that's very exciting19:26
corvushow should i read "Retries 4" in the ara output?19:26
corvusit looks like that output is saying there were 2 attempts; failed on the first passed on the second19:27
corvusthey are numbered "Attempts" 1 and 219:27
mordredcorvus: I can come up with no explanation of how to read retries 419:29
corvusmordred: i'm going to assume ansible is bad at math19:29
mordredunless it's a bug that's somehow adding attempt 1 to retries19:29
mordredyeah19:29
corvusit looks like all the other copies succeeded without retries19:29
mordredcool. so sometimes something just derps19:30
*** jamesmcarthur has quit IRC19:32
corvusi'm assuming someone keeps enqueing 655491 into gate despite check failures, right?19:37
corvus(that's fine, it's just -- it's not me and wanted to make sure that's what's happening)19:38
*** yamamoto has joined #openstack-infra19:40
fungicorvus: yes, that's me ;)19:42
fungisorry, trying to at least keep an eye on something useful in between laps with the mower19:42
fungiand cheering on the registry troubleshooting from the sidelines19:43
corvusi'd like to point out that hideci is making it all but impossible for the zuul team to do it's work19:44
*** yamamoto has quit IRC19:44
*** samueldmq has joined #openstack-infra19:44
corvuswe're having conversations in #zuul which we should not have to be having because most of the team can't even see zuul's output on its own code review system19:45
openstackgerritTobias Henkel proposed zuul/zuul master: WIP: Fix requirements loop warning  https://review.opendev.org/65578119:45
fungias in having to unhide the results? yeah, maybe we should trim that back to not filter out our gating ci system19:46
fungiand yeah, trying to follow along with conversation there too19:47
corvusthere are so many problems with it and no one seems interested in fixing them.  maybe we can exclude zuul for now and remove it entirely after the gerrit upgrade (where we should be able to use the reporting system plugin)19:49
*** igordc has joined #openstack-infra19:49
openstackgerritJames E. Blair proposed opendev/system-config master: Do not hide Zuul comments  https://review.opendev.org/65578219:50
corvusfungi: there's that ^19:50
mnasercorvus: is this some js-fu that I can help with?19:50
corvusmnaser: indeed it is19:50
mnaseroh I see that file19:51
mnasercorvus: where do I have to read to get context?19:51
mnasermaybe I can help fix quickly..19:51
corvusmnaser: there are 2 problems that we identified in a recent #zuul conversation19:52
corvusthe first is only going to be addressed by not hiding zuul comments, so i think 655782 is important regardless19:52
corvusthe second is that if a job is reported without a log link, it doesn't match the regex that causes it to appear in the table at the top19:53
openstackgerritTobias Henkel proposed zuul/zuul master: WIP: Fix requirements loop warning  https://review.opendev.org/65578119:53
corvussee https://screenshots.firefox.com/gOvoTW5JW5PvymIT/review.opendev.org19:53
mnasercorvus: ah yes, I've ran into that several times too in other context (i.e. RETRY_LIMIT with finger:// url might do the same)19:55
mnaserI think that's the same issue there?19:55
corvusmnaser: probably19:55
openstackgerritJeremy Stanley proposed opendev/system-config master: Don't hide Zuul CI comments  https://review.opendev.org/65578319:55
mnaserhttps://usercontent.irccloud-cdn.com/file/FF60bE1g/image.png19:55
fungicorvus: ^ a stab at it19:55
mnaser1 changes set apart19:56
fungioh, you were also writing one19:56
mnaseralmost the same commit id19:56
mnaser:P19:56
fungii'll abandon19:56
corvusfungi's is better19:56
mnasers/id/title/19:56
fungioh, okay19:56
fungiwe have two convergent solutions for comparison ;)19:56
fungii needed an excuse to sit down for a few minutes anyway19:57
fungiand fwiw i do more often than not end up un-hiding ci comments just so i can see when zuul leaves a new comment20:00
fungi(even when i expect its results to also be exposed in the ci table)20:00
mnaserok so20:02
mnaserI've narrowed it down here20:02
mnaserhttps://opendev.org/opendev/system-config/src/branch/master/modules/openstack_project/manifests/review.pp#L176-L18020:02
fungithere's a bunch which looks like it could be cleaned up in hideci.js, but our time is probably better spent on newer gerrit's ci integration feature than trying to polish that obsolete thing too heavily20:03
corvusi'm way overdue for lunch; biab.20:03
fungienjoy!20:03
mnaserI think its failing to parse that to convert it to the url that is annotated with span/etc20:04
mnaserwhich then makes hideci fail20:04
fungii'm going to continue vacillating between cutting my lawn and reenqueuing the zuul regression revert so we can hopefully unblock pypi uploads20:04
*** _erlon_ has quit IRC20:09
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Fix on multiparse request data * bug fixin * refactoring  https://review.opendev.org/65578620:10
openstackgerritMerged osf/openstackid-resources master: Fix on multiparse request data * bug fixin * refactoring  https://review.opendev.org/65578620:12
*** eharney has joined #openstack-infra20:18
openstackgerritFabien Boucher proposed zuul/zuul master: A reporter for Elasticsearch with the capability to index build and buildset results in an index.  https://review.opendev.org/64492720:18
openstackgerritFabien Boucher proposed zuul/zuul master: A reporter for Elasticsearch with the capability to index build and buildset results in an index.  https://review.opendev.org/64492720:19
*** Weifan has joined #openstack-infra20:19
*** eernst has joined #openstack-infra20:22
*** dave-mccowan has joined #openstack-infra20:26
*** eernst has quit IRC20:26
*** kgiusti has left #openstack-infra20:30
*** panda is now known as panda|off20:30
*** jamesmcarthur has joined #openstack-infra20:32
*** igordc has quit IRC20:36
*** Weifan has quit IRC20:41
*** eernst has joined #openstack-infra20:43
*** eernst has quit IRC20:45
*** kranthikirang has quit IRC20:45
*** eernst_ has joined #openstack-infra20:45
*** eharney has quit IRC20:46
*** pcaruana has quit IRC20:48
*** ricolin has quit IRC20:49
*** gmann is now known as gmann_afk21:01
*** Weifan has joined #openstack-infra21:03
openstackgerritsebastian marcet proposed osf/openstackid-resources master: Fixed bool parsing on multipart form  https://review.opendev.org/65579621:09
openstackgerritMerged osf/openstackid-resources master: Fixed bool parsing on multipart form  https://review.opendev.org/65579621:10
*** Goneri has joined #openstack-infra21:14
*** ijw_ has joined #openstack-infra21:15
*** smarcet has joined #openstack-infra21:15
*** ijw has quit IRC21:18
openstackgerritJames E. Blair proposed zuul/zuul master: Fix requirements loop warning  https://review.opendev.org/65578121:24
*** Weifan has quit IRC21:29
*** Goneri has quit IRC21:33
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Don't repeat the etc/alias setup for buildset registry pushes  https://review.opendev.org/65580221:33
*** ijw_ has quit IRC21:34
corvusthat one can wait until next week; it's just an optimization.21:34
*** ijw has joined #openstack-infra21:34
*** ijw has quit IRC21:35
*** ijw has joined #openstack-infra21:36
*** rlandy has quit IRC21:36
*** Goneri has joined #openstack-infra21:37
openstackgerritJames E. Blair proposed zuul/zuul master: Fix requirements loop warning  https://review.opendev.org/65578121:40
*** Weifan has joined #openstack-infra21:42
*** jamesmcarthur has quit IRC21:44
*** eharney has joined #openstack-infra21:44
*** Goneri has quit IRC21:46
*** Goneri has joined #openstack-infra21:47
*** Lucas_Gray has joined #openstack-infra21:48
*** jamesmcarthur has joined #openstack-infra21:51
*** jamesmcarthur has quit IRC21:53
*** jamesmcarthur has joined #openstack-infra21:56
openstackgerritJames E. Blair proposed zuul/zuul master: Halve stestr concurrency  https://review.opendev.org/65580421:56
*** slaweq has quit IRC21:57
*** Goneri has quit IRC21:57
corvusclarkb: zuul has been backlogged all day so far with no appreciable memory increase21:58
*** Weifan has quit IRC21:58
fungiyeah, it's been holding steady21:59
fungiso i think one of the missing commits is likely the culprit21:59
*** ccamacho has joined #openstack-infra21:59
clarkbok that implies on or both of the reverts are to blame21:59
fungiyeah, could be both i suppose21:59
corvusdid jobs for 655491 just restart?22:00
*** imacdonn has quit IRC22:02
*** imacdonn has joined #openstack-infra22:02
fungiyeah, the py35 unit tests hit test timeouts again so i "promoted" it as the only change in the queue22:03
corvusyes, tox-docs finished at 21:26 and 21:5922:03
corvusah ok22:03
corvusfungi: registry jobs okay?22:03
fungiseemed to be22:03
fungiso i think that's fixed22:03
*** hwoarang has quit IRC22:03
fungiit's just the slow unit tests which are problematic now22:04
corvusk; i've been checking on the reports to keep an eye on that.  if you promote, let me know if you see anything other than a tox failure, since i won't see it otherwise.22:04
fungihappy to!22:04
*** jamesmcarthur has quit IRC22:04
*** kaiokmo has quit IRC22:05
*** ijw has quit IRC22:06
*** ijw has joined #openstack-infra22:07
*** hwoarang has joined #openstack-infra22:07
*** ijw has quit IRC22:09
*** ijw has joined #openstack-infra22:10
openstackgerritJames E. Blair proposed zuul/zuul master: DNM: exercise halving concurrency  https://review.opendev.org/65580522:11
*** slaweq has joined #openstack-infra22:11
openstackgerritJames E. Blair proposed zuul/zuul master: DNM: exercise halving concurrency  https://review.opendev.org/65580522:12
*** jcoufal has quit IRC22:16
*** lpetrut has quit IRC22:26
*** jamesmcarthur has joined #openstack-infra22:27
*** ijw has quit IRC22:29
*** ijw has joined #openstack-infra22:29
*** ijw has joined #openstack-infra22:31
*** ijw has joined #openstack-infra22:31
*** jamesmcarthur has quit IRC22:32
fungicorvus: enqueuing 655491,2 into the gate again, this time hit unit test timeouts under py36 instead of py35 but everything else passed22:35
openstackgerritJames E. Blair proposed zuul/zuul master: Fix race in test_job_pause_pre_skipped_child  https://review.opendev.org/65580822:36
corvusfungi: that's a fix for the error it hit ^22:36
*** slaweq has quit IRC22:37
fungiooh!22:38
fungithanks22:38
fungioh, yeah, i wonder if this isn't the reason for most of them22:39
openstackgerritmelanie witt proposed openstack/project-config master: Add glance, tempest, and nova stable* to ceph grafana  https://review.opendev.org/65580922:41
*** smarcet has quit IRC22:41
*** Weifan has joined #openstack-infra22:44
corvusfungi: i think at least half the failures are due to resource contention22:44
*** smarcet has joined #openstack-infra22:44
corvusfungi: however, i have seen this test fail before22:44
corvususually if a bunch of tests fail and there are messages about zookeeper or gearman disconnections, it's contention22:45
*** smarcet has quit IRC22:45
corvusif one test fails, it's probably a race22:45
openstackgerritMerged zuul/nodepool master: Update devstack settings and docs for opendev  https://review.opendev.org/65423022:45
funginew life experience: debugging python source code over a voice call while pushing a mower around the lawn22:46
*** Weifan has quit IRC22:48
*** whoami-rajat has quit IRC22:51
mordredfungi: I have never debugged python source code while pushing a lawn mower22:57
*** tkajinam has joined #openstack-infra23:01
*** lathiat has quit IRC23:02
*** rcernin has joined #openstack-infra23:03
*** Lucas_Gray has quit IRC23:04
mordredcorvus: I had a thought re: refs/changes and the long replication - which is that we could add the patch to our gitea, then make a script that we run that pushes refs/changes refs/notes to gitea outside of the gerrit context - just in the background - then once it has caught up, remove the gerrit exclusion. it doesn't really help with making a new gitea server - but maybe for that we just need to23:07
mordredrsync the filesystem of an existing one and copy the mysql db23:07
*** lathiat has joined #openstack-infra23:16
*** lseki has quit IRC23:16
*** slaweq has joined #openstack-infra23:16
fungii've moved on from yardwork to laundry, and 655491,2 is getting really close to merging23:18
fungiactually passed all its tox jobs this time23:18
fungijust needs the quickstart and image uploads to finish without incident23:18
*** jamesmcarthur has joined #openstack-infra23:18
*** harlowja has quit IRC23:19
mordredfingers crossed23:19
openstackgerritMerged zuul/zuul master: Revert "Prepend path with bin dir of ansible virtualenv"  https://review.opendev.org/65549123:20
clarkbthereit is23:20
*** Weifan has joined #openstack-infra23:21
fungiwoo!!!23:23
fungii guess once puppet rolls that out i may be freed up enough for executor restarts23:23
*** slaweq has quit IRC23:24
openstackgerritJason Lee proposed opendev/storyboard master: RC1: Release Candidate 1  https://review.opendev.org/65481223:28
corvusmordred: i think when we make a new gitea server, maybe it's okay to bring it up and let it sit for a day getting replicated to before we add it to the LB23:28
smcginnis\o/23:28
corvusmordred: if we're okay with that, then i think the complexity around the current situation is just making sure we do something in a way that doesn't take out the existing system for any length of time.23:29
*** Weifan has quit IRC23:29
corvusmordred: so, yeah, i think the background script idea may be a good one for getting through the current state23:30
corvusthe gitea folks haven't been in quite so much of a hurry to review the refs change yet, though they have seen it and thrown some labels/milestones at it23:31
mordredcorvus: yeah - I think I'm ok with that - as I also think it's hopefully a short to medium term thing - and eventually we'll back to a cluster where adding/deleting nodes != needing to re-replicate23:38
corvusya that too23:40
*** yamamoto has joined #openstack-infra23:42
*** yamamoto has quit IRC23:46
*** jamesmcarthur has quit IRC23:48
*** jamesmcarthur has joined #openstack-infra23:49
mordredok - it's that time for me - see y'all tomorry23:50
openstackgerritJames E. Blair proposed zuul/zuul master: Fix race in test_job_pause_pre_skipped_child  https://review.opendev.org/65580823:54
*** gyee has quit IRC23:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!