Thursday, 2019-09-12

*** aaronsheffield has quit IRC00:02
*** adriant has quit IRC00:11
*** iokiwi has quit IRC00:11
*** goldyfruit_ has joined #openstack-infra00:12
donnydfungi: guess I will need to get a logo made up. I will get back to you on it00:18
*** igordc has quit IRC00:23
*** markvoelker has quit IRC00:29
*** markvoelker has joined #openstack-infra00:29
clarkbguilhermesp: root@23.253.245.7900:33
*** Goneri has quit IRC00:37
*** adriant has joined #openstack-infra00:42
*** ricolin has joined #openstack-infra00:48
*** gyee has quit IRC00:56
*** happyhemant has quit IRC01:00
*** markvoelker has quit IRC01:03
*** markvoelker has joined #openstack-infra01:04
cmurphyprometheanfire: seems like the configparser update in https://review.opendev.org/680914 broke keystone's py27 tests https://zuul.opendev.org/t/openstack/build/a858551fdb4749e0aa25a29abcad3e73/log/job-output.txt there's no configparser 4.0.1 https://pypi.org/project/configparser/#history01:05
clarkbcmurphy: they may have deleted that release01:06
cmurphymust have01:06
cmurphythey still have the git tag01:06
prometheanfirecmurphy: https://github.com/jaraco/configparser/releases01:08
gmanncmurphy: clarkb prometheanfire that is breaking all py27 jobs, may be we should cap the version01:08
cmurphyhttps://review.opendev.org/68163001:09
prometheanfireya, if they unpublished it then we need to roll it back01:09
*** pcaruana has quit IRC01:09
prometheanfireI'm guessing because of https://github.com/jaraco/configparser/issues/4501:09
cmurphyugh yeah https://github.com/jaraco/configparser/issues/45#issuecomment-53059595101:10
prometheanfireI'll leave it up to infra if they want to rush it but I +2+w'd it01:10
clarkbI'll enqueue and promote that change now that it is approved01:10
clarkbprometheanfire: jinx01:10
*** slaweq has joined #openstack-infra01:11
prometheanfireI'm just gonna leave this link there then01:11
prometheanfirehttps://doughellmann.com/blog/2016/02/25/so-youve-released-a-broken-package-to-pypi-what-do-you-do-now/01:11
cmurphylol01:12
prometheanfireyep, unpublishing things is one of the things I hate most, makes me want to remove the dep from openstack to not have to deal with it :P01:12
clarkbI also reenqued the nova fix that got kicked out due to this01:12
clarkbIf I was a smarter person I would've put it just behind the requirements fix but this is likely good enough for now01:13
cmurphyclarkb: we're also still having timeout problems in keystone https://review.opendev.org/681621 if you're feeling generous01:14
clarkbcmurphy: done01:14
cmurphytyvm01:15
*** slaweq has quit IRC01:16
prometheanfireI thought the timeout bump merged (for reqs at least)01:16
clarkbthis is a second bump01:17
gmannprometheanfire: i am worried about stestr version up now. how did you test that (if all other project working fine)? with workaround in Tempest it will fix it in Tempest but every tox use stestr and what project raise skip exception from setUpClass we do not know01:17
prometheanfiregmann: it's just our tempest job that's failing with it01:17
prometheanfireall the other checks were passing01:18
gmannok.01:18
gmannI will try to get Tempest workaround in by today.01:19
prometheanfirethanks :D01:20
clarkbneutron's functional job doesn't use speculative future states https://606e7073949a555d6ce7-a5a94d16cd1e63fdf610099df3afaf88.ssl.cf5.rackcdn.com/679813/2/gate/neutron-functional-python27/e20cb78/job-output.txt it failed onthe configparser issue behind the change that fixes that01:30
*** roman_g has quit IRC01:31
clarkb(I'm not going to debug that, its time for evening things. The neutron team should fix that though)01:32
*** mrda has joined #openstack-infra01:34
openstackgerritIan Wienand proposed zuul/zuul master: [wip] Test and expand documentation for executor-only jobs  https://review.opendev.org/67918401:37
*** dklyle has quit IRC01:42
*** david-lyle has joined #openstack-infra01:42
*** david-lyle has quit IRC01:46
*** dklyle has joined #openstack-infra01:47
*** dchen has quit IRC01:50
*** jcoufal has quit IRC01:54
guilhermespclarkb: thanks!01:56
fungii'm going to wager it directly retrieves the constraints file instead of using the copy zuul provides on the node's filesystem01:58
*** diablo_rojo has quit IRC02:02
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68160302:05
*** slaweq has joined #openstack-infra02:11
*** slaweq has quit IRC02:16
*** diablo_rojo has joined #openstack-infra02:17
*** rlandy|bbl has quit IRC02:37
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68160302:40
*** exsdev has quit IRC02:45
*** exsdev has joined #openstack-infra02:46
*** bobh has joined #openstack-infra02:49
*** tinwood has quit IRC02:50
*** tinwood has joined #openstack-infra02:52
*** xinranwang has joined #openstack-infra02:57
*** diablo_rojo has quit IRC03:06
*** bobh has quit IRC03:07
*** ykarel|away has joined #openstack-infra03:13
*** armax has quit IRC03:29
openstackgerritIan Wienand proposed zuul/zuul master: [wip] Test and expand documentation for executor-only jobs  https://review.opendev.org/67918403:32
*** Garyx_ has joined #openstack-infra03:33
*** Garyx has quit IRC03:33
*** dchen has joined #openstack-infra03:43
*** KeithMnemonic has quit IRC04:04
*** rh-jelabarre has quit IRC04:04
*** ykarel|away is now known as ykarel04:06
*** slaweq has joined #openstack-infra04:11
*** slaweq has quit IRC04:16
*** ricolin has quit IRC04:27
*** dolpher has joined #openstack-infra04:30
*** udesale has joined #openstack-infra04:31
*** pcaruana has joined #openstack-infra04:32
cmurphyhttps://review.opendev.org/681630 needs to be promoted again :/04:42
*** kjackal has joined #openstack-infra04:47
*** markvoelker has quit IRC04:48
*** pcaruana has quit IRC04:58
*** pots has joined #openstack-infra05:10
*** jaosorior has joined #openstack-infra05:20
*** sshnaidm|off is now known as sshnaidm|afk05:25
*** ykarel has quit IRC05:27
*** georgk has quit IRC05:28
*** fdegir has quit IRC05:28
*** fdegir has joined #openstack-infra05:29
*** georgk has joined #openstack-infra05:29
*** ramishra has joined #openstack-infra05:31
*** ccamacho has quit IRC05:33
*** hamzy_ has quit IRC05:35
openstackgerritIan Wienand proposed zuul/zuul master: Discuss executor-only jobs, add unit-test  https://review.opendev.org/67918405:38
openstackgerritIan Wienand proposed zuul/zuul master: Remove auto-add of localhost from unit test inventory  https://review.opendev.org/68164105:38
*** ykarel has joined #openstack-infra05:40
*** raukadah is now known as chandankumar05:41
ianwcmurphy: i just merged it ... i don't think there's much point waiting for it and making the queues worse05:45
ianw#status log force merged https://review.opendev.org/681630  "Revert configparser update" to avoid further breakage after it failed gate on it's prior promoted run05:46
openstackstatusianw: finished logging05:46
prometheanfireianw: ack05:47
ianwit had it's chance :)05:47
prometheanfire:D05:47
prometheanfireblame upstream05:48
*** slaweq has joined #openstack-infra05:53
*** kjackal has quit IRC05:58
cmurphythanks ianw05:58
*** slaweq has quit IRC05:59
*** tkajinam has quit IRC06:00
*** markvoelker has joined #openstack-infra06:06
*** tkajinam has joined #openstack-infra06:06
*** pcaruana has joined #openstack-infra06:08
*** markvoelker has quit IRC06:11
*** pgaxatte has joined #openstack-infra06:11
*** slaweq has joined #openstack-infra06:13
*** jaicaa has quit IRC06:16
*** jaicaa has joined #openstack-infra06:17
*** pgaxatte has quit IRC06:18
*** pgaxatte has joined #openstack-infra06:19
AJaegerianw: saw your comments on the review. I suggest to add to the etherpad the structure you have in mind for publishing - and I'll mirror that to the jobs...06:31
*** dchen has quit IRC06:31
AJaegerianw: left a comment...06:34
*** ccamacho has joined #openstack-infra06:35
*** rpittau|afk is now known as rpittau06:36
*** pcaruana has quit IRC06:41
*** ricolin has joined #openstack-infra06:41
AJaegerianw: I still see 681630 on http://zuul.opendev.org/t/openstack/status - why is Zuul not dequeing it?06:46
*** pgaxatte has quit IRC06:52
*** tkajinam_ has joined #openstack-infra06:53
*** tkajinam has quit IRC06:56
*** kjackal has joined #openstack-infra06:57
*** ricolin has quit IRC06:59
*** ralonsoh has joined #openstack-infra07:00
*** soniya29 has joined #openstack-infra07:04
*** spsurya has joined #openstack-infra07:07
*** aedc has joined #openstack-infra07:09
*** pcaruana has joined #openstack-infra07:11
*** trident has quit IRC07:15
*** ykarel is now known as ykarel|lunch07:15
*** ricolin has joined #openstack-infra07:18
*** ralonsoh has quit IRC07:23
*** happyhemant has joined #openstack-infra07:23
*** ralonsoh has joined #openstack-infra07:24
*** ralonsoh has quit IRC07:24
*** trident has joined #openstack-infra07:24
*** ralonsoh has joined #openstack-infra07:24
ianwAJaeger: i think it will still go through the motions but it's just already merged07:25
ianwAJaeger: yeah, thanks.  the change has prompted me to think a bit harder; i can come up with a bit of a spec, we can discuss it07:26
*** aedc has quit IRC07:27
*** aedc has joined #openstack-infra07:28
*** trident has quit IRC07:28
AJaegerianw: ok, looking forward to it.07:29
*** ralonsoh has quit IRC07:32
*** ralonsoh has joined #openstack-infra07:32
openstackgerritSorin Sbarnea proposed opendev/bindep master: Improve tox.ini setup  https://review.opendev.org/60561307:33
openstackgerritSorin Sbarnea proposed opendev/bindep master: Improve tox.ini setup  https://review.opendev.org/60561307:33
*** pgaxatte has joined #openstack-infra07:35
*** soniya29 has quit IRC07:36
*** xenos76 has joined #openstack-infra07:37
*** trident has joined #openstack-infra07:37
*** hwoarang has quit IRC07:40
*** dolpher has quit IRC07:41
*** hwoarang has joined #openstack-infra07:42
*** hamzy has joined #openstack-infra07:42
*** jpena|off is now known as jpena07:42
*** exsdev has quit IRC07:44
*** exsdev0 has joined #openstack-infra07:44
*** exsdev0 is now known as exsdev07:44
*** xinranwang has quit IRC07:56
*** e0ne has joined #openstack-infra07:57
*** electrofelix has joined #openstack-infra07:57
*** soniya29 has joined #openstack-infra08:01
*** e0ne has quit IRC08:08
*** tkajinam_ has quit IRC08:09
*** ykarel|lunch is now known as ykarel08:09
*** kopecmartin|off is now known as kopecmartin08:10
*** ykarel is now known as ykarel|meeting08:14
*** pkopec has joined #openstack-infra08:15
*** whoami-rajat has joined #openstack-infra08:15
*** yolanda has quit IRC08:16
*** markvoelker has joined #openstack-infra08:16
*** threestrands has quit IRC08:16
*** ykarel has joined #openstack-infra08:18
*** ykarel|meeting has quit IRC08:19
*** markvoelker has quit IRC08:20
*** derekh has joined #openstack-infra08:28
*** yolanda has joined #openstack-infra08:39
*** jaosorior has quit IRC08:45
*** piotrowskim has joined #openstack-infra08:50
*** gfidente has joined #openstack-infra08:53
*** priteau has joined #openstack-infra08:56
*** kjackal has quit IRC08:57
*** kjackal has joined #openstack-infra08:57
*** xenos76 has quit IRC08:59
*** ociuhandu has joined #openstack-infra09:00
*** e0ne has joined #openstack-infra09:03
*** e0ne has quit IRC09:03
*** udesale has quit IRC09:04
*** ociuhandu has quit IRC09:04
*** e0ne has joined #openstack-infra09:07
*** udesale has joined #openstack-infra09:07
*** ralonsoh has quit IRC09:13
*** FlorianFa has quit IRC09:13
*** e0ne has quit IRC09:15
*** lpetrut has joined #openstack-infra09:16
*** rcernin has quit IRC09:16
*** sshnaidm|afk is now known as sshnaidm|rover09:18
*** ociuhandu has joined #openstack-infra09:19
*** e0ne has joined #openstack-infra09:19
*** ociuhandu has quit IRC09:21
*** e0ne has quit IRC09:21
*** ociuhandu has joined #openstack-infra09:21
*** lpetrut has quit IRC09:23
*** prometheanfire has quit IRC09:24
*** lpetrut has joined #openstack-infra09:24
*** pgaxatte has quit IRC09:26
*** prometheanfire has joined #openstack-infra09:26
*** ricolin has quit IRC09:37
*** e0ne has joined #openstack-infra09:39
*** ykarel_ has joined #openstack-infra09:39
*** ykarel has quit IRC09:40
*** ralonsoh has joined #openstack-infra09:46
*** udesale has quit IRC09:49
*** udesale has joined #openstack-infra09:50
*** efried has joined #openstack-infra09:52
fricklerdonnyd: infra-root: I just saw three gate failures with mirror issues in FN, maybe the current load is too high?09:53
*** kjackal has quit IRC09:54
*** Tengu has quit IRC10:03
*** Tengu has joined #openstack-infra10:04
*** gfidente has quit IRC10:07
fricklerlooks like some kind of major issue since 08:30 according to https://grafana.fortnebula.com/d/9MMqh8HWk/openstack-utilization?orgId=2&refresh=30s , I think I'll disable that cloud until donnyd is awake again10:07
cgoncalveshey infra team! we in octavia are seeing high failure rates in CI due to connectivity issues with apt and pip in fortnebula10:08
*** dtantsur|afk is now known as dtantsur10:08
cgoncalvesthis is blocking us from merging bug fixes and features. feature freeze is this week10:08
cgoncalvese.g. https://ab6aa80517d8c71f588c-e65fbda5c4a8fc14eb81d398bd7b0a80.ssl.cf2.rackcdn.com/681144/8/check/openstack-tox-functional/0ac11dd/job-output.txt10:09
openstackgerritJens Harbott (frickler) proposed openstack/project-config master: Disable fortnebula cloud  https://review.opendev.org/68169710:10
fricklercgoncalves: yeah, I think I'll disable that cloud until it can be fixed ^^10:10
cgoncalves^ this might do it :)10:10
cgoncalvesfrickler, thanks!10:10
*** aedc has quit IRC10:13
*** lpetrut has quit IRC10:17
*** lpetrut has joined #openstack-infra10:19
*** noama has joined #openstack-infra10:22
*** pgaxatte has joined #openstack-infra10:23
*** ykarel_ is now known as ykarel10:24
*** markvoelker has joined #openstack-infra10:26
*** panda is now known as panda|ruck10:28
*** kjackal has joined #openstack-infra10:29
openstackgerritMerged openstack/project-config master: Disable fortnebula cloud  https://review.opendev.org/68169710:32
*** markvoelker has quit IRC10:32
*** jaosorior has joined #openstack-infra10:32
*** lucasagomes has joined #openstack-infra10:58
*** aedc has joined #openstack-infra10:58
*** ykarel is now known as ykarel|afk10:59
*** gfidente has joined #openstack-infra11:01
*** ykarel|afk is now known as ykarel11:08
*** lucasagomes has quit IRC11:12
*** lucasagomes has joined #openstack-infra11:25
donnydare there any other jobs with this issue?11:25
*** lucasagomes has quit IRC11:26
*** lucasagomes has joined #openstack-infra11:27
*** xenos76 has joined #openstack-infra11:30
donnydfrickler: I don't see any issue on any of the provider dashboard11:30
donnydbut I surely want to see jobs get through the gate11:31
*** jaosorior has quit IRC11:31
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68160311:31
*** roman_g has joined #openstack-infra11:32
fricklerdonnyd: other samples: https://aa7391dc0053b2e32f36-449df4e845769ef2b91c503daa7699ef.ssl.cf1.rackcdn.com/680354/2/gate/cloudkitty-tempest-full-ipv6-only/12b5704/job-output.txt https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_50a/681498/1/gate/neutron-grenade/50ae937/logs/grenade.sh.txt.gz11:35
donnydyea thats no good11:36
*** jpena is now known as jpena|lunch11:37
donnydFrom the logs it doesn't look like a dns issue, it looks like the mirror is just being overloaded11:39
donnydbut i don't have access to the mirrors logs11:39
donnydI was going to replace my edge router with a much faster one this weekend when the load calmed down anyways. Maybe this is a good time to get it done.11:40
*** pots has quit IRC11:41
*** pots has joined #openstack-infra11:42
donnydhttp://logstash.openstack.org/#/dashboard/file/logstash.json?query=node_provider:%5C%22fortnebula-regionone%5C%22%20AND%20filename:%5C%22job-output.txt%5C%22%20AND%20message:%5C%22RUN%20END%20RESULT_TIMED_OUT%5C%22&from=3h11:42
donnydyea there are a bunch of them11:43
*** ykarel is now known as ykarel|afk11:45
donnydfrickler: can you look at the logs from the FN mirror11:49
fricklerdonnyd: that's an extract from the mirror error log when the issue seems to have started, goes on like that until 11:27 http://paste.openstack.org/show/775408/11:49
donnydI am going to wait for the remaining jobs to finish and i will replace the router. Looking at the logs and metrics I don't see any real reason for it..11:51
*** elod has quit IRC11:51
*** elod has joined #openstack-infra11:52
*** whoami-rajat has quit IRC11:54
fricklerdonnyd: is that the same router that does the nat between the instances and the mirror? seems suboptimal that they connect via natted v4 instead of v611:54
donnydYea that tenant is special because I don't have public v611:55
donnyds/v411:55
donnydSo it hits my edge and then comes back in for v4 related things11:55
donnydIt's a 10G router and the load on it is well below that... however I already have a much more powerful one sitting in front of me.  Trying to get everything up to 40G11:56
donnydThere are also other things I can do for it, but the way we have the mirror designed this is how it works in all the other clouds too11:57
donnydall traffic leaves, hits a provider router on the outside of the tenant and then comes back down11:57
donnydIf you look at the graphs you can see the load on the edge router is pretty low11:59
donnydand the network node where traffic leaves is also not even close to hitting a limit12:00
donnydWith all that said there is clearly an issue12:00
*** markvoelker has joined #openstack-infra12:01
fricklerdonnyd: it may be that the nat is running out of ports to nat to. can you check nat table size on the router? although it might have recovered by now.12:01
*** soniya29 has quit IRC12:02
fungiif connections from the nodes to the mirror are funneled through a single layer 4 overload snat/pat then you may want to double-check that the pool of available ports it's using to map connections isn't getting exhausted12:02
donnydfrickler: Oh i adjusted that a couple days back12:02
fungier, what frickler said12:02
donnydI do have another option and that is to use openstack routers12:02
fungiusually those will incorporate a cooldown period for each port to handle stray/duplicate rst and fin packets, which can be on the order of minutes by default12:03
donnydI can keep it off the edge by connecting the zuul tenant directly to the same public network as the mirror tenant12:03
donnydor instead of NAT I can also route traffic12:04
fungiif necessary we can add a static route for the node network on the mirror server, i expect12:04
fungior... why aren't they talking to the mirror via ipv6?12:05
fungii've likely already forgotten12:05
donnydyea that too12:05
donnydthey should just resolve the v6 addr and then its direct12:05
donnydno NAT12:06
donnydI can fix that by providing a different DNS... but I know we are wary of using non public domain DNS servers for various reasons12:06
donnydi think it uses google right now12:07
donnydbut yea all the jobs seem to be resolving the mirror to a v4 addr12:07
fungiahh, right, we stopped using ipv6 per https://review.opendev.org/675156 because of excessive retransmits12:08
fungiif there's a chance that's gotten better we could just revert it12:08
donnydahhhhh... yea that makes mo sense12:08
donnydwell  I need to replace my edge router anyways12:08
donnydI was hoping for a time that wouldn't impact the jobs12:09
openstackgerritClint 'SpamapS' Byrum proposed zuul/zuul-jobs master: intercept-job -- self-service SSH access  https://review.opendev.org/67930612:09
*** rh-jelabarre has joined #openstack-infra12:10
donnydbut fungi I think I may know the reason for the retransmits, I need to provide a lower MTU for v6 to the tenant12:10
fungioh, interesting12:11
fungibecause it's tunneled?12:11
donnydneeds to be 145012:11
donnydyep12:11
donnydAt least that is what I am thinking12:11
fungimakes sense. anyway, i need to disappear to knock out some more storm cleanup before it gets too hot out, but will be back around soon12:11
donnydthanks fungi12:11
*** rh-jelabarre has quit IRC12:12
fungiyw!12:12
*** rh-jelabarre has joined #openstack-infra12:12
*** jamesmcarthur has joined #openstack-infra12:13
*** goldyfruit_ has quit IRC12:16
openstackgerritClint 'SpamapS' Byrum proposed zuul/zuul-jobs master: Add upload-logs-s3 role to send logs to S3  https://review.opendev.org/68173012:17
*** ykarel|afk is now known as ykarel12:23
donnydrebooting the mirror to pick up the new mtu12:23
*** jamesmcarthur has quit IRC12:24
*** owalsh is now known as owalsh_brb12:27
donnydOk all the jobs are done, replacing edge router now12:28
openstackgerritAndreas Jaeger proposed openstack/project-config master: Revert "Disable fortnebula cloud"  https://review.opendev.org/68173112:29
AJaegerthanks, donnyd. Let me prepare a change once you're ready ^12:30
donnydkk12:30
donnydthanks AJaeger12:30
*** iurygregory has joined #openstack-infra12:30
AJaegerdonnyd: Please tell us when you feel that everything is fine and we can merge it.12:31
*** rlandy has joined #openstack-infra12:31
*** elod has quit IRC12:32
*** elod has joined #openstack-infra12:33
*** eharney has joined #openstack-infra12:36
*** soniya29 has joined #openstack-infra12:36
*** jpena|lunch is now known as jpena12:38
*** derekh has quit IRC12:42
*** mriedem has joined #openstack-infra12:43
*** owalsh_brb is now known as owalsh12:43
*** e0ne has quit IRC12:44
*** e0ne has joined #openstack-infra12:44
*** larainema has quit IRC12:45
*** jamesmcarthur has joined #openstack-infra12:48
*** ociuhandu has quit IRC12:50
*** soniya29 has quit IRC12:50
fricklercorvus: fungi: regarding daily digests on lists.o.o, I have now manually tested running the senddigests cron with MAILMAN_SITE_DIR being set properly and it generated a release-announce digest as expected12:53
*** mriedem has quit IRC12:54
fungiAJaeger: donnyd: should we then also revert https://review.opendev.org/675156 now that the new mtu is in place?12:54
fricklerI have manually created /etc/cron.d/mailman-senddigest-sites that should take over starting tomorrow. a full solution would add creating that cron file to https://opendev.org/opendev/puppet-mailman/src/branch/master/manifests/site.pp but I don't know enough puppet for that, in particular staggering the start times12:54
*** soniya29 has joined #openstack-infra12:54
frickler#status log generated /etc/cron.d/mailman-senddigest-sites on lists.o.o in order to re-enable daily digests being sent12:55
openstackstatusfrickler: finished logging12:55
*** derekh has joined #openstack-infra12:56
*** ociuhandu has joined #openstack-infra12:58
*** derekh has quit IRC12:59
*** mriedem has joined #openstack-infra13:00
*** derekh has joined #openstack-infra13:00
donnydfungi I'm thinking it wouldn't be a bad idea and we can watch to see if it's still an issue13:04
*** jcoufal has joined #openstack-infra13:09
*** Goneri has joined #openstack-infra13:14
iceyI'm seeing a lot of POST_FAILUREs recently13:15
openstackgerritMatt Riedemann proposed opendev/elastic-recheck master: Add query for configparser 4.0.1 missing bug 1843715  https://review.opendev.org/68174113:18
openstackbug 1843715 in OpenStack-Gate "CI jobs failing Sept 11 due to "ERROR: No matching distribution found for configparser===4.0.1"" [Undecided,New] https://launchpad.net/bugs/184371513:18
*** aaronsheffield has joined #openstack-infra13:18
*** dolpher has joined #openstack-infra13:24
*** njohnston has joined #openstack-infra13:27
*** pcaruana has quit IRC13:30
*** aedc has quit IRC13:31
*** aedc has joined #openstack-infra13:32
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68160313:32
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174813:32
*** efried is now known as efried_afk13:36
donnydicey: do you have links for a job?13:37
donnydfungi: AJaeger it should be back up13:37
iceydonnyd: https://review.opendev.org/669617 https://review.opendev.org/67825213:37
iceyboth hit it in the last few minutes13:37
donnydicey: its likely because FN swift logs were still in operation and I just replaced the edge rtr13:38
*** aedc has quit IRC13:38
donnydits back up now13:38
iceythanks donnyd - will recheck :)13:39
fungiahh, oops, i should have thought to suggest disabling log uploads to there temporarily13:39
*** aedc has joined #openstack-infra13:39
donnydWe should be good to go now13:39
*** ricolin has joined #openstack-infra13:39
*** markvoelker has quit IRC13:40
*** ociuhandu has quit IRC13:41
*** markvoelker has joined #openstack-infra13:42
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68160313:44
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174813:44
openstackgerritMerged opendev/elastic-recheck master: Add query for configparser 4.0.1 missing bug 1843715  https://review.opendev.org/68174113:45
openstackbug 1843715 in OpenStack-Gate "CI jobs failing Sept 11 due to "ERROR: No matching distribution found for configparser===4.0.1"" [Undecided,Fix released] https://launchpad.net/bugs/184371513:45
donnydfungi: I do have one more option and that is to use an Openstack router for the zuul tenant13:47
*** happyhemant has quit IRC13:47
donnydit would work the same way, the traffic would just be NAT by an Openstack router13:47
donnydit would eliminate the edge router entirely13:47
fungiwell, if ipv6 works then there wouldn't be any nat needed between the nodes and the mirror13:48
donnydand may be a really good option as the edge router is processing lots of rules, where the Openstack router would just need rules for that single tenant13:48
donnydand states13:48
donnydI agree13:48
donnydI will watch the logs for ipv6 on the edge rtr13:49
donnydthat is when the DNS resolution is merged13:49
*** KeithMnemonic has joined #openstack-infra13:53
*** goldyfruit_ has joined #openstack-infra13:53
openstackgerritThierry Carrez proposed opendev/system-config master: [opendev][gitea] Add a link to open changes  https://review.opendev.org/68175313:54
*** goldyfruit_ has quit IRC13:54
*** tkajinam has joined #openstack-infra13:55
*** goldyfruit has joined #openstack-infra13:56
tkajinamHi. Can I ask a question about the usage of gerrit/zuul here?13:57
*** hamzy has quit IRC13:58
*** ykarel is now known as ykarel|afk13:59
*** ociuhandu has joined #openstack-infra14:00
*** whoami-rajat has joined #openstack-infra14:01
*** jbadiapa has quit IRC14:04
*** jbadiapa has joined #openstack-infra14:07
*** pcaruana has joined #openstack-infra14:08
*** goldyfruit_ has joined #openstack-infra14:09
*** aedc has quit IRC14:09
*** aedc has joined #openstack-infra14:10
*** goldyfruit has quit IRC14:12
*** aedc has quit IRC14:12
*** aedc has joined #openstack-infra14:13
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174814:14
*** soniya29 has quit IRC14:20
smcginnistkajinam: There are likely folks here that can answer, but there is also a #zuul channel if it is not related to OpenStack/OpenDev usage of Zuul.14:21
tkajinamsmcginnis, thanks14:22
tkajinamI'm currently looking for the way to retry gate job for the patch14:23
*** mriedem has quit IRC14:23
smcginnistkajinam: That is done by leaving a comment on the patch that starts with "recheck".14:23
tkajinamI know that putting "recheck" on the gerrit can work here, but it can trigger the execution also for check job14:23
tkajinamI remember we had something like reverify just to rerun only gate job. Is it still valid?14:24
smcginnistkajinam: We used to have a way to only retrigger gate jobs, but that led to some issues. Now there is only recheck and it needs to go through check and gate again.14:24
donnydok I am pretty happy,  I think we can go ahead and put FN back online14:24
smcginnisA little more overhead, but it ensures we are not missing issues.14:24
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174814:25
tkajinamsmcginnis, Understood. Thank you for your confirmation!14:25
smcginnisNo problem14:25
*** hamzy has joined #openstack-infra14:27
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: zuul-test: update jenkins-job-builder location  https://review.opendev.org/68176414:30
*** mriedem has joined #openstack-infra14:32
dtantsursmcginnis: a huge overhead, you meant? :)14:33
dtantsurI really miss the reverify button..14:33
* dtantsur issues the 8th recheck of a critical gate-fixing patch14:33
dtantsuranother feature request would be a fail-fast mode14:34
smcginnisdtantsur: Fail fast: abandon the patch, then restore it. ;)14:34
dtantsursmcginnis: that's what I'm about to do, yes14:35
pabelangerdtantsur: zuul supports it, I don't believe there is a want to enable it14:35
dtantsursigh14:35
smcginnisdtantsur: I do miss reverify as well though. Especially when the gate gets so backed up during these milestone times.14:35
dtantsursometimes I want to have fewer voting CI jobs..14:35
dtantsuranywa14:35
dtantsuris anybody else observing POST_FAILUREs in unit tests? we're seeing them a lot.14:35
pabelangerdtantsur: if critical, you can also discuss with infra-root to enqueue into gate directly. But usually try to avoid that14:35
dtantsurwe cannot merge anything for days. and our jobs keep using a lot of disk space on testing nodes.14:36
pabelangerPOST_FAILURE usually means issue with upload of logs, maybe swift related14:36
pabelangerdtantsur: I think clarkb was looking into that14:36
dtantsurpabelanger: yep, we have a temporary fix for the problem clarkb discovered, but we cannot merge it14:37
pabelangerdtantsur: have link?14:37
dtantsurpabelanger: https://review.opendev.org/#/c/680652/14:37
clarkbclean check exists to help prevent flaky code from landing14:37
pabelangerit looks like maybe swift issue again14:38
clarkbpabelanger: I think it was mentioned that FN swift was off temporarily?14:38
fricklerpabelanger: we had a short issue with fn swift when donnyd changed the router14:38
pabelangerkk, might have missed that14:38
pabelangerI'm going to spend some time working on retries for upload-logs-swift14:39
fricklersadly we missed turning off that swift site, only nodepool was/is off14:39
clarkbdtantsur: but as pabelanger says we can directly enqueue gate fixes to try and cut down on that flakyness14:39
clarkbdtantsur: I think configparser error threw everything into a mess last night though14:39
dtantsuryeah14:39
dtantsurclarkb: it would be great to try it with this patch. with Murphy's laws in full effect, we've hit all sorts of transient problems with it.14:40
clarkbI'm not caught up yet but if configparser fix isnt landed yet we need to start there14:41
clarkbthen keystone and nova also hadbug fixes weshould enqueue if they havent landed14:41
clarkband neutrons functional job doesnt test speculative future states14:41
clarkbslaweq: ^ fyi that needs fixing14:42
* dtantsur loves release time14:42
*** aedc has quit IRC14:43
slaweqclarkb: sorry but I don't understand what "speculative future states"14:43
fungidtantsur: and to clarify, some of our zuul tenants do indeed enqueue directly into their gate pipeline (and cancel/abort any pending or running check pipeline jobs corresponding to the change) immediately upon change approval. the openstack tenant is not configured for that behavior because we observed a propensity for buggy changes to get approved into shared gate queues with projects running very14:43
fungilengthy jobs, which caused significant additional disruption and delays merging changes14:43
dtantsuryeah, I understand that14:44
clarkbslaweq: last night I promoted the config parser change to the head of the gate. Then neutron functional python27 change next in the queue failed becauseit used master requirements and not the change ahead of it in the gate14:44
fungislaweq: neutron's functional job is not using (at least) a copy of the upper-constraints.txt file provided by zuul and is probably independently retrieving it from a git remote, resulting in it not being tested wit the state of that file provided by changes ahead of it in a dependent pipeline (would also break use of depends-on to such requirements repo changes)14:45
clarkbslaweq: the expectation is that jobs will use the unmerged zuul future state and test that otherwise our integrated gate testing isnt valid14:45
dtantsurI'd just like to have an option for situations like this14:45
*** exsdev has quit IRC14:45
slaweqclarkb: fungi: ok, I think I understand now14:46
*** exsdev0 has joined #openstack-infra14:46
*** exsdev0 is now known as exsdev14:46
fungidtantsur: absolutely, the option is to ask us to enqueue changes into the gate skipping check results and, if especially critical, promote them to the front so all changes already in the gate are not impacted by whatever those fix14:46
slaweqclarkb: fungi: I will take a look into that and will get back to You if I will have any questions14:46
*** armax has joined #openstack-infra14:47
clarkbslaweq: thank you and happy to answer questions14:47
dtantsurfungi: got it. so could you enqueue https://review.opendev.org/#/c/680652/ into the gate once it's clear that we're no longer affected by the configparse thing?14:47
*** ykarel|afk is now known as ykarel|away14:48
fungii think the configparser blacklist change landed in requirements hours ago, but i'll double-check14:48
*** hamzy has quit IRC14:48
fricklerianw force-merged it14:48
clarkbya its in git history now14:48
*** hamzy has joined #openstack-infra14:49
fricklerhttps://review.opendev.org/68163014:49
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174814:50
clarkbI'm impressed that the upper constraints update merged so easy and quick14:52
fungiokay, then i'll skip 680652 into the gate pipeline14:52
clarkbthen as soon aswe have to revert its all pain14:52
*** bdodd has joined #openstack-infra14:53
*** ykarel|away has quit IRC14:53
donnydOk I think the proper way forward for FN is moving the tenant router to Openstack so connections to the mirror don't have to go all the way up and then back down the FN. This should solve v4 related issues14:54
fungi--tenant=openstack --project=openstack/ironic --change=680652,3 is now in the gate14:54
fungilooks like ironic has its own gate queue so there's no need to promote that change14:54
donnydthe v6 related issues should be solved via the reduction in MTU for the tenant network14:54
fungiit's the only ironic change in the queue currently14:55
donnydI think maybe we should separate out the jobs that currently depend on FN into a separate pool so the jobs that need to run there can and the generic jobs can remain disabled until everyone is happy14:57
*** pcaruana has quit IRC14:57
clarkblooks like both the keystone and nova fixes landed14:58
clarkbmriedem: how important is that stein backport of the fix?14:59
dtantsurthanks fungi. now fingers crossed for no more transient problems..14:59
mriedemclarkb: let me check logstash15:00
mriedemclarkb: it's just nova functional tests so it won't break master by way of grenade, and logstash says it's only hitting on master15:01
mriedemi did the backport b/c the test was obviously racy from inspection but for whatever reason something is tickling that now on master15:02
donnydDoes anyone have any issues with turning the custom jobs for FN back on clarkb fungi AJaeger15:02
*** pgaxatte has quit IRC15:02
openstackgerritDonny Davis proposed openstack/project-config master: Re-enable FN for custom jobs  https://review.opendev.org/68177315:02
clarkbdonnyd: I dont. Are we thinking use that to show its stable then add the other jobs bavk in?15:02
donnydclarkb: it will be difficult to T/S the issues with no load because they don't appear until there is a full load15:03
donnydbut I also don't think we should disable the jobs that are using FN from a custom resource perspective15:04
donnydmakes sense to me, so hopefully everyone else see's it the same way15:04
clarkbsure I'm just wondering what the plan is for enabling the other jobs15:04
*** weshay is now known as weshay_passport15:05
*** armstrong has joined #openstack-infra15:05
donnydclarkb: I would imagine slowly15:05
clarkbgotcha so add nodes back in at a measured pace. wfm15:06
donnydAnd closely monitor the network to see if the retransmit issues come back15:06
*** tkajinam has quit IRC15:06
donnydyea, if we keep the job count low to begin with I can monitor the network to see if there are issues and if so it won't hit a ton of jobs if the issue is still there15:07
donnydI think a good start is to re-enable the custom jobs so we can not hold back those who are depending on it15:07
*** noama has quit IRC15:08
donnydso if we could abandon https://review.opendev.org/#/c/681731/ and merge https://review.opendev.org/#/c/681773/ we would be there15:08
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174815:09
AJaegerdonnyd: I abandoned 681731, not sure about the "custom" 681773, want to have infra-root look at it.15:10
donnydsure that makes sense to me15:10
donnydIf we need that name changed to something that makes more sense then I am happy to re-submit15:10
clarkbI'm making tea then will review15:11
* AJaeger will cycle a bit now...15:11
openstackgerritMerged zuul/zuul-jobs master: zuul-test: update jenkins-job-builder location  https://review.opendev.org/68176415:12
*** hamzy has quit IRC15:14
*** hamzy has joined #openstack-infra15:14
*** spsurya has quit IRC15:16
*** njohnston has quit IRC15:19
*** njohnston has joined #openstack-infra15:19
*** guimaluf has quit IRC15:21
clarkbdonnyd: AJaeger change lgtm15:22
*** gyee has joined #openstack-infra15:23
*** jamesmcarthur has quit IRC15:23
*** lpetrut has quit IRC15:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174815:26
Shrewsdonnyd: +Ad. the pool name can be any arbitrary string15:28
clarkbslaweq: https://606e7073949a555d6ce7-a5a94d16cd1e63fdf610099df3afaf88.ssl.cf5.rackcdn.com/679813/2/gate/neutron-functional-python27/e20cb78/job-output.txt is the example job where I noticed functional wasn't testing with the proposed requirements update15:32
clarkb(finally able to dig that out of my scrollback)15:32
clarkbslaweq: looks like the openstack-tox-python27 job may suffer the same problem in neutron15:33
clarkboh hrm maybe not I think the one I found for openstack-tox-python27 was due to the requirements update fialing and getting removed from the gate queue15:34
yoctozeptohey infra, kolla going ipv6 has issues with centos somehow not getting ipv6 address, see: https://c66a5275cc7b20f05ed9-882fbdc9765e1a9e81809c5a97a4ce6a.ssl.cf1.rackcdn.com/681573/6/check/kolla-ansible-centos-source-ipv6/f107f39/job-output.txt15:36
clarkbslaweq: I think we need to ensure that UPPER_CONSTRAINTS_FILE is set in your functional test https://opendev.org/openstack/neutron/src/branch/master/tox.ini#L1715:36
yoctozeptoubuntu gets it fine fine15:36
*** jamesmcarthur has joined #openstack-infra15:36
*** ykarel has joined #openstack-infra15:37
clarkbyoctozepto: not every cloud has a public ipv6 address15:37
yoctozeptoclarkb: should not nodepool public_ipv6 be empty then?15:37
clarkband in some clouds that depends on the base image because of whether or not that platform supports the ip configuration necessary in that cloud15:37
yoctozeptoah, so we might be doomed with respect to centos then15:37
clarkbI think in this case rackspace gives us an ipv6 address but we don't support rackspace ipv6 ip assignment on centos/fedora15:38
yoctozeptosad news for us, any way to resolve this soon? ipv6 in centos works pretty well normally15:39
openstackgerritMerged openstack/project-config master: Re-enable FN for custom jobs  https://review.opendev.org/68177315:39
openstackgerritMerged zuul/zuul master: Fix: prevent usage of hashi_vault  https://review.opendev.org/68104115:39
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174815:39
clarkbyoctozepto: you could add support to glean for configuring ipv6 statically via config drive on centos15:39
clarkbyoctozepto: out of curiousity why does it matter?15:39
clarkbyou can test the ipv6 goal without global ipv6 connectivity15:40
rosmaitacan someone approve this devstack-gate patch that has one +2: https://review.opendev.org/#/c/679610/1 ... thanks!15:40
clarkb(in fact I impressed upon the TC that if they went with the goal they would have to make that possible bceause some clouds just don't ipv6)15:40
clarkbrosmaita: done. We really need to get that data out of d-g and into devstack proper15:40
rosmaitaclarkb: ty15:41
openstackgerritSean McGinnis proposed openstack/project-config master: Tighten formatting for new branch reno page title  https://review.opendev.org/68178515:41
*** shachar has joined #openstack-infra15:42
*** snapiri has quit IRC15:42
*** rpittau is now known as rpittau|afk15:44
clarkbslaweq: the zuul tox role (which is used by the unittest tox jobs, but I'm not sure about the neutron-functional job as that parents to devstack-minimal instead) accepts this tox_constraints_file var to set that value https://opendev.org/openstack/openstack-zuul-jobs/src/branch/master/zuul.d/jobs.yaml#L8315:44
yoctozeptoclarkb: can we do multinode with private IPv6 addressing - there seem to be none provided?15:45
clarkbyoctozepto: yes, multinode jobs can put ipv6 addresses on the multinode network overlay. I believe octavia does this15:46
clarkbjohnsom: ^ can you confirm? I believe that was somethign you were working on at the last ptg15:46
fungidonnyd: sorry for the delay, but sounds fine to me. been quite a busy thursday for me so far, apologies!15:46
openstackgerritMerged zuul/zuul master: Pass zuul_success to cleanup playbooks  https://review.opendev.org/68155215:47
yoctozeptoclarkb: you mean create our own overlay then?15:47
clarkbyoctozepto: "yes" there exist zuul roles to do it for you already. I don't remember if ipv6 is already done on them or not15:47
clarkbjohnsom: ^ likely knows more about that than me at this point15:47
yoctozeptoclarkb: I see, thanks15:47
donnydnp fungi15:47
clarkbor rm_work ^ do you remember details on setting up ipv6 on the test overlay network for octavia?15:48
johnsomclarkb It hasn't been stable for us frankly15:48
clarkbjohnsom: in what way?15:48
johnsomSometimes the IPv6 doesn't seem to pass between the nodes.15:48
clarkbjohnsom: does ipv4? (that could be a bug in the ovs implementation?)15:48
johnsomYes, the IPv4 seems stable.15:49
clarkbinteresting15:49
johnsomI haven't had time to figure out *why* yet. This is why our multi-node jobs are still non-voting15:49
fungicould it be an mtu issue then? maybe pmtud is working for v4 but something is not getting the path mtu correct for v6?15:49
clarkbin any case we don't have ipv6 in every cloud so you'll have to figure this out regardless of centos not supporting static ipv6 in clouds that do support ipv615:50
yoctozeptothanks guys for the insights15:50
clarkbfungi: I don't think we can rely on pmtud here because some hops don't have ip addresses15:50
clarkbfungi: but there is a good chance ipv6 mtus are at fault15:50
johnsomyoctozepto https://github.com/openstack/octavia-tempest-plugin/blob/master/zuul.d/jobs.yaml if that is any help for you15:50
johnsomYeah, it seems to be in the neutron layer. Really I can't talk much to it as I haven't really dug into the problem yet.15:51
fungiclarkb: ahh, right, no source to generate the need-to-frag message with15:51
clarkbslaweq: I think you needt o apply https://opendev.org/openstack/openstack-zuul-jobs/src/branch/master/zuul.d/jobs.yaml#L82-L83 to https://opendev.org/openstack/neutron/src/branch/master/playbooks/run_functional_job.yaml#L10 and also add openstack/requirements to the job's required proejcts list15:52
*** hamzy_ has joined #openstack-infra15:52
*** hamzy has quit IRC15:52
yoctozeptojohnsom: no idea where to look in that file, though mgoddard has proposed vxlan for kolla in this patch: https://review.opendev.org/67069015:53
yoctozeptoseems we are not using any zuul stuff in there15:53
yoctozeptomgoddard to confirm15:53
*** kjackal has quit IRC15:53
yoctozeptomaybe we can use that for ipv6, sure15:53
mgoddardI didn't use the zuul roles to avoid dependency on OVS15:54
mgoddardlinux bridge worked fine15:54
*** kjackal has joined #openstack-infra15:54
clarkbmgoddard: fwiw we used OVS because linux bridge did not work with vxlan for the longest time and GRE traffic isn't passed in some clouds15:54
clarkbif linux bridge + vxlan works where you need it then it should be fine15:54
donnydyoctozepto: if you can build a job that can use one of the custom FN labels then you can find out15:54
clarkbthe original implementation of all of this was linux bridge + GRE as a lowest common denominator for linux support15:55
clarkbthen we discovered some clouds block gre traffic even if youtell security groups not to15:55
mgoddardalso went for a simpler topology with one shared overlay rather than the bridge plus tunnel approach in zuul15:55
clarkbmgoddard: doesn't that require multicast?15:55
clarkbmgoddard: the reason its a bridge plus tunnel appraoch is we can only do point to point overlays without multicast15:56
mgoddardnot if you add static fdb entries15:56
mgoddardhttps://review.opendev.org/#/c/670690/10/tests/run.yml15:56
donnydto at least make sure the job lands somewhere with ipv615:56
clarkbugh15:56
mgoddardbridge fdb append 00:00:00:00:00:00 dev vxlan0 dst {{ dest_ip }}15:56
clarkbso you are broadcasting always?15:56
mgoddardno15:57
mgoddardthat's just a catch all for BUM traffic15:57
mgoddardit will learn unicast15:58
clarkbI see it is a rule for broadcast frames15:58
mgoddardright15:58
mgoddardif it works I could tidy up and push into zuul15:59
clarkbfwiw improvements to the existing roles and discussion about them are welcome :)15:59
mgoddardyeah it's still a PoC15:59
clarkbok. The big two gotchas were linux bridge (at the time) could not vxlan hence ovs (and we can't gre in some clouds). And the decision to use a bridge was made because multicast doesn't work either16:00
mgoddards/if it works//16:00
mgoddardI'm pretty sure it does work :)16:00
clarkbthe bridge allows us to do point to point and tie everything together. It looks a lot like a real switch on a rack if you squint and pretend :)16:00
mgoddardmakes sense, given the constraints16:01
*** diablo_rojo has joined #openstack-infra16:02
*** ykarel is now known as ykarel|away16:02
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174816:02
mgoddardI'm not actively working on it right now, but if/when I (or yoctozepto) do, I'll look at how to get it into the zuul role16:02
*** dave-mccowan has joined #openstack-infra16:03
*** ricolin has quit IRC16:03
*** elod has quit IRC16:04
*** ricolin has joined #openstack-infra16:04
*** ricolin has quit IRC16:05
*** elod has joined #openstack-infra16:05
clarkbfwiw I'd personally much prefer linux bridge simply because tcpdump works out of the box with it16:06
clarkb(I'm always amazed at how painful it is to tcpdump an ovs interface)16:06
slaweqclarkb: thx for links, I will check that later today or tomorrow morning16:07
*** e0ne has quit IRC16:08
*** dtantsur is now known as dtantsur|afk16:09
*** ociuhandu has quit IRC16:13
*** ccamacho has quit IRC16:16
donnydclarkb: shouldn't nodepool be showing 10 max from turning on the separate pool?16:17
donnydhttp://grafana.openstack.org/d/3Bwpi5SZk/nodepool-fortnebula?orgId=1&from=now-3h&to=now16:17
zbrclarkb: https://review.opendev.org/#/c/677971/ info-banner? thanks.16:17
clarkbdonnyd: it is for me16:17
donnydI just needed to refresh my page i guess. doh16:18
*** electrofelix has quit IRC16:19
clarkbzbr: in that change, looking at it again, do we haev to do anything to ensure the ansible_distribution fact is gathered? we explicitly gather the python info16:22
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: Log new image upload external ID  https://review.opendev.org/68181216:24
zbrclarkb: AFAIK, zuul collects default facts on all nodes so we should be safe.16:24
pabelangerthat isn't 100% true, for localhost we try not too (to avoid leaking data)16:25
zbrclarkb: but what I did in other cases was to add a conditional to run "setup" at the start of the role when one of used facts was not defined, can do that too.16:25
pabelangerso we need to be careful about adding that to emit-job-header, which runs on localhost16:25
pabelangerthat would be my concern using setup their, data bout executor leaks some how16:26
pabelangerthere*16:26
clarkbpabelanger: can you leave a comment on the change about that?16:26
pabelangersure, will do that shortly16:26
clarkbty16:26
zbrhaha, already did it https://review.opendev.org/#/c/677971/8/roles/emit-job-header/tasks/main.yaml16:28
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-output: introduce zuul_use_fetch_output  https://review.opendev.org/68174816:28
*** DinaBelova has quit IRC16:30
*** goldyfruit_ has quit IRC16:32
*** udesale has quit IRC16:32
*** dave-mccowan has quit IRC16:34
clarkbdigging into the swift upload log error we noticed in rax. I see only one occurence of that in the last 10 hours16:34
clarkbpabelanger: ^ fyi since you had mentioned it earlier16:34
clarkbdefinitely something to get more debugging for, but not majorly catastrophic16:34
clarkbI determined this via grep of the current executor debug logs16:34
*** mattw4 has joined #openstack-infra16:34
*** dave-mccowan has joined #openstack-infra16:37
*** DinaBelova has joined #openstack-infra16:39
zbrpabelanger: re info-banner, i really doubt that collecting "gather_subset: python" is leaking any secrets, or maybe we want to keep a secret the python version being used?16:44
jrosserare the infra mirrors getting packages for debian buster?16:45
*** gfidente has quit IRC16:46
*** igordc has joined #openstack-infra16:46
clarkbjrosser: http://mirror.dfw.rax.openstack.org/debian/dists/buster/ says yes. and http://mirror.dfw.rax.openstack.org/debian/timestamp.txt says that was last updated today16:46
jrosserhttp://mirror.dfw.rax.opendev.org/debian/pool/main/e/exim4/exim4-config_4.92-8_all.deb  404  Not Found16:46
*** pkopec has quit IRC16:48
*** goldyfruit_ has joined #openstack-infra16:48
jrosseri have a log here https://openstack.fortnebula.com:13808/v1/AUTH_e8fd161dc34c421a979a9e6421f823e9/zuul_opendev_logs_f66/681777/5/check/openstack-ansible-deploy-aio_lxc-debian-stable/f66a770/job-output.txt16:48
clarkbhttp://mirror.dfw.rax.openstack.org/debian/pool/main/e/exim4/exim4-config_4.92-8+deb10u1_all.deb exists16:49
jrosserhmm16:50
clarkbreprepro is supposed to produce a valid index16:51
clarkbI wonder where the other name comes from16:51
fungicould a long-running job have straddled a delete?16:53
fungiwould have to be long enough to have done an apt install/upgrade from data two pulses after it did an apt update, in theory16:54
fungior it's doing an apt install without an apt update first and so relying on a package index contemporary with when the image was generated?16:55
clarkbfungi: oh ya that could be16:55
clarkbjrosser: ^16:55
jrosserah so check that we have an apt-update at some point before it does that16:55
clarkbthe role that is doing this is very rhel centric16:55
clarkbeven the debian package list is called the rhel7 package list16:56
clarkband with yum you don't need to update explicitly first16:56
fungiyikes16:56
clarkbhttps://opendev.org/openstack/ansible-hardening/src/branch/master/vars/debian.yml#L53 note the filename16:56
*** efried_afk is now known as efried16:56
*** weshay_passport is now known as weshay16:57
jrosserit does apt update at 2019-09-12 16:36:07.89358816:58
jrosserand again at 2019-09-12 16:36:24.52311016:58
clarkbmight need to check the index files then?17:00
clarkbsee where its gone wrong17:00
fungiyeah, i can confirm that's definitely a 404 currently17:00
*** derekh has quit IRC17:00
* jrosser has to travel, thanks for the help, bbl17:01
fungithose packages were deleted in the pulse which vos released at 2019-09-07T18:50:02,415682984+00:00 but it errored with "VLDB: vldb entry is already locked"17:03
fungii wonder if we later manually performed a vos release of a broken volume or something17:03
clarkbfungi: ya maybe worth rerunning to see if the vos release from today is good17:04
fungithough we're currently updating successfully as of 2019-09-12T16:53:43,137317847+00:0017:04
fungithe last broken vos release was 2019-09-09T22:49:50,220254695+00:0017:05
fungithe first successful index regen and vos release was 2019-09-10T18:58:49,163568873+00:0017:07
*** xarses has quit IRC17:08
*** xarses has joined #openstack-infra17:08
*** hamzy_ has quit IRC17:09
*** hamzy_ has joined #openstack-infra17:11
*** ociuhandu has joined #openstack-infra17:11
*** armax has quit IRC17:14
*** pkopec has joined #openstack-infra17:15
*** jpena is now known as jpena|off17:15
*** ociuhandu has quit IRC17:16
*** soniya29 has joined #openstack-infra17:16
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Log swift upload tracebacks with ansible  https://review.opendev.org/68184317:16
fungiunfortunately http://files.openstack.org/mirror/debian/dists/buster/main/binary-amd64/Packages.gz does indicate there should be a Version: 4.92-8 for Package: exim4-base but the last modified timestamp on the index is 2019-07-03 22:4817:17
openstackgerritMerged zuul/nodepool master: Log new image upload external ID  https://review.opendev.org/68181217:17
fungiso i wonder if reprepro isn't writing the package indices where we expect them for some reason17:17
openstackgerritDonny Davis proposed openstack/project-config master: Slowly scale FN back up  https://review.opendev.org/68184517:17
clarkbfungi: the package is exim4-config fwiw17:17
fungiexim4-base also failed to download17:18
clarkbah17:18
*** ykarel|away has quit IRC17:19
fungibasically all the binary packages built from the exim4 source package are disjunct because there's been an update since then17:19
fungiohh17:19
*** ykarel|away has joined #openstack-infra17:19
fungii bet today is when buster-updates activated17:20
fungithe buster and buster-updates suites share a common package pool17:20
*** ralonsoh has quit IRC17:20
fungihttp://files.openstack.org/mirror/debian/dists/17:20
fungii wonder if reprepro is incorrectly configured to limit the number of package versions it keeps17:20
fungiand so we're seeing this break because jobs are using the buster suite but not the buster-updates suite (which contains newer exim4 packages they would grab instead if they knew about them)17:21
sean-k-mooneyam what is the sate of vexxhost in ci currently17:21
clarkband that is because we disabled updates as it didn't exist yet?17:21
clarkbsean-k-mooney: I think it is leaking boot from volume root disks and we run out of quota17:22
clarkbsean-k-mooney: on top of that the quota tracking system seems to think we use 2x what we actually use for volumes17:22
corvusclarkb: re testing swift upload -- i don't have a great idea for that; everything else is testable from the cli except for that function which has the ansible wrapper.  given that it's an exception handler, i think we can look real close to make sure it has all the closing parens, then throw it into prod and wait for it to hit.17:22
fungiclarkb: we didn't disable updates, repropro was simply refusing to generate any empty indices for buster-updates17:23
sean-k-mooneyclarkb: ok :) that would explain the node_failures17:23
clarkbcorvus: k I await your real close inspection then :)17:23
openstackgerritMerged openstack/devstack-gate master: Prepare for stable branching in devstack-gate  https://review.opendev.org/67961017:23
corvusclarkb: (if it were the non-error case, i'd say make a test-zuul-swift-upload role and do the base-test dance with it, but that wouldn't really help here)17:23
clarkbsean-k-mooney: Shrews has been workign to debug if the volume leaks are on our end17:23
fungii think projects worked around the lack of buster-updates suite by omitting it from their sources.list files17:23
fungibut now it exists, and there are newer package versions, and repropro is configured to only keep the new packages even though they technically belong to a different suite17:24
sean-k-mooneyclarkb: while FN was offline i moved a some other temp job to vexhost only flavors. ill swap them to either limestone or FN17:24
corvusclarkb: my mental python parser caught an error :)17:25
sean-k-mooneythese specific temp jobs need nested vert but noting else special17:25
clarkbcorvus: perfect :)17:25
sean-k-mooney*lables not flavors17:25
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Log swift upload tracebacks with ansible  https://review.opendev.org/68184317:25
logan-sean-k-mooney: regarding the numa flavors, can you give me specs what I need to add for that?17:25
*** soniya29 has quit IRC17:26
sean-k-mooneysure one sec its basically just hw:numa_node=2 but i think i have alink to the ones that donnyd created17:27
AJaegerclarkb: you reviewed https://review.opendev.org/#/c/678573/ previously, should I update following the comments by frickler ? Or what's your take on it?17:28
fungiconfirmed, looking in http://files.openstack.org/mirror/debian/pool/main/e/exim4/ it's only providing the versions with 4.92-8+deb10u1 versions which are for debian 10 update 1 in buster-updates17:29
sean-k-mooneylogan-: https://www.irccloud.com/pastebin/FWxMEIqc/17:29
sean-k-mooneyso the last two are the main ones17:29
funginot the 4.92-8 versions from buster17:29
*** jcoufal has quit IRC17:29
sean-k-mooneythe first one was a hack to make sure it worked17:29
*** e0ne has joined #openstack-infra17:29
logan-ok, got it, thanks17:29
sean-k-mooneylogan-: if you can only provdie the 8G one that is ok too i plan to move to only that one in the future17:30
sean-k-mooneywhile testing the job the extra ram is useful on the contoler but i think i can optimise down17:30
clarkbAJaeger: frickler huh frickler seems to be correct. I guess that is a bug in python's docs?17:31
clarkbAJaeger: frickler I think we should catch OSError in that case17:31
sean-k-mooneylogan-: the other thing i want to add is a nodepool lable for nest-virt-ubuntu-bionic. if we had that common acorss FN limesotne and vexhost we coudl more reliable run jobs arouss all tree. that lable would jsut be an alise for the normal bionic one with no extraspecs added17:32
AJaegerthanks, clarkb - will try to update17:33
*** xek has joined #openstack-infra17:34
openstackgerritClark Boylan proposed zuul/zuul-jobs master: Log swift upload tracebacks with ansible  https://review.opendev.org/68184317:37
*** jamesmcarthur has quit IRC17:38
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-puppet-module-output: introduce zuul_use_fetch_output  https://review.opendev.org/68185517:38
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add tests for manifest generation for missing files  https://review.opendev.org/67857317:39
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: Do not overwrite image upload zk data on delete  https://review.opendev.org/68185717:40
Shrewscorvus: Can you think of any reason why the code I change in 681857 would be using a new ImageUpload object to update an existing object??17:41
clarkbAJaeger: lgtm thanks!17:42
*** priteau has quit IRC17:42
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-python-sdist-output: introduce zuul_use_fetch_output  https://review.opendev.org/68185917:45
*** markvoelker has quit IRC17:45
*** markvoelker has joined #openstack-infra17:46
fungion an unrelated topic, clarkb as infra ptl you might be interested in at least skimming https://review.opendev.org/68126017:46
AJaegerthanks, clarkb.17:46
Shrewscorvus: i think this may have been an accidental carryover from before we used proper objects to represent the data, just looking at the git history of that code  :/17:46
Shrewsthis also does not appear to be #zuul   :)17:47
AJaegerdo we want to scale FN slowly up? https://review.opendev.org/681845 adds 10 nodes...17:47
logan-sean-k-mooney: yep makes sense17:53
AJaegerconfig-core, could you review https://review.opendev.org/681785 https://review.opendev.org/681276 https://review.opendev.org/#/c/681259/ https://review.opendev.org/681361 https://review.opendev.org/680901 , please?17:53
openstackgerritNoorul Islam K M proposed zuul/zuul master: Fixed pull request URL and canMerge interface  https://review.opendev.org/68186017:55
clarkbfungi: will do thanks17:56
sean-k-mooneyAJaeger: it seams to be working ok17:56
sean-k-mooneyi have a few jobs running on the sepcial flavors in the other pool17:57
sean-k-mooneythose are running well17:57
*** pcaruana has joined #openstack-infra17:58
*** armax has joined #openstack-infra17:59
corvusShrews: i can not -- if you've looked at git history and don't see anything then lgtm17:59
AJaegerfrickler: is https://review.opendev.org/678573 now good?18:00
*** pkopec has quit IRC18:03
*** pkopec has joined #openstack-infra18:03
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-python-sdist-output: introduce zuul_use_fetch_output  https://review.opendev.org/68185918:04
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-tox-output: introduce zuul_use_fetch_output  https://review.opendev.org/68186418:04
*** jamesmcarthur has joined #openstack-infra18:07
clarkbcorvus: zuul is happy with https://review.opendev.org/#/c/681843/ now (it tripped over linting on previous ps)18:08
*** dolpher has quit IRC18:13
*** hamzy_ has quit IRC18:16
*** hamzy_ has joined #openstack-infra18:17
*** iurygregory has quit IRC18:19
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-python-sdist-output: introduce zuul_use_fetch_output  https://review.opendev.org/68185918:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-tox-output: introduce zuul_use_fetch_output  https://review.opendev.org/68186418:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-sphinx-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68187018:25
*** pkopec has quit IRC18:27
*** njohnston is now known as njohnston|lunch18:28
prometheanfireare we having problems with the opensuse-15.1 builds?18:36
fungiprometheanfire: switch to opensuse-1518:38
fungii believe the opensuse-15.0 and opensuse-15.1 nodes are deprecated in favor of generic opensuse-15 since a few weeks18:38
prometheanfireok, I'll try that18:38
prometheanfiredocs should be updated :D https://github.com/openstack/diskimage-builder/tree/master/diskimage_builder/elements/opensuse18:39
*** mriedem is now known as mriedem_afk18:39
AJaegernote that opensuse-15 *is* 15.1 currently, so if you have problems with 15.1 builds, that won't help. Best ask dirk, cmurphy, or evrardjp for help18:39
*** jamesmcarthur has quit IRC18:40
*** jamesmcarthur has joined #openstack-infra18:40
prometheanfireah, ya, DIB_RELEASE set to 15 doesn't fetch the image even18:40
* prometheanfire is trying to just get a generic non-cloud qcow2 image18:41
*** armstrong has quit IRC18:44
openstackgerritMerged zuul/zuul-jobs master: Log swift upload tracebacks with ansible  https://review.opendev.org/68184318:49
*** jamesmcarthur has quit IRC18:50
*** kjackal has quit IRC18:51
*** kjackal_v2 has joined #openstack-infra18:51
*** jamesmcarthur has joined #openstack-infra18:52
*** pkopec has joined #openstack-infra18:53
*** jamesmcarthur has quit IRC18:58
*** jamesmcarthur has joined #openstack-infra19:03
redrobotHi infra friends!19:03
*** armax has quit IRC19:03
*** jcoufal has joined #openstack-infra19:04
redrobotI'm trying to debug the Fedora gate failure in Barbican19:04
*** dciabrin has joined #openstack-infra19:06
*** armax has joined #openstack-infra19:06
redrobotIt was driving me crazy because the changes I'm making to the playbook didn't seem to get picked up19:06
redrobotand I finally narrowed it down to this ansible task that is running the stable/stein branch for some reason: https://zuul.opendev.org/t/openstack/build/ab3577be51234895a664bd283bfc10e2/log/job-output.txt#237219:07
*** dciabrin_ has quit IRC19:07
clarkbredrobot: if you look in the zuul inventory file there is a job inheritance stack19:08
clarkbthat tends to be useful to see why these things happen19:08
*** lucasagomes has quit IRC19:09
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-tox-output: introduce zuul_use_fetch_output  https://review.opendev.org/68186419:09
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-sphinx-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68187019:09
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-subunit-output: introduce zuul_use_fetch_output  https://review.opendev.org/68188219:09
clarkbits due to your branch matcher on barbican-devstack-functional-base19:12
clarkbyou shouldnt use a branch matcher there and instead rely on the implicit match for the current branch19:13
clarkbredrobot: ^19:13
redrobotclarkb, hmmm... I'm not sure I understand how that works... You're talking about this branch matcher, right? https://opendev.org/openstack/barbican/src/branch/master/.zuul.yaml#L419:15
redrobotI'm looking at the _inhertance_path: list in the inventory file you pointed out19:16
redrobotbut I'm not sure how to interpret it?19:16
*** JorgeFranco has joined #openstack-infra19:17
clarkbyes that says 'this config applies to these branches but since the stein branch has that too the stein config applies to master too19:17
clarkbby default without a branch matcher the config in the current branch applies to the current branch only19:17
*** ykarel|away has quit IRC19:18
donnydsean-k-mooney: Are your jobs running in FN ok?19:18
clarkbthat line should likely just be deleted in all of your branches and if ocata needa something special change the config on ocata19:19
redrobotclarkb, ack.  I think I understand now...19:19
sean-k-mooneydonnyd: yes19:20
sean-k-mooneyi see you put them back in a seperate pool too19:20
sean-k-mooneyim not sure that was required but it does make running them faster19:20
donnydwell there is no other load19:20
donnydWe did that so we could re-enable the NUMA jobs without the general pool being included19:21
donnydhow many have you ran?19:21
* redrobot sits down in the corner to read https://docs.openstack.org/infra/manual/zuulv3.html#job-inheritance19:21
donnydlooks like 8 are running right now, but not sure if that is all you19:21
donnydclarkb: do you think its safe to re-enable a small step in FN19:22
donnydand also we should probably revert the change that remove the ipv6 record19:22
sean-k-mooneyits al me19:23
sean-k-mooney1 2 node job finsihes and i have 4 running19:23
sean-k-mooneyall without the connection issue we were seeing before19:23
yoctozeptoseems ubuntu got hit by "ipv6 public promised but not present" too: https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_e3d/681573/12/check/kolla-ansible-ubuntu-source-ipv6/e3d170f/zuul-info/19:24
yoctozeptodifferent cloud now - ovh than rack19:24
clarkbyoctozepto: did that job run on ovh?19:24
yoctozeptoseems so19:25
yoctozeptoclarkb19:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-tox-output: introduce zuul_use_fetch_output  https://review.opendev.org/68186419:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-sphinx-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68187019:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-subunit-output: introduce zuul_use_fetch_output  https://review.opendev.org/68188219:25
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-translation-output: introduce zuul_use_fetch_output  https://review.opendev.org/68188719:25
clarkbya that happens because ovh neutron api says 'here is your ipv6 address' but the instancehas no info of that address19:25
clarkbthere is no router advertisements and the info is not in config drive19:25
clarkbwe've had a long standing ask from them to expose that info to the instances19:25
yoctozeptoah, ok19:25
yoctozeptothanks for quick response19:26
yoctozeptothen we are bound to that vxlan thingy19:26
openstackgerritBob Fournier proposed openstack/diskimage-builder master: Use x86 architeture specific grub2 packages for RHEL  https://review.opendev.org/68188919:26
yoctozeptoeh, switching course then19:26
clarkbyes since inap at least doesnt have ipv6 nor does ovh19:26
yoctozeptooh, we have inap as provider?19:27
yoctozeptonever seen19:27
clarkbits there but currently not launching instances19:27
mgagneyoctozepto: it's been disabled for a couple months now. =(19:27
clarkbwe need to syncup with mgagne on turning it back on19:27
yoctozeptomgoddard: vxlan is a must - but as promised the job works as soon as the underly does :-)19:28
yoctozeptoclarkb, mgagne: roger that19:28
yoctozeptomgoddard: so it seems it's vxlan either way19:29
clarkbmgagne: any more info on that? even if it will disabled for a while longer?19:29
yoctozeptoclarkb: one last question before I go to sleep - where to look at that zuul role you made for vxlan (for comparison sake)19:29
donnydhow do i update the serial for the opendev zone? what is used to generate that?19:29
mgagneclarkb: we are awaiting a network maintenance which takes longer than expected.19:29
*** factor has quit IRC19:30
*** factor has joined #openstack-infra19:30
clarkbyoctozepto: https://opendev.org/zuul/zuul-jobs/src/branch/master/roles the ones with multi-node prefix19:30
clarkbdonnyd: current unix epoch timestamp19:30
clarkbdonnyd: I use date +%s output19:30
mgagneclarkb: we need to change some network topology and it's unfortunately not a high priority atm. I poked my coworker about it. Lets see where it goes...19:31
clarkbmgagne: thank you for the update19:31
yoctozeptoclarkb: thanks, multi-node-bridge looks like the one19:31
slaweqclarkb: hi, can You take a look at https://review.opendev.org/681893 - I hope it will do what You talked with fungi earlier19:32
openstackgerritDonny Davis proposed opendev/zone-opendev.org master: revert "Temporarily remove AAAA RR for fortnebula mirror"  https://review.opendev.org/68189519:32
clarkbslaweq: that looks good. to confirm it works check the logs of that job against that change for the -c arguments to pip install in tox19:34
clarkbit should be a local file path not an https url19:34
slaweqclarkb: sure, I will check that tomorrow morning :)19:34
*** jcoufal has quit IRC19:36
donnydTo re-enable FN I am thinking we should do https://review.opendev.org/681895 first and then https://review.opendev.org/#/c/681845/19:36
clarkbI cant double check the IP currently but dns change lgtm otherwise19:37
clarkbfungi: ^ can probably verify?19:38
clarkboh actually can you set the ttl to 3600 like with the A record?19:38
clarkbdonnyd: ^19:38
donnydhttps://www.irccloud.com/pastebin/C2BvHcCS/19:39
donnydyea np19:39
openstackgerritDonny Davis proposed opendev/zone-opendev.org master: revert "Temporarily remove AAAA RR for fortnebula mirror"  https://review.opendev.org/68189519:40
fungican do, just a sec19:41
*** jamesmcarthur has quit IRC19:43
*** jamesmcarthur has joined #openstack-infra19:44
fungiclarkb: donnyd: i've double-checked that locally on the server and via the nova api19:46
fungi`ssh 2001:470:e045:2:f816:3eff:fee6:691d hostname -f` also returns "mirror01.regionone.fortnebula.opendev.org" for me19:47
clarkbI think you can approve it then19:47
fungidone19:49
*** jamesmcarthur has quit IRC19:49
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-tox-output: introduce zuul_use_fetch_output  https://review.opendev.org/68186419:51
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-sphinx-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68187019:51
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-subunit-output: introduce zuul_use_fetch_output  https://review.opendev.org/68188219:51
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-translation-output: introduce zuul_use_fetch_output  https://review.opendev.org/68188719:51
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-javascript-content-tarball: introduce zuul_use_fetch_output  https://review.opendev.org/68190319:51
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-coverage-output: introduce zuul_use_fetch_output  https://review.opendev.org/68190419:52
openstackgerritTristan Cacqueray proposed zuul/zuul-jobs master: fetch-sphinx-output: introduce zuul_use_fetch_output  https://review.opendev.org/68190519:52
*** goldyfruit_ has quit IRC19:52
*** goldyfruit has joined #openstack-infra19:52
*** pkopec has quit IRC19:53
openstackgerritMerged opendev/zone-opendev.org master: revert "Temporarily remove AAAA RR for fortnebula mirror"  https://review.opendev.org/68189519:53
*** icarusfactor has joined #openstack-infra19:53
*** factor has quit IRC19:55
donnydfungi: can we move forward on scaling FN back up to 10 test nodes19:55
donnydhttps://review.opendev.org/#/c/681845/19:56
*** jbadiapa has quit IRC19:57
openstackgerritJeremy Stanley proposed opendev/git-review master: Support spaces in topic  https://review.opendev.org/68190619:57
*** jtomasek has quit IRC19:58
fungidonnyd: yep, approved just now19:59
donnydthanks fungi19:59
*** njohnston|lunch is now known as njohnston19:59
donnydI will watch the edge and see if we get the same retries. If so I will kill it19:59
fungithanks!20:01
*** e0ne has quit IRC20:03
*** jamesmcarthur has joined #openstack-infra20:04
openstackgerritMerged openstack/project-config master: Slowly scale FN back up  https://review.opendev.org/68184520:08
pabelangerslowly! :)20:09
*** dolpher has joined #openstack-infra20:09
redrobotclarkb, ok, so I just want to make sure I'm understanding the config correctly.  Barbican is an "unstrusted-project" which is why Zuul is looking at the config in all branches (including stable/stein) for a change to master?20:10
redrobotas in https://zuul-ci.org/docs/zuul/user/config.html#configuration-loading20:10
*** markvoelker has quit IRC20:11
clarkbits looking at all branchea because the branch matcher tells it to20:11
*** markvoelker has joined #openstack-infra20:11
redrobothmm... maybe I'm looking into this too much.... my understanding from reading the above is that zuul is looking at the config in all branches, but because we define that matcher in all branches then all branches apply to master?20:12
rm_workoh hey, question -- with the new zuul log display stuff, the old way of colorizing/filtering/linkifying the logs doesn't work?20:12
clarkball branches with that matcher apply to all branaches the matcher matches20:12
rm_workis there a way to make that happen again or is that just dead until more work is done on the zuul side?20:12
clarkbredrobot: if you delete the branch matcher then only the config in that branch applies to that branch20:13
clarkbbut you need to do that for all brnches20:13
fungizuul checks all branches of the project for configuration, and when it finds branch explicit matchers (on any branch of the project) is incorporates them. if it doesn't find explicit branch matchers it uses implicit branch matching based on what branch it found the config on20:13
clarkbrm_work: it should work if you use the zuul rendered logs20:13
donnydLoad is low, but there is traffic20:13
redrobotclarkb, ok, I think I'm starting to grok this.  Thanks for the help! I really appreciate it. :)20:13
donnydhttps://usercontent.irccloud-cdn.com/file/7vLodPmg/image.png20:13
rm_workhmmmmm20:13
donnydfungi: ^^^^20:13
clarkbdont load the raw version20:13
donnydthat looks mo betta20:13
rm_workwhich page is the zuul rendered one20:13
fungirm_work: give us an example20:14
fungidonnyd: that's a lot of zero20:14
donnydyea, that should be what it looks like20:14
fungigood!20:14
fungiyep20:15
donnydI will keep watching as load increases... but before every single flow had some issue20:15
clarkbrm_work: the zuul.openetack.org/build/uuid/path url20:15
clarkbthe line numbers are clickable and you can filter severity there20:16
rm_worksorry yeah doing like 5 things, sec, looking20:16
*** xenos76 has quit IRC20:17
*** xek has quit IRC20:21
fungii think it's confused at least a few people that there's more than one way to pull up log files20:21
rm_workhttps://371dc00fd45f0dbfe46c-53453ebbb69cd33a875a83010b4c2a5c.ssl.cf2.rackcdn.com/681144/9/check/octavia-v2-dsvm-py2-scenario-centos-7/b4586ac/controller/logs/screen-n-api.txt.gz20:23
rm_workfor example20:23
clarkbya thats the raw url, if toy open the url returned to gerrit you get thosefeatures back20:23
clarkbfungi can probably type better than me20:23
rm_workerrr20:24
rm_workwhere's that?20:24
rm_workhow can I get to that file via the other way?20:24
fungiwhat's the initial link gerrit gave you?20:24
rm_workto get to this I click the Logs tab and navigate to the logs?20:24
rm_workso I go here: https://zuul.opendev.org/t/openstack/build/dbc0ecfbc95d40769c536392f05400dc/logs20:24
rm_workand click "controller" -> "logs" -> "screen-o-api.txt.gz"20:25
rm_workI wasn't aware of another way to get to those log files20:25
fungiwhen i do that it takes me to https://zuul.opendev.org/t/openstack/build/dbc0ecfbc95d40769c536392f05400dc/log/controller/logs/screen-n-api.txt.gz20:26
fungiare you clicking on the little linkaway icon to the right of the filename, or on the filename?20:26
fungithe former will go to the url you pasted, the latter to the url i pasted20:26
rm_workuhhh20:27
rm_workclicking on the filename?20:27
rm_workahh you have to click the little *arrows* next to the directory names20:27
rm_workif you click on "controller" it takes you out20:27
rm_workwtf20:27
fungioh, yeah, to expand the contents20:27
rm_workthis is not intuitive20:28
fungii concur, it could probably be improved on20:28
rm_workI wasn't even aware I could click on those, or that it would do something different, and it is freaking tiny20:28
rm_workT_T20:28
rm_workbut thanks, at least now that I know, I can get there now20:29
jrosseri find those controls very difficult on a mobile device20:29
fungimaybe the directory names should link to the same thing the > does and then we should add little linkaway icons next to the directory names to go to the file indices like we do to go to the raw log urls20:29
rm_workyesplz20:29
rm_workthere's already a linkaway icon, it just isn't clickable20:29
*** lpetrut has joined #openstack-infra20:30
fungioh, indeed, we now link the word "raw"20:30
rm_worki bet you like 3 beers that more than 50% of users believe you've "re-added the feature", not just made it more accessible, lol20:30
fungiso maybe we add that in parentheses20:30
fungirm_work: you are probably correct20:30
*** kjackal_v2 has quit IRC20:31
rm_workyeah that's funny, on FILES it shows the (raw) thing20:31
fungii'm going re-ask in #zuul since we're using the upstream feature as it ships, and see if this is something folks think is easy to change20:31
rm_workbut on directories it doesn't do anything and it just looks like you'd click on the dirname20:31
rm_workalso on files I feel like the linkaway image should be part of the hyperlink tag20:32
rm_work¯\_(ツ)_/¯20:32
*** pcaruana has quit IRC20:32
rm_workI guess I could always put up a patch, lol20:32
fungiyeah, i mean, i *would* put up a patch but my javascript knowledge is just a hair past nonexistent so...20:33
rm_workyeah i only know http://vanilla-js.com/20:35
*** prometheanfire has quit IRC20:36
*** prometheanfire has joined #openstack-infra20:36
*** e0ne has joined #openstack-infra20:37
*** markvoelker has quit IRC20:37
*** trident has quit IRC20:39
*** markvoelker has joined #openstack-infra20:39
*** mriedem_afk is now known as mriedem20:39
*** Goneri has quit IRC20:42
fungii learned a little javascript back in the mid-1990s to up my webmaster game and do fancy things with hyperlinks20:42
fungii think that's about where it ended for me20:42
fungibeen meaning to get up to speed on what's happened with it since the old netscape navigator days20:43
*** lpetrut has quit IRC20:46
*** hamzy_ has quit IRC20:49
*** trident has joined #openstack-infra20:51
openstackgerritDonny Davis proposed openstack/project-config master: Retransmit count  with IPv6 is still unacceptable on FN  https://review.opendev.org/68192820:54
donnydfungi: yea so its still doing it.. .doesn't seem to be effecting the jobs at the current scale, but this needs to get fixed properly20:55
donnydI will rework the ipv6 sid e of the network and probably cut the edge rtr out of path for the mirror20:57
fungioh well20:58
donnydright now it goes from tenant - > edge -> other tenant for mirror traffic20:58
donnydi will get it sorted out later tonight20:58
fungino rush, thanks for working on it!20:59
* donnyd frustrated with ipv6 - switching to overwhelming kinetic force mode21:00
*** slaweq has quit IRC21:00
*** diablo_rojo has quit IRC21:01
fungipercussive maintenance is often surprisingly effective21:08
openstackgerritMerged openstack/project-config master: Retransmit count  with IPv6 is still unacceptable on FN  https://review.opendev.org/68192821:21
*** ramishra has quit IRC21:23
*** dustinc has joined #openstack-infra21:23
*** e0ne has quit IRC21:32
*** markvoelker has quit IRC21:32
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: Support HTTP-only Gerrit  https://review.opendev.org/68193621:34
*** bdodd has quit IRC21:34
*** hamzy_ has joined #openstack-infra21:37
openstackgerritJeremy Stanley proposed opendev/git-review master: Support spaces in topic  https://review.opendev.org/68190621:38
*** whoami-rajat has quit IRC21:40
*** exsdev0 has joined #openstack-infra21:45
*** exsdev has quit IRC21:45
*** exsdev0 is now known as exsdev21:45
*** markvoelker has joined #openstack-infra21:50
*** efried has quit IRC21:50
*** jcoufal has joined #openstack-infra21:51
*** xenos76 has joined #openstack-infra21:59
*** weshay is now known as weshay|ruck22:04
*** jcoufal has quit IRC22:06
*** eharney has quit IRC22:19
*** rcernin has joined #openstack-infra22:19
*** armax has quit IRC22:24
*** markvoelker has quit IRC22:32
*** whoami-rajat has joined #openstack-infra22:37
*** rf0lc0 has joined #openstack-infra22:40
*** diablo_rojo has joined #openstack-infra22:41
*** rfolco has quit IRC22:41
*** slaweq has joined #openstack-infra22:42
*** markvoelker has joined #openstack-infra22:46
*** slaweq has quit IRC22:48
*** diablo_rojo has quit IRC22:51
*** markvoelker has quit IRC22:51
*** mattw4 has quit IRC22:52
*** aaronsheffield has quit IRC22:57
fungipaladox: i don't suppose you know how to encode spaces in gerrit push options, for example to set a change topic with spaces in it? the webui lets me set a topic with spaces but the docs are unclear on actually setting any values with spaces through git push23:01
paladoxwould enclosing it with quotes work?23:02
fungidoesn't seem to, from what i can tell23:02
paladoxhmm23:02
fungii tried both " and '23:02
fungialso took a shot in the dark with substituting %20 for space, but that also didn't do it23:03
*** tkajinam has joined #openstack-infra23:03
*** mattw4 has joined #openstack-infra23:03
*** diablo_rojo has joined #openstack-infra23:04
fungiit's come up because a user unwittingly tried to set a (quoted) topic string with git review -t and at first i was like, "nah gerrit doesn't allow spaces in change topics" but i went looking in the docs and they didn't say, so i tried it via the webui and was surprised to discover it worked23:05
paladoxfungi yeh, i remember fixing this in polygerrit's ui23:06
fungibut if it has a way to interpolate spaces in a topic from git push, i can't for the life of me figure out what it is23:06
paladoxfungi i guess we could call this a *bug*23:07
paladoxsince the docs doin't show me an example of a topic on the command line containing a space.23:07
paladoxhttps://github.com/GerritCodeReview/gerrit/commit/efcce2f7ee6f5e50f54fd6371e27d661c392135d is the change i did :)23:08
*** mriedem is now known as mriedem_afk23:08
fungihuh, neat23:08
corvusfungi, paladox: under ssh, sometimes quoting is accomplished with "{}"  eg:  "ssh review gerrit query message:{commit message with spaces}"23:20
corvusvariations on that might be worth a try in case that applies here23:21
paladoxwould that work with git push (topic=)?23:21
corvusno idea, i'm hoping fungi will try and tell us :)23:21
paladoxsomething like git push origin HEAD:refs/for/master%topic={multi space master}23:21
paladoxheh :)23:21
fungimebbe, worth a try for sure. thanks corvus!23:22
fungigoing to find out in a sec23:22
*** jamesmcarthur has quit IRC23:26
fungierror: src refspec spaces} does not match any23:26
funginope, probably not23:27
fungialso worth noting, spaces in a topic (added via the webui) break `git review -d ...` with an error like "fatal: 'review/jeremy_stanley/topic spaces' is not a valid branch name."23:28
fungiso likely we need robustness in both directions with it23:29
fungiat this stage, unless folks have other ideas on how we might solve it, i think postel's law should be applied23:31
*** dchen has joined #openstack-infra23:31
fungiadd a clear error when a user tries to pass spaces in a topic string, and map them to something else when trying to create local git branches from found gerrit topics23:32
paladoxheh23:32
*** icarusfactor has quit IRC23:32
*** icarusfactor has joined #openstack-infra23:33
fungiprobably remap anything we expect git won't handle in a branch string... there's probably a clear specification for those23:33
*** igordc has quit IRC23:34
*** armax has joined #openstack-infra23:35
*** factor has joined #openstack-infra23:37
*** icarusfactor has quit IRC23:37
* paladox is excited, we've got a new server to replace our existing gerrit server.23:40
paladoxwe've outgrown the server too, so this upgrade is well needed!23:40
*** rlandy has quit IRC23:41
clarkbok I'm finally back to a computer after a long afternoon out. Is there anything I can help with? anything with eg FN?23:42
fungiclarkb: we turned fn back down while donnyd ragebuilds a soft router to shuttle traffic between the nodes and the bridge so they don't trombone in and out of his border router23:44
clarkbI guess I should check if my swift logging change has caught anything yet23:44
*** JorgeFranco has quit IRC23:46
donnydfungi: rage builds... I about fell out of my chair23:46
donnydLOL23:46
donnydThere have been a steady stream of jobs so I havent tinkered with it yet23:47
donnydhttp://grafana.openstack.org/d/3Bwpi5SZk/nodepool-fortnebula?orgId=123:47
clarkbI told my kids that servers are computers that talk to other computers. They now think that servers literally speak vocally to other computers23:47
clarkball that to say we should be nice to the servers and routers :)23:47
donnydhttps://usercontent.irccloud-cdn.com/file/TUdxgDFC/image.png23:48
donnydfungi:23:48
paladoxlol23:48
donnydthis is no good23:48
fungiclarkb: postel's law is all about polite network communication, so they might be closer to the truth than you realize23:52
clarkbI've not caught any new swift errors23:52
fungidonnyd: indeed, that's rather a big number23:52
clarkbfungi: I almost pulled out a photo of putting a phone receiver in the modem then realized they don't know that that is a phone either23:52
fungii mean, i've seen some big numbers in my time, but that one's up there23:53
fungiclarkb: funny-sad23:53
clarkbfungi: the best part to me is how similar that stuff is to what we do today. Layer 1 has changed but the things above aren't all that different23:57
paladoxclarkb bt use to have that type of modem/router.23:59
fungiindeed23:59
paladoxthat's long been dead :)23:59
paladox(discontinued in the bt home hub 3 (when bt infinity was launched)23:59
fungi*i* used to have an acoustic coupler modem23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!