Monday, 2018-12-10

*** cgoncalves has quit IRC00:00
*** pbourke_ has quit IRC00:13
*** pbourke_ has joined #openstack-infra00:13
*** hwoarang has quit IRC00:17
openstackgerritMerged openstack-infra/git-review master: tests/__init__.py: ssh-keygen -m PEM for bouncycastle  https://review.openstack.org/62263600:17
*** hwoarang has joined #openstack-infra00:19
*** dave-mccowan has quit IRC00:23
openstackgerritIan Wienand proposed openstack-infra/system-config master: Fix enumerated list in gerrit.rst  https://review.openstack.org/55075700:24
*** jamesmcarthur has joined #openstack-infra00:42
*** jamesmcarthur has quit IRC00:47
*** dave-mccowan has joined #openstack-infra00:57
openstackgerritMerged openstack-infra/system-config master: Fix spelling mistakes and reST typos in the doc  https://review.openstack.org/62366801:04
openstackgerritMerged openstack-infra/system-config master: Fix dead link  https://review.openstack.org/54693001:04
openstackgerritIan Wienand proposed openstack-infra/system-config master: Trivial: Update pypi url to new url  https://review.openstack.org/56341501:20
*** markvoelker has quit IRC01:21
*** markvoelker has joined #openstack-infra01:22
*** markvoelker has quit IRC01:26
openstackgerritIan Wienand proposed openstack-infra/system-config master: fix somes typos in doc file.  https://review.openstack.org/54274001:27
*** jamesmcarthur has joined #openstack-infra01:37
*** jamesmcarthur has quit IRC01:40
*** jamesmcarthur has joined #openstack-infra01:41
*** hongbin has quit IRC01:45
openstackgerritMerged openstack-infra/system-config master: Fix enumerated list in gerrit.rst  https://review.openstack.org/55075701:47
openstackgerritIan Wienand proposed openstack-infra/system-config master: Ectomy some Jenkins out of the docs  https://review.openstack.org/43645201:49
*** yamamoto has quit IRC01:50
*** yamamoto has joined #openstack-infra01:53
*** yamamoto has quit IRC01:53
*** yamamoto has joined #openstack-infra01:56
*** yamamoto has quit IRC01:56
*** yamamoto has joined #openstack-infra01:57
*** wolverineav has joined #openstack-infra02:02
openstackgerritMerged openstack-infra/system-config master: fix somes typos in doc file.  https://review.openstack.org/54274002:04
*** yamamoto has quit IRC02:05
openstackgerritMerged openstack-infra/system-config master: Trivial: Update pypi url to new url  https://review.openstack.org/56341502:09
*** mrsoul has quit IRC02:11
*** lbragstad has quit IRC02:13
*** jamesmcarthur has quit IRC02:15
*** jamesmcarthur has joined #openstack-infra02:16
*** lbragstad has joined #openstack-infra02:16
*** bobh has joined #openstack-infra02:18
*** agopi-pto has joined #openstack-infra02:28
*** agopi-pto is now known as agopi02:29
openstackgerrit98k proposed openstack-dev/hacking master: Don't quote {posargs} in tox.ini  https://review.openstack.org/60919202:37
*** yamamoto has joined #openstack-infra02:39
*** yamamoto has quit IRC02:46
*** bhavikdbavishi has joined #openstack-infra02:47
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397402:48
*** yamamoto has joined #openstack-infra02:50
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397402:50
*** wolverineav has quit IRC02:55
*** wolverineav has joined #openstack-infra02:56
*** psachin has joined #openstack-infra02:59
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397403:00
*** hongbin has joined #openstack-infra03:05
*** wolverineav has quit IRC03:13
*** jamesmcarthur has quit IRC03:14
*** bobh has quit IRC03:31
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397403:33
*** jamesmcarthur has joined #openstack-infra03:34
*** lbragstad has quit IRC03:38
*** jamesmcarthur has quit IRC03:39
*** udesale has joined #openstack-infra03:40
*** bobh has joined #openstack-infra03:42
*** wolverineav has joined #openstack-infra03:44
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397403:44
*** jamesmcarthur has joined #openstack-infra03:45
*** jamesmcarthur has quit IRC03:48
*** jamesmcarthur has joined #openstack-infra03:48
ianwhrm, the gate is not busy but my job has been waiting 10+ minutes for a fedora node03:55
openstackgerritIan Wienand proposed openstack-infra/zuul master: Rework zuul nodepool stats reporting  https://review.openstack.org/62028503:57
openstackgerritIan Wienand proposed openstack-infra/zuul master: Add a statsd check for clashing keys  https://review.openstack.org/62043603:57
*** hongbin has quit IRC04:00
ianw2018-12-10 03:45:11,006 DEBUG zuul.Pipeline.openstack.check: Adding node request <NodeRequest 300-0000774611 <NodeSet [<Node None ('base',):fedora-28>]>>04:04
ianwit's now Mon Dec 10 04:05:02 UTC 201804:05
ianw2018-12-10 04:05:24,101 ERROR nodepool.NodeLauncher-0001088594: Request 300-0000774611: Launch attempt 2/3 failed for node 0001088594:04:05
ianw2018-12-10 03:45:14,583 DEBUG nodepool.driver.NodeRequestHandler[nl01-26872-PoolWorker.rax-iad-main]: Locked building node 0001088594 for request 300-000077461104:09
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397404:11
ianw... let's see if this goes to a different provider and works ... either something just broke with fedora or we have some sort of fedora rax issue :/04:12
ianwseeing as the images are about 1/2 hour old, i'm not confident04:15
*** wolverineav has quit IRC04:19
*** wolverineav has joined #openstack-infra04:19
ianw8c99c67791054cc697b2edc3b06377aa was limestone and worked04:20
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397404:24
*** jamesmcarthur has quit IRC04:26
*** bobh has quit IRC04:33
*** ramishra has joined #openstack-infra04:34
*** hwoarang has quit IRC04:34
*** hwoarang has joined #openstack-infra04:36
*** ykarel has joined #openstack-infra04:40
*** zzzeek has quit IRC04:41
*** zzzeek has joined #openstack-infra04:41
openstackgerritMerged openstack-infra/zuul master: Rework zuul nodepool stats reporting  https://review.openstack.org/62028504:42
openstackgerritMerged openstack-infra/zuul master: Add a statsd check for clashing keys  https://review.openstack.org/62043604:42
*** jamesmcarthur has joined #openstack-infra04:47
*** bobh has joined #openstack-infra04:47
*** dave-mccowan has quit IRC04:53
*** chkumar|off is now known as chandan_kumar04:54
*** ykarel_ has joined #openstack-infra04:59
*** bobh has quit IRC05:00
*** ykarel has quit IRC05:02
openstackgerritJea-Min, Lim proposed openstack-infra/project-config master: Add new project called ku.stella  https://review.openstack.org/62339605:11
openstackgerritJea-Min, Lim proposed openstack-infra/project-config master: Add new project called ku.stella  https://review.openstack.org/62339605:12
*** hrubi has joined #openstack-infra05:14
*** jamesmcarthur has quit IRC05:22
*** wolverineav has quit IRC05:26
openstackgerritJea-Min, Lim proposed openstack-infra/project-config master: Add new project called ku.stella  https://review.openstack.org/62339605:32
*** yamamoto has quit IRC05:36
*** jamesmcarthur has joined #openstack-infra05:39
*** yamamoto has joined #openstack-infra05:39
*** wolverineav has joined #openstack-infra05:43
*** jamesmcarthur has quit IRC05:44
*** yamamoto has quit IRC05:44
*** wolverineav has quit IRC05:46
*** jamesmcarthur has joined #openstack-infra05:49
*** jbadiapa has joined #openstack-infra06:08
LinkidHi06:19
LinkidThanks ianw for your quick reviews :)06:19
Linkid(and sorry for digging up some old commits ^^)06:21
Linkid(I procrastinated and found some typos all the week-end...)06:24
*** wolverineav has joined #openstack-infra06:24
*** _alastor_ has joined #openstack-infra06:29
*** ykarel_ has quit IRC06:30
ianwLinkid: np, we might as well just fix such things06:32
*** yamamoto has joined #openstack-infra06:37
openstackgerritIan Wienand proposed openstack-infra/system-config master: [DNM] kafs testing  https://review.openstack.org/62397406:39
*** yamamoto has quit IRC06:42
*** jamesmcarthur has quit IRC06:49
*** wolverineav has quit IRC07:00
openstackgerritneilsun proposed openstack-infra/zuul master: Add type check for zuul conf  https://review.openstack.org/59191707:04
*** jamesmcarthur has joined #openstack-infra07:06
*** jamesmcarthur has quit IRC07:11
*** yamamoto has joined #openstack-infra07:12
*** quiquell has joined #openstack-infra07:13
*** alexchadin has joined #openstack-infra07:16
*** aojea has joined #openstack-infra07:18
*** gfidente has joined #openstack-infra07:20
*** jamesmcarthur has joined #openstack-infra07:22
*** dpawlik has joined #openstack-infra07:22
*** rcernin has quit IRC07:23
*** apetrich has joined #openstack-infra07:24
*** udesale has quit IRC07:24
*** jamesmcarthur has quit IRC07:26
*** neilsun has joined #openstack-infra07:27
*** aojea has quit IRC07:31
*** jamesmcarthur has joined #openstack-infra07:38
*** pgaxatte has joined #openstack-infra07:41
*** jamesmcarthur has quit IRC07:42
*** slaweq has joined #openstack-infra07:47
*** ykarel_ has joined #openstack-infra07:48
*** ahosam has joined #openstack-infra07:49
*** quiquell is now known as quiquell|brb07:51
*** jamesmcarthur has joined #openstack-infra07:54
*** ccamacho has joined #openstack-infra07:55
*** jamesmcarthur has quit IRC07:58
*** aojea has joined #openstack-infra07:59
*** _alastor_ has quit IRC08:00
*** adriancz has joined #openstack-infra08:03
amorinhey fungi clarkb frickler and all08:05
amorinabout BHS1 issue08:05
amorinmy proposal is to08:05
amorinspawn 20 instances, i'll do the job to make sure that only one is running per hosts, then you'll be able to do some performance test on each instance, and we try to figure out which hypervisor is giving bad results.08:05
amorinwe have 20 hypervisors on the aggregate08:05
amorinI update the etherpad with this proposal08:05
amorinhttps://etherpad.openstack.org/p/bhs1-test-node-slowness08:05
*** quiquell|brb is now known as quiquell08:08
*** jamesmcarthur has joined #openstack-infra08:10
*** evrardjp_ is now known as evrardjp08:11
*** shardy has joined #openstack-infra08:14
*** ykarel_ is now known as ykarel08:16
*** wolverineav has joined #openstack-infra08:16
*** ginopc has joined #openstack-infra08:17
*** ykarel is now known as ykarel|lunch08:17
*** xek has joined #openstack-infra08:18
*** wolverineav has quit IRC08:20
*** imacdonn has quit IRC08:22
*** imacdonn has joined #openstack-infra08:22
*** fresta_ has joined #openstack-infra08:30
*** jonher_ has joined #openstack-infra08:30
*** jonher has quit IRC08:31
*** jonher_ is now known as jonher08:31
*** fresta has quit IRC08:31
*** bhavikdbavishi has quit IRC08:34
*** yboaron has quit IRC08:36
*** ykarel|lunch is now known as ykarel08:42
*** jamesmcarthur has quit IRC08:43
*** tosky has joined #openstack-infra08:45
*** yamamoto has quit IRC08:47
*** udesale has joined #openstack-infra08:49
*** bhavikdbavishi has joined #openstack-infra08:51
*** jpich has joined #openstack-infra08:51
*** jamesmcarthur has joined #openstack-infra08:55
*** d0ugal has quit IRC08:55
*** d0ugal has joined #openstack-infra08:56
*** jtomasek has joined #openstack-infra08:57
*** jamesmcarthur has quit IRC09:00
*** jpena|off is now known as jpena09:01
*** jamesmcarthur has joined #openstack-infra09:11
*** apetrich has quit IRC09:13
*** apetrich has joined #openstack-infra09:14
*** jamesmcarthur has quit IRC09:16
*** yamamoto has joined #openstack-infra09:17
*** rossella_s has joined #openstack-infra09:17
*** jpich has quit IRC09:18
*** apetrich has quit IRC09:19
*** eumel8 has joined #openstack-infra09:21
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: Implement zookeeper-auth  https://review.openstack.org/61915609:23
*** jamesmcarthur has joined #openstack-infra09:27
*** derekh has joined #openstack-infra09:31
*** jamesmcarthur has quit IRC09:32
*** apetrich has joined #openstack-infra09:32
*** bhavikdbavishi has quit IRC09:37
*** jpich has joined #openstack-infra09:42
*** jpich has quit IRC09:42
*** ahosam has quit IRC09:43
*** jamesmcarthur has joined #openstack-infra09:43
*** AJaeger has quit IRC09:45
*** jpich has joined #openstack-infra09:45
*** jamesmcarthur has quit IRC09:48
*** electrofelix has joined #openstack-infra09:49
*** yboaron has joined #openstack-infra09:49
*** AJaeger has joined #openstack-infra09:53
*** yamamoto has quit IRC09:58
*** jamesmcarthur has joined #openstack-infra09:59
*** e0ne has joined #openstack-infra10:03
*** jamesmcarthur has quit IRC10:03
*** adriancz has quit IRC10:04
*** adriancz has joined #openstack-infra10:04
*** yboaron_ has joined #openstack-infra10:04
*** yboaron has quit IRC10:07
*** jamesmcarthur has joined #openstack-infra10:15
*** udesale has quit IRC10:18
*** jamesmcarthur has quit IRC10:19
*** betherly has joined #openstack-infra10:22
*** cgoncalves has joined #openstack-infra10:27
*** priteau has joined #openstack-infra10:28
*** e0ne has quit IRC10:30
*** kaisers has left #openstack-infra10:30
*** jamesmcarthur has joined #openstack-infra10:31
*** yamamoto has joined #openstack-infra10:31
*** jamesmcarthur has quit IRC10:35
*** e0ne has joined #openstack-infra10:36
frickleramorin: not sure how you want us to proceed, would you like us to spawn 20 instances first, and then distribute them?10:37
*** yamamoto has quit IRC10:39
*** dpawlik has quit IRC10:39
*** cgoncalves has quit IRC10:40
*** yboaron_ has quit IRC10:41
*** yboaron_ has joined #openstack-infra10:41
*** dpawlik has joined #openstack-infra10:45
*** ldnunes has joined #openstack-infra10:45
*** jamesmcarthur has joined #openstack-infra10:46
*** jamesmcarthur has quit IRC10:51
*** yamamoto has joined #openstack-infra10:53
*** jamesmcarthur has joined #openstack-infra11:02
*** pbourke_ has quit IRC11:05
*** jamesmcarthur has quit IRC11:06
*** pbourke_ has joined #openstack-infra11:08
*** ahosam has joined #openstack-infra11:08
*** udesale has joined #openstack-infra11:11
*** larainema has quit IRC11:15
*** sshnaidm has quit IRC11:15
*** sshnaidm has joined #openstack-infra11:16
*** jamesmcarthur has joined #openstack-infra11:18
*** ahosam has quit IRC11:20
*** ahosam has joined #openstack-infra11:20
*** jamesmcarthur has quit IRC11:23
openstackgerritneilsun proposed openstack-infra/zuul master: Add type check for zuul conf  https://review.openstack.org/59191711:24
*** dtantsur|afk is now known as dtantsur11:25
*** kjackal has joined #openstack-infra11:25
amorinfrickler: yes11:29
*** jamesmcarthur has joined #openstack-infra11:31
*** dayou_ has quit IRC11:34
*** jtomasek has quit IRC11:34
*** jamesmcarthur has quit IRC11:35
*** jamesmcarthur has joined #openstack-infra11:43
*** dayou has joined #openstack-infra11:43
*** yamamoto has quit IRC11:47
*** yamamoto has joined #openstack-infra11:47
*** jamesmcarthur has quit IRC11:47
*** dayou has quit IRC11:51
*** quiquell is now known as quiquell|brb11:55
*** tpsilva has joined #openstack-infra11:55
*** jamesmcarthur has joined #openstack-infra11:55
*** bhavikdbavishi has joined #openstack-infra11:58
*** dayou has joined #openstack-infra11:58
*** e0ne has quit IRC11:58
*** jamesmcarthur has quit IRC12:00
*** e0ne has joined #openstack-infra12:00
*** e0ne has quit IRC12:00
*** yamamoto has quit IRC12:04
*** jamesmcarthur has joined #openstack-infra12:08
*** quiquell|brb is now known as quiquell12:10
*** jamesmcarthur has quit IRC12:12
*** yamamoto has joined #openstack-infra12:16
*** jtomasek has joined #openstack-infra12:17
*** larainema has joined #openstack-infra12:19
*** jamesmcarthur has joined #openstack-infra12:21
*** yamamoto has quit IRC12:22
*** dpawlik has quit IRC12:23
*** dpawlik has joined #openstack-infra12:23
*** jamesmcarthur has quit IRC12:26
*** bhavikdbavishi has quit IRC12:26
frickleramorin: o.k., I started 20 hosts, waiting 30 seconds in between them. one still failed in the middle with "There are not enough hosts available.'", I left that one in place in case you want to further debug.12:31
*** kjackal has quit IRC12:32
frickleramorin: you can distribute the others, once finished I would start running the simple dd tests again and some devstack setup afterwards12:32
*** kjackal_v2 has joined #openstack-infra12:32
*** dayou has quit IRC12:32
*** jamesmcarthur has joined #openstack-infra12:33
*** dayou has joined #openstack-infra12:34
*** bhavikdbavishi has joined #openstack-infra12:34
*** kjackal_v2 has quit IRC12:36
*** kjackal has joined #openstack-infra12:36
*** jpena is now known as jpena|lunch12:37
*** jamesmcarthur has quit IRC12:37
*** jamesmcarthur has joined #openstack-infra12:45
*** tosky has quit IRC12:47
*** rh-jelabarre has joined #openstack-infra12:47
*** jistr is now known as jistr|medchk12:57
*** bobh has joined #openstack-infra12:58
*** jamesmcarthur has quit IRC12:58
*** rlandy has joined #openstack-infra12:58
*** weshay has joined #openstack-infra12:59
*** dayou has quit IRC13:01
*** ahosam has quit IRC13:02
*** psachin has quit IRC13:05
*** jamesmcarthur has joined #openstack-infra13:06
*** boden has joined #openstack-infra13:11
*** jtomasek_ has joined #openstack-infra13:14
*** jamesmcarthur has quit IRC13:15
*** jtomasek has quit IRC13:15
*** jamesmcarthur has joined #openstack-infra13:15
*** dave-mccowan has joined #openstack-infra13:17
*** dpawlik has quit IRC13:17
*** dayou has joined #openstack-infra13:18
*** jtomasek_ is now known as jtomasek13:19
*** dpawlik has joined #openstack-infra13:20
*** priteau has quit IRC13:23
*** dpawlik has quit IRC13:30
fungiinfra-root: per tonyb's message to openstack-discuss about release job failures, i'm stopping ze12 for a few minutes to troubleshoot why afsd isn't running on it13:32
fungii have a feeling this could be fallout from not rebooting after bootstrapping it13:32
*** dpawlik has joined #openstack-infra13:32
fungiyeah, confirmed, the openafs lkm isn't loaded either13:33
pabelangerfungi: yes, would agree. the server needs rebooting to pick up latest kernel also13:33
fungiwell, that could also pose a problem. i see that we've got openafs.ko built for the running 4.4.0-137-generic kernel, but not for the 4.15.0-42-generic which is the newest installed one13:34
*** jpena|lunch is now known as jpena13:35
fungii'll try rerunning the postinst for it13:35
fungiopenafs-modules-dkms is definitely installed13:36
fungirunning `sudo dpkg-reconfigure openafs-modules-dkms` now13:38
*** kgiusti has joined #openstack-infra13:39
fungithis is going to take a while13:39
*** kgiusti has left #openstack-infra13:41
sean-k-mooneyo/ quick question. can nodepool be configred to prefer 1 cloud over another. e.g. if i have a local private cloud and a limit public cloud can i prefer to use the local resouces and only use the public resoces if i run out or weight it in some way so that 75% run locally and 25% remote13:41
*** mdrabe has joined #openstack-infra13:41
*** kgiusti has joined #openstack-infra13:41
*** bhavikdbavishi has quit IRC13:42
fungisean-k-mooney: i think i've heard that requested before but i don't recall if anyone has a solution worked out for it. you might ask in #zuul instead where more of the zuul/nodepool maintainers hang out13:42
pabelangersean-k-mooney: currently no, node requests are randon across clouds ATM13:42
sean-k-mooneyok thats fine13:42
sean-k-mooneyfungi: thanks13:42
pabelangerbut agree, it would be nice. especially in the case wher you might have 1 donated cloud and one you pay for13:43
fungii can think of a variety of reasons for wanting that sort of node scheduling, for that matter, sure13:43
fungithe reconfig has finally gotten to "Building initial module for 4.15.0-42-generic"13:44
fungifingers crossed13:44
sean-k-mooneyya its not an issue for now as i only have the local cloud. im considering adding public cloud for bustable resource in the future.13:45
fungisean-k-mooney: nodepool is written in python and is pretty easy to dig into if you get the urge. i hear the maintainers are really welcoming too! ;)13:46
*** owalsh_ has joined #openstack-infra13:47
sean-k-mooneyperhaps after i have deployed an run it for a bit. i find its alway easist to learn a new codebase when your trying to scratch an iche thats bugging you13:47
fungii have a feeling most of the work would be in designing the configuration option(s) for selecting an alternative node scheduling scheme and getting it plumbed through, but again #zuul is a much better place to talk about it13:47
fungior zuul-discuss@lists.zuul-ci.org13:47
sean-k-mooneyya13:47
sean-k-mooneyim on the zuul channel too i just taught it may have come up  as a usecase for the upstream ci at some point anyway thanks13:48
fungiERROR: Cannot create report: [Errno 17] File exists: '/var/crash/openafs-modules-dkms.0.crash' Error! Bad return status for module build on kernel: 4.15.0-42-generic (x86_64) Consult /var/lib/dkms/openafs/1.6.15/build/make.log for more information.13:49
fungiugh13:49
fungii guess that's why we didn't have the lkm built yet. it probably already crashed once when we were bringing up the server13:50
*** owalsh has quit IRC13:50
*** priteau has joined #openstack-infra13:50
*** owalsh has joined #openstack-infra13:50
fungi/var/lib/dkms/openafs/1.6.15/build/src/afs/LINUX/osi_machdep.h:73:3: error: #error Not sure what to do about rlim (should be in the Linux task struct somewhere....)13:51
*** owalsh_ has quit IRC13:52
fungistrange, our other executors seem to have built it successfully13:53
fungioh, they're using openafs-modules-dkms 1.6.22.2-1ppa1 and ze12 has 1.6.15-1ubuntu113:54
fungiianw: ^ if you're still around, i vaguely recall you having more insight into the ppa situation there than i do13:55
*** jistr|medchk is now known as jistr13:56
fungilooks like we have http://ppa.launchpad.net/openstack-ci-core/openafs-amd64-hwe/ubuntu active in /etc/apt/sources.list.d/openstack-ci-core-ubuntu-openafs-amd64-hwe-xenial.list13:56
fungiso i probably just need to explicitly request apt pull in that version13:56
*** jamesmcarthur has quit IRC13:57
fungilooks like `sudo apt install openafs-modules-dkms=1.6.22.2-1ppa1` is working13:57
*** jamesmcarthur has joined #openstack-infra13:58
*** mriedem has joined #openstack-infra14:00
*** rh-jelabarre has quit IRC14:06
openstackgerritMerged openstack-infra/git-review master: tox: Remove dead settings/targets  https://review.openstack.org/61057614:07
fungiit's up to "Building initial module for 4.15.0-42-generic" again finally14:08
fungiand it's finished. rebooting ze12 now14:12
*** rh-jelabarre has joined #openstack-infra14:12
fungiit's back up now running 4.15.0-42-generic with the openafs lkm loaded and afsd running14:13
fungi#status log upgraded openafs on ze12 with `sudo apt install openafs-modules-dkms=1.6.22.2-1ppa1` and rebooted onto the latest hwe kernel14:14
*** e0ne has joined #openstack-infra14:15
fungii guess we're missing statusbot too14:15
fungiping timeouts according to its debug log as of 2018-12-08 00:09:20,95014:16
fungithat's also roughly when we lost gerritbot over the weekend, so must have been the same situation14:16
*** lbragstad has joined #openstack-infra14:17
*** openstackstatus has joined #openstack-infra14:17
*** ChanServ sets mode: +v openstackstatus14:17
fungiand it's back14:18
fungi#status log restarted statusbot to recover from connectivity issues from saturday14:18
openstackstatusfungi: finished logging14:18
fungi#status log upgraded openafs on ze12 with `sudo apt install openafs-modules-dkms=1.6.22.2-1ppa1` and rebooted onto the latest hwe kernel14:18
openstackstatusfungi: finished logging14:18
openstackgerritMerged openstack-infra/git-review master: CONTRIBUTING.rst, HACKING.rst: fix broken link, minor flow updates  https://review.openstack.org/62336214:22
openstackgerritMerged openstack-infra/python-storyboardclient master: Change openstack-dev to openstack-discuss  https://review.openstack.org/62236814:24
*** chandan_kumar is now known as chkumar|off14:25
*** aspiers has quit IRC14:26
*** jamesmcarthur has quit IRC14:29
*** quiquell is now known as quiquell|off14:30
openstackgerritMerged openstack-infra/zuul master: Fixed the necesssary to necessary  https://review.openstack.org/62364614:37
*** aspiers has joined #openstack-infra14:38
openstackgerritMerged openstack-infra/zuul master: Fixed the word from congfiguration to configuration  https://review.openstack.org/62363414:40
*** alexchadin has quit IRC14:42
fungize12 seems to have equalized its running builds count with the other executors as of the past few minutes, so i expect all is well there now14:45
fungistill curious why we're down 4 mergers according to our graphs (at 16, should be 20)14:46
fungibut the remaining mergers seem to be keeping up fine14:47
*** gfidente has quit IRC14:49
*** gfidente has joined #openstack-infra14:53
*** jamesmcarthur has joined #openstack-infra14:59
*** yboaron_ has quit IRC15:01
*** yboaron_ has joined #openstack-infra15:02
*** jamesmcarthur has quit IRC15:06
*** jamesmcarthur has joined #openstack-infra15:07
*** gagehugo has joined #openstack-infra15:07
*** priteau has quit IRC15:10
corvusfungi: thanks for the ze12 fix, sorry i missed that15:20
corvusfungi: we can probably find out which mergers are missing by interrogating gearman15:20
fungiyeah, i was considering checking gearman status to see which are registered, just hadn't become urgent yet15:21
*** kjackal has quit IRC15:21
*** kjackal_v2 has joined #openstack-infra15:22
openstackgerritDoug Hellmann proposed openstack-infra/python-storyboardclient master: add script for tagging stories  https://review.openstack.org/60870715:23
*** kjackal_v2 has quit IRC15:26
*** kjackal has joined #openstack-infra15:26
openstackgerritJonathan Herlin proposed openstack-infra/zuul master: Add spacing to Queue lengths line  https://review.openstack.org/62396015:28
*** sthussey has joined #openstack-infra15:34
*** aojeagarcia has joined #openstack-infra15:36
*** aojea has quit IRC15:38
*** aojeagarcia__ has joined #openstack-infra15:39
*** aojeagarcia has quit IRC15:42
*** aojeagarcia__ has quit IRC15:45
*** neilsun has quit IRC15:47
fungiinfra-root: i'm still in the process of writing up the forum summary for our opendev session (yeah, i know it's a month late) but it's becoming increasingly obvious that a lot of the things going into it are also things we're planning to say in the official announcement... should i keep this summary on hold pending that announcement so we don't prematurely start the same conversations on the ml we15:51
fungiexpect the announcement to trigger?15:51
*** bhavikdbavishi has joined #openstack-infra15:58
openstackgerritMerged openstack-infra/project-config master: Add the os-resource-classes project  https://review.openstack.org/62166616:01
*** bhavikdbavishi has quit IRC16:02
*** bhavikdbavishi has joined #openstack-infra16:03
*** jamesmcarthur has quit IRC16:04
*** yboaron_ has quit IRC16:06
*** yboaron_ has joined #openstack-infra16:07
*** lpetrut has joined #openstack-infra16:12
openstackgerritMerged openstack-dev/pbr master: Change openstack-dev to openstack-discuss  https://review.openstack.org/62232116:17
ttxIf anyone is interested in ptgbot, you can review https://review.openstack.org/#/q/status:open+project:openstack/ptgbot+branch:master+topic:all-configurable -- otherwise I'll soon self-approve those, which would be a bit sad. aspiers maybe?16:17
ttxIt's mostly tech debt reduction fwiw16:18
aspiersttx: I can take a look tomorrow but if that's not quick enough I'm sure everyone would be fine with your self-approval :)16:18
ttxI implemented all suggestions except the topic subscription one, which I may get to next.16:19
*** bhavikdbavishi has quit IRC16:20
*** jamesmcarthur has joined #openstack-infra16:22
aspiersooh, sounds cool - looking forward to taking a look at that :)16:25
aspiersgotta dash now, sorry16:25
*** bhavikdbavishi has joined #openstack-infra16:26
*** jamesmcarthur has quit IRC16:27
openstackgerritBenoît Bayszczak proposed openstack-infra/zuul master: add fetch_vault_secrets Ansible module  https://review.openstack.org/62031116:27
clarkbfungi The website publication you mean?16:28
clarkbI sent the email announcement pre summit (session summary would be good followup to that maybe?)16:28
fungiclarkb: i thought we also planned a more formal announcement of the opendev plans once there was a website up, but oh i guess i misremembered and that was indeed pre-summit?16:29
fungiif so, no concern, i'll go ahead with sending this out16:29
clarkbYa pre summit was the official email thing  We should send more emails for sure but dont think that have to wait16:30
fungiperfect16:30
fungihave a link to the ml post?16:30
fungithe one in the session etherpad was to the winterscale announcement back in may16:31
*** chkumar|off has quit IRC16:32
clarkbwhere'd the -dev archive end up? though maybe it was forwarded to discuss16:32
clarkbhrm doesnt look like it16:33
fungiit's still where it was: http://lists.openstack.org/pipermail/openstack-dev/16:33
clarkbah not on the index page though?16:33
*** ginopc has quit IRC16:33
fungiit no longer appears on the list of active mailing lists, right16:34
clarkbhttp://lists.openstack.org/pipermail/openstack-dev/2018-November/136403.html16:34
fungithanks!16:34
fungiand i suppose we could tack on a set of links to the archives for the old lists we've decommissioned if someone is interested in working on that (there are something like a dozen already, and probably will be more as time goes on too)16:35
*** openstackgerrit has quit IRC16:35
*** psachin has joined #openstack-infra16:42
*** yboaron_ has quit IRC16:46
*** gyee has joined #openstack-infra16:47
*** gfidente has quit IRC16:51
*** ccamacho has quit IRC16:52
*** ykarel has quit IRC16:52
*** ykarel has joined #openstack-infra16:53
*** jamesmcarthur has joined #openstack-infra16:53
*** yamamoto has joined #openstack-infra16:55
*** gfidente has joined #openstack-infra16:56
*** openstackgerrit has joined #openstack-infra16:57
openstackgerritBenoît Bayszczak proposed openstack-infra/zuul master: add fetch_vault_secrets Ansible module  https://review.openstack.org/62031116:57
*** jamesmcarthur has quit IRC16:57
*** udesale has quit IRC16:58
clarkbfrickler: amorin let me know if I can help with the benchmarking but seems like you have it under control?16:59
*** yamamoto has quit IRC16:59
clarkbfungi: while you've got opendev paged in care to look at https://review.openstack.org/622624 ? that is my draft of website content17:03
*** ykarel is now known as ykarel|away17:03
fungididn't i? thought i left a comment17:07
clarkboh I didn't see any +1 or -1 so assumed there wasn't any, bad assumption I take it17:07
*** jamesmcarthur has joined #openstack-infra17:07
clarkbyup, thanks, I'll read it over shortly17:07
fungiwell, i figured it won't land until it's reworked into html and we have a job to run against it (at least noop)17:07
fungiso there was little point in leaving a vote17:08
clarkbI've updated the meeting agenda and will set out the set agenda at 1900UTC today (you still have time to add other topics if you like)17:14
clarkb*send out the set  agenda17:14
fungiyou know you've written a long session summary when you don't want to proofread it17:14
clarkbha17:14
corvusfungi: your readers will do it for you!17:14
fungihah17:15
*** eumel8 has quit IRC17:15
clarkbmordred: are you about today? One of the things on my list from friday is further debugging of the inap image upload issues17:16
clarkbit appeared to be buggy openstacksdk or keystoneauth1. Updating to latest keystoneauth1 on the nodepool builders did not fix things17:16
clarkbshort of pdb'ing the process any thoughts on debugging that?17:17
clarkbI guess we could try a manual upload outside of nodepool with all the debugging turned on17:17
*** _alastor_ has joined #openstack-infra17:17
clarkbanyone else want to review a procedure docs update for infra-specs https://review.openstack.org/#/c/623211/6 before I approve that? its mostly updates that take into account that storyboard has changed a bit since we first wrote those docs17:19
clarkbalso if any infra-root are able to review the ara stack at https://review.openstack.org/#/q/topic:inner-ara-results and the fedora-29/networkmanager stack at https://review.openstack.org/#/q/status:open+topic:fedora29 that will likely make ianw and dmsimard very happy (I've reviewed the changes and I think they are mostly ready for quick double check and go)17:20
*** dtantsur is now known as dtantsur|afk17:21
*** jpich has quit IRC17:22
*** e0ne has quit IRC17:22
*** e0ne has joined #openstack-infra17:22
*** e0ne has quit IRC17:23
*** e0ne has joined #openstack-infra17:26
*** kjackal has quit IRC17:26
*** kjackal has joined #openstack-infra17:26
*** derekh has quit IRC17:29
mordredclarkb: I am around today - finishing up a patch update, then I need to do a quick phone call, then I can totally help with that17:37
*** bhavikdbavishi has quit IRC17:37
openstackgerritClark Boylan proposed openstack-infra/opendev-website master: Add some initial content thoughtso  https://review.openstack.org/62262417:37
openstackgerritClark Boylan proposed openstack-infra/opendev-website master: Add .zuul.yaml  https://review.openstack.org/62413917:37
clarkbmordred: thanks17:37
*** bhavikdbavishi has joined #openstack-infra17:37
clarkbfungi: corvus ^ Content now with gating so things can merge. Any thoughts on where to start with htmlification? I can sort of manually html something out I guess17:38
corvusclarkb: i suggest simple manual htmlification17:39
fungi<html><head>...</head><body><h1>...<.h1><p>...</p><p>...</p></body></html>17:39
fungiyeah, basic sgml tags for now and then we can always make it "pretty" later17:40
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Ectomy some Jenkins out of the docs  https://review.openstack.org/43645217:40
corvusclarkb: there are 2 other ideas in play and i don't know which way we might want to go: a) gatsby static site generation; b) make this page the homepage of a gitea instance.  those are very different, so until we have an idea of which way to go, doing one may be wasted effort if we do the other.  so really simple html is probably the safest thing right now as we can upconvert it to whichever later.17:41
mordred++ totally agree17:41
clarkbok17:41
corvusi hope to have more info so we can make a decision on that soon :)17:41
mordredclarkb, corvus, fungi: ^^ jenkins-ectomy patch should be ready for review - and is a little long in the tooth and easy to ignore17:42
clarkbthere are a bunch of simple md to html formatters so I'll actually probably use oen of them to make this simpler17:43
corvusmordred: i reviewed it17:46
corvusclarkb: pandoc rocks.17:46
corvusclarkb: also handles rst input17:46
fungiyeah, pandoc can do it all17:46
corvusclarkb: and it's haskell17:46
fungithough you can also mark it up with rst and then use docutils.core.publish_string(rst, writer_name="html")17:50
fungiprobably supports md too, i've never tried17:50
clarkbya at this point I'm trying a few things to see if anything renders something that is nicer than the otehrs17:51
clarkbrst + sphinx would be familiar17:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Ectomy some Jenkins out of the docs  https://review.openstack.org/43645217:55
mordredcorvus: thanks! updated17:55
mordredclarkb: given the size of the content, I could probably just make you a gatsby patch in almost no time - then any jobs we write to do the gatsby publish step could be re-used for zuul-website even if we don't wind up using gatsby for opendev and use gitea instead17:56
mordredclarkb: lemme hop on this phone call I need to do real quick, and when I'm off I can do that patch for you real quick17:57
clarkbmordred: I've got something that mostly works but needs some css. I figure figuring that out is worthwhile for myself. But feel free to push up a thing for the other thing if you like17:57
clarkb(also inap image uploads :P)17:57
mordrednod18:01
fungiseeing a _lot_ fewer job timeouts today18:01
fungi17 in the past 6 hours18:01
fungithough ~50% of those are still in limestone-regionone18:02
*** e0ne has quit IRC18:09
*** e0ne has joined #openstack-infra18:11
*** e0ne has quit IRC18:11
openstackgerritClark Boylan proposed openstack-infra/opendev-website master: Convert initial content to html for publication  https://review.openstack.org/62414918:13
clarkbcue dog meme "I have no idea what I'm doing" ^ :)18:13
clarkbthat seems to look ok in firefox if quite simple18:13
*** jpena is now known as jpena|off18:14
*** adam_g has quit IRC18:15
*** adam_g has joined #openstack-infra18:15
*** jamesmcarthur has quit IRC18:19
clarkbfungi: late friday I started looking at some limestone failure and some of them were to pypi through our mirror cache18:19
clarkbpypi doesn't have AAAA records so we'll be NATing there18:20
*** wolverineav has joined #openstack-infra18:20
clarkblogan-: ^ any insight on if the NAT is having trouble? With the mirror it should use the 1:1 fip nAT though18:20
*** psachin has quit IRC18:23
*** wolverineav has quit IRC18:24
clarkber I take that back they do have AAAA records18:27
clarkbso must be direct access with network blips. We have seen those in pypi before18:27
*** diablo_rojo has joined #openstack-infra18:29
clarkbmordred: for when you do get to looking at upload failures, http://paste.openstack.org/show/736850/ was the traceback from friday. We should double check it persists in case it was a cloud side issue18:32
*** e0ne has joined #openstack-infra18:34
fungiclarkb: earlier last week there was a point where both limestone-regionone and rax-dfw were failing to reach pypi.org for the same span of time. wonder if there's a local fastly cdn endpoint misbehaving for both of them given their relative global proximity18:40
AJaeger~.18:40
fungi...NO CARRIER18:41
clarkbfungi: that seems likely if they both experienced ti together18:41
*** wolverineav has joined #openstack-infra18:41
*** jamesmcarthur has joined #openstack-infra18:42
*** bhavikdbavishi has quit IRC18:43
*** bhavikdbavishi has joined #openstack-infra18:43
*** wolverineav has quit IRC18:44
*** wolverineav has joined #openstack-infra18:47
AJaegerconfig-core, mnaser has a change up for tweaking vexxhost - could you review, please? https://review.openstack.org/62360518:56
AJaegerconfig-core, there are a couple of change queued up with +2s - could you grab some of these, please?18:57
*** diablo_rojo has quit IRC18:57
*** jcoufal has joined #openstack-infra19:01
*** josephrsandoval has joined #openstack-infra19:02
*** wolverineav has quit IRC19:03
*** wolverineav has joined #openstack-infra19:03
*** e0ne has quit IRC19:06
*** ykarel|away has quit IRC19:08
*** diablo_rojo has joined #openstack-infra19:09
*** wolverineav has quit IRC19:09
*** gfidente is now known as gfidente|adk19:10
*** gfidente|adk is now known as gfidente|afk19:10
*** electrofelix has quit IRC19:11
*** wolverineav has joined #openstack-infra19:11
*** josephrsandoval has quit IRC19:14
*** bobh has quit IRC19:15
*** bhavikdbavishi has quit IRC19:20
*** mguiney has joined #openstack-infra19:20
*** anteaya has joined #openstack-infra19:21
pabelangerAJaeger: fungi: left a -1 on 623605. I _think_ we need to have it be a 2 step process19:23
pabelangerotherwise, we'll have nodes online but no nodepool provider config19:23
fungioh, yep, we need to do max-servers:0 first?19:23
fungii've unapproved it for now19:24
pabelangeryah, once landed you then need to manually delete the online VMs or wait for them to be used.  I am unsure if max-server: -1 is still a config setting, because that would delete any READY nodes19:24
*** bobh has joined #openstack-infra19:25
AJaegerpabelanger: oh - thanks!19:27
AJaegermnaser: ^19:28
*** bobh has quit IRC19:30
fungionce the max-servers:0 goes in, we can manually delete ready nodes anyway19:31
*** bobh has joined #openstack-infra19:34
fungi#status log provider indicates the host on which ze01 resides has gone offline19:35
openstackstatusfungi: finished logging19:35
fungiand i've confirmed i can't ping it19:36
fungii guess those jobs should all get rescheduled?19:37
clarkbfungi: yes unless they had already failed two times previously for similar network related issues19:37
*** harlowja has joined #openstack-infra19:38
*** bobh has quit IRC19:38
fungifair point, then they get a retry_limit19:38
clarkbactually in this case it will be the executor going away so maybe it jsuit works out regardless of previous failures19:39
clarkbsince it can differentiate ebtween the two class of failure in the schedulure19:39
clarkb(I can't spell)19:39
fungii can ping it again now19:39
fungiit's been rebooted19:40
fungi"...up 0 min..."19:40
clarkbfungi: ns2 is still spamming, related to that is stack at https://review.openstack.org/#/c/623041/1 care to review those?19:40
*** jamesmcarthur has quit IRC19:41
fungiclarkb: i only see ns2 reporting the same situation as ns1 now. it's notifying us there's an available update which will pull in a new dependency on upgrade so it's not acting on it automatically19:42
fungii guess netplan.io is related to lxd though?19:43
clarkboh is it?19:43
clarkbI looked itu p and its yaml based network config setup19:43
clarkbI could see how that would be useful to lxd19:43
openstackgerritClark Boylan proposed openstack-infra/system-config master: Update nsd systemd unit deps  https://review.openstack.org/62262019:44
clarkbthat should fix a bug that testing caught19:44
fungiwow, the motd19:44
*** jamesmcarthur has joined #openstack-infra19:44
fungii'd switch to debian if for no other reason than to no longer get subjected to advertising in our motds19:45
*** jamesmcarthur has quit IRC19:45
mordredclarkb: ok. I'm now in a position to help debug the image uploads ... where should I start?19:45
fungiclarkb: anyway, the cronspam for the past few days is basically letting us know that upgrading netplan.io will also install python3-netifaces which is not yet installed19:46
*** wolverineav has quit IRC19:46
*** wolverineav has joined #openstack-infra19:47
clarkbmordred: I pasted a paste link above why don't you digest that while I double check behavior hasn't changed since friday19:48
fungicorvus: heh, ze01 was apparently home to one of the missing mergers... our merger count bumped back up to 17 when it was rebooted19:48
clarkbfungi: if we merge the change to remove lxd apt will still want to udpate netplan.io right? its won't recursively remove netplan if that is where it is pulled in from19:48
clarkb(or will it?)19:48
mordredclarkb: reading19:48
clarkbmordred: I have confirmed it is still happening as of 19:18UTC today19:49
fungiclarkb: i was only speculating that netplan was installed as a dependency of lxd. i haven't looked at its list of deps to confirm19:49
clarkb(half an hour ago)19:49
clarkbfungi: ah19:49
mordredclarkb: uhm.19:49
clarkbfungi: https://packages.ubuntu.com/bionic/lxd doesn't list netplan19:49
clarkbmordred: yes its saving to saving :)19:49
clarkbmordred: its a fun one19:49
mordredoh - hrm. I have a thought ...19:50
clarkbmordred: where I got to was the image content upload call is what triggers that and that does not seem to provide any header/metadata info for the state19:50
fungiclarkb: agreed, i looked with apt show and didn't see it as a depends nor recommends19:50
clarkbmordred: the sdk native stuff might but we are still using the shade layer which doesn't19:50
fungiclarkb: it was more of a question as to why the change to stop installing lxd would silence this particular current cronspam on the authoritative nameservers19:51
fungi(i don't personally object to that stack of changes though and am reviewing anyway)19:51
clarkbfungi: not installing lxd would fix the lxd cronspam19:51
mordredclarkb: it'll almost certainly be related to the reorg of the image code we did in anticipation of using the lower-level stuff19:51
clarkbmay also need to add netplan.io to that list19:52
*** wolverineav has quit IRC19:52
fungiclarkb: agreed, but it hasn't complained about lxd for days, ever since i uninstalled it19:52
*** xek has quit IRC19:52
clarkbfungi: yup, the ansible change is mostly a followup to that so that future hosts don't complain either19:52
mordredclarkb: my hunch is that we're half-way in - so the self.get_image() is returning an image with status['saving'] - and that we're running update_image on that to update the metadata19:52
*** xek has joined #openstack-infra19:52
mordredclarkb: when we should be waiting on the image to be in a ready state before moving forward with the update19:52
clarkbI can't help btu feel this is more evidence the multistep process is just broken19:53
clarkbthat said I don't see where its calling those methods19:54
clarkbnodepool uses the shade layer which is doing a POST to create the image record then PUTting to it with the raw client stuff19:55
clarkbwhcih si why I couldn't see where we might be adding bits in19:55
clarkb(the native sdk stuff definitely tries to be smarter about that stuff though)19:55
mordredclarkb: yeah - this will be cleaner to follow once we finish the transition19:56
corvuspabelanger, fungi, AJaeger: would you please re-evaluate 623605 based on my comments?19:56
clarkbfungi: "Netplan replaced ifupdown as the default configuration utility starting with Ubuntu 17.10 Artful." I don't think we want to remove it19:59
clarkbfungi: in this case probably best to just push it past that upgrade step and let it be happy?19:59
*** diablo_rojo has quit IRC20:00
*** _alastor_ has quit IRC20:00
*** jamesmcarthur has joined #openstack-infra20:01
fungiclarkb: yeah, just strange to see a package upgrade in an lts release suddenly pull in a new dependency20:01
clarkbfungi: https://code.launchpad.net/~usd-import-team/ubuntu/+source/netplan.io/+git/netplan.io/+ref/ubuntu/bionic-updates shows it being added at least20:03
fungiahh, nplan is marked as a transitional package20:03
fungiso i guess they decided to rename it to netplan.io but still a strange choice within an already released lts20:04
mordredclarkb: oh - wait - the words I said above don't make 100% sense ... still poking20:05
fungii guess this will clear up after the next bionic point release or whatever20:05
clarkbfungi: or even if $cloud updates its bionic image20:06
clarkb?20:06
clarkbassuming the image has transitioned to the new name20:06
fungiyeah20:06
mordredclarkb: remote:   https://review.openstack.org/624188 Don't try to upload to images in saving or queued state20:11
clarkbmordred: oh, ist trying  to upload to an image from a previous upload attempt?20:13
clarkbmordred: won't that just result in not uploading any images then?20:14
mordredclarkb: yes, I believe that is what is going on - there is an image in saving state and it's trying to upload data to it20:14
*** wolverineav has joined #openstack-infra20:14
clarkbnothing should be uploading to that from nodepool but the thread that is failing (due to zk locks)20:14
mordredclarkb: I agree with you - unless the uploaders were restarted - or maybe an upload attempt was aborted halfway?20:15
mordredclarkb: cause there is nothing to prevent the sdk from trying to upload the file contents again even if openstack isn't in the right state for that20:15
*** ldnunes has quit IRC20:15
*** wolverineav has quit IRC20:19
fungi#status log manually invoked `apt upgrade` on ns1 and ns2.opendev.org in order to silence cronspam about unattended-upgrades not upgrading netplan.io due to introducing a new dependency on python3-netifaces20:19
clarkbmordred: ya I think either of those cases are possible20:19
openstackstatusfungi: finished logging20:19
clarkbmordred: does nodepool need to try and delete that preexisting record of an image then start uploads?20:20
openstackgerritMerged openstack-infra/project-config master: vexxhost: tweak nodepool settings  https://review.openstack.org/62360520:20
spotzfungi clarkb are either of you still about?20:20
*** wolverineav has joined #openstack-infra20:20
fungiyep20:20
fungi'sup?20:20
spotzfungi - I go this Channel #openstack-ansible is linked to another channel and was thus disabled.20:21
spotzMay just be me but with a lot of the team on vacation I'm wondering if y=our channel is broke as it's quiet except for system messages20:21
openstackgerritMerged openstack-infra/system-config master: Don't install lxd on our servers  https://review.openstack.org/62304020:22
fungispotz: i'm afraid i'm not following20:23
fungispotz: did someone say this in the #openstack-ansible channel in the last few minutes or something?20:23
spotzfungi - I think the openstack-ansible channel is broken20:23
openstackgerritMerged openstack-infra/system-config master: Configure packages on ubuntu arm servers  https://review.openstack.org/62304120:23
spotzThat was in a status message from Freenode over the weekend20:23
fungispotz: i see people talking in it today: http://eavesdrop.openstack.org/irclogs/%23openstack-ansible/%23openstack-ansible.2018-12-10.log.html20:24
spotzfungi: Ok then I'm just broken, may need to re-register as I haven't seen anyone20:24
fungidiscussion in there as of the past few minutes even20:24
spotzOk just me then20:25
fungilet us know if you need help troubleshooting20:25
spotzfungi: Will do thanks:)20:25
clarkbmordred: as an alternative maybe always upload to a new image?20:25
clarkbmordred: in shade I mean. I'm not sure why shade would try to be optomizing that there20:26
clarkbmordred: is it when the names match?20:26
*** lpetrut has quit IRC20:27
*** e0ne has joined #openstack-infra20:28
*** e0ne has quit IRC20:28
mordredclarkb: yeah - this is when someone is asking shade to upload an image with a name that already exists20:30
*** bobh has joined #openstack-infra20:30
mordredclarkb: in general we try to make that a no-op when we can20:30
mordredso that if you're actually asking to re-upload an image that already exists with the same name, we don't actually attempt to upload the data a second time20:31
*** jamesmcarthur has quit IRC20:31
clarkbya and if we upload a new image with a new record and uuid the two names will be ambiguous20:31
*** jamesmcarthur has joined #openstack-infra20:33
clarkbmordred: given your debugging I think I may try to manually deleting the "SAVING" broken images or would you rather we wait and make sure that whatever fixes go in fix it?20:33
mordredclarkb: no - go ahead and try manually deleting20:33
clarkbok20:34
mordredclarkb: I *think* it's going to error on you - and that we might need to get a cloud admin to do something20:34
mordredclarkb: that said - we should at some point be attempting to upload a whole new image with an new name20:34
clarkbya when we build a new dib image iirc20:35
clarkbthere is likely we different failure mode we should watch for once I clean some of these up20:35
mordredyeah20:35
*** bobh has quit IRC20:35
clarkbmordred: I'm starting by deleting the centos7 images that are stuck/stale in inap20:37
openstackgerritMatt Riedemann proposed openstack-infra/elastic-recheck master: Modify job timeout query to exclude tempest-all/slow build_name  https://review.openstack.org/62394920:37
clarkbsince centos 7.6 uploads not working is how we caught this20:37
*** ahosam has joined #openstack-infra20:37
mordredclarkb: awesome20:37
clarkbmordred: another frustrating thing is I think its not returning that error until after the PUT is complete20:38
*** ahosam has quit IRC20:38
*** ahosam has joined #openstack-infra20:38
clarkbso glance isn't verifying data until it has the entire transfer :/20:38
mordredclarkb: of course20:38
clarkbok the cleanup seems to have worked. I'll do it for the other images now too20:40
clarkb(then we wait and see if uploads give us a different error)20:40
clarkbthinking out loud a bit more too we may want to delete the old centos 7 images in inap that are active too. THis way osa stops having problems with 7.5 and 7.6 both being around (they use virtualenvs in bad ways and have discovered they are non protable)20:44
ianwfungi: hrm, i have to remember why we're on the ppa version20:52
ianwoh right, hwe kernel20:53
ianwclarkb: can you catch me up on the image issues?20:54
clarkbianw: ya last week mnaser/osa noticed that we still had some centos 7.5 images floating around. After diggign in I Figured out that it was because inap images were not getting successful uploads so that one cloud region had 7.5 from when things were working20:56
clarkbianw: mordred thinks the behavior is due to openstacksdk/shade trying to reupload to an image if it already exists and the hashes don't match20:56
clarkbexcept that glance won't let you update an image in a saving state. Also it doesn't check that until after the upload has completed (so likely at least one bug in glance too)20:56
*** yamamoto has joined #openstack-infra20:57
clarkbianw: the problem is every time we build a new dib image we should try to upload to a new name and shade won't try to update the existing image record in that case. So I am cleaning out the stale records in the cloud now and we should see new errors hopefully20:57
clarkbthe new error being the actual cause of the problem20:57
ianwah ok, sounds good.  I'll keep watching, LMN if i can help :)20:57
*** diablo_rojo has joined #openstack-infra20:58
*** kjackal has quit IRC20:58
*** rockyg has joined #openstack-infra20:58
openstackgerritMerged openstack-infra/elastic-recheck master: Modify job timeout query to exclude tempest-all/slow build_name  https://review.openstack.org/62394921:00
*** yamamoto has quit IRC21:01
*** gfidente|afk has quit IRC21:03
pabelangerianw: any idea why python3-virtualenv is missing on the fedora images? http://logs.openstack.org/02/623602/1/check/ansible-role-virtualenv-pkg-fedora-latest/8f80ecc/job-output.txt.gz#_2018-12-08_00_09_39_929607 Some how I think DIB is doing something here21:06
*** jamesmcarthur has quit IRC21:08
*** eernst has joined #openstack-infra21:09
clarkbhow is it already 2100UTC21:09
*** jamesmcarthur has joined #openstack-infra21:11
*** yboaron_ has joined #openstack-infra21:12
ianwpabelanger: hrm ... no immediate thoughts21:13
*** cgoncalves has joined #openstack-infra21:15
fungiclarkb: i think i fell into a time warp today too21:16
pabelangerianw: yah, something is blocking dnf, but haven't looked into it to hard. Thanks, I'll dig more21:16
clarkbmordred: ianw ok nb01 is uploading 4c5eee26-114e-4674-ae0a-c62ff7f46ab8 centos7 image now. It started at 2018-12-10 20:42:26,852 which according to irc logs is after I cleaned up the centos7 queued/saving images21:19
ianwpabelanger: oohhhh, then yes i know, we pin that21:20
*** eernst has quit IRC21:21
ianwpabelanger: here -> https://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/pip-and-virtualenv/install.d/pip-and-virtualenv-source-install/04-install-pip#n14621:21
pabelangerianw: oh, so excludes seems to remove them completely from package indexes. And dnf returns 40421:23
ianwyeah, i think it pretends it doesn't exist, but since it's installed it resolves as a dependency21:24
ianwthe only case this is a problem is when you try to install it directly, which seems to be exactly what you're doing :)21:24
*** yboaron_ has quit IRC21:24
ianw(i am under no illusions that this is not a big mess ...)21:25
pabelangerianw: https://dnf-plugins-core.readthedocs.io/en/latest/versionlock.html might be a better way to fix that.  reading up on it now21:25
ianw++ ... first i've heard of that, but looks promising21:26
*** kgiusti has left #openstack-infra21:27
*** efried_cya_jan has quit IRC21:29
openstackgerritBen Nemec proposed openstack-dev/pbr master: Ignore --find-links in requirements file  https://review.openstack.org/59729021:29
mordredclarkb: awesome. fingers crossed21:30
*** eernst has joined #openstack-infra21:31
*** eernst has quit IRC21:32
*** jtomasek has quit IRC21:32
*** bobh has joined #openstack-infra21:36
*** wolverineav has quit IRC21:44
*** wolverineav has joined #openstack-infra21:46
*** jamesmcarthur has quit IRC21:52
openstackgerritBen Nemec proposed openstack-dev/pbr master: Ignore --find-links in requirements file  https://review.openstack.org/59729021:56
clarkbmordred: it failed with the same saving to saving 409 error21:57
clarkbits possible I didn't get the ordering right21:58
clarkbmordred: maybe we shoudl test with a manual upload with a completely different name and see if it has the same failure?21:58
clarkbI'm going to try that with openstackclient22:00
clarkbthen if that works we can pretend to be nodepool22:00
mordredok. if that's still failing - that's unhappy making22:01
*** tpsilva has quit IRC22:02
clarkbError finding address for https://image.api.mtl01.cloud.iweb.com/v2/images/457f4cd0-780a-4d90-9848-70e044ab9292/file: (32, 'EPIPE')22:05
clarkbdoesn't work with openstackclient either but fails different22:05
*** _alastor_ has joined #openstack-infra22:06
clarkbI can create an image with no file22:11
clarkbbut then osc doesn't seem to have a way to upload to the file22:12
clarkbthat appears to be an ssl error22:16
mordredgrump22:19
clarkbapparently EPIPE means the remote closed the connection22:20
clarkbso ssl by way of using tls on top of tcp22:20
clarkbI'm trying on nb01 instead of bridge.o.o in case its a python3.6 issue22:22
clarkbsame error though22:22
clarkbin that case osc likely not working then22:22
clarkbmgagne_: ^ not sure if you have noticed, but uploads using openstacksdk and osc to inap are currently failing (for what appear to be different reasons)22:23
mgagne_what are the reasons?22:23
mgagne_EPIPE ?22:24
clarkbmgagne_: glanceclient.exc.CommunicationError: Error finding address for https://image.api.mtl01.cloud.iweb.com/v2/images/0cfb3a49-2f51-4ac5-9653-1b9e918e324d/file: (32, 'EPIPE') is the osc reason22:24
clarkbopenstack.exceptions.ConflictException: ConflictException: 409: Client Error for url: https://image.api.mtl01.cloud.iweb.com/v2/images/ec5fab20-4ff7-4286-b166-fb81f0fa0699/file, 409 Conflict: Image status transition from saving to saving is not allowed is the sdk reason22:24
clarkbthere is a non zero chance it is bugs in both tools22:25
clarkbbut maybe you are aware of something recent in the cloud that may explain that? like in the last 2 weeks22:25
mgagne_have problems started 2 weeks ago or more recently?22:25
clarkbmgagne_: the last successful image from nodepool using sdk was 13 days ago. I only juist tried osc now22:26
mgagne_clarkb: ok, let me look into it.22:27
clarkbthank you22:28
fungisounds vaguely reminiscent of when one provider had some sort of proxy in front of the glance api endpoint and it was timing out while uploads were in progress22:32
mgagne_fungi: I'm looking into timeout issues. I'm trying to figure out what part is timing out as several lbs are involved.22:34
*** boden has quit IRC22:34
mgagne_I'm able to reproduce the issue so there is that.22:37
*** efried has joined #openstack-infra22:39
*** diablo_rojo has quit IRC22:44
clarkbcorvus: do we want to try restarting the zuul scheduler today to pick up the queue organizational stuff?22:45
corvusclarkb: i think that would be a good idea, but i'm deep into learning some k8s right now and don't have a lot of attention.  if you or others have some cycles, i think it's ready.22:46
corvusclarkb: if you feel like doing it, but something goes wrong, you can ping me and i can drop what i'm doing to help fix.22:47
clarkbcorvus: ok, any idea if we can get away with a scheduelr only restart or if the executors should be restarted as well?22:47
corvusclarkb: scheduler-only should be fine22:48
*** wolverineav has quit IRC22:49
clarkbok I'll get set up to do that here in a bit22:49
clarkbprocess is notify release team, dump queues, stop scheduler, start scheduler, restore queues22:50
clarkbI've asked the release team about it, if I don't hear back soon I'll assume they are all afk enjoying their evenings nad that it is safe to restart the scheduler now22:52
clarkbzuul==3.3.2.dev54  # git sha 6497fa3 is the version of zuul installed on zuul01 which appears to include corvus' change to specify group membership for relative priority22:55
*** _alastor_ has quit IRC22:55
*** rcernin has joined #openstack-infra22:59
clarkbThere is a dragonflow change close to merging, I'll begin once that merges22:59
mgagne_clarkb: I think I found the issue.23:00
mgagne_I'm now able to upload an image in inap-mtl01 in my private account.23:01
clarkbmgagne_: great, I will retest with osc on our end once the zuul restart is compelted23:01
clarkbdragonflow change merged I'll proceed with zuul-scheduler restart now23:02
*** diablo_rojo has joined #openstack-infra23:03
*** panda is now known as panda|off23:06
fungiclarkb: should we plan a gerrit restart for the same time?23:08
fungithough https://review.openstack.org/471078 would be nice to get in at some point as well and needs a gerrit restart too23:09
*** wolverineav has joined #openstack-infra23:10
clarkbfungi: too late, zuul is already done :P or almost done, the web portion seems unhappy23:10
fungino worries23:11
fungii can probably swing a gerrit restart any time i think of it when not much is going on. it's not like those take long23:12
clarkbok its back now23:12
clarkbreenqueuing changes now23:12
clarkbI needed to restart zuul-web but then stop zuul-web didn't clean out its pid file so I had to rm the pid file, stop zuul web again (because systemd?) then start it23:13
clarkb#status log Restarted Zuul scheduler to pick up changes to how projects are grouped into relative priority queues.23:14
openstackstatusclarkb: finished logging23:14
clarkbmriedem: dansmith ^ fyi I'm hoping that allows us to put itno place some of the ideas from your feedback23:15
dansmithcool23:15
EmilienMgerrit is slow for me only? (take 20s to push a patch)23:16
EmilienMyou must have firewall rules against Quebec IPs, I'm sure :-P it's always just me23:17
clarkbEmilienM: its possible that zuul restart hits it pretty hard23:18
*** dave-mccowan has quit IRC23:19
clarkbmgagne_: my image upload isn't failing as quickly as before23:19
clarkbwith osc23:19
mgagne_clarkb: but is it still failing?23:19
*** slaweq has quit IRC23:19
clarkbmgagne_: no sorry, it hasn't finished yet23:19
mgagne_cool23:20
clarkboh it just finished successfully so osc is working now I think23:20
mgagne_:D23:20
clarkbI'll watch nodepool to see if it improves too23:20
clarkbEmilienM: there is a spike in gerrit cpu usage that roughly correlates to when I restarted zuul23:22
clarkbI expect that now that zuul is started this will taper off23:22
EmilienMclarkb: thanks23:22
*** ahosam has quit IRC23:26
*** shardy has quit IRC23:29
*** diablo_rojo has quit IRC23:30
clarkbI've not noticed any weird zuul behavior yet and we have a change in post which implies we've merged stuff23:37
clarkbI think the next step is to set queue: values on various check queues for projects23:38
*** jamesmcarthur has joined #openstack-infra23:39
*** bobh has quit IRC23:39
clarkbremote: https://review.openstack.org/62424623:42
*** bobh has joined #openstack-infra23:43
clarkbcorvus: ^ any idea if it should work out of a project-template like that? I seem to recall that may have caused problems in the past (but is easier than updating every project)23:43
*** jamesmcarthur has quit IRC23:43
corvusclarkb: should be fine23:44
*** yamamoto has joined #openstack-infra23:45
pabelangerclarkb: if we roll back or disable the feature for priority, will 624246 just be noop, or does that also need to be reverted?23:51
clarkbpabelanger: good question I think zuul may treat it as a config error? but unsure23:52
pabelangerkk23:53
*** bobh has quit IRC23:53
*** mriedem is now known as mriedem_away23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!