Friday, 2018-09-07

*** ijw has joined #openstack-infra00:05
*** dhinesh has joined #openstack-infra00:12
*** ijw has quit IRC00:13
*** longkb has joined #openstack-infra00:16
*** sthussey has quit IRC00:17
*** gyee has quit IRC00:21
*** graphene has quit IRC00:31
*** threestrands has joined #openstack-infra00:32
*** threestrands has quit IRC00:32
*** threestrands has joined #openstack-infra00:32
*** spsurya has quit IRC00:35
openstackgerritMerged openstack-infra/project-config master: Revert "Revert "Revert "Disable inap-mtl01 provider"""  https://review.openstack.org/60057700:42
clarkbI'll keep an eye on ^00:42
clarkbanyone will to review 600575 to speed up puppet runs, it passes tests00:43
*** spartakos has quit IRC00:45
*** dave-mccowan has joined #openstack-infra00:51
*** hamzy has joined #openstack-infra00:55
*** hongbin_ has joined #openstack-infra00:55
*** akhila has quit IRC01:01
*** Emine has quit IRC01:11
*** Emine has joined #openstack-infra01:11
ianwclarkb: yeah, sorry ... think it looks good, just getting my head around globs01:17
openstackgerritKendall Nelson proposed openstack-infra/storyboard-webclient master: Show Email Addresses when Searching  https://review.openstack.org/58971301:18
clarkbonce the current remote puppet afs finishes we'll run the else playbook and should update nl03 with inap max servers01:19
*** markvoelker has quit IRC01:19
clarkbhave logstash query ready to go as well `node_provider:"inap-mtl01" AND filename:"job-output.txt" AND message:"gzip console log and json output"`01:19
*** markvoelker has joined #openstack-infra01:20
clarkbthe message filter there is just to get single return per job to quickly check success vs failure01:20
*** bobh has joined #openstack-infra01:22
*** markvoelker has quit IRC01:24
zxiirocan I use shade to share an image with another tenant? or do I need to dig into the sdk for that01:24
clarkbzxiiro: openstackcloud.update_image_properties is probably what you want in shade01:27
clarkbit will do the json diff patching for you, you just supply the new values that you want iirc01:27
*** camunoz has quit IRC01:28
zxiirocool will dig into that01:28
*** ijw has joined #openstack-infra01:35
*** bobh has quit IRC01:37
clarkbnl03's nodepool.yaml has reenabled inap01:39
*** markvoelker has joined #openstack-infra01:41
clarkbnodes are coming online and getting used, I'll watch logstash for any abnormalities01:43
clarkbI wonder if this is the first time we've had our periodic jobs wrap around01:46
clarkbor are in danger of that at least01:46
clarkban openstack-tox-docs job has succeeded in inap-mtl0101:49
clarkbthat is a good first sign01:49
clarkbthats neat you can see gate resets in the nodepool utiliation graphs01:54
*** ijw has quit IRC01:55
clarkbreset did fail on inap but appears to be valid pylint fail in neutron01:55
clarkbhttp://logs.openstack.org/47/583847/2/gate/openstack-tox-pep8/f5daa43/job-output.txt.gz#_2018-09-07_01_52_01_80408401:55
clarkbthere are a lot of neutron changes too so we should expect a bunch of resets I bet01:56
*** ijw_ has joined #openstack-infra01:58
clarkbthat is the only inap failure indexed so far02:00
clarkb20 successes02:01
*** dhinesh has quit IRC02:01
*** ijw_ has quit IRC02:03
*** apetrich_ has joined #openstack-infra02:04
clarkbianw: ^ anything else you'd like checked before I call this good? I've also pinged the neutron channel about their gate resets02:04
ianwclarkb: nope, i'll keep an eye.  the only thing i think was wrong was the mirror "timeouts" (i was a little surprised when that turned out to be an infinite timeout with the server gone ;)02:05
*** apetrich has quit IRC02:07
*** diablo_rojo has quit IRC02:08
clarkbalright then, I'm finding dinner02:11
*** Bhujay has joined #openstack-infra02:12
*** longkb has quit IRC02:16
*** longkb has joined #openstack-infra02:17
*** ramishra has joined #openstack-infra02:22
*** r-daneel has quit IRC02:23
*** jamesmcarthur has joined #openstack-infra02:34
*** bhavikdbavishi has joined #openstack-infra02:36
*** psachin has joined #openstack-infra02:48
*** Bhujay has quit IRC02:56
*** jamesmcarthur has quit IRC02:59
*** ijw has joined #openstack-infra03:01
*** ijw has quit IRC03:05
openstackgerritMerged openstack-infra/storyboard-webclient master: Expose ordering criteria in search UI  https://review.openstack.org/57787003:07
*** r-daneel has joined #openstack-infra03:13
*** jamesmcarthur has joined #openstack-infra03:21
*** dave-mccowan has quit IRC03:25
*** eernst has quit IRC03:27
*** eernst has joined #openstack-infra03:30
*** annp has joined #openstack-infra03:32
*** eernst has quit IRC03:32
*** eernst has joined #openstack-infra03:34
*** nicolasbock has quit IRC03:36
*** eernst has quit IRC03:39
*** jamesmcarthur has quit IRC03:39
*** jamesmcarthur has joined #openstack-infra03:43
*** r-daneel has quit IRC03:47
*** r-daneel has joined #openstack-infra03:47
*** toabctl has quit IRC03:48
*** toabctl has joined #openstack-infra03:57
*** udesale has joined #openstack-infra04:05
*** spsurya has joined #openstack-infra04:10
*** pgadiya has joined #openstack-infra04:13
*** pgadiya has quit IRC04:13
*** rlandy has quit IRC04:16
*** jamesmcarthur has quit IRC04:16
*** jamesmcarthur has joined #openstack-infra04:17
*** Bhujay has joined #openstack-infra04:17
*** Bhujay has quit IRC04:18
*** Bhujay has joined #openstack-infra04:19
*** hongbin__ has joined #openstack-infra04:22
*** hongbin_ has quit IRC04:25
*** spartakos has joined #openstack-infra04:29
*** dhajare has joined #openstack-infra04:31
*** psachin has quit IRC04:34
AJaegerconfig-core, https://review.openstack.org/#/c/597598/ https://review.openstack.org/597590 and https://review.openstack.org/597582 are  repos that have migrated jobs now as part of python3-first, please +2A04:36
*** psachin has joined #openstack-infra04:36
AJaegerclarkb: no, this is not the first time with periodic jobs wrap around - but it's rare04:36
*** jamesmcarthur has quit IRC04:38
AJaegerconfig-core, please also review https://review.openstack.org/598487 https://review.openstack.org/600457 and https://review.openstack.org/60045704:41
*** jamesmcarthur has joined #openstack-infra04:42
*** dhinesh has joined #openstack-infra04:44
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Retire developement-proposals (Step 2)  https://review.openstack.org/60064404:45
*** dhinesh_ has joined #openstack-infra04:46
*** dhinesh has quit IRC04:49
*** psachin has quit IRC04:51
*** psachin has joined #openstack-infra04:54
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Retire developement-proposals (Step 2)  https://review.openstack.org/60064404:56
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Retire developement-proposals (Step 4)  https://review.openstack.org/60064804:56
*** rcernin has quit IRC04:57
*** spartakos has quit IRC04:57
*** kjackal has joined #openstack-infra05:06
*** kjackal_ has joined #openstack-infra05:06
*** psachin` has joined #openstack-infra05:07
AJaegertimburke, corvus, the test results are back - http://logs.openstack.org/40/600540/1/check/swift-dsvm-functional/0a61993/job-output.txt.gz shows that while we initally install for devstack python-swiftclient from git, the tox role uses the package. The other job is fine. timburke, so we can move forward with your change!05:08
*** psachin has quit IRC05:09
*** masber has quit IRC05:12
*** spartakos has joined #openstack-infra05:12
*** jamesmcarthur has quit IRC05:12
*** jamesmcarthur has joined #openstack-infra05:16
*** akhila has joined #openstack-infra05:21
*** ssbarnea|bkp has quit IRC05:22
*** e0ne has joined #openstack-infra05:23
*** hongbin__ has quit IRC05:23
*** e0ne has quit IRC05:24
*** e0ne has joined #openstack-infra05:28
*** e0ne has quit IRC05:29
*** rcernin has joined #openstack-infra05:30
*** akhila has quit IRC05:41
*** e0ne has joined #openstack-infra05:43
*** r-daneel has quit IRC05:44
*** quiquell has joined #openstack-infra05:45
*** jamesmcarthur has quit IRC05:46
*** e0ne has quit IRC05:47
*** e0ne has joined #openstack-infra05:48
*** jamesmcarthur has joined #openstack-infra05:50
*** psachin` has quit IRC05:50
*** e0ne has quit IRC05:50
*** e0ne has joined #openstack-infra05:51
*** e0ne has quit IRC05:57
*** ijw has joined #openstack-infra06:00
*** ijw has quit IRC06:04
*** ijw has joined #openstack-infra06:05
*** ijw_ has joined #openstack-infra06:06
openstackgerritOpenStack Proposal Bot proposed openstack-infra/project-config master: Normalize projects.yaml  https://review.openstack.org/60065806:09
*** ijw has quit IRC06:10
*** cshastri has joined #openstack-infra06:10
*** pcaruana has joined #openstack-infra06:13
*** dhinesh_ has quit IRC06:14
*** jamesmcarthur has quit IRC06:15
*** jamesmcarthur has joined #openstack-infra06:16
*** e0ne has joined #openstack-infra06:16
*** e0ne has quit IRC06:18
openstackgerritTobias Henkel proposed openstack-infra/nodepool master: Fix label name in reported label stats  https://review.openstack.org/60065906:22
*** bhavikdbavishi has quit IRC06:23
*** bhavikdbavishi has joined #openstack-infra06:24
openstackgerritMerged openstack-infra/project-config master: remove job settings for storlets repositories  https://review.openstack.org/59759806:26
*** holser_ has joined #openstack-infra06:29
openstackgerritMerged openstack-infra/project-config master: remove job settings for kolla repositories  https://review.openstack.org/59759006:30
openstackgerritMerged openstack-infra/project-config master: remove job settings for ec2-api repositories  https://review.openstack.org/59758206:30
gmannclarkb: corvus ianw frickler last patch to finish Rocky testing setup work. It is passing now- https://review.openstack.org/#/c/591594/06:30
*** quiquell is now known as quique|rover|bbl06:36
*** hashar has joined #openstack-infra06:42
*** aojea has joined #openstack-infra06:43
*** jamesmcarthur has quit IRC06:51
*** ginopc has joined #openstack-infra06:57
*** jamesmcarthur has joined #openstack-infra06:58
*** jtomasek has joined #openstack-infra07:00
AJaegerfrickler: could you put https://review.openstack.org/598487 and https://review.openstack.org/600644 and https://review.openstack.org/#/c/600476/ on your review queue, please?07:03
AJaegerianw: thanks for reviews!07:03
*** quique|rover|bbl has quit IRC07:06
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Use tox-docs for specs repositories  https://review.openstack.org/60045707:06
*** dhajare has quit IRC07:06
*** quiquell has joined #openstack-infra07:06
*** quiquell has quit IRC07:07
*** quiquell|rover has joined #openstack-infra07:07
*** quiquell|rover has quit IRC07:07
*** quiquell|rover has joined #openstack-infra07:08
*** quiquell|rover has quit IRC07:11
AJaegerdirk: could you +A https://review.openstack.org/#/c/598973/ , please?07:11
*** florianf has joined #openstack-infra07:12
*** quiquell has joined #openstack-infra07:12
*** quiquell is now known as quiquell|rover07:12
dirkAJaeger: done07:12
AJaegerthanks, dirk07:13
openstackgerritMerged openstack-infra/project-config master: Normalize projects.yaml  https://review.openstack.org/60065807:13
openstackgerritMerged openstack-infra/project-config master: Remove references to pypi mirroring from dashboard  https://review.openstack.org/60036407:14
openstackgerritMerged openstack-infra/project-config master: Use tox-docs for specs publishing  https://review.openstack.org/60045807:16
*** ccamacho has joined #openstack-infra07:17
*** dhajare has joined #openstack-infra07:20
*** tesseract has joined #openstack-infra07:27
evrardjpmorning07:30
*** pguimaraes has quit IRC07:30
*** jamesmcarthur has quit IRC07:33
*** dhajare has quit IRC07:33
* SotK notices that the publish-openstack-javascript-content job appears to have become broken: http://zuul.openstack.org/builds.html?job_name=publish-openstack-javascript-content07:34
*** dhajare has joined #openstack-infra07:34
*** dhajare has quit IRC07:35
openstackgerritThierry Carrez proposed openstack-infra/puppet-ptgbot master: Make the FC SIG freed slots actually reservable  https://review.openstack.org/60066507:35
*** dhajare has joined #openstack-infra07:35
*** pblaho has joined #openstack-infra07:35
AJaegerSotK: http://logs.openstack.org/67/6776d05c12596c4dbd2b4c737b1cf6bb23884ef2/post/publish-openstack-javascript-content/2ebe59d/job-output.txt.gz#_2018-09-07_03_33_23_588274 is the problem...07:37
AJaegerSotK: let's ask fungi, he looked into some of this already earlier this week (or last one) (gpg temp signing key)07:38
openstackgerritTobias Henkel proposed openstack-infra/zuul master: WIP: Match tag items against containing branches  https://review.openstack.org/57855707:38
openstackgerritThierry Carrez proposed openstack/ptgbot master: Update unbook documentation  https://review.openstack.org/60052007:39
*** pguimaraes has joined #openstack-infra07:43
*** dtantsur|afk is now known as dtantsur07:46
openstackgerritAndreas Jaeger proposed openstack-infra/openstack-zuul-jobs master: Remove legacy-cloudkitty-dsvm-install  https://review.openstack.org/60066807:48
*** jpena|off is now known as jpena07:51
*** alexchadin has joined #openstack-infra07:52
*** ramishra has quit IRC07:58
*** tosky has joined #openstack-infra07:59
*** jpich has joined #openstack-infra08:01
*** shardy has joined #openstack-infra08:08
*** slaweq_ is now known as slaweq08:10
*** ssbarnea|bkp has joined #openstack-infra08:11
*** ramishra has joined #openstack-infra08:14
*** electrofelix has joined #openstack-infra08:14
*** ethfci has joined #openstack-infra08:16
*** spartakos has quit IRC08:18
hwoarangi notice that some opensuse jobs are failing with RETRY_LIMIT such as https://review.openstack.org/600272 but i can't figure out what exactly it is retrying for08:23
hwoarangany clues?08:23
AJaegerhwoarang: that's an indication that the pre.yaml failed08:25
AJaegerhwoarang: http://logs.openstack.org/72/600272/1/check/openstack-ansible-functional-opensuse-423/b23c425/ara-report/08:25
AJaegerSee the red x in front of pre.yaml? Check which one failed...08:25
hwoarangoh good point08:25
hwoarangoh mirrors again :(08:26
AJaegeryeah ;(08:26
hwoarangok infra mirrors are out of sync again08:28
hwoarang*sigh*08:28
* hwoarang will wait until the next sync before he investigates more08:28
*** olivierb has joined #openstack-infra08:29
*** e0ne has joined #openstack-infra08:32
*** threestrands has quit IRC08:33
openstackgerritMerged openstack-infra/project-config master: Add development-proposals to spec index  https://review.openstack.org/59848708:33
openstackgerritMerged openstack-infra/project-config master: Remove system-config project config  https://review.openstack.org/60047608:33
openstackgerritMerged openstack-infra/project-config master: Retire developement-proposals (Step 2)  https://review.openstack.org/60064408:33
*** dhajare has quit IRC08:33
hwoarangeven mirrors.kernel.org (the source of infra mirrors) seems to be lagging by a day compared to downloads.opensuse.org :/08:34
*** derekh has joined #openstack-infra08:40
*** ssbarnea|bkp has quit IRC08:43
*** psachin has joined #openstack-infra08:46
*** ssbarnea|bkp has joined #openstack-infra08:54
*** dhajare has joined #openstack-infra08:54
*** rcernin has quit IRC09:16
*** mriedem has joined #openstack-infra09:23
*** nicolasbock has joined #openstack-infra09:39
*** nicolasbock has quit IRC09:40
*** pblaho has quit IRC09:45
*** e0ne has quit IRC09:47
*** ssbarnea1 is now known as ssbarnea09:53
hwoarangdirk fyi^09:54
*** jtomasek has quit IRC09:55
*** gfidente has joined #openstack-infra10:07
hwoaranginfra-root could you please check again if there is something wrong with the opensuse-mirror script? opensuse mirrors seem to not have been synced since yesterday afternoon10:11
*** roman_g has joined #openstack-infra10:12
*** jd_ has quit IRC10:25
openstackgerritMarkos Chandras (hwoarang) proposed openstack-infra/system-config master: modules: mirror: opensuse: Ensure that old distros are wiped from mirrors  https://review.openstack.org/60070110:25
openstackgerritMarkos Chandras (hwoarang) proposed openstack-infra/system-config master: modules: mirror: opensuse: Ensure that old distros are wiped from mirrors  https://review.openstack.org/60070110:27
openstackgerritTobias Henkel proposed openstack-infra/zuul master: WIP: Match tag items against containing branches  https://review.openstack.org/57855710:27
*** jd_ has joined #openstack-infra10:28
*** alexchadin has quit IRC10:32
*** alexchadin has joined #openstack-infra10:33
*** alexchadin has quit IRC10:33
*** alexchadin has joined #openstack-infra10:34
*** alexchadin has quit IRC10:34
*** alexchadin has joined #openstack-infra10:35
*** alexchadin has quit IRC10:35
*** alexchadin has joined #openstack-infra10:35
*** alexchadin has quit IRC10:36
*** bhavikdbavishi has quit IRC10:38
*** dave-mccowan has joined #openstack-infra10:51
*** ramishra_ has joined #openstack-infra10:51
*** jtomasek has joined #openstack-infra10:53
*** ramishra has quit IRC10:53
*** pbourke has quit IRC11:00
*** jpena is now known as jpena|lunch11:01
*** pbourke has joined #openstack-infra11:02
*** annp has quit IRC11:13
*** udesale has quit IRC11:16
*** jamesdenton has joined #openstack-infra11:20
*** EmilienM is now known as EvilienM11:25
*** takamatsu has joined #openstack-infra11:27
*** nicolasbock has joined #openstack-infra11:29
*** eernst has joined #openstack-infra11:32
*** eernst has quit IRC11:37
*** r-daneel has joined #openstack-infra11:39
*** ssbarnea|bkp has quit IRC11:43
*** jamesmcarthur has joined #openstack-infra11:48
*** jpena|lunch is now known as jpena11:54
*** jamesmcarthur has quit IRC11:54
*** andreaf has joined #openstack-infra12:00
pabelangerrsync: failed to connect to rsync.opensuse.org (62.146.92.202): Connection timed out (110)12:01
pabelangerhwoarang: ^12:01
pabelangerthat's about all I see right now12:01
*** alexchadin has joined #openstack-infra12:05
odyssey4mepabelanger: I think https://review.openstack.org/#/c/600701/2 might be to address the issue?12:07
odyssey4mehwoarang: ^ ?12:07
*** alexchadin has quit IRC12:09
*** alexchadin has joined #openstack-infra12:11
*** elod has quit IRC12:13
*** elod has joined #openstack-infra12:13
*** efried is now known as fried_rice12:14
*** trown|outtypewww is now known as trown12:15
*** eharney has quit IRC12:17
*** jtomasek has quit IRC12:20
*** elod has quit IRC12:25
*** kjackal has quit IRC12:25
*** kjackal_ has quit IRC12:25
*** sthussey has joined #openstack-infra12:29
*** camunoz has joined #openstack-infra12:29
*** psachin has quit IRC12:31
*** longkb has quit IRC12:40
*** rlandy has joined #openstack-infra12:40
*** kjackal has joined #openstack-infra12:44
*** hashar is now known as hasharAway12:44
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: remove job settings for watcher repositories  https://review.openstack.org/59334012:49
*** e0ne has joined #openstack-infra12:49
AJaegerconfig-core, next python3-first migration to review, please - https://review.openstack.org/59334012:52
*** jamesmcarthur has joined #openstack-infra12:54
*** tmorin has joined #openstack-infra12:57
tmorinhi infra-root: I have https://review.openstack.org/599321 which is W+1 but not pushed to gate queue by zuul12:59
*** ssbarnea|bkp has joined #openstack-infra12:59
*** ssbarnea|bkp2 has joined #openstack-infra12:59
tmorininfra-root: I suspect this is because zuul believes that it's Depends-On is not merged12:59
*** ssbarnea|bkp2 has quit IRC12:59
*** ssbarnea|bkp has joined #openstack-infra13:00
tmorininfra-root:  but the Depends-On gerrit change *has merged* in this branch13:00
tmorinbut this Depends-On still ain't merged in other branches: is this what is confusing zuul ?13:00
toskynot an infra core, but maybe that's one of the reasons for using the new syntax of Depends-On13:01
AJaegertmorin: was the depends-on merged when the W+1 given13:01
tmorinAJaeger: yes13:01
AJaegertmorin: and yes, if you use "Ixxx", it waits for *all* changes to merge.13:01
tmorinAJaeger: ah ok... I thought it was doing that branch per branch13:02
*** bobh has joined #openstack-infra13:02
AJaegertmorin: either use new syntax Depends-On: URL for a single change - or remove and reapprove ;)13:02
tmorinAJaeger: wasn't that the behavior, at least some time ago ?13:02
AJaegertmorin: no, never13:02
tmorinAJaeger: ok...13:02
fungitmorin: the behavior with change-id type depends-on has always been that you're depending on every change in any project and any branch with that id13:03
openstackgerritMarkus Hosch proposed openstack-infra/zuul master: Reload tenant in case of new project branches  https://review.openstack.org/60008813:03
AJaegerfungi, could I trouble you to review 593340 , please?13:03
tmorinfungi: so it means that when backporting a fix with a Depends-On, the Depends-On very often has to be adjusted to the right change URL13:04
*** ansmith has joined #openstack-infra13:04
fungitmorin: sure, that could be seen as a feature i suppose since more people wanted to be able to depend on changes for specific branches rather than on all changes with the same id13:05
AJaegerah, frickler did it - thanks. fungi, no need for that one...13:05
fungicool, i was wondering why it needed looking at as it was already approved ;)13:06
AJaegerconfig-core, some more repos are ready for python3-first: https://review.openstack.org/598974 https://review.openstack.org/597597 https://review.openstack.org/59757413:06
fungiSotK: AJaeger: i'll see if we somehow regressed the js jobs by adding the artifact signing role13:06
AJaegerthanks, fungi13:06
SotKfungi: thanks13:06
tmorinfungi: I can leave with that, but if the behavior was "merge if the change with this I... in the same branch has merged" , that would possibly make this more confortable13:07
*** yamamoto has joined #openstack-infra13:08
fungitmorin: well, we've never had that. also zuul v3 adds the ability for us to do cross-project dependencies between gerrit changes and github pull requests. the latter need a url-specific dependency identifier anyway which is why using gerrit ids for depends-on is deprecated13:09
*** agopi has quit IRC13:10
fungiAJaeger: SotK: looking at an example failure, it looks like we failed to declare the gpg_key variable the sign-artifacts role expects13:11
fungibut also, this is a job which runs on non-release events so i don't think we should use that role there13:11
openstackgerritMerged openstack-infra/project-config master: remove job settings for watcher repositories  https://review.openstack.org/59334013:11
*** elod has joined #openstack-infra13:11
fungii wonder if we're reusing the same job for both branch tip tarballs and tagged release tarballs when it comes to javascript projects?13:11
*** aojea has quit IRC13:12
fungihttp://logs.openstack.org/67/6776d05c12596c4dbd2b4c737b1cf6bb23884ef2/post/publish-openstack-javascript-content/2ebe59d/ara-report/result/1b961c7f-6c10-4324-85ee-687cae71560e/13:12
tmorinfungi: really ? I would have thought the pattern to be not uncommon: fix X in project A allowing a fix in project B: merge X in master, stable/x, stable/y -- in parallel, merge the fix in B master, stable/x, stable/y ...13:12
*** myoung has joined #openstack-infra13:13
fungitmorin: it might be a common case, but it's not a feature we've had in the past13:13
tmorinfungi, AJaeger: ok... fair enough.. thanks for your explanations!13:13
AJaegerfungi, we have publish-openstack-javascript-tarball as post job13:14
fungitmorin: more often it seems like we ran into problems where a fix needed to merge to be backported to different branches to solve upgrade failures and so changes needed to depends-on fixes in other branches13:15
AJaegerfungi, indeed we use that content job in both post and release pipelines - while the nodejs4-publish-to-npm template uses different jobs13:15
fungii.e. backport to stable/queens needed a backport merged on stable/pike first and then the stable/rocky change depended on merging a fix to stable/queens...13:16
fungiAJaeger: nodejs4-publish-to-npm doesn't take care of our releases published to tarballs.o.o i guess, which are the ones we actually want merged13:17
fungii guess we need different versions of this job so that ones triggered from tags get signed and those triggered from ref updates don't13:17
AJaegerfungi, those are different - just wanted to point out that I think you're on the right track that we run same job in both post and release13:18
*** tmorin has left #openstack-infra13:19
*** eharney has joined #openstack-infra13:23
*** agopi has joined #openstack-infra13:30
*** jistr is now known as jistr|call13:32
*** kjackal has quit IRC13:35
*** rh-jelabarre has joined #openstack-infra13:36
*** dhajare has quit IRC13:38
*** jtomasek has joined #openstack-infra13:42
*** kjackal has joined #openstack-infra13:44
AJaegerdhellmann: I think OpenStackClient is also ready - want to un-WIP https://review.openstack.org/#/c/597577 ?13:53
dhellmannAJaeger : you're right; done13:54
AJaegerconfig-core, some more repos are ready for python3-first: https://review.openstack.org/598974 https://review.openstack.org/597597 https://review.openstack.org/597574 https://review.openstack.org/#/c/597577 - please review13:55
AJaegerdhellmann: that means roughly half of teams are done?13:56
dhellmann28/6613:56
dhellmann42%13:56
dhellmannonly 7 teams haven't started yet13:57
AJaegerplus those 4 - makes 32/66 -> 49 per cent...13:57
dhellmannwe have 406 open patches to import settings13:57
dhellmannyeah, I don't count them done until those cleanup patches merge :-)13:57
AJaegerfair13:57
AJaeger406 ;(13:58
dhellmanndone-ish :-)13:58
* AJaeger tried pushing some patches through the last days...13:58
dhellmannwe have several teams with 10s of open patches13:58
dhellmannonly 2 of the teams that haven't started have more than 10 repos13:58
dhellmannyeah, you've been a review machine13:58
dhellmannwe have a pretty high recheck rate on these, though13:59
dhellmannthe stable branches are in bad shape13:59
dhellmannit's even hard to land anything in master for a few projects13:59
AJaegerdhellmann: yeah, indeed, stable is in bad shape for many branches and repos13:59
AJaegerdhellmann: we should ask a couple of projects to retire repos ;)14:00
*** TheJulia is now known as needssleep14:00
dhellmannI need to pull some stats about which jobs caused rechecks14:00
dhellmanncharms hasn't started and they have 80 :-)14:00
* dhellmann has to step out for pre-travel errands14:00
AJaegerfun...14:00
*** eernst has joined #openstack-infra14:04
ssbarneai see lots of timeout failures on post collect which is apparently 30mins. Can we make this 35-40min as I bet this would avoid 9/10 timeouts happening there, as I seen it happening very close to the last tasks.14:05
openstackgerritMerged openstack-infra/nodepool master: Add metric for image build result  https://review.openstack.org/59041214:06
*** quiquell|rover has quit IRC14:08
*** alexchadin has quit IRC14:10
*** kjackal has quit IRC14:13
*** kjackal has joined #openstack-infra14:16
*** cshastri has quit IRC14:16
*** lbragstad is now known as elbragstad14:18
mnaserAJaeger, dhellmann: pushed those out14:21
*** holser_ has quit IRC14:22
fungissbarnea: which jobs?14:22
*** holser__ has joined #openstack-infra14:22
ssbarnea|bkpfungi the magic http://logs.openstack.org/84/583984/4/check/tripleo-ci-centos-7-containerized-undercloud-upgrades/d89eaff/job-output.txt.gz#_2018-09-07_13_28_44_19803614:23
Shrewsssbarnea|bkp: you should be able to change that by changing the value for https://zuul-ci.org/docs/zuul/user/config.html#attr-job.post-timeout14:24
hwoarangpabelanger: ok thank you. it seems that the kernel.org mirror is outdated14:25
AJaegerthanks, mnaser14:25
*** eernst has quit IRC14:25
ssbarnea|bkpShrews thanks, i am trying to see if this works. I seen timeouts too often.14:26
*** eernst has joined #openstack-infra14:26
AJaegerconfig-core, a trivial job removal for review, please - https://review.openstack.org/60066814:26
openstackgerritMerged openstack-infra/project-config master: remove job settings for Packaging-rpm repositories  https://review.openstack.org/59897414:29
openstackgerritMerged openstack-infra/nodepool master: Improve static provider to check non-ssh hosts  https://review.openstack.org/60007914:32
*** pcaruana has quit IRC14:33
openstackgerritMerged openstack-infra/project-config master: remove job settings for qinling repositories  https://review.openstack.org/59759714:35
openstackgerritMerged openstack-infra/project-config master: remove job settings for OpenStack-Helm repositories  https://review.openstack.org/59757414:35
openstackgerritMerged openstack-infra/project-config master: remove job settings for OpenStackClient repositories  https://review.openstack.org/59757714:35
openstackgerritMerged openstack-infra/nodepool master: Fix missing node state refresh in static driver  https://review.openstack.org/60008014:35
openstackgerritMerged openstack-infra/nodepool master: Fix use of wrong label static test fixture  https://review.openstack.org/60008114:35
dhellmannmnaser : thanks!14:41
corvusssbarnea|bkp: what is that playbook doing that takes 30+ minutes?14:42
corvusssbarnea|bkp: collect logs, i see -- but why does it take 30+ minutes to do that?14:43
*** anteaya has joined #openstack-infra14:49
*** spartakos has joined #openstack-infra14:52
ssbarneacorvus: this is something i plan to investigate and address (or at least ping those that can fix it). mainky upgrades to generate two set of tempest results which are converted to xml. also ARA xml generation is taking more as there are more tasks to run.14:54
*** apetrich_ has quit IRC14:58
corvusssbarnea: do you mean you run tempest in the post playbook?14:58
ssbarneano way, that should be part of the tempest. I will check and ping you. I am on PTO all next week so now I am trying to get wip work sorted.15:00
*** holser__ has quit IRC15:03
*** jistr|call is now known as jistr15:03
*** holser_ has joined #openstack-infra15:04
*** d0ugal has quit IRC15:04
*** gyee has joined #openstack-infra15:06
fungissbarnea: is it just the tripleo-ci-centos-7-containerized-undercloud-upgrades job which is timing out this way, or others too?15:06
*** ginopc has quit IRC15:07
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: remove job settings for heat repositories  https://review.openstack.org/59728915:08
clarkbcorvus: ianw +2'd https://review.openstack.org/#/c/600575/ (afs ansible node selection fix) if you want to take another look and approve? I don't think we need to add yet another group15:09
fungissbarnea: another build which didn't time out for that same patchset seems to contain 53mib of data in 4392 files under the logs subdir... does that seem like a lot compared to your other jobs?15:09
clarkbhttps://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/collect-logs/tasks/collect.yml it is a rather extensive set of things being collected. I'm guessing the -containers jobs are slower as they grab all the container logs15:13
clarkbfungi: ssbarnea probably need insight into how long each of those individual tasks within the play are taking15:14
*** jtomasek has quit IRC15:17
ssbarnea|bkpfungi not really, sounds like average to me. this reminded me of something i seen in the last two weeks, the task that was building the list of files was timing out, it was looking as stuck trying to create the log size stats.15:17
fungipossible we're more constrained for bandwidth/throughput collecting files from nodes in some providers than others?15:18
ssbarnea|bkpthe problem is that in this case we don't have the http://logs.openstack.org/84/583984/4/check/tripleo-ci-centos-7-containerized-undercloud-upgrades/5ac6e52/logs/quickstart_collect_logs.log15:18
ssbarnea|bkpthis was the success run, with 10min only, so there is a chance that the timeout is caused by some kind of stuck task.15:19
*** e0ne has quit IRC15:21
fungiyeah, ara reports the elapsed time on the post-logs.yaml playbook in that run as a whipping 6 seconds15:21
fungis/whipping/whopping/15:21
*** shardy has quit IRC15:21
corvusclarkb: yes, sorry i thought i got that yesterday.  +315:21
*** akhila has joined #openstack-infra15:22
corvusfungi: https://review.openstack.org/600558 could use a look from you when you have a sec15:23
fungissbarnea|bkp: oh, yeah so even on that 5ac6e52 build we're missing details on the fnial post-logs.yaml task15:23
*** akhila has quit IRC15:23
*** d0ugal has joined #openstack-infra15:25
clarkblooks like we cleared the backlog overnight15:28
clarkbappears neutron may have addressed the pep8 job failures too that were reseting the gate often15:29
*** fried_rice has quit IRC15:29
*** ramishra_ has quit IRC15:30
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Remove legacy-cloudkitty-dsvm-install  https://review.openstack.org/60066815:30
clarkbfungi: https://review.openstack.org/#/c/600557/ is likely also of interest15:31
clarkbshould fix cloud launcher on bridge.o.o, looks like this particular issue was affecting cloud launcher on puppetmster too15:32
*** efried has joined #openstack-infra15:32
dmsimardfungi: the post playbook in ara is sort of flawed because the report is saved before the playbook actually ends15:33
fungiright, that last task is the one which collects it15:33
fungii don't know that i'd call it "flawed" just a known caveat15:33
dmsimardYup, just wanted to make sure that was understood15:33
*** Dobroslaw has quit IRC15:33
dmsimardSure15:33
clarkbthis is why the old jenkins scp plugin console log copy had a hack of spawning a thread that would continue running after the job completed to finish copying the console log in its entireety15:35
clarkbwhat this did was move the hard to debug point from within the job context to the jenkins master15:35
clarkbmore use friendly but ugh pain to debug15:35
hwoaranginfra-root: could you confirm with me that mirror-update.openstack.org/23.253.92.251 is the host which runs the various distro mirror scripts?15:37
fungihwoarang: for the moment it is. also 2001:4800:7818:103:be76:4eff:fe04:5f3a15:37
fungihwoarang: it can change over time though, we don't guarantee a stable source address there15:38
pabelanger104.130.141.75 is mirror-update for me15:38
fungier, right15:38
fungimirror-update.openstack.org is an alias for mirror-update01.openstack.org15:38
fungiand mirror-update01.openstack.org is 104.130.141.75 and 2001:4800:7818:103:be76:4eff:fe04:5f3a15:38
hwoarangoh it changed because on stage.opensuse.org we had 23.253.92.251 on whitelist15:38
hwoarangok so a dynamic ip will not work  in that case. need to find another way then15:39
fungihwoarang: aha, yes whitelisting the address for our mirror updater won't really be a good solution15:39
clarkbcorvus: https://review.openstack.org/#/c/600562/1 does make the logging much better, but ianw points out we continue to leak that info to the main console log. I guess ansible writes to the logfile and to stdout by default? I +2'd because I think it is an incremental improvement as is15:39
fungihwoarang: it's not so much that it's a dynamic ip address, more that our infrastructure is dynamic and we upgrade servers by rebuilding and replacing them15:40
fungiso servers are getting replaced with other servers which have different ip addresses15:40
hwoaranghmm i see15:40
fungiwe keep a static cname resource record in dns pointing to whatever the current server is for that service15:41
fungiand update the cname whenever the service moves to a different/new server15:41
hwoarangok so the mirror-update.openstack.org hostname is static15:42
hwoarangok i will check if we can do whitelisting based on that15:42
openstackgerritMerged openstack-infra/system-config master: Serialize puppet on afs servers properly  https://review.openstack.org/60057515:42
fungistatic for the foreseeable future, though i could imagine replacing it with a ci job at some point so we no longer relied on a persistent server to do that work at all15:43
*** ssbarnea|bkp has quit IRC15:43
hwoarangi see15:43
*** camunoz has quit IRC15:43
fungiand i think we don't want our mirror servers to be in any sort of privileged position for the data we mirror. it's something that has to be kept up/communicated over time if we end up trying to maintain such a relationship, and also means additional hurdles for anyone else who wants to deploy a similar solution15:44
corvusclarkb: yeah, let's push it in so it's there when we need to debug apply test15:44
clarkbansible run just started 15 minutes ago so about an hour and a half until we see 600575 tak effect15:46
corvusclarkb: oh?  what did our full runtime end up being after switching back to -f20?15:47
clarkbcorvus: it lowered it by a few minutes but because we trigger on a 15 minute cron period the effective rotation was the same iirc15:47
clarkbcorvus: I think the -f 1 with ~30 nodes is dominating the runtime as that cost is about half an hour15:48
corvusclarkb: hrm, last 6 runs look to be 75 mins15:48
corvusclarkb: ok.  let's let that land, then do -f5015:48
pabelangercorvus: clarkb: fungi: mordred: With ptg next week, do we have any thoughts are ara-reports for bridge.o.o? EG: enable apache2 on bridge.o.o to host them, or some other process with new server?15:52
clarkbpabelanger: I think we should have reporting like that, we just need to make sure we aren't leaking sensitive info first. Apparently the sensitive info we are logging is a bug and mordred intended on fixing that so this should be doabl15:53
corvuspabelanger: good question!  we also need a database.15:53
clarkbwe probably want a bigger server if we are going to run a webserver on it too15:53
corvusclarkb: i don't think the openstacksdk stuff will go into ara.  that's python logging module stuff; shouldn't touch the ara callback.15:53
*** yamamoto has quit IRC15:53
corvusbut we may need to think about setting no_log in places15:54
*** yamamoto has joined #openstack-infra15:54
*** dtantsur is now known as dtantsur|afk15:54
clarkbya I think ara will dump host and group vars into output in places15:54
* corvus looks at cloud-launcher15:54
corvuswhatever we do -- we should set it up firewalled off to start so we can inspect it manually15:55
pabelangerclarkb: re security, I figure once we decided which server ara runs on, we can first keep it firewalled off to http, then infra-root and do some audit of playbook runs (kinda how we first disabled jenkins UI). Then confirm things good and open to public15:55
pabelangerwhat corvus said15:55
clarkbcorvus: ++15:55
corvuspabelanger: good idea :)15:55
clarkband pabelanger too15:55
pabelangerwould we want local DB on server or trove?15:56
corvusi guess trove is still what we're doing15:56
openstackgerritMerged openstack-infra/system-config master: Use git-servers group in remote_puppet_else  https://review.openstack.org/60057615:56
openstackgerritMerged openstack-infra/system-config master: Up profiled tasks limit  https://review.openstack.org/60025615:56
openstackgerritMerged openstack-infra/system-config master: Update launch README for bridge  https://review.openstack.org/60055815:56
corvusi think the main question is -- how to install and run ara?  do we want to make it one of the first 'run in container' services?15:56
clarkbfor something like this I'm not sure it matters too much, it isn't state we need to persist carefully15:57
clarkbre db15:57
clarkbas for install and run ara, if it is going on the bridge I think that it would be ansibled15:57
corvusyeah, but trove is still pretty easy15:57
corvusclarkb: right, but installed globally?  in a venv?  seems like container would be better?15:57
*** Bhujay has quit IRC15:58
*** yamamoto has quit IRC15:59
openstackgerritMarkos Chandras (hwoarang) proposed openstack-infra/system-config master: Revert "Use Kernel.org mirror for OpenSUSE"  https://review.openstack.org/60080516:00
hwoaranginfra-root: can you help me get ^^ in to resolve the CI opensuse failures for the time being? :/ TIA16:00
hwoarangdirk fyi^16:00
fungigonna grab lunch and run pre-trip errands but should be back soonish16:00
clarkbcorvus: maybe? I don't actually know what ara looks like as a service16:00
clarkbI've only used it as generate a thing or weird apache middleware16:00
*** jamesmcarthur has quit IRC16:06
openstackgerritMerged openstack-infra/system-config master: Write ansible log to fileout in apply test  https://review.openstack.org/60056216:06
*** jamesmcarthur has joined #openstack-infra16:06
corvusbacklog is okay today (everyone is packing?)  so how about i restart zuul in a few minutes?16:09
clarkbwfm16:09
pabelanger31280416:09
pabelangeroops16:10
*** jamesmcarthur has quit IRC16:10
openstackgerritMerged openstack-infra/system-config master: Don't set security groups in openstackci-packethost  https://review.openstack.org/60055716:10
pabelangerclarkb: corvus: fungi: I think we can do virtualenv first for ara, and if that goes well move into container later?16:11
pabelangerbut yah, ara on bridge.o.o, which then stores into trovedb16:11
corvuspabelanger: hrm.  i wonder why we don't just do container?  isn't that what we decided in the spec?16:11
pabelangerwe then should be able to stand up ara.openstack.org, another minimal server16:11
pabelangercorvus: I mean, i guess we could. I'm not sure yet how that looks from ansible pov, I guess docker_image task should be okay16:12
corvuspabelanger: seems like a great thing to work out at the ptg :)16:12
pabelangerindeed16:12
pabelangerand pbrx jobs for ara also16:12
corvusdmsimard: is there an existing published ara server container image?16:13
*** holser_ has quit IRC16:14
*** felipemonteiro has joined #openstack-infra16:14
*** felipemonteiro_ has joined #openstack-infra16:16
corvusdo we think the zuul_restart playbook is working now?  i think the issues were the init scripts which got fixed?16:16
corvusi'm going to give it a shot :)16:17
*** felipemonteiro has quit IRC16:19
corvusthe web.pid file was not removed16:19
*** mgagne has left #openstack-infra16:20
*** mgagne has joined #openstack-infra16:20
corvuswe're generating ssh keys for projects16:21
*** mriedem is now known as mriedem_bbq16:22
pabelangercool!16:23
*** ssbarnea|bkp has joined #openstack-infra16:23
corvusthat's done; loading branches now16:24
*** bnemec is now known as beekneemech16:24
*** olivierb has quit IRC16:24
corvuscat jobs16:25
corvusthe rest of the restart playbook seems to be going well; most of the ze's have stopped.  it's just waiting for the last few to restart them16:25
corvusmergers are already up16:25
clarkbwhat was the fix for the init scripts, I don't remember that going in. I'm guessing waiting for service to actually stop and clean up pids on its own?16:26
corvusclarkb: yep16:26
corvusbut it looks like zuul-web didn't delete its own pidfile for some reason16:26
*** dhinesh has joined #openstack-infra16:26
corvusthe restart playbook is pretty usable in the current state though.  you just have to delete web.pid manually once it stops.  that's the only hiccup so far.16:27
corvusand the executors have started again16:27
clarkband then start zuul-web manually?16:27
*** spartakos has quit IRC16:27
corvusclarkb: no, playbook will do that16:27
corvushttp://paste.openstack.org/show/729697/16:27
*** e0ne has joined #openstack-infra16:28
corvusclarkb: it just gets stuck at "wait to stop zuul web", then you go delete web.pid, and then it resumes16:28
clarkbgotcha16:28
corvusit's back up; re-enqueing now16:28
corvuswe need to check on the logs for a job to make sure we can still upload them correctly (and haven't broken secrets somehow)16:29
*** dhinesh has quit IRC16:30
*** florianf has quit IRC16:30
*** jamesmcarthur has joined #openstack-infra16:32
clarkbcorvus: http://logs.openstack.org/27/596427/4/check/tox-py35/1e8a9e6/ other than a suspicuously short url that lgtm16:32
corvus\o/16:32
corvusenqueue is finished16:33
*** jpich has quit IRC16:34
*** yamamoto has joined #openstack-infra16:35
corvusso i think we're ready to try some CD stuff next week :)16:36
corvushttp://zuul.openstack.org/api/project-ssh-key/openstack-infra/system-config.pub16:36
corvuswell, as soon as https://review.openstack.org/599073 lands16:37
*** e0ne has quit IRC16:42
*** hasharAway is now known as hasharDinner16:43
clarkbrun with the afs fix in it has just started16:46
dirkhwoarang: thanks. I'm a bit concerned about gwdg.de again, see inline comment16:46
dirkhwoarang: we had issues with tumbleweed, which is pretty large, and gwdg.de is pretty slow in the  US16:46
dirkit took more than the AFS timeout to sync which tripped something over16:47
*** felipemonteiro_ has quit IRC16:53
hwoarangdirk: but tw is still using rsync.o.o so it should be ok right?16:54
hwoarangdirk: maybe we can try the leaseweb mirrors in the US? i have 0 metrics on which mirrors are stable and which are not so it's a shot in the dark16:56
openstackgerritMarkos Chandras (hwoarang) proposed openstack-infra/system-config master: Revert "Use Kernel.org mirror for OpenSUSE"  https://review.openstack.org/60080516:58
hwoarang^^ with the leaseweb mirror16:59
*** myoung is now known as myoung|dr16:59
*** Swami has joined #openstack-infra17:01
*** derekh has quit IRC17:01
*** jpena is now known as jpena|off17:01
*** e0ne has joined #openstack-infra17:02
*** psachin has joined #openstack-infra17:02
evrardjpthanks hwoarang17:09
*** trown is now known as trown|lunch17:09
*** e0ne has quit IRC17:11
clarkbdoing more ansible performance debugging and significant amount of time seems to be spent generating inventory from our inventory sources. `sudo ansible logstash-worker01.openstack.org -vvv -m ping` illustrates this17:16
clarkbthis cost is probably relatively low for our puppet_run_else playbook but high for others that execute against a small number of nodes17:17
*** tesseract has quit IRC17:20
corvusclarkb: there's a cache, but i'm not sure how it works or what the timeout situation is17:21
*** caphrim007 has joined #openstack-infra17:22
dmsimardcorvus: I know that harlowja and SpamapS run ara in a container but I haven't set up anything "official"17:23
dmsimardThere's two ansible roles that I know of, https://github.com/openstack/ansible-role-ara (that I'm trying to make a thing eventually) as well as https://github.com/robertdebock/ansible-role-ara17:23
corvusdmsimard: interested in publishing containers automatically on every commit to dockerhub?17:23
corvuss/containers/images/17:24
clarkbcorvus: oddly strace seems to imply that reading the cache for all nodes is at least part of the slowness17:24
*** jtomasek has joined #openstack-infra17:24
corvusShrews: ^ both of these conversations may interest you17:24
*** ssbarnea|bkp2 has joined #openstack-infra17:25
dmsimardcorvus: I don't have a personal use case for that yet but sure17:26
*** ssbarnea|bkp has quit IRC17:26
corvusdmsimard: so ara (short for ansible-role-ara) installs ara in a venv, yeah?17:26
clarkbit appears to read all the contents of /var/cache/ansible/facts to run against a single node. I'm guessing it does this as part of determining group membership to determine where to run the module/play17:26
*** ijw_ has quit IRC17:27
*** jmorgan1 has quit IRC17:28
dmsimardcorvus: the default behavior of the openstack ara role is to install the latest version from pypi in a virtualenv and have the built-in webserver run as a systemd service, yes.17:28
dmsimardcorvus: ideally I want to get to a point where the role can be used to test ara itself, I've been experimenting on and off when time allows: https://review.openstack.org/#/c/557933/17:28
openstackgerritMerged openstack-infra/zuul-jobs master: add-build-sshkey: Remove only the master key  https://review.openstack.org/59907317:29
corvusdmsimard: thanks -- we might use that to set up our system-config ara, or we might go the container route, and if you don't object, throw some patches at the ara repo to do those image builds17:29
dmsimardFWIW I'm going to take a first good look at the usability of the ara 1.0 backend, API and callback next week with tristanC17:29
dmsimardI landed what I think are the last big schema changes17:30
dmsimardlet me show you something17:31
corvuscan plays have names?17:33
clarkbcorvus: Shrews reading the docs on fact_caching config option implies it is host specific (because well facts are host specific) so I don't think ansible should be reading all of the fact caches when running against a single node. It should load that node's facts from cache only17:33
clarkbI don't know how much a real world timesaving this would be with our large puppet_run_else playbook but seems like a bug in ansible? maybe someone with a bit more knowledge of how ansible is supposed to work on startup can check17:34
clarkbcorvus: we also seem to rarely use as many forks as we have allowed for17:35
corvusclarkb: seems like the big win on forks would be puppet_else, yeah?17:35
clarkbbut that may just be difference between human poll rate and cpu execution rate :)17:35
clarkbcorvus: ya17:35
clarkbI've watched else do 1-19 forks just doing a poll via ps17:36
clarkbcorvus: also base since that runs against everything17:36
*** psachin has quit IRC17:36
*** jmorgan1 has joined #openstack-infra17:37
*** spartakos has joined #openstack-infra17:39
*** diablo_rojo has joined #openstack-infra17:40
clarkbfor those of you in channel attending the PTG I just sent email to the list making Monday evening beer garden team dinner official. Suggest we meet at 6pm in the PTG hotel lobby to group up into taxi/uber/lyft travel17:45
*** roman_g has quit IRC17:45
clarkbdhellmann: diablo_rojo smcginnis ttx infra-root config-core ^17:45
diablo_rojoclarkb, cool :) Thanks!17:46
dhellmannclarkb : sounds good17:47
dmsimardclarkb: http://ovh.dmsimard.com:8000/api/v1/17:48
dmsimarder, that was meant for corvus17:49
dmsimardah, found a bug :D17:49
dmsimardsec17:50
clarkbcorvus: ianw afs fix got our period down to one hour, was an hour 15-30 minutes prior17:50
clarkbnot bad, but plenty more to improve upon I think. corvus is there a -f 50 change yet?17:51
corvusclarkb: not yet, about to write it17:51
*** mriedem_bbq is now known as mriedem17:52
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Name plays in playbooks  https://review.openstack.org/60082717:52
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Increase forks to 50  https://review.openstack.org/60082817:53
corvusclarkb: ^17:54
clarkbinfra-root ^ any chance you can review 600828 real quick? We are making progress on improving ansible performance pre PTG which will be helpful as we try to do things at the PTG17:55
*** electrofelix has quit IRC17:55
clarkbcorvus: reading 600827 I think we may want to set strategy: free on more of the base plays17:56
clarkbthey are largely indepednent and we don't need to do things in lock step17:56
*** electrofelix has joined #openstack-infra17:56
corvusclarkb: yeah, i think you're right17:56
clarkbthat should get us better thread utilization17:57
clarkbI'll write that change17:59
*** electrofelix has quit IRC17:59
dmsimardcorvus: okay, sorry -- here you go: http://ovh.dmsimard.com:8000/api/v1/17:59
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Move the !ci-backup play next to the rest of the servers  https://review.openstack.org/60083317:59
dmsimardsent a patch to add support for names :) https://review.openstack.org/#/c/600834/18:01
openstackgerritClark Boylan proposed openstack-infra/system-config master: Run base playbook with strategy free  https://review.openstack.org/60083518:02
*** ijw has joined #openstack-infra18:02
corvusdmsimard: how does a playbook name work?18:03
corvusdmsimard: and hey, look, it's an ara 1.0!  :)18:03
corvusdmsimard: i mean, where would you put the name for a playbook?18:04
corvus(since a playbook is strictly a list of plays aiui)18:04
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Remove !ci-backup play  https://review.openstack.org/60083718:04
clarkbcorvus: the filename maybe? thought you can use --- to have multiple documents in a single file18:05
dmsimardcorvus: What you're looking at is the backend and the API -- I haven't settled on the best approach to give playbooks labels and names from a client perspective yet.18:05
dmsimardcorvus: It could be a simple ansible module or something else, what would be a convenient way for you to do it ?18:06
dmsimardFrom an API perspective, they can be done as POST or PATCH so it doesn't need to be at runtime18:07
*** bharath_t has joined #openstack-infra18:07
*** eharney has quit IRC18:07
corvusdmsimard: i'd love it if that were an ansible thing, like if ansible allowed you to name playbooks and that was the ara interface.  if i were to suggest an implementation, i'd suggest a sort of "meta" play that you add to a playbook to name it (a la zuul's pragma config item).  or maybe a special comment syntax.18:08
*** trown|lunch is now known as trown18:09
corvusi think it would feel weird to have special ara modules in playbooks.  i like that ara's heisenburg factor is low.  :)18:09
dmsimardsomething I was thinking about was to use ansible vars18:10
dmsimardi.e, ansible-playbook -i foo -e "playbook_name=foo playbook_label=bar"18:11
corvusdmsimard: oh, for the zuul use case i think that would be great18:11
*** dhinesh has joined #openstack-infra18:12
dmsimardI'm pretty excited about the use of labels FWIW, it's what I ended up going for to categorize playbooks. It's very generic so it could be used to identify phases (such as pre/run/post), or results (success/failure), etc.18:12
*** anteaya has quit IRC18:13
dmsimardoh, a new addition is the notion of host aliases -- to track a single host across multiple playbook runs18:13
*** anteaya has joined #openstack-infra18:14
dmsimardi.e, click on a host to see what playbooks it ran and what were the results18:14
*** jamesmcarthur has quit IRC18:16
SpamapSdmsimard: coming to Denver? I'd love to get some ARA hacking time in with you.18:17
SpamapSIncluding possibly contributing our containerized ara stuff somewhere.18:17
*** mshalamov has quit IRC18:17
dmsimardSpamapS: I'm not, I'll be in Ausstin next month though18:18
dmsimardAustin*18:18
SpamapSOh we can do it there then.18:18
SpamapSI'll be at AnsibleFest too18:18
SpamapSlabels++ btw18:18
bharath_tclarkb, do we need to create project manually in launchpad.net for bugs/blueprints?18:19
*** ijw has quit IRC18:19
*** _Cyclone_ has quit IRC18:19
clarkbbharath_t: I think your projects specifies they would use storyboard in the yaml file. Storyboard projects will have been auto created for you. The project creators doc should have all this info for you too18:20
*** gfidente has quit IRC18:21
*** _Cyclone_ has joined #openstack-infra18:23
bharath_tclarkb, okay got it ... I misunderstood18:24
dhineshis there any latest documentation\guide which would help in building a third-party CI from scratch?18:26
clarkbdhinesh: https://docs.openstack.org/infra/system-config/third_party.html18:26
openstackgerritGoutham Pacha Ravi proposed openstack-infra/project-config master: remove job settings for manila repositories  https://review.openstack.org/59389818:26
fungidhinesh: does the current one not work?18:26
*** jamesmcarthur has joined #openstack-infra18:29
dhineshfungi: unfortunately the working version was compromised unintentionally by someone when I was on break18:32
dhineshfungi: I am using the original https://docs.openstack.org/infra/openstackci/third_party_ci.html to bring it up. i am using older versions of nodepool like 0.4.0 and zuul 2.5.1 and they seem to be be having issues, when i try to push images built images using nodepool-builder18:34
fungidhinesh: ahh, that's what i was wondering. we've of course moved on to newer zuul/nodepool without jenkins which means job rewrites and all that, so had hoped the openstackci third-party ci setups would continue to use the older versions of stuff18:35
*** diablo_rojo has quit IRC18:35
*** diablo_rojo has joined #openstack-infra18:35
*** bobh has quit IRC18:36
*** bobh has joined #openstack-infra18:36
openstackgerritMerged openstack-infra/devstack-gate master: Update grenade settings for stable/rocky  https://review.openstack.org/59159418:39
dhineshfungi: yeah that is what i was hoping as well, to build the way i had originally setup, but am running into errors with python modules18:41
smcginnisLooks like we may have an oddity with the docs.openstack.org site and the publish-openstack-tox-docs-direct job.18:41
smcginnisIt's getting a redirect to /latest/ when it shouldn't.18:41
smcginnisSo using https://docs.openstack.org/operations-guide/index.html works.18:41
smcginnisUsing https://docs.openstack.org/operations-guide/ redirects and 404's.18:42
fungidhinesh: can you link those "errors with python modules" from details on http://paste.openstack.org/ ?18:42
smcginnisIs that an openstackdocstheme thing that does the redirect or something with our site config?18:42
fungismcginnis: i think the redirects to latest/ are in a .htaccess file in the openstack-manuals repo18:43
smcginnisfungi: This isn't in -manuals.18:43
fungiis the operations guide intended to be unversioned or release-specific?18:43
smcginnisUnversioned.18:43
smcginnisIt uses publish-openstack-tox-docs-direct, which appears to be right.18:43
dhineshfungi: sure http://paste.openstack.org/show/729706/18:44
smcginnisAs the redirected docs are not there as expected.18:44
smcginnisSo really just need to understand where the redirect is coming from.18:44
fungismcginnis: and you're sure it's not https://git.openstack.org/cgit/openstack/openstack-manuals/tree/www/.htaccess18:45
openstackgerritsebastian marcet proposed openstack-infra/openstackid-resources master: Added new endpoints for Track Tag Groups  https://review.openstack.org/60084518:45
smcginnisfungi: I don't see anything similar in this repo - http://git.openstack.org/cgit/openstack/operations-guide/tree/18:46
fungismcginnis: what i'm trying to say is the openstack-manuals www/.htaccess file is deployed to the root of https://docs.openstack.org/18:47
smcginnisIt had been working.18:47
fungiwhen did it first break?18:47
smcginnisNot sure really. It had been fine and we've gotten a few updates.18:47
smcginnisThen today I was prep'ing an etherpad for the ops meetup to talk about it and realized it isn't working anymore.18:48
corvushttps://docs.openstack.org/.htaccess18:48
fungihuh, interesting that apache doesn't refuse to serve that file18:48
fungii thought normally it wouldn't serve dotfiles18:48
smcginnisSure enough, operations guide is in the list of redirectmatch 30118:50
smcginnisSo it's now pulled into here: https://git.openstack.org/cgit/openstack/openstack-manuals/tree/www/.htaccess#n19818:50
smcginnisWhere is REGULAR_REPOS set?18:51
fungihttp://codesearch.openstack.org/?q=REGULAR_REPOS18:51
smcginnisYeah, looking at those.18:52
fungiseems tools/www-generator.py sets it18:52
openstackgerritJames E. Blair proposed openstack-infra/project-config master: Add cherrypy to third-party ci  https://review.openstack.org/60084618:52
fungismcginnis: hah, i wonder if the readdition to governance is what broke it ;)18:53
smcginnisOh, that could be. I don't think I've looked at it since that patch was still pending out there.18:53
fungiseems there's a loop over the governance sigs list in the _get_official_repos() function18:54
fungii wonder if it's no longer parsing that file successfully18:54
*** ijw has joined #openstack-infra18:54
fungithe schema for it did change with the addition of the security-sig repos18:54
smcginnisI don't see any kind of exclusion list there.18:54
*** efried is now known as fried_rice18:56
fungino, _get_official_repos() does also seem to add the _GOVERNANCE_SIGS_URL repos to the regular_repos list after all18:56
fungidhellmann: ^ if you're still around, you had some hand in designing that docs transition... do you recall if there was an escape hatch built in for repos publishing documentation which didn't want a redirect to latest/ ?18:57
smcginnisoperations-guide appears to be the only one using that publish-direct job18:57
*** r-daneel has quit IRC18:58
dhellmannfungi : I don't know if that case came up specifically18:59
smcginnisLooks like it would be easy enough to add an exlusion check in the www-generator. But where should it go instead?18:59
smcginnisErr, I suppose nowhere, since it shouldn't redirect at all?19:00
dhellmannyeah19:00
smcginnisOK, I'll add an exclusion list of one for now.19:01
dhellmannmaybe this repo shouldn't be in the REGULAR_REPOS list19:02
smcginnisExactly.19:02
fungiright, i think smcginnis is looking at how to exclude it from REGULAR_REPOS19:03
dhellmannok19:03
fungiright now that list is autogenerated by https://git.openstack.org/cgit/openstack/openstack-manuals/tree/tools/www-generator.py#n36419:03
dhellmannI wonder what other redirects that's going to disable19:03
fungiand just includes everything official19:03
dhellmannoh, that seems to be the only place REGULAR_REPOS is used19:03
clarkbcorvus: any idea if cherrypy is interested in that? (or are you planning to not report for now (I think that requires a new pipeline though))19:03
smcginnisIf this repo clone ever finishes, you can check if I did it right. ;)19:04
dhellmannI'm going to be going offline shortly, but I can look at it over the weekend once I'm in denver19:04
smcginnisThanks dhellmann19:05
smcginnisI'm presenting it Tuesday morning, so I hope we have plenty of time to clear things up by then.19:05
dhellmannyeah, we should be able to fix it by then19:05
corvusclarkb: not planning on reporting; just want to use it in a required-projects line19:05
dhellmannplan to talk slowly ;-)19:05
corvus(at some point, i'll fix zuul so you don't need to do that to add something to required-projects...)19:06
smcginnisdhellmann: ;)19:06
*** jamesmcarthur has quit IRC19:06
*** eernst has quit IRC19:07
smcginnisdhellmann: Oh, I can just add it to http://git.openstack.org/cgit/openstack/openstack-manuals/tree/tools/www-generator.py#n349 I think?19:08
openstackgerritMerged openstack-infra/system-config master: Name plays in playbooks  https://review.openstack.org/60082719:11
openstackgerritJeremy Stanley proposed openstack-infra/project-config master: Revert "add signing steps to javascript publishing job"  https://review.openstack.org/60085019:13
openstackgerritJeremy Stanley proposed openstack-infra/project-config master: Revert "add signing steps to javascript publishing job"  https://review.openstack.org/60085019:14
openstackgerritMerged openstack-infra/system-config master: Increase forks to 50  https://review.openstack.org/60082819:14
*** jamesmcarthur has joined #openstack-infra19:15
*** anteaya has quit IRC19:16
*** graphene has joined #openstack-infra19:18
*** graphene has quit IRC19:23
*** graphene has joined #openstack-infra19:25
*** beekneemech has quit IRC19:29
*** r-daneel has joined #openstack-infra19:30
zxiiroanyone here know how to deal with https://docs.openstack.org/bandit/latest/blacklists/blacklist_calls.html#b310-urllib-urlopen19:30
zxiiroIt says to audit for URL schemes, and I do a re.match() to check that it's only what I want but I'm not sure how to satisfy whatever the checker is checking for.19:30
clarkbzxiiro: I'm not sure you may have to flag the line with # noqa or similar?19:31
clarkb(and assert you've preventing unexpected local opens)19:31
zxiiroyeah I figure that'd be the easiest solution but was curious if there was a more correct way to assert that I've done my homework.19:32
*** EvilienM is now known as EmilienM19:33
fungizxiiro: is the code hitting that public?19:34
clarkbfungi: care to review https://review.openstack.org/#/c/600835/ ? I think that may help speed up ansible runtime19:35
zxiirofungi: yep, take a look at this https://gerrit.linuxfoundation.org/infra/#/c/12498/4/lftools/openstack/image.py19:35
*** r-daneel_ has joined #openstack-infra19:35
zxiiroI added "# nosec" to the line in question to make bandit happy but I'm not sure if that's the right approach19:36
*** jbadiapa has quit IRC19:36
*** r-daneel has quit IRC19:37
*** r-daneel_ is now known as r-daneel19:37
corvusclarkb: looks like the run after the current one will be -f5019:38
clarkbcorvus: yup19:38
fungizxiiro: i'll see if i have any ideas, though the bandit authors are officially hanging out in ##python-code-quality these days19:38
*** ijw has quit IRC19:39
zxiirofungi: sure no worries I can ask them too. Thanks for the direction.19:39
clarkbfwiw my concern there would be more worrying about verifying the downloaded image hash prior to uploading to the cloud more so than if someone wanted to upload a file using ftp or file://19:39
clarkbbut I also don't have quite the security paranoia of others19:39
zxiiroclarkb: that's a good idea i should check the checksum at least heh19:40
fungiyeah, i have a feeling what's tripping that is passing a variable straight into the urlretrieve parameters19:41
fungibut without digging into the bandit source hard to say what it's actually expecting instead19:41
fungii agree it could do with some improved documentation on that check19:41
fungialso, to clarkb's point, not making some cursory checks of the image file does seem risky19:43
*** e0ne has joined #openstack-infra19:45
*** spartakos has quit IRC19:49
*** hasharDinner is now known as hashar19:55
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Move coverage jobs to voting check job  https://review.openstack.org/43283619:55
AJaegerianw: updated description ^19:56
AJaegerclarkb, fungi, I would value your input on 432836 overall - whether general direction is fine.19:56
clarkbI've found coverage in check as being valuable for inspecting test results. I do wonder about applying it globally, but projects can always remove it after if they don't care for it19:58
*** bharath_t has quit IRC19:59
fungiclarkb: any idea why system-config-run-base is failing on 600835?19:59
corvusi was just starting to look at that myself20:00
fungiAJaeger: one counterpoint to the argument you make in 432836 is that the linked discussion from 2016 predates the zuul v3 builds dashboard20:00
AJaegerclarkb: I found quite a few broken cover jobs - that never worked in post ;(20:00
AJaegerfungi: yeah, it does. Still, nobody checks if I look at post jobs that fail completely ;(20:01
fungii don't think finding the latest post pipeline coverage runs is as hard as it was back then, but i do expect that people still aren't looking at them much20:01
corvusfungi, clarkb: http://logs.openstack.org/35/600835/1/check/system-config-run-base/680f906/job-output.txt.gz#_2018-09-07_19_44_47_600584  is that the failure?20:01
clarkbfungi: http://logs.openstack.org/35/600835/1/check/system-config-run-base/680f906/job-output.txt.gz#_2018-09-07_19_44_47_60058420:01
fungiyou both pasted the same url, so i'm guessing that's it ;)20:01
clarkbapt must've failed or maybe it tried using the wrong package manager again?20:03
clarkb(I don't actually know how we fixed that()20:03
fungihttps://packages.ubuntu.com/bionic/logrotate20:03
fungiindeed20:03
fungioh, wait, centos720:04
fungiwhat?20:05
fungii guess that next line is output from the centos7 node in the job (indirectly) and not the bridge node?20:05
clarkbbionic and centos7 are multiple nodes running the same task, bionic failed but centos7 succeeded I think20:05
fungiahh, yeah, my eyeballs were skipping past the [bionic] tag at the beginning of that line20:06
corvusi will say that free makes things a bit harder to follow20:08
corvus(but ara will help with that)20:08
clarkbwe can compare the base playbook runtime too in production before and after to decide if it is worthwhile20:09
corvusi'm trying to figure out whether/when the "Update apt cache" handler ran on bionic20:09
fungidhinesh: any chance you can confirm what version of openstackclient you have installed there? that AttributeError exception is raised by openstack._adapter.request making a call to what looks like probably one of its own class methods...20:10
corvusthat not having run (yet) for some reason would cause the error20:10
clarkbcorvus: http://logs.openstack.org/35/600835/1/check/system-config-run-base/680f906/job-output.txt.gz#_2018-09-07_19_41_21_178872 it ran there as part of job setup at least20:11
corvusclarkb: yeah, but that's before we overwrite the config20:12
*** ijw has joined #openstack-infra20:12
clarkbcorvus: grepping cache_updated I don't see an entry for bionic20:13
*** agopi is now known as agopi|off20:14
corvusclarkb: i agree.  i see http://logs.openstack.org/35/600835/1/check/system-config-run-base/680f906/job-output.txt.gz#_2018-09-07_19_44_34_608943 for xenial (after writing the apt config and before installing logrotate).  but not for bionic.20:15
corvusclarkb: i wonder if there's some weird interaction between handlers and free20:15
clarkbhandlers do seem to have weird interactions. Like that reorder of the unbound play so that handlers run at the right time20:16
corvushttps://github.com/ansible/ansible/issues/3150420:16
clarkbhttps://github.com/ansible/ansible/commit/5f953d1129925579b7bf6ea2cce1ccf583199801 is the commit we need which I guess isn't in 2.6.220:17
corvusconfirmed, it's not in 2.6.220:18
corvusit's in the 2.7.0 rc20:19
fungiwell, at least the good news is that change is what's causing the failure, and our testing kept it from breaking our deployments20:19
*** agopi|off has quit IRC20:19
fungibad news is we're stuck waiting until ansible 2.7 to make use of that feature i guess?20:19
corvusfungi: yeah.  this whole test like production thing is kinda working out :)20:20
fungiall things being equal, i'm fine with this result ;)20:20
corvusdoes anyone know why users has to be in the base-repos play?20:21
clarkbya its great to see the testing catch real problems20:21
corvusmaybe we can move users to the second play and then just have the first play be linear?20:21
fungiseems like a reasonable enough workaround20:22
corvus(users is long and slow and may benefit from free)20:22
clarkbcorvus: do we need linear for other roles like exim and unbound (they install packages too)20:22
corvusi don't see anything in base-repos which suggests the need for users20:23
clarkbI guess we can rely on that happening soon enough after base-repos to be fine most of the time20:23
corvusclarkb: it'd only be for handlers.  we do have several roles with handlers.  most of them are for restarting services after config file changes.20:23
corvusbase-server, snmpd, timezone, iptables, and exim all have handlers20:23
*** anteaya has joined #openstack-infra20:24
corvushrm.  maybe we should avoid free :(20:24
clarkbor handlers20:24
clarkbwe can restart services and update package caches explicitly in tasks20:24
clarkb(I don't know why that isn't normal though)20:24
corvusit's hard to do the "restart on config file change" without a handler20:24
corvusi think you'd have to do a bunch of register + when stuff20:25
corvus(and do it for every config file; so you'd end up with "when: config_file.changed or defaults_file.changed" i think.)20:26
*** diablo_rojo has quit IRC20:26
corvusi wonder how hard it would be to install 2.7.0rc120:27
corvuswe use the ansible pip module to install it20:28
clarkbI want to say ansible doesn't publish rc's to pypi20:28
corvushttps://pypi.org/project/ansible/2.7.0rc1/20:28
clarkbbut you can download the sdist off github and install that? it was weird in the past when git submodules were used20:28
clarkboh neat20:28
corvushttps://docs.ansible.com/ansible/2.6/modules/pip_module.html20:29
corvusi don't see anything there about enabling pre-releases...20:29
*** ijw has quit IRC20:29
corvusmaybe we just need to do 'version==' ?20:29
clarkbya pip will install an exact version if you give it one20:29
corvuspatch incoming20:29
clarkbthat overrides the no rc's by default policy20:29
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Install ansible 2.7.0rc1 on bridge  https://review.openstack.org/60086120:31
corvusthat should be rather self-testing20:32
*** trown is now known as trown|outtypewww20:32
corvus-f50 started at 20:0020:33
*** r-daneel has quit IRC20:34
corvusclarkb: mind if i update your patch to depends-on that one?  might get some more useful testing20:36
clarkbgo for it20:36
openstackgerritJames E. Blair proposed openstack-infra/system-config master: Run base playbook with strategy free  https://review.openstack.org/60083520:36
*** jtomasek has quit IRC20:36
*** ijw has joined #openstack-infra20:38
*** ijw has quit IRC20:39
*** ansmith has quit IRC20:42
dhineshfungi: don't think openstackclient is installed on the CI machine, did you mean the version of openstackclient on the private openstack cloud side20:46
*** r-daneel has joined #openstack-infra20:47
fungidhinesh: sorry, openstacksdk. that traceback you pasted showed shade calling the openstack sdk20:47
clarkbcorvus: I think we have to wait for the 2100 run to see the -f50 because the run_all.sh script is what updates system-config on disk20:50
clarkbcorvus: the playbooks themselves will use the new version on the first run because it updates then runs plabooks but run_all doesn't reexec itself so has to wait for run + 120:50
corvusclarkb: ah right20:51
*** e0ne has quit IRC20:57
*** jtomasek has joined #openstack-infra20:59
*** spartakos has joined #openstack-infra21:00
*** jento has quit IRC21:01
*** rh-jelabarre has quit IRC21:02
*** spartakos has quit IRC21:02
*** diablo_rojo has joined #openstack-infra21:03
clarkbyup see it running with -f 50 now21:03
corvushrm, we don't run the bridge playbook with -v so i don't see a way to independently confirm that 2.7.0 was installed in the job21:05
corvusoh, i think that's in zuul, we may have it in the zuul ara report then21:06
corvuswoot! http://logs.openstack.org/61/600861/1/check/system-config-run-base/6437436/ara-report/result/2711b6f4-2fb7-4cd6-9ee1-a43b7bb4ec6d/21:06
clarkband the depends on in 600835 should check that it works for fixing that bug21:07
corvusclarkb, fungi: 2.7.0 appears to work with the base playbook and makes clarkb's change happy.  of course, that's very likely a non-deterministic error, so it's hard to confirm success there.  but we have at least failed to confirm failure.21:07
clarkbwatchign the run_all.sh I am somewhat skeptical that -f 50 is doing what we think it does21:08
corvusi'm game to try it out21:08
corvusclarkb: oh, why?21:08
clarkbwe have ~130 nodes which should cause it to be running at 50 forks most of the time but looking at ps its well under 10 most of the time looks like21:09
dmsimardcorvus: the ansible version displayed at the top right in ARA is probably misleading21:09
dmsimardTop right = webapp ansible version21:09
dmsimardThe real ansible version that was used can be found in the parameters or when clicking on the playbook file21:10
corvusdmsimard: yeah, i'm looking at the bottom of http://logs.openstack.org/61/600861/1/check/system-config-run-base/6437436/ara-report/result/2711b6f4-2fb7-4cd6-9ee1-a43b7bb4ec6d/ which confirms that it installed 2.7.0rc1 on fake bridge.21:10
dmsimardAh, cool21:10
dmsimardI manage to confuse myself sometimes working on ara, it gets a bit inception-ish21:11
corvusdmsimard: that's zuul's ansible bootstrapping the ansible for a node which is standing in for our bastion host which runs ansible on the control plane21:11
dmsimardOh boy21:12
corvusclarkb: okay now we're at a place which should be able to do a lot in parallel21:15
*** holser_ has joined #openstack-infra21:17
corvusclarkb: i agree the number is very small.  i wonder if these tasks are very quick and ansible can't launch them fast enough?  like, would we see better utilization if we had a bunch of slow tasks?21:17
clarkbpossibly, we do seem to have better utilization in the run puppet else playbook and puppet is a reltively slow task21:18
*** bobh has quit IRC21:19
*** ssbarnea|bkp2 has quit IRC21:19
clarkbI also wonder if each fork does its own startup process reaing all of the cache data21:19
*** vaidy has quit IRC21:21
clarkbtesting manually with `sudo ansible logstash-worker* -m shell -a "sleep 20" -f 10` implies the slow start is a one time cost21:21
clarkbI see two waves of succeses ~20 seconds apart21:21
clarkb(that slow start does make ansible less appealing than an ssh for loop for foreground work though :/)21:22
corvusit's interesting to see the 'ok' lines in the log at a relatively consistent about 4/sec21:23
corvusclarkb: what if it's the inventory?  ansible isn't supposed to create more forks than needed for the inventory... our inventory is dynamic21:26
clarkboh interesting21:26
clarkbya maybe its only reading N out of the inventory at a time?21:27
corvusi'm digging into code for that now21:27
*** mriedem is now known as mriedem_mowtown21:28
corvuswow that really does take a long time to start up21:31
corvusclarkb: if i run your logstash command with 'sleep 0' it looks more like what we see in production21:32
corvusclarkb: so maybe the idea that it only matters for sufficiently slow tasks is the better theory21:32
pabelanger51783221:33
pabelangergah21:33
pabelangerI hit it everytime I move my laptop21:33
clarkbseems like that would be a bug in ansible right? If you've told it you can run more threads then it should do that and not be hindered by task startup time?21:34
clarkbI don't know enough about ansible itnernals though21:35
*** holser_ has quit IRC21:35
*** diablo_rojo has quit IRC21:39
*** tosky has quit IRC21:40
corvusclarkb: i think the forked worker process only lasts as long as a single task21:41
clarkbif that is the case then python startup is likely to hurt us quite a bit21:42
corvusit looks like there's a lot of stuff that happens for each host-task21:43
*** jamesmcarthur has quit IRC21:45
*** jamesmcarthur has joined #openstack-infra21:46
*** anteaya has quit IRC21:46
*** anteaya has joined #openstack-infra21:47
*** eernst has joined #openstack-infra21:48
*** graphene has quit IRC21:49
*** myoung|dr is now known as myoung21:49
corvusi don't expect free to behave differently in this respect.21:49
clarkbfree could potentially reuse forks since it runs to completion for each host, but I have no idea if the implemetnation does that21:50
corvus(of course, it should improve parallelism in general, but it still uses the same task queue system)21:50
corvusclarkb: i think it's basically a fully lineraized task queue.  the different strategies just decide what order the tasks run in and where the checkpoints are21:51
corvusbut ultimately, each host-task goes onto a queue and assigned to a worker (which will then fork, run that task, and exit)21:51
*** agopi|off has joined #openstack-infra21:52
*** bobh has joined #openstack-infra21:52
*** diablo_rojo has joined #openstack-infra21:52
clarkbthat is an unfortuantely slow way to do it with python given startup times21:54
corvusclarkb: you don't incur a python startup time with a fork, though, right?21:54
*** jamesmcarthur has quit IRC21:54
clarkboh right for some reason I had in my head fork + exec21:55
clarkbif not doing the exec then ya should be fine21:55
corvusyeah, it's multiprocessing.Process under the hood21:56
*** spartakos has joined #openstack-infra21:56
*** jtomasek has quit IRC21:58
*** kukacz_ has quit IRC22:02
*** kukacz_ has joined #openstack-infra22:04
corvusi'm looking at the openstacksdk docs for cache, and i'm not clear about expiration: https://docs.openstack.org/openstacksdk/latest/user/config/configuration.html22:04
*** yamamoto has quit IRC22:05
corvuswhat does the expiration dictionary mean?22:05
clarkbcorvus: the keys are the different cached objects and the values are ttls in seconds22:05
clarkb-1 is a special ttl that means never expire looks like22:05
clarkb(I'm not sure what the valid list of keys is there)22:06
corvuslooks like that's the extent of documentation about expiration22:06
dhinesh:fungi: after some digging, got access to my old CI setup, will continue to work on that. Thanks for help.22:08
corvusi'm editing clouds.yaml on bridge to see if tweaking the cache params makes startup faster22:08
corvusso far it's not22:09
*** bobh has quit IRC22:10
*** diablo_rojo has quit IRC22:12
*** r-daneel has quit IRC22:12
*** spartakos has quit IRC22:14
*** rfolco has quit IRC22:15
*** spartakos has joined #openstack-infra22:15
*** gema has quit IRC22:18
*** gema has joined #openstack-infra22:20
*** gema has quit IRC22:20
*** gema has joined #openstack-infra22:20
*** spartakos has quit IRC22:23
*** hashar has quit IRC22:25
*** spartakos has joined #openstack-infra22:27
*** dave-mccowan has quit IRC22:38
*** jamesmcarthur has joined #openstack-infra22:44
*** jamesmcarthur has quit IRC22:48
corvusclarkb: our 2 vcpus may have an impact too.22:51
corvusclarkb: it's the constructed inventory plugin that's taking so long, not openstack.22:51
clarkbthat is unexpected22:52
*** rlandy has quit IRC22:55
*** caphrim007_ has joined #openstack-infra22:58
*** caphrim00_ has joined #openstack-infra22:59
*** eernst has quit IRC23:01
*** caphrim007 has quit IRC23:01
fungiweird23:01
corvusi'm trying to profile it now23:01
*** caphrim007_ has quit IRC23:02
*** caphrim00_ has quit IRC23:03
clarkbcorvus: reading docs on constructed inventory it uses the fact cache to builds its variable set. That probably explains why we read the entire fact cache for every ansible run23:05
clarkbregardless of what host we run against23:05
corvusclarkb: yep23:06
clarkbI wonder if we shouldn't do a more traditional inventory23:06
clarkband use globbing and other pattern matching23:06
corvusclarkb: i'm not convinced it's the cache that's slow23:07
corvusthe slowest call is self._add_host_to_composed_groups(self.get_option('groups')23:08
corvusthis is the profile: http://paste.openstack.org/show/729716/23:08
corvuslet me git github links23:09
corvushttps://github.com/ansible/ansible/blob/devel/lib/ansible/plugins/inventory/constructed.py#L12123:09
corvuswhich is https://github.com/ansible/ansible/blob/devel/lib/ansible/plugins/inventory/__init__.py#L29323:10
corvusi'm wondering if it's the fact that we're compiling that template for every group for every host23:10
corvushow many hosts do we have? 100?  we have 55 groups.  that's like 5500 template compilations.23:11
clarkb~130 hosts23:12
corvus715023:12
*** sthussey has quit IRC23:13
clarkbit is reading the yaml entry and evaluating it in a dynamic jinja2 template huh23:13
clarkband then it does that hosts * groups times23:13
corvusyeah, if i comment out this line it's super fast: https://github.com/ansible/ansible/blob/devel/lib/ansible/plugins/inventory/__init__.py#L30123:14
corvusand it's still loading all the facts from disk23:14
corvusso it's the jinja templating that's killing it23:14
clarkbI wonder if we can memoize/cache jinja2 intermediate results23:15
clarkbthen only reevaluate with new variables23:15
*** ansmith has joined #openstack-infra23:16
*** ijw has joined #openstack-infra23:17
*** spartakos has quit IRC23:17
*** mriedem_mowtown is now known as mriedem23:19
corvusclarkb: i think i'm going to stop digging into this now.  i think we've got good data we can take into next week and discuss inventory/grouping ideas.23:21
clarkbcorvus: I think we could get away with the yaml inventory fwiw, we'd jsut be left using patterns23:21
*** ijw has quit IRC23:21
corvusclarkb: the yaml inventory does not call that method, so that's promising.23:22
openstackgerritMerged openstack-infra/puppet-ptgbot master: Make the FC SIG freed slots actually reservable  https://review.openstack.org/60066523:23
corvusclarkb: that looks mostly like good ole compiled regexes23:23
*** mriedem has quit IRC23:28
*** bobh has joined #openstack-infra23:45
*** bobh has quit IRC23:49
*** anteaya has quit IRC23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!