Wednesday, 2019-09-18

*** goldyfruit_ has joined #openstack-infra00:06
clarkbhttps://review.opendev.org/#/c/682755/1 has a +1 from zuul now. Thanks for getting that fix in00:18
*** lbragstad has quit IRC00:19
*** lbragstad has joined #openstack-infra00:19
*** zhurong has quit IRC00:38
*** jamesmcarthur has joined #openstack-infra00:42
*** zhurong has joined #openstack-infra00:49
*** gyee has quit IRC00:50
*** xenos76 has quit IRC00:53
*** jamesmcarthur has quit IRC01:07
*** markvoelker has quit IRC01:08
*** slaweq has joined #openstack-infra01:11
*** slaweq has quit IRC01:16
*** rkukura has quit IRC01:20
*** markvoelker has joined #openstack-infra01:20
openstackgerritAkihiro Motoki proposed openstack/project-config master: Stop gerrit notification to neutron-stadium specific channels  https://review.opendev.org/68277101:40
*** yamamoto has joined #openstack-infra01:56
openstackgerritMohammed Naser proposed zuul/zuul-operator master: Create zookeeper operator  https://review.opendev.org/67645802:08
*** apetrich has quit IRC02:10
openstackgerritMerged openstack/project-config master: Add the gerrit checks plugin repo to Zuul  https://review.opendev.org/68275202:17
*** yamamoto has quit IRC02:21
*** yamamoto has joined #openstack-infra02:21
openstackgerritMerged openstack/project-config master: Add upstream Gerrit projects to OpenDev Zuul tenant  https://review.opendev.org/68275402:21
openstackgerritMerged openstack/project-config master: Add system-config and zuul to OpenDev tenant for jobs  https://review.opendev.org/68275502:21
*** jamesmcarthur has joined #openstack-infra02:22
*** roman_g has quit IRC02:35
*** jamesmcarthur has quit IRC02:38
*** jamesmcarthur has joined #openstack-infra02:47
*** larainema has joined #openstack-infra02:49
ianwdmsimard: fwiw i've had good luck importing svg's into tinkercad and extruding, then stl into cura02:49
ianwand i can't remember what it was, something microsoft related, you can run an stl file through it to make it "watertight" and that worked well for me once too02:50
ianwwas this -> https://tools3d.azurewebsites.net/02:51
dmsimardianw: thanks for the advice ! I'll check tinkercad out tomorrow02:55
*** zhangfei has joined #openstack-infra02:57
ianwyeah i've found it enough to do all manner of simple to moderate designs, with a very shallow learning curve02:58
*** markvoelker has quit IRC03:09
*** slaweq has joined #openstack-infra03:11
*** slaweq has quit IRC03:16
*** jamesmcarthur has quit IRC03:24
*** weshay|ruck has quit IRC03:24
*** jamesmcarthur has joined #openstack-infra03:36
*** dave-mccowan has quit IRC03:36
*** rh-jelabarre has quit IRC03:47
*** rh-jelabarre has joined #openstack-infra03:47
*** zhangfei has quit IRC03:49
*** jamesmcarthur has quit IRC03:50
*** ricolin has joined #openstack-infra03:54
openstackgerritIan Wienand proposed zuul/zuul master: zuul_console: fix python 3 support  https://review.opendev.org/68255603:59
openstackgerritIan Wienand proposed zuul/zuul master: Support nodes setting 'auto' python-path  https://review.opendev.org/68227503:59
*** ramishra has joined #openstack-infra04:01
*** zhangfei has joined #openstack-infra04:07
auristorianw: was the delay between last volume update and release enabled for mirror.opensuse ?04:07
ianwauristor: no, i've only done that for the fedora script at the moment04:08
ianwhonestly i haven't had a chance to check back in on it ... let me see04:08
auristorok.  mirror.opensuse has been released at least twice today and the last release started almost six hours ago04:09
ianw2019-09-17 21:06:17  | Starting ForwardMulti from 536871007 to 536871007 on afs02.dfw.openstack.org (as of Tue Sep 17 13:03:58 2019).04:09
ianw2019-09-18 03:34:27  | updating VLDB ... done04:09
ianwthat's a good 7+ hours :/04:10
auristorit completed in the last few minutes04:10
ianw2019-09-17 13:04:11  | Starting ForwardMulti from 536871007 to 536871007 on afs02.dfw.openstack.org (as of Tue Sep 17 05:04:17 2019).04:10
ianw2019-09-17 18:51:20  | updating VLDB ... done04:10
ianw^ that one was a "zero" update (rsync didn't touch anything)04:11
ianwfor opensuse, we merged https://review.opendev.org/#/c/682576/ which brought in some extra repo bits ...04:12
ianwi think, if i'm reading the logs correctly, sent 23,618 bytes  received 4,694,912,393 bytes  5,219,495.29 bytes/sec04:13
ianwan extra 5gb or so added to the volume04:13
ianwhttp://files.openstack.org/mirror/logs/rsync-mirrors/opensuse.log04:14
ianwi think it might be better if these files rotate weekly04:14
*** ykarel has joined #openstack-infra04:24
*** ociuhandu has joined #openstack-infra04:30
*** auristor has quit IRC04:33
openstackgerritIan Wienand proposed opendev/system-config master: logrotate: better rotation options  https://review.opendev.org/68278604:33
*** janki has joined #openstack-infra04:34
*** auristor has joined #openstack-infra04:34
*** ociuhandu has quit IRC04:35
AJaegerconfig-core, https://review.opendev.org/#/c/682215/ and https://review.opendev.org/682216 should be ready - please review04:37
*** whoami-rajat has joined #openstack-infra04:38
*** lajoskatona has joined #openstack-infra04:42
*** lajoskatona has left #openstack-infra04:44
*** udesale has joined #openstack-infra04:45
*** pcaruana has joined #openstack-infra04:46
*** jaosorior has quit IRC04:57
*** jaosorior has joined #openstack-infra04:57
*** markvoelker has joined #openstack-infra05:10
*** slaweq has joined #openstack-infra05:11
*** jtomasek has joined #openstack-infra05:13
*** e0ne has joined #openstack-infra05:14
*** e0ne has quit IRC05:14
*** markvoelker has quit IRC05:14
*** slaweq has quit IRC05:16
*** e0ne has joined #openstack-infra05:16
*** ramishra has quit IRC05:39
*** e0ne has quit IRC05:50
*** kopecmartin|off is now known as kopecmartin05:54
openstackgerritIan Wienand proposed zuul/nodepool master: Set default python-path to "auto"  https://review.opendev.org/68279705:57
openstackgerritIan Wienand proposed opendev/system-config master: logrotate: better rotation options  https://review.opendev.org/68278606:02
*** slaweq has joined #openstack-infra06:11
*** pgaxatte has joined #openstack-infra06:13
*** slaweq has quit IRC06:16
*** igordc has joined #openstack-infra06:17
*** igordc has quit IRC06:19
*** e0ne has joined #openstack-infra06:21
*** xek_ has joined #openstack-infra06:22
*** e0ne has quit IRC06:26
*** xek_ has quit IRC06:30
*** zhangfei has quit IRC06:32
*** shachar has quit IRC06:35
*** snapiri has joined #openstack-infra06:35
*** armax has joined #openstack-infra06:39
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add horizon to translation jobs  https://review.opendev.org/68280806:54
*** slaweq has joined #openstack-infra06:55
*** armax has quit IRC06:56
*** rcernin has quit IRC06:57
*** zhangfei has joined #openstack-infra06:57
*** apetrich has joined #openstack-infra06:59
*** roman_g has joined #openstack-infra07:00
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Use on-disk horizon dir for translations  https://review.opendev.org/68280907:03
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add horizon to translation jobs  https://review.opendev.org/68280807:03
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Use on-disk horizon dir for translations  https://review.opendev.org/68280907:07
AJaegerconfig-core, please review 682808 and 682809 to fix train translations - and make them more robust.07:07
*** trident has quit IRC07:08
openstackgerritAndreas Jaeger proposed openstack/project-config master: Add horizon to translation jobs  https://review.opendev.org/68280807:08
*** tosky has joined #openstack-infra07:18
*** trident has joined #openstack-infra07:19
*** sshnaidm|pto is now known as sshnaidm|rover07:24
sshnaidm|roverbefore this big logs change I could leave ara.sqlite of my nested ansible in my custom ara-report directory and have nice html reports, is it possible to get this now in some way?07:31
*** beagles has quit IRC07:31
*** mmedvede has quit IRC07:31
*** b3nt_pin has joined #openstack-infra07:32
*** rpittau|afk is now known as rpittau07:32
*** arxcruz has quit IRC07:32
*** mmedvede has joined #openstack-infra07:34
*** arxcruz has joined #openstack-infra07:36
*** jpena|off is now known as jpena07:41
*** ykarel is now known as ykarel|lunch07:44
AJaegermordred: https://review.opendev.org/682809 removes a "TODO(mordred)", want to review that together with https://review.opendev.org/682808, please?07:46
AJaegersshnaidm|rover: best discuss later with corvus07:47
sshnaidm|roverAJaeger, ack07:48
*** gfidente has joined #openstack-infra07:50
*** gfidente has quit IRC07:51
*** e0ne has joined #openstack-infra07:57
*** dchen has quit IRC08:03
*** ralonsoh has joined #openstack-infra08:04
*** tkajinam has quit IRC08:04
*** AJaeger has quit IRC08:14
ianwsshnaidm|rover: the answer is currently no, because that relied on the log server running wsgi middleware to read the sqlite file.  now the files are served directly from buckets by the hosting providers08:14
ianwsshnaidm|rover: the current practice for nested ansible is to switch ara to static html generation.  it creates a lot of little files, but for one project it should be OK08:15
*** gfidente has joined #openstack-infra08:15
ianwas AJaeger mentioned, future options are under discussion ... there is a thread i think on infra list08:15
sshnaidm|roverianw, ack, thanks08:15
sshnaidm|roverianw, btw, can I generate something as zuul artifact so I can show this in build logs interface?08:16
sshnaidm|roverlike file or link to something08:16
*** xenos76 has joined #openstack-infra08:17
ianwsshnidm|rover: yes ... i'm trying to think of an example08:18
ianwi think if drop it in the "artifacts" directory; i.e. sibling to logs.  codesearching for things like that you'll probably find some examples08:18
*** AJaeger has joined #openstack-infra08:27
fricklerhttps://zuul-ci.org/docs/zuul/user/jobs.html#return-artifacts08:27
openstackgerritMerged zuul/zuul master: Add no-jobs reporter action  https://review.opendev.org/68127808:36
*** ociuhandu has joined #openstack-infra08:37
*** kaiokmo has joined #openstack-infra08:38
*** dciabrin_ has quit IRC08:38
*** derekh has joined #openstack-infra08:40
*** ociuhandu has quit IRC08:41
*** iurygregory has joined #openstack-infra08:48
openstackgerritMerged zuul/zuul master: Add report time to item model  https://review.opendev.org/68132308:50
*** ociuhandu has joined #openstack-infra08:52
openstackgerritMerged openstack/cookiecutter master: Update URLs  https://review.opendev.org/67999808:52
*** ociuhandu has quit IRC08:53
*** KeithMnemonic1 has joined #openstack-infra08:55
*** KeithMnemonic has quit IRC08:55
*** KeithMnemonic1 has quit IRC08:55
*** KeithMnemonic1 has joined #openstack-infra08:56
*** pcaruana has quit IRC08:57
*** ociuhandu has joined #openstack-infra08:58
*** ramishra has joined #openstack-infra08:59
openstackgerritMerged openstack/project-config master: Add horizon to translation jobs  https://review.opendev.org/68280809:00
*** pcaruana has joined #openstack-infra09:01
*** priteau has joined #openstack-infra09:07
openstackgerritMerged zuul/zuul master: Add Item.formatStatusUrl  https://review.opendev.org/68132409:11
stephenfinI assume there's a reason that rechecks on approved patches run both check and gate jobs?09:13
sshnaidm|roverstephenfin, firstly check jobs should run09:14
openstackgerritMerged openstack/openstack-zuul-jobs master: Use on-disk horizon dir for translations  https://review.opendev.org/68280909:19
*** kjackal has joined #openstack-infra09:21
stephenfinsshnaidm|rover: Once, sure, but if they've already passed and it's just a flaky gate failure, why do we need to run them again?09:22
sshnaidm|roverstephenfin, I assume to be sure that it's just flaky gate failure and not some bug.. But I also find it not convenient.09:24
*** ramishra_ has joined #openstack-infra09:25
*** ramishra has quit IRC09:26
*** ramishra has joined #openstack-infra09:26
*** yamamoto has quit IRC09:27
*** ramishra_ has quit IRC09:30
*** panda|ruck|off is now known as panda|ruck09:37
*** tesseract has joined #openstack-infra09:44
AJaegerinfra-root, any idea why the translation jobs now fail with 682809 merged, see http://zuul.opendev.org/t/openstack/build/60c2674924914f57a176b3e5f69bea9f09:45
AJaegerit works fine here http://zuul.opendev.org/t/openstack/build/0978b7bb113945d8ac70f1e20e234ab6/log/job-output.txt#128909:46
AJaegerbut fails here http://zuul.opendev.org/t/openstack/build/60c2674924914f57a176b3e5f69bea9f/log/job-output.txt#1008 with horizon checked out via required-projects09:48
AJaegerfound it "horizon/horizon"09:48
openstackgerritAndreas Jaeger proposed openstack/openstack-zuul-jobs master: Fix horizon translation setup  https://review.opendev.org/68285009:49
*** dtantsur|afk is now known as dtantsur09:49
AJaegermordred: could you +2, please? ^09:49
mordredAJaeger: whoops! +3d it. (that's an obvious error)09:50
* mordred feels silly for missing that09:50
openstackgerritMerged openstack/project-config master: Announce opendev/project-config changes in channels  https://review.opendev.org/68275709:50
* AJaeger feels silly for leaving it in - thanks, mordred09:51
AJaegerthanks, frickler !09:51
*** yamamoto has joined #openstack-infra09:58
*** markvoelker has joined #openstack-infra09:59
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - fix wrong commit gitweb url  https://review.opendev.org/67994610:02
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - handle initial comment change event  https://review.opendev.org/68031010:02
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - handle Pull Request tags (labels) metadata  https://review.opendev.org/68105010:02
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - reference pipelines add open: True requirement  https://review.opendev.org/68125210:02
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - handles pull-request.closed event  https://review.opendev.org/68127910:02
*** pgaxatte has quit IRC10:03
*** markvoelker has quit IRC10:04
*** openstackgerrit has quit IRC10:06
*** ociuhandu has quit IRC10:07
*** yamamoto has quit IRC10:07
*** openstackgerrit has joined #openstack-infra10:10
openstackgerritMerged openstack/openstack-zuul-jobs master: Fix horizon translation setup  https://review.opendev.org/68285010:10
*** zbr has quit IRC10:11
*** dciabrin has joined #openstack-infra10:12
*** zbr has joined #openstack-infra10:12
*** kjackal has quit IRC10:12
*** whoami-rajat has quit IRC10:13
*** kjackal has joined #openstack-infra10:13
*** pgaxatte has joined #openstack-infra10:14
*** ociuhandu has joined #openstack-infra10:14
*** markvoelker has joined #openstack-infra10:16
*** dciabrin has quit IRC10:17
*** ociuhandu_ has joined #openstack-infra10:18
AJaegeryeah, fixed - just saw first successful run10:18
*** kjackal has quit IRC10:19
*** ramishra has quit IRC10:20
*** markvoelker has quit IRC10:20
*** ociuhandu has quit IRC10:22
openstackgerritFabien Boucher proposed zuul/zuul master: Pagure - add support for git.tag.creation event  https://review.opendev.org/67993810:30
*** pgaxatte has quit IRC10:30
*** ykarel|lunch is now known as ykarel10:30
mordredAJaeger: \o/10:31
*** whoami-rajat has joined #openstack-infra10:34
*** dciabrin has joined #openstack-infra10:37
*** yamamoto has joined #openstack-infra10:42
*** exsdev0 has joined #openstack-infra10:46
*** exsdev has quit IRC10:47
*** exsdev0 is now known as exsdev10:47
*** AJaeger has quit IRC10:55
openstackgerritMerged zuul/zuul master: Pagure - fix wrong commit gitweb url  https://review.opendev.org/67994610:57
*** udesale has quit IRC11:01
*** AJaeger has joined #openstack-infra11:01
*** udesale has joined #openstack-infra11:02
*** yamamoto has quit IRC11:12
*** lucasagomes has joined #openstack-infra11:13
*** pcaruana has quit IRC11:19
*** e0ne_ has joined #openstack-infra11:20
*** e0ne has quit IRC11:23
*** dave-mccowan has joined #openstack-infra11:23
*** armax has joined #openstack-infra11:24
*** psachin has joined #openstack-infra11:25
*** pcaruana has joined #openstack-infra11:28
*** iurygregory_ has joined #openstack-infra11:29
*** iurygregory has quit IRC11:29
*** jpena is now known as jpena|lunch11:35
*** zhangfei has quit IRC11:42
*** udesale has quit IRC11:44
ttxmgoddard: recent removal of #openstack-kayobe from accessbot is confusing irc-meetings, since you still have #openstack-kayobe in https://opendev.org/opendev/irc-meetings/src/branch/master/meetings/kayobe-team-meeting.yaml11:49
ttxmgoddard: should we remove / change that meeting?11:50
*** yamamoto has joined #openstack-infra11:59
fungii generally don't recommend removing channels from accessbot unless the channel owners really want infra to stop managing their access controls12:00
fungithere's nothing wrong with keeping access lists up to date on defunct channels, in my opinion12:01
AJaegerfungi, the channel redirects now to another one, so is dead12:01
*** yamamoto has quit IRC12:02
*** yamamoto has joined #openstack-infra12:02
AJaegerfungi: it redirects to #openstack-kolla - don't we remove it in that case?12:02
*** pgaxatte has joined #openstack-infra12:03
fungiwhat if they want to undo the redirect in the future but we've switched which account we use to manage channels?12:03
fungiit's not as if accessbot joins any channels, it just checks/updates chanserv acls, so additional channels aren't a significant overhead for it12:04
AJaegerfungi: Ah! Wasn't aware that the redirect is only limited in this way and therefore accessbot is useful. Let me propose a re-add12:05
*** markvoelker has joined #openstack-infra12:05
openstackgerritAndreas Jaeger proposed openstack/project-config master: Revert "Remove accessbot from #openstack-kayobe"  https://review.opendev.org/68288112:06
AJaegerfungi: ^12:06
AJaegermgoddard: sorry for wrong advise on the accessbot removal ^12:06
*** janki has quit IRC12:07
* fungi disappears for morning exercise, back in a while12:08
*** jamesmcarthur has joined #openstack-infra12:09
*** rh-jelabarre has joined #openstack-infra12:10
*** weshay has joined #openstack-infra12:12
*** goldyfruit_ has quit IRC12:15
*** jamesmcarthur has quit IRC12:16
*** armax has quit IRC12:20
*** jamesmcarthur has joined #openstack-infra12:22
*** derekh has quit IRC12:24
*** rlandy has joined #openstack-infra12:25
*** armax has joined #openstack-infra12:28
*** jamesmcarthur has quit IRC12:31
*** jpena|lunch is now known as jpena12:31
openstackgerritMerged openstack/project-config master: Revert "Remove accessbot from #openstack-kayobe"  https://review.opendev.org/68288112:31
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add scheduler config options for hold expiration  https://review.opendev.org/68267512:42
mgoddardAJaeger: np12:44
*** xenos76 has quit IRC12:46
*** fdegir has quit IRC12:47
*** georgk has quit IRC12:47
*** fdegir has joined #openstack-infra12:47
*** georgk has joined #openstack-infra12:47
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add scheduler config options for hold expiration  https://review.opendev.org/68267512:48
*** jamesmcarthur has joined #openstack-infra12:51
*** mriedem has joined #openstack-infra12:51
AJaegermgoddard: did you see ttx' ping for the IRC meeting?12:52
*** janki has joined #openstack-infra12:52
*** iurygregory has joined #openstack-infra12:53
*** yamamoto has quit IRC12:53
*** janki has quit IRC12:53
*** iurygregory_ has quit IRC12:55
*** yamamoto has joined #openstack-infra12:57
*** jcoufal has joined #openstack-infra13:01
*** ramishra has joined #openstack-infra13:01
*** JorgeFranco has joined #openstack-infra13:03
*** derekh has joined #openstack-infra13:04
AJaegerfungi: a few more repos for your system-config change: https://review.opendev.org/#/c/682771/ - see also email thread13:04
*** ociuhandu_ has quit IRC13:06
*** ricolin_ has joined #openstack-infra13:08
*** Goneri has joined #openstack-infra13:09
*** ricolin has quit IRC13:11
*** ricolin_ is now known as ricolin13:12
*** eharney has joined #openstack-infra13:17
*** ociuhandu has joined #openstack-infra13:18
*** eharney has quit IRC13:19
mgoddardAJaeger: oh no I didn't, thanks13:20
mgoddardttx: I'll remove the kayobe meeting from irc-meetings13:20
*** eharney has joined #openstack-infra13:20
openstackgerritMark Goddard proposed opendev/irc-meetings master: Remove kayobe team meeting  https://review.opendev.org/68289413:22
*** goldyfruit_ has joined #openstack-infra13:30
*** whoami-rajat has quit IRC13:33
*** psachin has quit IRC13:33
*** aaronsheffield has joined #openstack-infra13:34
*** armax has quit IRC13:35
*** ociuhandu has quit IRC13:35
*** armax has joined #openstack-infra13:36
ttxmgoddard: thanks!13:37
mgoddardttx: np13:38
AJaegerconfig-core, please review https://review.opendev.org/682216 and https://review.opendev.org/682215 to finish switching specs.o.o publishing to promote jobs13:41
*** yamamoto has quit IRC13:49
*** xenos76 has joined #openstack-infra13:55
openstackgerritMerged openstack/project-config master: Stop gerrit notification to neutron-stadium specific channels  https://review.opendev.org/68277114:02
*** ykarel is now known as ykarel|afk14:04
*** xek_ has joined #openstack-infra14:06
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add scheduler config options for hold expiration  https://review.opendev.org/68267514:07
*** dtantsur is now known as dtantsur|afk14:10
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Mark nodes as USED when deleting autohold  https://review.opendev.org/66406014:12
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Auto-delete expired autohold requests  https://review.opendev.org/66376214:12
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Add autohold delete/info commands to web API  https://review.opendev.org/67905714:12
openstackgerritDavid Shrewsbury proposed zuul/zuul master: Remove outdated TODO  https://review.opendev.org/68242114:12
*** kjackal has joined #openstack-infra14:17
*** openstackgerrit has quit IRC14:21
*** pgaxatte has quit IRC14:27
*** yamamoto has joined #openstack-infra14:28
*** kjackal has quit IRC14:29
*** ociuhandu has joined #openstack-infra14:30
mnaserinfra-root: i think openstackgerrit bot died? or maybe its still joining14:36
*** openstackgerrit has joined #openstack-infra14:36
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: DNM: test prepare-workspace-git base-test  https://review.opendev.org/68291214:36
*** yamamoto has quit IRC14:37
corvusmnaser: ^ seems to work -- is it not working in some other channel?14:37
mnasercorvus: it literally joined ONE second before you made that change14:37
mnaserlol14:37
mnaserunless it joins-on-demand or something post restart (since 682771 merged)14:37
corvusmnaser: it leaves and joins channels based on activity14:37
mnaseraaaaaaaah14:37
*** yamamoto has joined #openstack-infra14:37
mnaserok that explains it, sorry i just saw it leave and never join again so i figured i'd let people know14:38
mnasersorry for my human nagios failure :)14:38
corvusmnaser: you're still ahead of real nagios :)14:38
*** yamamoto has quit IRC14:40
*** yamamoto has joined #openstack-infra14:40
fungimnaser: yeah, it's a workaround for freenode's limit of 120 simultaneous channels for a single account14:42
mnaserneat-o14:43
fungigerritbot leaves less-active channels when it needs to join others to announce things, in order to keep its joined channel count at or below 120 at all times14:43
fungislightly more efficient than just sharding the configuration across instances for separate freenode accounts14:44
*** priteau has quit IRC14:47
*** priteau has joined #openstack-infra14:51
mnaser++14:51
*** ykarel|afk is now known as ykarel14:54
*** priteau has quit IRC14:56
openstackgerritMerged opendev/irc-meetings master: Remove kayobe team meeting  https://review.opendev.org/68289414:57
*** jaosorior has quit IRC14:57
*** priteau has joined #openstack-infra14:58
openstackgerritMerged zuul/zuul master: Pagure - handle initial comment change event  https://review.opendev.org/68031014:59
*** eernst has joined #openstack-infra15:04
*** iurygregory has quit IRC15:05
openstackgerritMerged zuul/zuul-website master: Update to page titles and Users  https://review.opendev.org/68045915:06
*** arxcruz is now known as arxcruz|ruck15:08
*** gyee has joined #openstack-infra15:08
*** mattw4 has joined #openstack-infra15:09
*** TxGirlGeek has joined #openstack-infra15:10
*** xek__ has joined #openstack-infra15:12
*** zul has joined #openstack-infra15:13
*** xek_ has quit IRC15:14
clarkbmgoddard: fwiw ovs is namespaced and you can run a separate vswitchd in a different namespace aiui15:15
clarkbif however you are using the host network namespace then ya I think you have to defer to the host15:15
clarkbsimilar to how you wouldnt run a competing dhcpcd in a container15:16
openstackgerritMerged zuul/zuul master: zuul_console: fix python 3 support  https://review.opendev.org/68255615:17
*** michael-beaver has joined #openstack-infra15:19
clarkbstephenfin: "clean check" was applied to openstack's pipelines by sdague in an effort to keep flaky changes out of the gate15:20
*** goldyfruit_ has quit IRC15:20
clarkbstephenfin: there were two major issues he was trying to addres with this (though this is all from memory so maybe I remwmber wrong)15:20
fungibasically if your change fails in the gate, then there's an increased chance it's introducing nondeterministic failure behavior so forcing it to go through check again helps make that more apparent15:21
clarkbstephenfin: first was people approving changes that hadno hope of ever passing due to pep8 or other trivial errors. The change would then enter the gate and causehavoc wih resets15:21
clarkband ya second thing is what fungi points out15:21
fungiwell, we also had a stale check guard in place for a while to help deal with that first point15:22
clarkbwe root caused how a handful of gate instability bugs managed to merge and most ot them were the result of gettibg lucky after a number of reverifies15:22
fungibecause people would approve changes which had been sitting around with a 6-month-old passing check15:22
fungiand then wreck the gate as they went through unable to pass the current state of jobs for the repo15:23
corvusclarkb: can you join #opendev?15:23
stephenfinAh, those are all good point. I hadn't considered the knock on effect to the queue of jobs in the gate15:24
clarkbstephenfin: this is why whenever this topic of queue times comes up I push so hard on starting with gate stability15:27
clarkbany other changes we make will be completely wiped out by a flaky gate15:27
clarkband a flaky gate implies our software is flaky15:27
clarkbfixing it is win win15:27
stephenfinyup, for sure15:27
stephenfinwell, _someone's_ software is flaky15:27
stephenfinmy software is perfect(TM)15:28
* stephenfin kids, obviously15:28
clarkbwe prioritize the gate because they are changes in a state reviewed by humans to both be "good" and ready to merge15:29
clarkbwhen the gate resets we stop all jobs behind the rest point and start them all again with a higher priority than all check jobs15:30
clarkbthis takes resources from your changes in check15:30
*** liuyulong has joined #openstack-infra15:30
clarkbstart doing that every 15-45 minutes and you never free up resources for check15:31
clarkbthis has been our sitaution all through feature freeze with openstack integrated and tripleo trading off on who gets to reset15:32
*** iurygregory has joined #openstack-infra15:34
*** markvoelker has quit IRC15:35
*** david-lyle has quit IRC15:35
*** dklyle has joined #openstack-infra15:35
*** zbr has quit IRC15:36
*** zbr has joined #openstack-infra15:36
*** zzehring has quit IRC15:37
*** sshnaidm|rover is now known as sshnaidm15:39
*** dtantsur|afk is now known as dtantsur15:39
*** panda|ruck is now known as panda15:39
*** zzehring has joined #openstack-infra15:40
*** trident has quit IRC15:41
liuyulongHi guys, how to make the running instance new NIC get the right IPv6 address with NetworkManager? Or it is impossible?15:44
clarkbliuyulong: are you asking about our test instances?15:44
liuyulongclarkb, no, Donny told me, our CI uses glean.15:45
*** jamesmcarthur has quit IRC15:45
liuyulongI mean upstream CI.15:45
liuyulongSo why not NetworkManager?15:46
openstackgerritAndreas Jaeger proposed openstack/infra-manual master: Document "clean check" requirement  https://review.opendev.org/68293315:46
AJaegerstephenfin, fungi, clarkb , what about this ^ ?15:46
clarkbliuyulong: we do use network manager on centos and fedora. Glean is the piece between reading cloud metadata and network manager15:47
*** ociuhandu has quit IRC15:47
*** jamesmcarthur has joined #openstack-infra15:47
clarkbon other platforms glean sits between metadata and the platform specifc network config  /etc/network/interfaces on debuntu, systemd-networkd on gentoo and so on15:47
*** ociuhandu has joined #openstack-infra15:48
clarkbfor ipv6 specifically some platforms never had static configuration of ipv6 addresses added to glean15:48
openstackgerritJames E. Blair proposed openstack/project-config master: Symlink /opt/git to /opt/git/opendev.org  https://review.opendev.org/68293415:49
liuyulongclarkb, great news, we are tunning our local image NetworkManager settings. But everytime the running instances' new NIC will have an incorrect IPv6 address.15:49
openstackgerritJames E. Blair proposed openstack/project-config master: Cache repos in /opt/git/opendev.org  https://review.opendev.org/68293515:49
*** ociuhandu has quit IRC15:49
clarkbthey function if there are RAs but not if you are expected to manually co figure the IPs15:49
liuyulongThe address is not the port fixed IPv6 addr.15:49
*** ociuhandu has joined #openstack-infra15:49
*** jamesmcarthur has quit IRC15:49
liuyulongThe image is CentOS7.5. FYI15:50
*** ykarel is now known as ykarel|away15:50
clarkbif we arent talking about our images then I wouldnt know where to start. Are you receiving router advertisements?15:50
AJaegerclarkb, could you please review https://review.opendev.org/682216 and https://review.opendev.org/682215 to finish switching specs.o.o publishing to promote jobs?15:51
clarkbor are you statically configuring the IP?15:51
*** jamesmcarthur has joined #openstack-infra15:52
liuyulongRA packets are received. IP was configured automatically.15:52
*** trident has joined #openstack-infra15:53
*** markvoelker has joined #openstack-infra15:53
*** kjackal has joined #openstack-infra15:53
liuyulongIPv4 address is configured correctly. IPv6 address has the right perfix.15:54
openstackgerritJames E. Blair proposed openstack/project-config master: Cache repos in /opt/git/opendev.org  https://review.opendev.org/68293515:54
*** ramishra has quit IRC15:55
clarkbliuyulong: I think that points at a problem in neutron then? if it thinks the port has one mac address but another is used? The other thing to check is whether or not hardware address obfuscation is enabled on your image15:55
clarkbfungi: ^ I forget how to check that but you probably remember15:55
liuyulongclarkb, but the entire IPv6 address was incorrect. So, since the MAC is correct, we may want to see if NetworkManager can use the EUI64 to generate and config the address.15:56
clarkbliuyulong: you said it has the right prefix15:57
*** jamesmcarthur has quit IRC15:57
liuyulongclarkb, yes, the our prefix len is /48, the Neutron subnet pool size is /64.15:58
clarkbthat sounds like a misconfigured neutron then? or something else is sending RAs too?15:58
*** armax has quit IRC15:58
*** TxGirlGeek has quit IRC15:58
*** jamesmcarthur has joined #openstack-infra15:59
*** eernst has quit IRC16:00
liuyulongclarkb, 240e:980:xxxx::/48 is for the subnet pool with a default /64 prefix len for our subnets.16:03
*** rpittau is now known as rpittau|afk16:03
liuyulongclarkb, Then I have a subnet with prefix 240e:980:xxxx:8009::/64, and a port 240e:980:xxxx:8009:f816:3eff:fee1:4af1 with MAC fa:16:3e:e1:4a:f1.16:04
liuyulong10: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP group default qlen 100016:04
liuyulong    link/ether fa:16:3e:e1:4a:f1 brd ff:ff:ff:ff:ff:ff16:04
liuyulong    inet 192.168.1.30/24 brd 192.168.1.255 scope global noprefixroute eth116:04
liuyulong       valid_lft forever preferred_lft forever16:04
liuyulong    inet6 240e:980:xxxx:8009:fb5d:500:74bd:2700/64 scope global noprefixroute dynamic16:04
liuyulongThis is the new NIC inside the VM.16:04
clarkbliuyulong: please use a paste service like http://paste.openstack.org for pasting large chunks of lines in the future. It helps with readability of the channel16:05
*** kjackal has quit IRC16:05
*** ykarel|away has quit IRC16:05
clarkbso ya that looks like you are getting the correct prefix16:05
clarkbI think that may be caused by hardware obfuscation?16:06
*** yamamoto has quit IRC16:06
clarkbI want to say it is a sysctl setting to disable that16:06
liuyulongclarkb, yes, the prefix is correct. Just have no idea how this "240e:980:xxxx:8009:fb5d:500:74bd:2700" was coming form.16:06
*** zbr is now known as zbr|ruck16:07
liuyulongnet.ipv6.conf.*.use_tempaddr and net.ipv6.conf.*.addr_gen_mode ?16:07
clarkbliuyulong: https://tools.ietf.org/html/rfc4941 is the rfc16:07
openstackgerritAndreas Jaeger proposed openstack/infra-manual master: Document "clean check" requirement  https://review.opendev.org/68293316:08
clarkbliuyulong: ya what is the value of use_tempaddr for eth1?16:08
*** zzehring has quit IRC16:09
openstackgerritJames E. Blair proposed openstack/project-config master: Symlink /opt/git to /opt/git/opendev.org  https://review.opendev.org/68293416:09
openstackgerritJames E. Blair proposed openstack/project-config master: Cache repos in /opt/git/opendev.org  https://review.opendev.org/68293516:09
liuyulongclarkb, this will be the problem, the eth1 was new added to the instance, it can not be set. This value will inherit the 'default' and 'all' ?16:10
fungicorrect, you'll need to disable ipv6 tempaddr support via sysctl configuration16:10
liuyulongnet.ipv6.conf.all.use_tempaddr = 016:11
fungineutron doesn't work with it as far as i know16:11
*** xek__ has quit IRC16:11
clarkbliuyulong: what is it on your running instance? you were able to check the ip addr output, can you check the sysctl for that same eth1 interface?16:11
haleybhttps://docs.openstack.org/neutron/latest/admin/config-ipv6.html#configuring-interfaces-of-the-guest16:11
fungithanks haleyb!16:12
*** kjackal has joined #openstack-infra16:12
haleybwindows guests trip over that a lot, but lately linux distros have been enabling it as well16:12
*** gfidente has quit IRC16:12
*** efried is now known as efried_pto16:13
*** Zara has joined #openstack-infra16:13
AJaegercorvus: want to WIP 682935 ?16:13
AJaegercorvus: Ah, the depends-on has WIP - so, we're good. Ignore me...16:14
liuyulongfor new added NIC on running instance, the value is net.ipv6.conf.*.use_tempaddr = 016:14
openstackgerritMerged opendev/irc-meetings master: Updated the Diversity WG to monthly  https://review.opendev.org/68272716:15
liuyulong# uname -r16:15
liuyulong3.10.0-862.el7.x86_616:15
liuyulong3.10.0-862.el7.x86_6416:15
fungiliuyulong: one possibility is, depending on how you set your sysctl variables at boot, interface autoconfiguration could be racing whatever disables privacy extensions and opaque identifiers16:16
*** kjackal has quit IRC16:16
fungiso by the time those variables get set back to 0 the interface has already set an address?16:17
openstackgerritMerged openstack/openstack-zuul-jobs master: Switch project-config to promote jobs [5]: Update specs, cleanup  https://review.opendev.org/68221516:17
spotzthanks ttx!16:18
fungiliuyulong: is addr_gen_mode being set to 0 too?16:19
clarkbliuyulong: fungi it is also possible that network manager configures it separetly of sysctl16:19
fungi(per the document haleyb linked above)16:19
liuyulongfungi, sure, let me test it again to see if it has race...16:19
*** kjackal has joined #openstack-infra16:20
fungiclarkb: oh, yep there's that as well, we've seen kernel interface autoconfiguration and network-manager race each other to decide who gets to set the interface's v6 addresses16:20
*** zzehring has joined #openstack-infra16:22
liuyulongThe naked eye cannot distinguish...It is too quick...16:23
*** gmann is now known as gmann_afk16:23
liuyulongnet.ipv6.conf.eth3.use_tempaddr is 0 all the time before the address added.16:24
haleybliuyulong: and NM hasn't set addr-gen-mode or ip6-privacy?  https://developer.gnome.org/NetworkManager/unstable/settings-ipv6.html says it defaults the first to privacy16:27
liuyulong[connection]16:28
liuyulongipv4.route-metric=016:28
liuyulongipv6.method=auto16:28
liuyulongipv6.ip6-privacy=016:28
liuyulongipv6.addr-gen-mode=016:28
liuyulongaddr-gen-mode NM_SETTING_IP6_CONFIG_ADDR_GEN_MODE_EUI64 (0)16:28
*** goldyfruit_ has joined #openstack-infra16:28
liuyulongip6-privacy  -1: unknown, 0: disabled, 1: enabled (prefer public address), 2: enabled16:29
liuyulongThis is the settings of /etc/NetworkManager/NetworkManager.conf16:29
clarkbliuyulong: haleyb fungi as a sanity check liuyulong may want to boot our centos 7 image and see if it fares any better?16:29
*** kopecmartin is now known as kopecmartin|off16:29
openstackgerritMerged openstack/project-config master: Symlink /opt/git to /opt/git/opendev.org  https://review.opendev.org/68293416:30
clarkbliuyulong: https://nb02.openstack.org/images/centos-7-0000054888.qcow2 there are md5sums and sha256sums in that dir too16:30
liuyulongclarkb, cool16:30
clarkbglean will configure the root user with your supplied ssh key16:30
clarkbliuyulong: on my desktop the nm gui configurator tool does have options for enabling privacy extensions16:33
clarkbso it could be that nm is doing this independent of your sysctl values16:33
liuyulongclarkb, yes, nmcli connection show the real value does not match the "NetworkManager.conf".16:34
*** dtantsur is now known as dtantsur|afk16:38
*** derekh has quit IRC16:41
liuyulongThank you guys, the image download is a bit slow. I will update my testing here.16:45
liuyulongIt's 0:45 in Beijing now. I had to go bed now.16:45
*** liuyulong has quit IRC16:47
*** tosky has quit IRC16:49
*** e0ne_ has quit IRC16:50
mnaserout of curiosity16:52
mnaserhows nodepool coping with centos 7.716:52
*** tesseract has quit IRC16:53
fungii'm not sure we know yet16:55
clarkbmnaser: haven't seen any complaints yet, but also haven't checked that we've built and uploaded it anywhere either16:55
mnaseryeah i wonder if its just been failing to buidl for a while :P16:55
clarkbwell the image I linked liuyulong to is from today16:55
mnaseroh ok16:56
clarkbabout 10 hours ago16:56
mnaserso at least we know they're still building16:56
clarkbya should be building fine16:56
mnaserbut maybe mirrors havent syncd yet too16:56
mnaserthe infra ones at least16:56
fungiyeah, here's the image build log: https://nb02.openstack.org/centos-7-0000054888.log16:56
pabelangerI'm seeing mirror issues in zuul.a.c for centos-7, but haven't dug into it yet.16:56
fungi2019-09-18 06:24:22.626 | Build completed successfully16:56
fungiso we started uploading it to providers around 10.5 hours ago16:57
mnasersudo rpm --dbpath=/var/lib/rpm --root /opt/dib_tmp/dib_build.am3pYeSd/mnt --nodeps -ivh /tmp/tmp.IFHWiSygY6/basesystem-10.0-7.el7.centos.noarch.rpm /tmp/tmp.IFHWiSygY6/centos-release-7-7.1908.0.el7.centos.x86_64.rpm /tmp/tmp.IFHWiSygY6/filesystem-3.2-25.el7.x86_64.rpm /tmp/tmp.IFHWiSygY6/setup-2.8.71-10.el7.noarch.rpm16:57
mnaserit is installing centos-release-7.7.190816:57
mnasercool!16:57
*** xenos76 has quit IRC16:57
openstackgerritMerged zuul/zuul master: Add support for the Gerrit checks plugin  https://review.opendev.org/68077816:59
fungimnaser: http://zuul.opendev.org/t/openstack/build/0a18d82c6ceb495ba6b1d1e1b9225f69/log/logs/ansible/facts-all.log.txt.gz#1215-122117:04
fungiyep17:04
mnaserwewt prefect17:04
fungithat build was a successful centos-7 job for openstack-ansible btw17:05
fungijust grabbed at random from the zuul builds page17:05
mnaserwho else but us and tripleo mostly using centos-7 :p17:06
fungiright, and i wasn't 100% sure a tripleo job would provide a good example17:06
fungiso i filtered by openstack-ansible17:06
openstackgerritMerged zuul/zuul master: Update gerrit pagination test fixtures  https://review.opendev.org/68211417:08
*** jamesmcarthur_ has joined #openstack-infra17:09
*** jamesmcarthur has quit IRC17:12
*** jpena is now known as jpena|off17:19
*** whoami-rajat has joined #openstack-infra17:20
*** armax has joined #openstack-infra17:26
*** ricolin has quit IRC17:27
*** armax has quit IRC17:31
openstackgerritMerged openstack/project-config master: Switch project-config to promote jobs [6]: Remove publish-openstack-specs  https://review.opendev.org/68221617:31
AJaegerconfig-core, a couple of open changes for review, please: https://review.opendev.org/682639 https://review.opendev.org/680830 https://review.opendev.org/682613 https://review.opendev.org/#/c/682581/17:33
*** yamamoto has joined #openstack-infra17:33
AJaegerclarkb: what's the status of https://review.opendev.org/#/c/681322/ ? did we restart Zuul with the dependency merged and thus can merge the cleanup playbook?17:34
clarkbAJaeger:  on https://review.opendev.org/#/c/680830/ I asked pabelanger if he wanted to use one of the popular galaxy roles for ES instead and didn't really get a firm answer either way. if we can confirm that using one of those is undesireable then we should go ahead and merge it ( thats mostly me just double checking stuff and from an opendev perspective the change is fine )17:35
clarkbAJaeger: good question I believe we did restart with the necessary change. I'll rerun my test now17:35
*** iurygregory has quit IRC17:35
openstackgerritMerged zuul/zuul master: Support HTTP-only Gerrit  https://review.opendev.org/68193617:35
*** rkukura has joined #openstack-infra17:35
AJaegerclarkb: ok, let's wait for pabelanger to confirm.17:36
pabelangerclarkb: AJaeger: yah, ones on galaxy are to agressive in what they configure. Looking to create a very thing shim to package install / config file laydown17:36
*** zbr|ruck is now known as zbr17:36
clarkbpabelanger: can you check the comment on that change I noticed one other small thing you should confirm or update before we merge17:37
pabelangersure17:38
*** yamamoto has quit IRC17:38
openstackgerritPaul Belanger proposed openstack/project-config master: Add windmill-ops / ansible-role-elasticsearch  https://review.opendev.org/68083017:40
*** armax has joined #openstack-infra17:42
*** jtomasek has quit IRC17:43
openstackgerritPaul Belanger proposed openstack/project-config master: Set requireContributorAgreement false for windmill roles  https://review.opendev.org/68295417:44
AJaegerclarkb: see what you triggered ;) ^17:45
clarkbAJaeger: :P17:46
denaitreHey everyone! I am quite new regarding Zuul and the OpenStack testing process, and I hope this chan is the right place to ask my questions. I would like to know how many times OpenStack is deployed per day/week by the CI process. If I understood correctly, OpenStack is deployed with devstack for each integration tests. Given that nearly 2500 changes are commited each week, I guess OpenStack is de17:47
denaitreployed a large number of times. Is it possible to get this number from Zuul? Maybe by investigating the parent job http://zuul.openstack.org/job/devstack-base ?17:47
openstackgerritMerged openstack/project-config master: Update gentoo image to current profile  https://review.opendev.org/68263917:47
*** e0ne has joined #openstack-infra17:48
fungidenaitre: there are many different ways openstack can be deployed by jobs (devstack, openstack-ansible, kolla/loci, openstack-helm, tripleo, puppet-openstack...) so that complicates identifying a solid count17:49
*** e0ne has quit IRC17:49
mordredyeah. but - the data should ultimately be able to be mined :)17:49
openstackgerritClark Boylan proposed zuul/zuul-jobs master: DO NOT MERGE test cleanup phase playbook  https://review.opendev.org/68017817:49
fungii agree, it would just take a fair amount of knowing those sorts of details17:49
clarkbI think the last time I counted devstack-gate executions it was on the order of 10k/day17:50
mordredyeah. that said - taking a first stab at finding all of the jobs that have the devstack base job as a base job in their hierarchy17:50
clarkbbut this was a couple years ago so things may have changed17:50
AJaegerteam, FYI, specs publishing works http://specs.openstack.org/openstack/keystone-specs/ just got updated via promote job. So, all those changes are tested and look fine.17:50
mordredwell - actually - you'd want devstack and devstack-gate base jobs (to get modern and legacy versions)17:50
mordredit wouldn't be _perfect_ - but it would be a good first number17:51
mordredthen adding the list of base jobs like fungi mentions as a second pass to improve the numbers17:51
clarkbdenaitre: mordred fungi one way to get a quick count is to ask logsatsh for an indicator of a deployment. With the right string you'll be able to do it for all th deployment methods (though I have no idea if such a string exists)17:52
mordredclarkb: oh - that's a good idea17:52
mordredclarkb:  I mean - you could start by asking logstash for invocations of "./stack.sh"17:53
mordredthat would get all the devstacks17:53
clarkbmordred: ya though multinode jobs run stack.sh multiple times per job :/17:53
mordredthat's a good point17:53
mordredthere's probably some initial keystone bootstrap that always happens but only happens once17:53
mordredkmalloc cmurphy: ^^ ?17:53
fungithen there's the problem of defining "openstack" for purposes of this analysis. not all jobs deploy the same combinations of services... does a functional test job which just starts one or a handful of services count as an openstack deployment?17:54
openstackgerritMerged openstack/project-config master: Enable Review-Priority flag for Blazar  https://review.opendev.org/68261317:54
openstackgerritMerged openstack/project-config master: Fix release ACL for Kayobe  https://review.opendev.org/68258117:54
denaitresounds great! and I just found the link to logstash in the doc17:55
mordredfungi: I'd argue that, for the purposes of understanding how many times a week the CI system installs openstack, any combination of keystone + a catalog + a service would likely be good enough to count as the CI system doing such work17:55
clarkbdenaitre: note we have a hole in our data for the ~3 days prior to about 24 hours ago17:55
clarkbdenaitre: there was an elasticsearch outage17:55
fungimordred: yeah, that would be a reasonable definition17:56
clarkbbut data from about 24 hours ago through the present and forward should be accurate17:56
mordredfungi: I agree with you in the more specific sense17:56
*** ociuhandu has quit IRC17:56
fungii mainly wanted to point out that methodology makes a big difference on what you'll ultimately be counting17:56
mordredfungi: and a more refined report might want to show a rollup report reporting total number of "clouds" deployed, then breakdowns of the set of services in each "cloud"17:56
*** sshnaidm is now known as sshnaidm|bbl17:57
mordred"10k clouds, 9k clouds with keystone, nova, neutron, cinder, glance, 4k clouds with keystone, nova, neutron., cinder, glance, octavia ..."17:57
*** armax has quit IRC17:57
mordreddenaitre: good job of nerdsniping us to all ponder this :)17:58
* kmalloc looks17:59
clarkbAJaeger: corvus I think 681322 is ready http://paste.openstack.org/show/777443/ shows my test results (I'll leave that on the change too)17:59
openstackgerritMerged zuul/zuul master: Add autogenerated tag to Gerrit reviews  https://review.opendev.org/68247317:59
kmallocmordred: bootstrap should also be idempotent and relatively fast if the data exists18:00
*** armax has joined #openstack-infra18:00
kmallocbut bootstrap should also be light-weight... ideally devstack should know if keystone is bootstrapped up18:00
clarkbmordred: kmalloc: fwiw its incredibly slow with osc. That accounts for about 5-7 minutes of every devstack job18:00
kmallocand just never run rather than ghe get-or-create method18:00
mordredkmalloc: in general would bootstrap be a decent thing to troll for in logstash logs to find instances of "I installed an openstack" ?18:00
*** armax has quit IRC18:00
kmallocsure.18:01
mordredkmalloc: and it should be a thing that would be run by non-devstacks too, yeah?18:01
kmallocclarkb: right, i think stack.sh is inefficient vs keystone's manage-bootstrap mechanism.18:01
*** e0ne has joined #openstack-infra18:01
kmallocstack.sh does the get-or-create method over and over, which is slow(ish)18:01
*** priteau has quit IRC18:01
kmallocwe might need to write out some state stuff ... or something else to know things have been done and can just reload the info cleanly18:01
clarkbif keystone knows of a magical way to amke that better I'm sure we would all be happy for patches18:02
kmallocit's def. not magical18:02
mordredclarkb: we had a discussion with johnsom yesterday about an issue octavia had with their devstack plugin due to an output format change for complex data from osc which they parse with awk and whether or not it would be a good idea to replace their whole plugin with python using sdk18:02
kmallocit's a lot of re-writing18:02
clarkbbut bnemec has been poking at using the osc as a service idea18:02
mordredclarkb: because they have a bunch of complex logic18:02
AJaegerclarkb: nice18:02
clarkbosc as a service can reuse tokens and remember some data so it helps18:03
kmallocyeah, it might help a bunch18:04
spotzfungi can I get topic permissions on openstack-uc please?18:05
denaitreclarkb: ok thanks for the warning18:05
mordredclarkb: https://opendev.org/openstack/octavia/src/branch/master/devstack/plugin.sh#L391-L39918:05
mordredenjoy line 39918:05
kmallocmordred: my eyes are bleeding18:05
johnsomIt isn't that bad... lol18:06
clarkbmordred: fwiw my sdk rewrite wasn't small, but ya I think in many ways its mroe maintanable18:06
* AJaeger needs a larger monitor or two18:06
fungispotz: done, i have you the same permissions in #openstack-uc as mrhillsman has18:06
mordredjohnsom: :)18:07
spotzfungi: Thank you Sir just got pointed out the unupdated meeting time is there18:07
bnemecI'm a bit hung up on figuring out how to disable the osc service hack when testing osc changes.18:07
kmallocjohnsom: well i guess it matters what you compare it to18:07
bnemecI briefly looked at the dependency resolution code in one of the other roles and it was...daunting.18:07
kmalloci've written worse in my career...usually in perl18:08
mordredclarkb: that's this yeah? https://review.opendev.org/#/c/673018/ <-- kmalloc, johnsom18:08
johnsomI pasted a readable version for the reviewers. That is a bit "minimized".18:08
clarkbmordred: johnsom kmalloc the child https://review.opendev.org/#/c/673108/6 is much more intresting18:08
clarkbbnemec: did you see my comment on the mailing list about using a proxy executable called openstack that uses osc as is or the service depending on what is being executed?18:09
clarkbbnemec: if you did that you could also have a flag to pass through to real osc when testing osc18:09
clarkbor maybe it was on the change. I know I posted that idea somewhere18:09
bnemecclarkb: I think I already did that. The problem is figuring out when to disable the service.18:10
bnemecIf osc or any of its deps are anywhere in the depends on for the change we shouldn't use the service.18:10
clarkbbnemec: that info is in the zuul dict in the jobs iirc18:10
bnemecI'm sure it's doable, it's just not trivial and I haven't had the time to really dig into it.18:11
clarkbif 'openstack/openstackclient' in zuul.projects: type of condition18:11
*** ociuhandu has joined #openstack-infra18:11
corvusclarkb: 681322+318:12
bnemecIt needs to be all of the deps too though.18:12
clarkbcorvus: tyty18:12
bnemecAlthough maybe we could get around that by restarting the service to pick up test versions of the deps after they get installed.18:13
clarkbbnemec: well any python imports would just work because they'd find the new code via the python path18:13
clarkbthe only thing we have to worry about is cli invocation right?18:13
bnemecI was hoping to avoid that rabbit hole for the initial version, but maybe I can't.18:13
clarkbI guess the worry is that if there is a break to the cli input/outpu (like what johnsom is running into) then we'd not catch any of those cases18:14
bnemecRight now osc is installed from pip so it gets pip deps too.18:14
bnemecWhich ianw correctly noted means we might not actually test a change if it was to one of the things pulled in by the osc pip install.18:15
clarkbwe are still running osc though right?18:15
clarkbwe are only changing the interface to interact with osc18:15
*** ociuhandu has quit IRC18:16
openstackgerritMerged openstack/project-config master: Add windmill-ops / ansible-role-elasticsearch  https://review.opendev.org/68083018:18
*** xenos76 has joined #openstack-infra18:19
kmalloci think going to a single instance of OSC per standup vs the package_resource_load (it's expensive... for sure) a bazillion times is worth it18:19
openstackgerritMerged opendev/base-jobs master: Add cleanup playbook to all base jobs  https://review.opendev.org/68132218:19
kmallocit should absolutely speed things up18:19
openstackgerritMerged zuul/zuul master: Use robot_comments in Gerrit  https://review.opendev.org/68248718:19
kmallocthe only other thing i would consider is writing out a state and sourcing it in/exporting env for the values set, so even the OSC bits aren't needed once it's been run once for keystone.18:20
kmallocbasically run once, save somewhere, if values are set trust the values.18:20
bnemecI guess my concern is that as it works right now, if someone pushes a patch to osc-lib we won't actually use that test patch in the service because it uses a pip installed version.18:21
bnemecUnless we restart the service after installing the osc-lib patch being tested, which is something we'll need to do for projects that use a plugin to osc anyway.18:22
clarkbbnemec: I see. Are you changing how osc is installed in the change then?18:23
clarkbor is that already a problem?18:23
bnemecNo, I added a pip install of osc before running devstack so I could start the service in Ansible.18:23
bnemecBut maybe I should move that into devstack itself and just have the Ansible set a flag.18:24
clarkbya I would expect devstack to do that (it is already doing it for regular osc cli interaction)18:24
clarkbit could be the first "service" it installs18:24
*** kjackal has quit IRC18:31
*** ociuhandu has joined #openstack-infra18:33
*** openstackgerrit has quit IRC18:37
*** ociuhandu has quit IRC18:38
*** goldyfruit___ has joined #openstack-infra18:38
*** goldyfruit_ has quit IRC18:41
clarkbfwiw there were 863 completed devstack runs on a host called "controller" in the last hour or so18:41
clarkbI've asked for the count for the last 24 hours but I've given logstash a sad doing so18:42
clarkbbut 863 in an hour makes me think we are still in at least the 10k/day range18:42
clarkbonce you factor in daily work rhythm and such18:42
*** mattw4 has quit IRC18:44
clarkboh wait was that for 6 hours ugh logstash slowness making it hard to know what you got resutls for and what you didn't get results for18:46
* clarkb keeps fiddling with it18:46
clarkbok ya its 1550 ish per 12 hours so 863 must've been 6 hours18:48
clarkb24 hours now returns and says 356618:48
clarkbjust devstack jobs then is 3.5k/day18:49
clarkbmessage:"controller | DevStack Component Timing" AND filename:"job-output.txt"18:49
clarkbthat is my query18:49
clarkbdenaitre: ^18:49
clarkbthen you need to count triplo, osa, etc18:49
*** jcoufal has quit IRC18:56
clarkbmessage:"export TOCI_JOBTYPE=" AND filename:"job-output.txt" for tripleo gives us another 85718:57
*** lucasagomes has quit IRC18:58
clarkbmessage:"Bootstrap the OSA AIO" AND filename:"job-output.txt" for OSA another 27718:59
clarkbballpark figure at least 4.7k jobs per day18:59
clarkbif the last 24 hours are representative18:59
AJaegerclarkb: do legacy jobs use controller as well?19:03
clarkbAJaeger: oh good question. I don't think they do. They use primary | iirc19:04
* clarkb checks19:04
clarkbhrm that string doesn't exist I'll need to find a legacy job log I can read19:05
AJaegerit should be primary19:06
AJaegerclarkb: http://zuul.opendev.org/t/openstack/build/a2e91aa3d5d44768b63f7ba067209ef1/log/job-output.txt#5319:06
clarkbthanks19:07
clarkbmessage:"primary | Mapping projects to workspace" AND filename:"job-output.txt" should capture d-g and that shows another 115719:12
clarkbtakes us to 5.8k range19:12
mordredclarkb: you got nerdsniped good19:14
clarkbmordred: yup. But its also useful show off data so figured was worth poking at a bit19:14
mordredagree19:14
mordredit's a good problem to dig in to19:15
*** mriedem has quit IRC19:15
AJaegerinfra-root, I documented the "clean check" requirement in the infra-manual, enhancement suggestions welcome: https://review.opendev.org/68293319:15
*** mriedem has joined #openstack-infra19:16
AJaegerinfra-root, https://review.opendev.org/#/c/681353/1 documents our renames from Monday, please approve for bookkeeping19:17
*** openstackgerrit has joined #openstack-infra19:18
openstackgerritMerged opendev/project-config master: Add record of 20190916 gerrit project renames  https://review.opendev.org/68135319:18
AJaegerthanks, mordred ^19:18
mordredAJaeger: I think the manual patch looks good - I left a +2- feel free to land it - or keep it open if you want more comments19:19
clarkbinfra-root https://review.opendev.org/#/c/682732/ is also related to our recent renames if you want to take a look19:19
clarkbmordred: corvus ^ in particular since you were involved with the change this restores back to19:19
AJaegermordred: I'll +A tomorrow in case others want to improve. thanks19:20
clarkbNot sure I called it out in the meeting yseterday but my journey to ansiblefest actually starts on friday so I'll be out starting then19:21
clarkbI'm making a pitstop to attend a wedding on my way19:21
mordredclarkb: are you going to a random wedding?19:22
clarkbnot a random one no19:22
mordredok. I mean - I kind of like the idea that you have a hobby of stopping by weddings, so you figured you catch on on the way to ansiblefest ...19:22
*** eernst has joined #openstack-infra19:23
*** ociuhandu has joined #openstack-infra19:24
corvusyeah, i was waiting for an xkcd "my hobby..."19:29
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove obsolete docs publish jobs  https://review.opendev.org/68297219:34
*** prometheanfire has quit IRC19:36
corvusclarkb: comment on the setup-org change19:37
*** prometheanfire has joined #openstack-infra19:37
clarkbcorvus: good idea I'll do that19:38
*** armax has joined #openstack-infra19:38
*** eharney has quit IRC19:39
*** ociuhandu has quit IRC19:40
openstackgerritClark Boylan proposed opendev/system-config master: Restore setup-org.yaml  https://review.opendev.org/68273219:41
clarkbcorvus: mordred ^ that better?19:41
openstackgerritDavid Shrewsbury proposed zuul/nodepool master: Reduce upload threads in tests from 4 to 1  https://review.opendev.org/68297719:41
* clarkb finds lunch19:41
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove obsolete docs publish jobs  https://review.opendev.org/68297219:42
*** tosky has joined #openstack-infra19:43
*** jbadiapa has quit IRC19:44
openstackgerritAndreas Jaeger proposed openstack/project-config master: Remove obsolete docs publish jobs  https://review.opendev.org/68297219:46
*** armax has quit IRC19:47
kmallocoh i ... i think that should be a thing: Random wedding stopping on the way to ansiblefest19:49
*** mattw4 has joined #openstack-infra19:50
mordredkmalloc: I think I hear you suggesting that everyone who is going to ansiblefest should stop by a random wedding19:51
mordredthat wouldn't be a weird tradition to start at all19:52
*** sshnaidm|bbl is now known as sshnaidm19:52
*** Goneri has quit IRC19:52
kmallocthat is the correct interpretation of what I'm saying19:53
kmallocfor sure!19:53
*** ralonsoh has quit IRC19:56
*** jbadiapa has joined #openstack-infra19:57
* dtroyer isn't going to ansiblefest so will settle for an engagement party19:58
dtroyerWhile I'm here, I need a bit of assistance with an Intel 3rd party CI account in Gerrit…19:58
dtroyerI'm looking for the email address for intel_zuul in Gerrit19:59
fungithe gerrit rest api will let you query that, anonymously even20:02
fungii'd look it up but i'm about to disappear to grab dinner20:02
dtroyeraha!  teaching me to fish… no worries, thanks fungi20:02
fungimmm, now i want fish20:02
*** larainema has quit IRC20:06
*** sthussey has joined #openstack-infra20:07
* jrosser wonders again about the debian buster updates repo.....20:08
*** pcaruana has quit IRC20:10
*** jamesmcarthur_ has quit IRC20:12
*** jamesmcarthur has joined #openstack-infra20:17
*** jamesmcarthur has quit IRC20:17
*** jamesmcarthur has joined #openstack-infra20:18
rpiosoPlease remind me. Does a clean rebase of a change remove its existing approvals to merge?20:26
clarkbrpioso: yes20:29
rpiosoclarkb: Thank you!20:29
clarkbjrosser: I'm not sure where we ended up with that. The job was definitely using the updates repo20:29
clarkbfungi was looking at reprepro further20:30
clarkbjrosser: I don't know much about apt's repo format or why packages may be superceded or not so will have to defer to others. The entire config for reprepro is in config management (and I can dig up links to that if people want) and the mirrors themselves are publicly accessible (they shouldn't be used or relied on outside of CI but you can poke around them to see how they are set up20:31
jrosseryes thats right - i couldnt see anything particularly wrong with the job setup20:32
*** xenos76 has quit IRC20:32
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: RFC: Generic cache implementation  https://review.opendev.org/68299220:38
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: RFC: Generic cache implementation  https://review.opendev.org/68299220:39
clarkbjrosser: comparing http://ftp.us.debian.org/debian/pool/main/e/exim4/ to http://mirror.dfw.rax.openstack.org/debian/pool/main/e/exim4/ it seems we've got the right packages (latest for each release)20:40
clarkbI think20:40
* clarkb downloads the contents file20:40
clarkber I want the packages file20:41
*** eharney has joined #openstack-infra20:43
*** goldyfruit___ has quit IRC20:44
clarkbneither the buster-updates nor the buster-backports packages files seems to point at the up to date exim4-base package and the package that buster proper points to isn't present20:44
clarkbis the problem we need to be mirror a buster security repo too maybe?20:45
clarkbexim4 had updates due to security issues /me pokes around in upstream mirror20:45
*** Goneri has joined #openstack-infra20:45
clarkbthe upstream mirror points to the package files we do have in its packages file20:47
clarkbso ya reprepro must be producing an invalid packages file20:47
clarkbanyone know about reprepro and where to start looking for why that may happen?20:47
corvusinfra-root: i'd like to restart zuul and, well, frankly see if it can still talk to gerrit after all the changes to the gerrit driver which just landed20:52
corvusi think we should do a full restart, just because i think the executors and mergers might switch from using ssh to http20:53
corvusfor git ops20:53
corvusso basically, the plan would be a full restart, watch some scheduler/executor logs, and if things don't work, reinstall from an older commit on all systems and restart again20:54
corvusi'll identify the revert commit now20:54
*** ociuhandu has joined #openstack-infra20:56
corvusactually, the revert commit should be the 3.10.2 tag.  so if things go poorly, i'll just reinstall that tag.20:56
*** goldyfruit___ has joined #openstack-infra20:56
corvusoh, we might be able to do this in multiple phases -- we can probably restart a small number of mergers first and make sure they work okay, then do all the mergers...  if those work out okay, then proceed to do the whole system.20:57
clarkbcorvus: can probably do the executors in a similar way?20:58
corvusclarkb: yeah, but i think the mergers would be sufficient test coverage for that20:58
corvusso i think we can skip it and not lose anything20:58
clarkbfwiw our reprepro options have export=changed set. reading the man page that seems to mean export the package indices for dists that have changed. Which I think means reprepro should've updated those package files20:58
clarkbthe reprepro log shows when the package we point at was replaced, but I don't see naythign in the months of logs there about it updating packages.gz21:00
*** signed8bit has quit IRC21:01
*** signed8bit has joined #openstack-infra21:02
corvusi've stopped all mergers and restarted zm0121:07
corvusit hasn't picked up a job yet :/21:07
corvusapparently most of what our mergers do is fetch ansible refs21:08
*** eernst has quit IRC21:09
jrosserclarkb: thankyou for looking at it, i need to !computer now, will read back tomorrow if anything turns up21:10
clarkbjrosser: I'm quickly out of my depth here. Reprepro's behavor seems to need an understanding of apt's essoteric options and behaviors21:11
openstackgerritMerged opendev/system-config master: Restore setup-org.yaml  https://review.opendev.org/68273221:11
clarkbwe could try changing export=always21:11
clarkbmaybe?21:12
clarkbcorvus: is it working then?21:16
corvuszm01 just handled an openstack job and didn't log an error so... yes i think?21:18
corvusi think i want to see it handle one more job21:19
clarkbI guess the executors could've been grabbing the jobs too?21:19
clarkb(that would explain why zuul seems to be running jobs just fine)21:19
corvusyeah, and merger jobs also tend to come in waves21:19
Shrewscorvus: do you need to restart the scheduler?21:21
corvusShrews: yes, that has the biggest changes21:21
*** Goneri has quit IRC21:22
*** goldyfruit_ has joined #openstack-infra21:22
Shrewscorvus: then maybe wait until the autohold stuff all lands? guess it isn't necessary, but since it's in flight...21:23
corvusShrews: well, it'll take a while to get installed; i'd likely eod before i'm able to do that, so i was thinking we'd restart for that later in the week21:23
Shrewscorvus: yah, that's fine then21:23
*** goldyfruit___ has quit IRC21:25
*** Goneri has joined #openstack-infra21:26
corvusokay i give up waiting; i'm assuming that one job without errors is good enough.  i'll restart the whole shebang now.21:27
EmilienMclarkb: I have a short update in our outage21:27
EmilienMclarkb: we have identified the root cause and we have a few patches what we would need to land, on different repos21:28
clarkbEmilienM: ya I saw your email. Won't that bring back the authentication failures though?21:28
clarkb(also I thought cloudnull's change had been in place for a few weeks now but the problems you were seeing are much more recent/)21:28
EmilienMclarkb: no we don't want to revert that one yet21:28
EmilienMclarkb: we want to disable something in our testing, which isn't related to the docker.io thing21:28
clarkbah that is the third chagne you link21:28
clarkbhttps://review.opendev.org/#/c/682905/ that one21:29
EmilienMclarkb: could we exceptionally land https://review.opendev.org/#/c/683001/ now ?21:29
clarkbEmilienM: we can certainly enqueue and promote it to the gate21:29
EmilienMclarkb: I'm asking for a force merge21:29
clarkbEmilienM: why?21:29
clarkbcan you explain why that is necessary in this case?21:29
cjloadergates down?21:30
EmilienMit'll probably fail in gate21:30
EmilienMas we need other patches in tripleoclient21:30
clarkbusually force merges are reserved for unwedging dependencies between changes21:30
EmilienM2 actually21:30
EmilienMwe need https://review.opendev.org/#/c/682905/ and https://review.opendev.org/#/c/682943/ to land21:30
clarkbEmilienM: can we enqueue them both then?21:30
EmilienMwe need them 3 together..21:31
EmilienMthey passed check21:31
clarkbEmilienM: https://review.opendev.org/#/c/682943/ well that one sets the new behavior to what we want by default right?21:31
EmilienMwe would save hours...21:31
clarkbso if that one goes in first the others can follow?21:31
cjloaderhttps://zuul.opendev.org/t/openstack/status down....21:32
corvuscjloader: it's restarting21:32
cjloaderugh21:32
corvuscjloader: sorry?21:32
EmilienMclarkb: right but for that to be effective we need https://review.opendev.org/#/c/683001/21:32
clarkbEmilienM: I'm just trying to understand what the interdependencies are here. My read of the commit messages is that 682943 does what we want by default and it passes check21:33
cjloaderit was almost done with a job21:33
clarkbEmilienM: we need that even if disabled is the default?21:33
corvuscjloader: hang around, let's discuss this when i'm done with the maintenance21:33
EmilienMclarkb: right21:34
clarkbEmilienM: also we are now up to 4 chagnes we need and 2 of them are not approved. Why would force merging a single change help?21:34
EmilienMweshay: ^ can you help me here please21:34
weshayI'm here21:34
EmilienMI really want to leave my laptop for today21:35
weshayok..21:35
EmilienMwe need these patches merged, that's all I want21:35
weshayEmilienM,  you want to force merge those?21:35
EmilienMyes this is what I asked21:35
weshayok21:35
clarkbwell you asked to force merge one of them21:35
weshayWhat EmilienM said21:35
EmilienMit happens every 6 months21:35
clarkbbut apparently all 4 are necessary21:35
EmilienMI need https://review.opendev.org/#/c/683001/21:35
clarkband only 2 are approved21:35
EmilienMI just approved21:35
EmilienM682943 682905 and 68300121:36
clarkbso now I'm confused over what is actually necessary. It would be helpful if someone can explain that and why we can't sequence them and promote to the gate21:36
weshayok..21:37
corvusi'm seeing a problem with the re-enqueue process; we may need to revert21:37
weshaythis one can go in w/o any deps https://review.opendev.org/#/c/68300121:37
weshaytotally safe21:37
clarkbweshay: and if we enqueue that one at the top of the gate we don't expect it will pass on its own without the others?21:37
clarkbprobably because the others are what take action on that config change?21:38
weshaythis one is in the gate.. https://review.opendev.org/#/c/682905/ now .. and is safe21:38
EmilienMin theory yes they could be enqueued21:38
EmilienMbut will likely fail since 683001 isn't landed21:38
clarkbEmilienM: is the commit message wrong about the default being disabled then? or maybe these are two different flags that we are disabling?21:38
clarkb(reading the commit messages those two changes seem to be equivalent just in different locations)21:39
clarkbthey both disable validations21:39
mriedemshould i expect to be seeing this from zuul after the restart?21:39
mriedem"Unable to freeze job graph: 'dict_keys' object does not support indexing"21:39
clarkbmriedem: corvus just mentioned a problem with the requeue and that we may need to revert. I'm going to guess that is related21:40
EmilienMclarkb: one is to disable all validations (683001) and the other ones are to bind the new inflight validations which we now try to disable as well to the parameter set in 68300121:40
corvusmriedem: ack, thx21:40
*** panda has quit IRC21:41
clarkbEmilienM: ok so they are different sets of validations?21:41
*** mriedem is now known as mriedem_afk21:41
*** e0ne has quit IRC21:41
EmilienMclarkb: yes, that we try to configure at once21:42
EmilienMand we need to disable them21:42
clarkbEmilienM: and we think both of them cause the gate problems?21:42
EmilienMthey need to be reworked, they caused timeouts21:42
clarkb(eg disabling one or the other won't cause jobs to start passing)21:42
*** panda has joined #openstack-infra21:42
corvusrestarting scheduler on 3.10.221:42
corvusi'll leave the rest of the system on HEAD21:43
weshayclarkb, timeouts.. lots of things can contribute to timeouts21:43
clarkbweshay: right but is the impact of one of these sufficient to address the timeouts? If so it can be enqueued to the head of the gate and ocne merged should allow everything behind it to pass21:44
weshayhttp://dashboard-ci.tripleo.org/d/si1tipHZk/jobs-exploration?orgId=1&from=now-90d&to=now&fullscreen&panelId=1621:44
weshayno.. as EmilienM was saying.. we're trying to address it with a few patches.. that never should have been merged21:44
weshaysorry for the trouble21:45
weshayyou guys are probably busy atm21:45
EmilienMclarkb: could you please go ahead and merge https://review.opendev.org/#/c/683001/ https://review.opendev.org/#/c/682905/ and https://review.opendev.org/#/c/682943/21:45
*** nhicher has quit IRC21:46
*** nhicher has joined #openstack-infra21:47
clarkbEmilienM: weshay is this accurate for a message in gerrit for this "Force merging https://review.opendev.org/#/c/683001/ https://review.opendev.org/#/c/682905/ https://review.opendev.org/#/c/682943/ together to disable validations that cause jobs to timeout. Once disabled these can be added back in little by little as their functionality is verified" ?21:47
openstackgerritTristan Cacqueray proposed zuul/zuul master: Store a list of held node per held build in hold request  https://review.opendev.org/68246621:47
weshay+1 from .. well said21:48
EmilienMyes, +121:48
weshayme that is21:48
corvusre-enqueueing21:48
clarkbweshay: EmilienM ok I'll do that as soon as zuul restart is sorted21:49
cjloadercorvus: can we discuss?21:49
*** michael-beaver has quit IRC21:49
weshayaye.. bad timing21:50
weshaysorry21:50
EmilienMI need to step away for family stuff, I'll check later. Hopefully we can land that in the next hour then i'll look at the 401 errors21:51
corvusclarkb: re-enque is still in progress but otherwise i think we're stable; the first of EmilienM's changes is set to be re-enqueued in check, the second in gate, the third not.21:51
clarkbcorvus: ok I think I udnersatnd the problem well enough now and if indeed these tests all contributing to timeouts then merging them to avoid them thrashing each other makes sense21:52
EmilienMcorvus: I haven't asked a re-enqueue21:52
clarkbcorvus: I was planning to click the submit button on those three21:52
corvusEmilienM: i know, you're getting one anyway since that's the restart process; nothing i can do :)21:52
corvusEmilienM: clarkb can dequeue afterwords if he wants21:52
clarkbI'll do that now that you've said the startup is happy21:52
clarkbcorvus: ++21:53
corvuscjloader: yes.  our zuul is a very busy system, and it is nearly always on the verge of merging a change.  when we perform a restart, we take several things into consideration, including backlog and release activity.  but we can't wait until it's idle, because it never is.  instead, we save the queues and restore them so that, aside from some additional delay, it shouldn't be a big impact.  this21:55
corvusis further mitigated by zuul's dependency system which means you can continue to develop and even approve changes which depend on un-merged changes that are still in the queue.21:55
EmilienMcorvus, clarkb : thanks for the help today21:55
clarkbEmilienM: weshay the merging is done I'll work on dequeuing now21:55
EmilienM++ thanks again21:55
weshayrock on!21:55
corvusclarkb: gate re-enqueue is done; check in progress21:55
EmilienMwe can now focus on 401 and bring back our jobs21:55
cjloadercorvus: my initial +1 was seconds away from occurring on https://review.opendev.org/#/c/682446/ which occurred on a bad gateway21:57
clarkband now dequeues of those three changes are complete21:58
cjloaderI had already gotten approval from a core21:58
cjloadercorvus: I initiated the recheck, and was in the middle of "uploading swift logs"21:59
cjloaderon the last one21:59
cjloaderbut okay...22:00
*** eharney has quit IRC22:04
*** rlandy is now known as rlandy|bbl22:04
*** jamesmcarthur has quit IRC22:06
corvuscjloader: i agree that's bad timing.  but due to our scale, it is unavoidable that it will happen to some change in the system -- on average, at any given time, there are about 7 changes that zuul is 5 minutes or less away from completing.  but it needn't slow down your development process. if your work depends on that change, you can use git dependencies or the depends-on header to continue22:06
corvusbuilding on it.  if you would like to avoid that sort of thing entirely in the future, there is a work-in-progress spec for implementing a highly-available scheduler for zuul.  if you'd like to contribute to it, that would be welcome, i'd be happy to point you at it.22:06
*** jamesmcarthur has joined #openstack-infra22:09
cjloadercorvus: ok ty22:09
cjloadercorvus: sorry i'm a bit frustrated because of all the blockers recently I fully understand your point and i'll leave it as that.22:11
*** goldyfruit_ has quit IRC22:15
cjloadercorvus: that actually helped me out btw22:15
corvuscjloader: no worries.  we all have changes that have been sitting out there a while :)22:15
cjloaderall good now corvus -- https://review.opendev.org/#/c/680284/22:20
cjloaderused the Depends-On22:20
corvuscjloader: great!22:21
*** markvoelker has quit IRC22:24
*** JorgeFranco has quit IRC22:24
*** jamesmcarthur has quit IRC22:33
*** ociuhandu has quit IRC22:40
*** goldyfruit_ has joined #openstack-infra22:40
clarkbI've learned things from filing that gitea bug about timing stuff out22:41
clarkbthere does already exist git command timeouts, but not all git commands use them22:41
*** rfolco has quit IRC22:43
clarkboh hrm if you don't specify a timeout it should make it to the default timeout whcih is 60 seconds22:44
clarkbso how do we end up servicing requests for hours?22:44
clarkbpossible the timeouts don't actualyl work or something else is eating the time22:46
*** jamesmcarthur has joined #openstack-infra22:50
clarkbreading the timeout code I think they should work22:51
clarkbthey rely on stdlib context.WithTimeout contexts and exec.CommandContext which should kill -9 a process once the context is marked done which happens when the timeout is reached22:51
clarkbother things I've learned, we should have a 16 hour cache ttl on the objects we do cache22:53
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: Fix gerrit errors from production  https://review.opendev.org/68300622:53
openstackgerritJames E. Blair proposed zuul/zuul master: DNM: Use http for all gerrit tests  https://review.opendev.org/68300722:53
*** mriedem_afk has quit IRC22:53
corvusclarkb: you mean that's the default cache ttl?22:55
clarkbcorvus: yes, but I think we've found we don't cache much22:55
clarkbthere are a number of defunct git processes on gitea01. i wonder if these are proceses that hit those git timeouts22:58
clarkbI suppose they could be otherwise leaked because we don't have an init system in our docker container22:58
clarkbya their parent is gitea web22:58
clarkbI bet it needs to reap them22:58
*** dchen has joined #openstack-infra22:59
*** tkajinam has joined #openstack-infra23:02
clarkbhttps://github.com/go-gitea/gitea/issues/491 has had interesting conversation in recent days too23:02
clarkbthere is a suggestion that adding commit graphs to repos will help then testing that shows it doesn't help at all23:04
*** jamesmcarthur has quit IRC23:09
*** whoami-rajat has quit IRC23:10
*** rcernin has joined #openstack-infra23:16
*** mriedem has joined #openstack-infra23:17
clarkbtaking the idea of a config flag to not look for modified times, I think that is possible with a config flag that does not run the function to get that commit data and selects a different template23:19
*** goldyfruit_ has quit IRC23:24
*** goldyfruit_ has joined #openstack-infra23:29
*** jamesmcarthur has joined #openstack-infra23:34
*** sthussey has quit IRC23:35
*** tosky has quit IRC23:40
clarkbya, pretty sure we would need to update the templates otherwise we'll get errors doing this. But the code sort of already supports it by returning nil values on error. We could return nil values when diasbled as well then handle them in the templates23:41
clarkbits too late in the day for me to learn golang templating though23:41
clarkbI actually think the algorithm they use now goes through the entire repo history?23:44
*** efried_pto is now known as mriedem123:47
*** mriedem1 is now known as efried_pto23:47

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!