Friday, 2018-08-03

ianwpabelanger: all that job split up ready for review?  i'll look in on that too00:02
pabelangerianw: for base-minimal? yah, ready for some eyes00:02
*** hamzy has joined #openstack-infra00:09
*** tonyb has quit IRC00:16
*** dingyichen has joined #openstack-infra00:17
ianwyay, all working00:24
ianwi wonder how many people noticed that rtd publishing was failing, and then noticed they could hook it up to github, and now do a dance gerrit->github->rtd ping.  it bet it's !000:25
*** gyee has quit IRC00:36
*** edmondsw has joined #openstack-infra00:37
*** edmondsw has quit IRC00:42
*** jcoufal has joined #openstack-infra00:49
*** mriedem_afk has quit IRC01:34
*** hongbin has joined #openstack-infra01:44
*** ramishra has joined #openstack-infra02:00
*** yamamoto has joined #openstack-infra02:01
*** jcoufal has quit IRC02:05
*** ramishra has quit IRC02:08
*** yamamoto has quit IRC02:12
*** yamamoto has joined #openstack-infra02:18
tristanCcorvus: clarkb: isn't the logic of os-loganalyze (e.g. linkable timestamp) going to be implemented in the zuul dashboard?02:21
*** tonyb has joined #openstack-infra02:22
*** yamamoto has quit IRC02:23
*** edmondsw has joined #openstack-infra02:25
tristanCoh i see, the first step seems to be doing static HTMLification02:29
*** edmondsw has quit IRC02:30
tristanCthen either we add the reporting code to the htmlify role, either wait for the zuul dashboard enhancement02:30
*** psachin has joined #openstack-infra02:34
*** dave-mccowan has quit IRC02:35
*** yamamoto has joined #openstack-infra03:03
jheskethtristanC: small query in 550978 if you have time :-)03:07
*** yamamoto has quit IRC03:11
*** rlandy|bbl is now known as rlandy03:14
*** udesale has joined #openstack-infra03:31
*** yamamoto has joined #openstack-infra03:37
*** yamamoto has quit IRC03:41
*** yamamoto has joined #openstack-infra03:43
*** hongbin has quit IRC03:52
*** hwoarang has quit IRC03:52
*** yamamoto has quit IRC04:02
*** ramishra has joined #openstack-infra04:03
*** yamamoto has joined #openstack-infra04:03
*** mschuppert has joined #openstack-infra04:06
*** yamamoto has quit IRC04:14
*** yamamoto has joined #openstack-infra04:28
*** rlandy has quit IRC04:28
*** udesale has quit IRC04:35
*** viks_ has joined #openstack-infra04:37
*** yamamoto has quit IRC04:38
*** dklyle has quit IRC04:39
*** viks_ has quit IRC04:42
*** yamamoto has joined #openstack-infra04:43
*** yamamoto has quit IRC04:47
*** udesale has joined #openstack-infra04:55
*** yamamoto has joined #openstack-infra04:56
*** yamamoto has quit IRC04:58
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/job/{job_name} route  https://review.openstack.org/55097805:36
*** quiquell has joined #openstack-infra05:39
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/projects and /{tenant}/project/{project} routes  https://review.openstack.org/55097905:40
*** jaosorior has quit IRC05:40
*** jaosorior has joined #openstack-infra05:41
*** janki has joined #openstack-infra05:44
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/pipelines route  https://review.openstack.org/54152105:44
*** jrist has quit IRC05:49
*** xarses has joined #openstack-infra05:55
*** apetrich has joined #openstack-infra05:57
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: scheduler: add job's parent name to the rpc job_list method  https://review.openstack.org/57347306:01
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/labels route  https://review.openstack.org/55397906:01
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/nodes route  https://review.openstack.org/55399806:01
*** yamamoto has joined #openstack-infra06:03
*** jrist has joined #openstack-infra06:03
*** yamamoto_ has joined #openstack-infra06:07
*** yamamoto has quit IRC06:09
*** yamamoto_ has quit IRC06:10
*** jesusaur has quit IRC06:40
*** chason has joined #openstack-infra06:43
*** jesusaur has joined #openstack-infra06:45
*** ccamacho has joined #openstack-infra06:45
*** rcernin has quit IRC06:54
*** chason has quit IRC07:01
*** ginopc has joined #openstack-infra07:01
*** chason has joined #openstack-infra07:02
*** annp has quit IRC07:02
*** chason has quit IRC07:08
*** amoralej|off is now known as amoralej07:09
openstackgerritTristan Cacqueray proposed openstack-infra/zuul master: web: add /{tenant}/labels route  https://review.openstack.org/55397907:12
*** tosky has joined #openstack-infra07:26
openstackgerritSlawek Kaplonski proposed openstack-infra/project-config master: Move ironic-tempest job for Neutron to "in tree"  https://review.openstack.org/58818107:28
*** jtomasek_ has quit IRC07:29
*** jtomasek has joined #openstack-infra07:31
*** chason has joined #openstack-infra07:44
*** jpich has joined #openstack-infra07:53
*** Bhujay has joined #openstack-infra07:56
quiquellDo you know if we can 'curl' review.o.o to get the CI results of a change ?07:58
*** dtantsur|afk is now known as dtantsur08:01
*** tommylikehu is now known as tommylikehu208:02
*** tommylikehu2 is now known as tommylikehu08:03
*** derekh has joined #openstack-infra08:03
*** tommylikehu is now known as tommylikehu_afk08:04
*** Bhujay has quit IRC08:05
*** tommylikehu_afk is now known as tommylikehu08:09
*** jpena|off is now known as jpena08:20
*** vivsoni_ has quit IRC08:29
*** jiapei has joined #openstack-infra08:33
*** ginux has joined #openstack-infra08:34
*** ginux is now known as Guest1094508:34
*** xarses has quit IRC08:37
*** ginopc has quit IRC08:38
*** electrofelix has joined #openstack-infra08:49
*** jaosorior has quit IRC09:03
*** chason has quit IRC09:10
*** gfidente has joined #openstack-infra09:13
*** pbourke has joined #openstack-infra09:26
*** chason has joined #openstack-infra09:28
*** vivsoni has joined #openstack-infra09:31
*** zoli is now known as zoli|lunch09:32
*** chason has quit IRC10:06
*** chason has joined #openstack-infra10:07
*** agopi has quit IRC10:08
*** chason has quit IRC10:12
*** bradm has joined #openstack-infra10:37
*** zoli|lunch is now known as zoli10:50
*** e0ne has joined #openstack-infra11:01
*** e0ne has quit IRC11:01
*** e0ne has joined #openstack-infra11:02
*** e0ne has quit IRC11:02
*** jpena is now known as jpena|lunch11:03
*** e0ne has joined #openstack-infra11:17
*** e0ne has quit IRC11:17
*** vivsoni has quit IRC11:17
*** rh-jelabarre has joined #openstack-infra11:26
*** auristor has quit IRC11:36
*** dave-mccowan has joined #openstack-infra11:43
*** boden has joined #openstack-infra11:43
*** jiapei has quit IRC11:47
*** tpsilva has joined #openstack-infra11:48
*** jpena|lunch is now known as jpena11:57
*** hemna_ has quit IRC12:04
*** rosmaita has joined #openstack-infra12:12
*** auristor has joined #openstack-infra12:14
*** hemna_ has joined #openstack-infra12:15
*** panda|rover is now known as panda|rover|off12:17
*** sthussey has joined #openstack-infra12:20
*** kgiusti has joined #openstack-infra12:21
evrardjpcan someone explain me what is the purpose of project-config's bindep-fallback file ?12:22
evrardjpAJaeger: you are the last one to touch this, maybe you know? ^12:23
pabelangerevrardjp: it's a legacy file for when we used to have devstack based images.  The goal is to eventually delete it, and is under freeze for the most part12:25
evrardjppabelanger: well the thing is that it's used for bindep testing12:25
evrardjpand it's outdated12:26
pabelangerevrardjp: which job?12:26
pabelangerwhen we bring on newer versions of fedora, we just skipping adding a test for it12:26
evrardjpok12:27
pabelangerwhich means, newer OSes are broken for bindep-fallback and is a way that projects should add a bindep.txt file to their repo12:27
evrardjpthat sounds a bad idea12:27
evrardjppabelanger: oh i see12:27
evrardjpso the idea is to REALLY push the bindep into projects' repo?12:27
pabelangeryup12:27
pabelangerlast I checked a lot of projects already have bindep.txt file12:28
evrardjpproof is here that it doesn't work for SUSE tumbleweed and leap 15: https://review.openstack.org/#/c/588209/12:28
evrardjppabelanger: yeah.12:28
evrardjpso I will work on those projects across the board then12:28
evrardjpI will abandon my idea of changing the fallback12:28
evrardjpI should have asked earlier : p12:29
pabelangeryah, ideally all those jobs will go away, once we remove the file12:29
evrardjppabelanger: if you could help on those reviews, I'd be happy: https://review.openstack.org/#/q/topic:loci-suseleap15+status:open+project:openstack-infra/bindep12:30
evrardjpianw: as you were already a reviewer in some of those, I'd be happy to see a review too ^12:31
evrardjpif you need anything reviewed that I can help, shoot :)12:31
*** mriedem has joined #openstack-infra12:39
*** rlandy has joined #openstack-infra12:42
*** efried is now known as fried_rice12:48
*** rfolco|off is now known as rfolco|ruck12:52
*** nicolasbock has joined #openstack-infra13:00
*** edmondsw_ has joined #openstack-infra13:00
*** eharney has joined #openstack-infra13:07
*** amoralej is now known as amoralej|lunch13:07
*** agopi has joined #openstack-infra13:10
*** agopi_ has joined #openstack-infra13:11
*** quiquell is now known as quiquell|off13:13
*** agopi has quit IRC13:14
*** agopi_ is now known as agopi13:14
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Replace with_first_found with lookup first_found  https://review.openstack.org/58854613:16
*** quiquell|off has quit IRC13:17
*** ramishra has quit IRC13:22
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Remove old inactive users  https://review.openstack.org/58855313:22
*** jcoufal has joined #openstack-infra13:24
*** dave-mccowan has quit IRC13:25
*** mriedem is now known as mriedem_afk13:25
*** agopi_ has joined #openstack-infra13:25
*** lbragstad has quit IRC13:26
*** agopi has quit IRC13:28
*** stephenfin is now known as finucannot13:28
*** agopi_ is now known as agopi13:31
*** psachin has quit IRC13:33
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add sudoers file and groups  https://review.openstack.org/58785413:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add bridge.openstack.org to trusted ssh list  https://review.openstack.org/58785513:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add emacs and vim to base-server packages  https://review.openstack.org/58798313:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add pip and virtualenv to bridge.openstack.org  https://review.openstack.org/58798413:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Install and configure ansible on bridge  https://review.openstack.org/58798513:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add an Ansible role to configure exim  https://review.openstack.org/58808913:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Update install_modules to not need puppet  https://review.openstack.org/58839413:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Rename update_puppet to update-system-config  https://review.openstack.org/58839613:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Fix some little ansible issues  https://review.openstack.org/58839713:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Replace with_first_found with lookup first_found  https://review.openstack.org/58854613:51
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Remove old inactive users  https://review.openstack.org/58855313:51
*** bnemec is now known as beekneemech13:51
*** Emine has joined #openstack-infra13:53
fungipabelanger: i didn't realize we were skipping testing the fallback file for some platforms. that job was serving as functional testing of bindep on those platforms, so without it we don't necessarily have any guarantees it'll run there do we? or was a different functional test job added to supplant it?13:55
fungii expected we'd eventually move that file into the bindep repo as a test fixture once we no longer needed it for actual fallback13:57
pabelangerfungi: yah, moving a file in repo should be good if we want to keep testing on specific distro.13:57
fungiwell, presumably we want to test against all the distros that we have13:57
pabelangerideally, ya. Some have slipped though13:58
* mordred looks forward to not needing the fallback file14:01
pabelanger++14:01
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Install and configure ansible on bridge  https://review.openstack.org/58798514:06
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add an Ansible role to configure exim  https://review.openstack.org/58808914:06
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Update install_modules to not need puppet  https://review.openstack.org/58839414:06
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Rename update_puppet to update-system-config  https://review.openstack.org/58839614:06
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Fix some little ansible issues  https://review.openstack.org/58839714:07
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Replace with_first_found with lookup first_found  https://review.openstack.org/58854614:07
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Remove old inactive users  https://review.openstack.org/58855314:07
*** alex_xu has quit IRC14:08
mordredpabelanger: updated https://review.openstack.org/#/c/587854 to add a comment based on your review14:08
*** edmondsw_ is now known as edmondsw14:08
mordredclarkb, corvus, fungi, pabelanger: ^^ I updated the stack based on review comments and have re-tested on bridge.o.o14:09
*** amoralej|lunch is now known as amoralej14:10
*** rpittau has quit IRC14:18
clarkbfungi with bindeps switch to distro the in repo test suite has fixtures for all supported distro os-release files. Not a direct on going test of functionality on that distro but pretty close14:23
*** janki has quit IRC14:28
fungiyeah, i'd still feel more comfortable if we tested it on all the platforms where we might run it in other jobs, but its unit testing is fairly comprehensive14:33
*** hongbin has joined #openstack-infra14:33
*** jcoufal_ has joined #openstack-infra14:36
*** jcoufal has quit IRC14:37
*** Emine has quit IRC14:39
*** nhicher has joined #openstack-infra14:40
*** Emine has joined #openstack-infra14:40
*** mriedem_afk is now known as mriedem14:43
clarkbIn this case it looks like the problem is with the actual contents of the file and not bindep itself14:47
openstackgerritMonty Taylor proposed openstack-infra/system-config master: WIP Use ansible for openstack_project::server  https://review.openstack.org/58583614:48
mordredclarkb: I have responded to and/or fixed all of your review comments from yesterday14:49
clarkbmordred: thanks, I'll go back through that list when project renames are done14:50
clarkbI have disabled puppet cron on the puppetmaster14:50
mordredclarkb: sweet14:50
clarkbmordred: when doing ^ I noticed that hte cloud launcher ansible might be something we want ot move to bridge early since it is all ansible14:50
mordredclarkb: agree14:50
mordredclarkb: also - we still have infra cloud things in place14:51
*** janki has joined #openstack-infra14:51
clarkbmordred: ya noticed that too14:51
clarkbmordred: I think it may noop because we removed nodes from inventory14:52
clarkbhrm notice that the -1 on the chef change isn't due to zuul config update being unhappy its because we have linting things on projects.yaml14:53
clarkbscas: fungi ^ fyi I'm going t oaddress the two items that were caught now14:54
clarkbhttp://logs.openstack.org/71/585471/3/check/project-config-gerrit/d3a56e4/job-output.txt.gz#_2018-07-24_20_49_55_56861814:54
*** rpioso|afk is now known as rpioso14:55
mnaseris infra still maintaining openstack/ansible-role-cloud-launcher ?14:56
openstackgerritClark Boylan proposed openstack-infra/project-config master: Unretire the openstack-chef project  https://review.openstack.org/58547114:56
clarkbfungi: scas ^ that should make zuul happy. We might also want to relax those rules so that we can make project entries uniform across a larger project14:57
clarkbmnaser: in as much as we use it today and we fix the occasional bug. I don't think we've added features in a while14:57
mnaserclarkb: https://review.openstack.org/#/c/588332/ i proposed some test and i think the tests are failing :<14:58
toskymordred: hi! Regarding sahara-tests and local changes, I did some tests but I probably did some mistake, because I get the wrong result14:59
toskymordred: namely https://review.openstack.org/#/c/581781/ and https://review.openstack.org/#/c/588515/ (which should test it)14:59
scasclarkb: thanks. i wasn't 100% on the process, but wanted to get it on the schedule with haste14:59
*** Emine has quit IRC15:00
*** Emine has joined #openstack-infra15:00
clarkbmnaser: looks like it expects a devstack cloud to be running that it can talk to, but I don't see any devstack setup15:00
evrardjppabelanger: fungi I can fix it for new distros if you want.15:01
mnaserclarkb: yep, that seems to be the issue.15:01
mnaseris there an easy way to get devstack in there using a different basejob or so15:01
mordredmnaser: yah - there is a base job15:01
mnaserbrrr, guess i signed myself up for more work15:02
mnaser:<15:02
fungiclarkb: we're less than an hour to go time. should we proceed with stopping puppet?15:03
mordredmnaser: devstack-tox-functional-consumer will make a devstack in pre-run and is set upto run tox-based functional tests15:03
clarkbfungi: I already did :)15:03
fungioh, excellent!15:03
clarkbfungi: please double check ;)15:03
mnasermordred: wow, that's awesome15:03
mordredmnaser: behold the awesome power of zuul. bow down to its majesty.15:04
fungii wasn't following the state of our ansible changes super closely yesterday so didn't know if the process for that has changed now15:04
mnaserhttps://docs.openstack.org/devstack/latest/zuul_jobs.html i didnt get through my reading to get there, thats sweet15:04
mnaser!!15:04
openstackmnaser: Error: "!" is not a valid command.15:04
mordred:)15:04
clarkbfungi: it shouldn't have changed. Only hiera data organization changed15:04
fungiokay, cool15:04
clarkbI still don't know if corvus has had a chance to read through the updated process15:05
fungiit's not too far off from what we did last time15:09
fungihonestly, if we just take a safety snapshot of the pipeline contents in zuul when we're starting, it's probably all the insurance policy we need15:09
clarkbok, maybe add that as a step?15:10
fungiadded15:12
clarkbthanks15:12
mordredclarkb, fungi: I haven't been following the renames super closely, but I am here to help should I be useful15:14
clarkbmordred: https://etherpad.openstack.org/p/project-renames-2018-08-03 is the tracking/process document if yo uwant to give that a read15:15
*** jcoufal has joined #openstack-infra15:15
corvusclarkb: i'll give it a once over now15:16
*** jcoufal_ has quit IRC15:16
openstackgerritMonty Taylor proposed openstack-infra/zuul master: Switch storyboard url to be by name  https://review.openstack.org/58859715:18
corvushrm, we don't actually stop gerrit anymore do we?15:18
openstackgerritMohammed Naser proposed openstack-infra/project-config master: Add cloud launcher role to infra. channel  https://review.openstack.org/58859815:19
openstackgerritMohammed Naser proposed openstack-infra/project-config master: Remove Ansible function jobs from cloud-launcher  https://review.openstack.org/58859915:19
openstackgerritMonty Taylor proposed openstack-infra/nodepool master: Switch storyboard url to be by name  https://review.openstack.org/58860015:20
clarkbcorvus: the playbook stops it15:20
fungicorvus: not directly, but the playbook does15:20
mordredclarkb: thanks15:20
openstackgerritCarlos Goncalves proposed openstack/diskimage-builder master: Add netcat to redhat-common map-packages  https://review.openstack.org/58860115:20
fungiplaybook is basically 1. stop gerrit, 2. update db, 3. move files, 4. start gerrit, 5. rename groups, 6. initiate online reindex15:21
corvushow long is the downtime?15:21
fungi(some of those middle steps may be in a slightly different order)15:21
fungicorvus: roughly equivalent to a normal gerrit restart15:21
clarkbcorvus: I expect about 10 minutes max. I think it takes gerrit about 5 minutes to start now and that gives us 5 minutes to stop and update things (which should be plenty)15:21
corvusokay, since we're not restarting zuul, i guess the idea is we'll roll the dice on not many changes merging during the downtime?15:22
clarkbcorvus: ya, to avoid needing to manually push updated project-config everywhere while gerrit spends its time replicating the world15:22
corvus(we used to be able to pause zuul, we should add that back)15:22
fungia pause certainly would be convenient there15:23
clarkbmordred: one thing you can double check is that the groups/hosts used in the playbook map to the right hosts still. hosts: review for example15:23
fungibut worst case zuul fails to report some change(s) and those have to be requeued by their maintaniers15:23
fungi(i think?)15:24
corvusyeah.  i expect that to happen some, but i think this is a reasonable procedure for today.15:24
mordredclarkb: yah - host: review == review01.openstack.org15:24
clarkbmordred: thanks15:24
corvusit'll continue to get better in the future :)15:24
fungii wonder if the notedb situation will make it possible to do this without gerrit restarts, though i'm not holding out hope15:25
corvusthough... we may end up with a bunch of merge failures if a change fails to report and zuul can't re-merge changes behind it.15:25
mordredfwiw: "ansible --list-hosts review" is the way to check that15:25
*** janki has quit IRC15:29
*** zoli is now known as zoli|gone15:38
*** zoli|gone is now known as zoli15:38
openstackgerritJames E. Blair proposed openstack-infra/zuul master: Add pause/unpause support to scheduler  https://review.openstack.org/58861015:44
corvus^ for later :)15:45
clarkbmordred: in the rename playbook is the comment # TODO: gerrit startup exceeds the timeout, so this task fails15:45
clarkbmordred: will the play after that run even if the task fails?15:45
mordredclarkb: no, it shouldn't15:46
clarkbhrm ok15:46
clarkbthats ok, its the end of the playbook and only group renames and reindexing left. can do that mnaully if gerrit doesn't start quickly neough15:46
mordredclarkb: I think maybe last time we thught about splitting it into two playbooks15:46
mordredclarkb: ++15:46
clarkbI think we can ignore error there since subsequent play has you check gerrit is happy before hitting enter to coninue15:46
clarkbbut I won't do that for this run, it is a straightforward manual set of steps if necessary15:47
mordredagree15:47
fungisounds fine15:48
*** ccamacho has quit IRC15:57
*** mriedem is now known as hansmoleman15:57
* fungi is on hand for the maintenance, but also in release team meeting which is just winding up15:58
fungiwas someone going to send a #status notice? have any wording drafted yet?15:58
clarkbfungi: I hadn't drafted anything yet but maybe "The infra team is renaming projects in Gerrit. There will be a short ~10 minute Gerrit downtime in a few minutes as a result.15:59
clarkbalso can someone else review https://review.openstack.org/#/c/575478/ before we force merge?15:59
fungisounds good. looking at 575478 now16:00
clarkb#status notice The infra team is renaming projects in Gerrit. There will be a short ~10 minute Gerrit downtime in a few minutes as a result.16:01
openstackstatusclarkb: sending notice16:01
* clarkb wonders if the notice is actually sending16:02
-openstackstatus- NOTICE: The infra team is renaming projects in Gerrit. There will be a short ~10 minute Gerrit downtime in a few minutes as a result.16:02
clarkboh there it is, I'm just not in as many channels that get it early I guess16:02
clarkbshall I force merge the changes now? anyone willing to grab the zuul queues?16:03
corvusi'll grab zuul queues16:04
clarkbmaybe wait for notice to finish sending16:04
clarkbcorvus: thanks16:04
openstackstatusclarkb: finished sending notice16:04
clarkbI've added myself to the project bootstrappers group and am force merging those two changes now16:04
*** Guest10945 has quit IRC16:05
corvusgimme a second; i think there's a version mismatch between zuul-changes script and running server16:05
clarkbcorvus: ok16:05
openstackgerritMonty Taylor proposed openstack-infra/zuul-website master: WIP Add podcast.__init__ audio stream  https://review.openstack.org/58861516:05
fungioh, that's fun16:06
clarkb(for those following along changes have not been force merged yet)16:06
*** rlandy is now known as rlandy|mtg16:07
corvusokay, queues saved16:07
clarkbcorvus: ready for me to proceed?16:07
corvuspatched script in ~root/zuul-changes.py16:07
corvusclarkb: yep16:07
openstackgerritMerged openstack-infra/project-config master: Unretire the openstack-chef project  https://review.openstack.org/58547116:08
openstackgerritMerged openstack-infra/project-config master: Rename the API-WG to API-SIG  https://review.openstack.org/57547816:08
clarkbdone and I've rmeoved myself from bootstrappers16:08
clarkbcan someone check if replication is complete for those changes and I will get ready to run the playbook16:09
*** ccamacho has joined #openstack-infra16:09
clarkbI'm starting a root screen on puppetmaster for that16:09
fungii'm looking16:09
*** gyee has joined #openstack-infra16:09
fungithere's a backlog of github pushes hours old, but nothing for git.o.o in the queues16:10
fungier, for gitNN.o.o that is16:11
clarkbhttps://git.openstack.org/cgit/openstack-infra/project-config/log/ shows the two commits too16:11
clarkbready to proceed with playbook?16:11
fungii think so16:11
clarkbok will run it in root screen on puppetmaster16:11
clarkbit is running16:12
fungii see the shutdown in the gerrit log now16:13
*** melwitt is now known as jgwentworth16:14
clarkbmysql failed16:14
clarkbcouldn't connect16:14
mordredwell that's not great16:14
clarkbseems like it was expecting a local connection16:15
scaswasn't me (probably was me)16:15
mordredclarkb: you are running as root?16:15
clarkbmordred: yes16:15
fungiin a screen session16:15
clarkbits in screen16:15
clarkbthe issue is I think we don't have a mysql default conf for talking to the remote db16:16
mordredwe used to16:16
mordredwhere did it go?16:16
clarkbmordred: we made a new server guessing that as done by hand?16:16
fungiit's ~root/.gerrit_db.cnf16:16
clarkbmordred: ^ is used for db backups but not default16:16
mordredah. gotit. then yeah, we should update something to add that file16:16
clarkbmordred: in the interim can we copy that file to the default name and rerun the playbook?16:17
fungiyou'll need a copy of the playbook with the gerrit stop removed probably16:17
mordredclarkb: yes. ~/.my.cnf16:17
clarkbok first I'm copying  ~root/.gerrit_db.cnf to ~root/.my.conf16:17
clarkbdone16:18
clarkbnow will make copy of the playbook and remove the gerrit stop16:18
mordred++16:18
clarkbfungi: more if following along in the screen that look good to go now?16:19
*** fried_rice is now known as fried_rolls16:19
fungii can't see it all, but if you removed the gerrit stop that should be enough i think16:20
*** udesale has quit IRC16:20
clarkbya I did16:20
mordredyah16:20
fungilgtm16:20
clarkbrunning now16:20
fungiwatching gerrit error_log now16:21
clarkbgerrit is starting back up now16:21
fungithere it goes16:21
fungiGerrit Code Review 2.13.9-4-g2a605d5 ready16:22
clarkboh wow it didn' tfail16:22
clarkblet me double check that ssh and http work before itting enter16:22
corvus    raise BadHostKeyException(hostname, server_key, our_key)16:22
corvusparamiko.ssh_exception.BadHostKeyException: ('review.openstack.org', <paramiko.rsakey.RSAKey object at 0x7f76a19076d8>, <paramiko.rsakey.RSAKey object at 0x7f76bb63dfd0>)16:22
corvuszuul is unhappy16:22
clarkbya local ls-projects over ssh doesn't seem to work either16:23
fungii wonder if mina-sshd was returning garbage for a bit at start16:23
clarkbfungi: anything in the error log?16:23
clarkboh there it goes16:23
fungiprobably the sshd_log, checking16:23
clarkbthough still haven't receied complete output yet16:24
clarkband now it is done /me tries again16:24
clarkbmuch quicker now. I think ssh must take a while to startup beyond gerrit saying "ready"16:24
clarkbcorvus: is zuul looking happier?16:24
fungithird-party ci systems spam that ssh api really, really hard too, looking at the log16:25
fungistill trying to find the error16:25
corvusclarkb: no the host key is wrong16:25
clarkbcorvus: hrm I didn't have to reconfirm it here16:25
mordredI didn't have to reconfirm it here either16:25
clarkbdid we configure the wrong key in zuul via puppet?16:25
mordredssh -p 29418 review.openstack.org gerrit ls-projects | wc -l ... worked for me16:26
corvusgerrit-code-review@review-dev.openstack.org16:26
corvusclarkb: i believe we did16:26
fungiwhups!16:26
clarkbugh16:26
corvusi will manually fix the known_hosts file on zuul16:26
clarkbcorvus: thank you16:26
fungiand i guess this is the first time the scheduler had to reconnect after that file got updated16:26
clarkbI'll wait for zuul to be confirmed happy before I hit enter in ansible16:27
clarkbunless you all think we should just continue?16:27
fungiis it only the scheduler which uses ssh access to gerrit, or do we need to worry about mergers?16:27
fungi[2018-08-03 16:24:26,165] [SSH gerrit ls-projects (mordred)] ERROR com.google.gerrit.sshd.BaseCommand : Internal server error (user mordred account 2) during gerrit ls-projects16:28
fungistream is already closed16:28
fungii guess he disconnected before it completed16:28
corvusfungi: only the scheduler16:28
corvusi think zuul is gtg now16:28
clarkbok I'm hitting enter in the screen now to continue with group renames and reindexing16:28
mordredfungi: I just tried again - but without the | wc -l16:28
mordredclarkb: ++16:29
fungiyeah, i just saw zuul report a build succeeded according to the gerrit logs16:29
clarkband that is the playbook done16:29
*** psachin has joined #openstack-infra16:29
clarkbfungi: want to handle storyboard? I'm making openstack/openstack-chef active now16:30
fungiyep16:30
corvusi manually re-enqueued the tripleo changes which failed to report16:30
clarkband I am off to do the github renaming as soon as I get my 2fa token16:30
fungiRows matched: 1  Changed: 1  Warnings: 016:31
fungisb project-group renamed16:31
*** jpich has quit IRC16:32
fungithe online reindexing seems to have gone extremely quickly16:33
clarkbfor the ownership transfer the new organization name is 'openstack' right?16:33
clarkbI guess it doesn't take a full url just the name /me goe sfor it16:34
fungiyeah, just the short org name16:34
fungias long as you're an admin in both it should work16:34
clarkbyup seems to be there and even asked me if I wanted to give gerrit perms on it16:35
clarkbI think we are to the point where we can reenable puppet, please check me on that16:35
fungiyeah, seems we're safe to go ahead16:38
clarkblooks like cgit isn't serving the new repos yet. I believe because we need manage projects there to update the index lists16:38
clarkbenabling puppet should take care of that16:38
clarkbif I bypass cgit and clone directly the repos seem to be there16:38
clarkbalright reenabling puppet cron now16:39
clarkbdone16:39
clarkbshould start running in ~6 minutes16:39
fungigerrit started at around 15k queued tasks and is now under 10k16:41
clarkbjava 8 so fast16:41
mordredyay java 816:41
fungiso so fast, the java 816:41
clarkbthe two todo items are fix zuul ssh known hosts and give gerrit a copy of its .my.cnf?16:42
corvusthat's my recollection16:42
fungiand maybe add functionality to support project-group renames16:42
mordredwe could also just add --defaults-file to the mysql invocation in the playbook16:43
clarkbmordred: actually that might be better16:43
fungithe playbook could in theory add a parameter for the mysql config file name and then16:43
fungiyeah16:43
clarkbsince its explicit16:43
fungiexactly what i was thinking too16:43
fungishould do the same for the storyboard mysqlclient config16:43
clarkbcorvus: I'm looking at system-config/manifests/site.pp and it isn't immediately clear to me what was wrong with the ssh host key16:43
fungifor consistency16:43
* mordred on mysql change16:43
*** jpena is now known as jpena|off16:44
fungipuppet waking up in 15 seconds16:44
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Add .gerrit_db.cfg to project rename playbook  https://review.openstack.org/58862016:44
fungido we want to tail that log?16:44
clarkbfungi: I'm tailing it outside the screen16:45
clarkbalso tailing manage projects log on review.o.o16:45
fungicool16:45
clarkbfungi: I dind't want puppet log to push the ansible stuff off the screen buffer16:46
fungiyep, good call16:46
fungi(though could just ctrl-a,c and add a new window in it)16:46
clarkbas an aside github didn't actually end up asking for 2fa even though it asked for a password when transfering repo owernship. I think because i was already auth'd with 2fa odd that they wouldn't enforce it anytime they ask for a password though16:48
clarkbgit* done now16:48
clarkbon to review16:48
clarkbcgit looks good16:49
*** openstackgerrit has quit IRC16:49
clarkbmanage projects is running now16:49
fungiyeah, makes sense that gh would just consider you already authed16:50
*** gfidente has quit IRC16:50
*** agopi_ has joined #openstack-infra16:51
clarkbmanage projects ran against both api-sig and openstack-chef and seems to have been happy with them16:53
*** agopi has quit IRC16:54
*** agopi_ is now known as agopi16:54
toskymordred: now that the gerrit crysis is over, if you have some time for my question... basically https://review.openstack.org/#/c/588515/ does not apply the variant as it should16:55
toskysame question for corvus16:56
corvusclarkb, mordred: i think the fix for the ssh key thing is just in hiera.  what do i need to do?  edit the same files on both puppetmaster and bridge?16:56
* fungi questions the use of "crisis" to refer to a scheduled maintenance16:56
corvustosky: gimme 5 mins and i can take a look :)16:56
clarkbcorvus: yes I think so. I don't see it in the in repo hiera16:56
toskyfungi: ups, sorry, you are right; it was an hyperbolic way to express the busy time on the channel16:56
clarkbcorvus: fyi Aug  3 16:54:02 zuul01 puppet-user[32202]: (/Stage[main]/Zuul::Scheduler/Exec[zuul-reload]) Triggered 'refresh' from 1 events16:57
clarkbI believe that we have cross the point of zuul concern16:57
fungiexcellent16:57
fungii need to spend a few minutes cooking lunch since maintenance has concluded. gerrit queue is down to ~3k tasks now so should clear out in a few more minutes16:57
corvusalso "gerrit_zuul_user_ssh_key_contents" is a... weird... name for the public gerrit server host key.16:58
mordredcorvus: I agree - and yes to also editing on bridge16:58
corvusi guess i should make the commit on puppetmaster and push the commit to bridge?16:59
corvusthat way the git repos stay in sync16:59
clarkbcorvus: ya16:59
mordredcorvus: there is an extra commit on bridge16:59
corvusmordred: can that commit be pushed to puppetmaster?16:59
mordredno - it removes the production subdirectory17:00
corvusokay, so we're forking the repos, and puppetmaster is a dead-end17:00
mordredyah. I'd just make it both places and we'll work to finish killing puppetmaster as quickly as we can17:00
*** derekh is now known as derekh_afk17:00
mordredcorvus: also - we should maybe put on the todo list pulling that out of secrets and into in-tree variables17:02
fungii had taken a stab at putting the gerrit ssh public host key in public hiera already17:02
corvusmordred: there's a bunch of renamed files on puppetmaster... should i just avoid committing there?17:02
mordredcorvus: hrm. lemme look?17:03
fungibut i guess not all uses of the host key got switched over to use the centralized copy17:03
mordredcorvus: oh  that's the rename - that's totally safe to commit, sorry, I should have committed it yesterday17:04
mordredcorvus: we're live with the change to consume that rename17:04
corvusmordred: ok... making them two separate commits at this point will be tricky, but i'll give it a shot17:05
corvusmordred, clarkb, fungi: okay, everything committed on both hosts17:07
mordred\o/17:07
corvuswe might want to send a note to openstack-infra describing the interim procedure for other roots17:07
mordredI shall do that17:07
corvuscool, i will look at tosky's thing17:08
mordredcool17:08
clarkbI'm waiting for puppet run all to finish before giving the all clear on the rename but I think we are all clear on the rename at this point17:08
clarkbok puppet is done17:12
*** calebb has quit IRC17:13
mordredcorvus, clarkb, fungi: https://etherpad.openstack.org/p/FTD8VBMWfw read ok?17:14
mordred(to send to openstack-infra@17:14
fungigerrit task queue has stabilized around 450 tasks17:15
fungilooks like it's still indexing nova, neutron and openstack-manuals17:15
fungithe rest are done17:16
clarkbmordred: I'm not quite sure I understand what you mean about removing the production subdir17:16
fungioh, actually the task queue count is still falling, that's good17:16
clarkbbut I think that is necessary for puppet 4?17:16
mordredclarkb: it's necessary on the remote hosts - it is not useful on bridge itself, as "environments" are a puppet concept17:17
mordredclarkb: the syncing code has to put data into two different paths depending on 3 or 4 already anyway17:18
clarkbya I guess we don't need it source side17:18
mordredso the directory on the puppetmaster/bridge host is not used for anything17:18
mordredyah17:18
*** dtantsur is now known as dtantsur|afk17:20
clarkbI'm not sure the current code copies it where we need it htough?17:21
clarkber for puppet 4 specifically17:21
corvustosky: left comment on https://review.openstack.org/58178117:22
logan-looks like limestone is stuck deleting in nodepool again17:22
*** dmellado has quit IRC17:22
*** stevebaker has quit IRC17:23
*** gouthamr has quit IRC17:23
toskycorvus: oh, thanks a lot; a PEBKAC17:23
mordredclarkb: it's in ansible-role-puppet in tasks/main.yaml - look for Set management server hieradata var17:23
toskyI will go for the branches syntax, it's more clear17:23
clarkblogan-: we'll have t osee if we have logs explaining what happened this time17:24
corvustosky: if you set this attribute on a project-pipeline in a change where you can't figure out why zuul didn't apply a job (ie, include it in 588515 in this case), zuul will report back with a complete list of all the jobs and variants it considered applying: https://zuul-ci.org/docs/zuul/user/config.html#attr-project.%3Cpipeline%3E.debug17:24
mordredclarkb: ah - however, there is definitely a deficiency there that we'll need to fix before we start running puppet from bridge17:24
corvustosky: that can be useful to find out why zuul didn't run a job17:24
mordredfixing now17:25
toskycorvus: I see, thanks17:25
corvustosky: you'd get a line like: Variant <Job sahara-tests-tempest branches: {MatchAny:{BranchMatcher:^(stable/(ocata|pike|queens)).$}} source: openstack/sahara-tests/.zuul.yaml@master#58> did not match <Change 0x7f76ba962f98 588515,1>17:25
clarkbI guess we should send a followup its done notice17:26
toskycorvus: did you manually build that line, or did you check the server logs?17:26
*** gouthamr has joined #openstack-infra17:26
clarkbhow about #statuc notice Project renames and review.openstack.org downtime are complete without any major issue.17:26
clarkbcorvus: mordred fungi ^17:26
mordred++17:26
clarkb#status notice Project renames and review.openstack.org downtime are complete without any major issue.17:27
openstackstatusclarkb: sending notice17:27
corvustosky: that's in the debug logs.  we log a lot of stuff so that if something weird happens, we can reconstruct it and fix the bug.  but the debug:True bit makes it self-service for any user17:27
-openstackstatus- NOTICE: Project renames and review.openstack.org downtime are complete without any major issue.17:28
toskyyeah, makes sense17:28
mnaserclarkb: when you have a second, can you comment/see https://review.openstack.org/#/c/588598/ with openstack infra ptl hat?17:28
clarkbmnaser: ya17:29
clarkbinfra-root please review https://review.openstack.org/#/c/588620/1 so that we don't forget it for next time17:29
*** psachin has quit IRC17:29
openstackstatusclarkb: finished sending notice17:30
*** openstackgerrit has joined #openstack-infra17:30
openstackgerritMonty Taylor proposed openstack-infra/ansible-role-puppet master: Allow explicit override for mgmt_hieradata  https://review.openstack.org/58862617:30
openstackgerritMonty Taylor proposed openstack-infra/system-config master: Set mgmt_hieradata variable for bridge.openstack.org  https://review.openstack.org/58862717:31
clarkbmnaser: done17:31
mordredclarkb: ^^ those two should fix that - good catch17:31
clarkblogan-: ok, you are next on my list then mordreds stack of bridge changes17:31
mordred(we haven't run puppet remotely from bridge yet)17:32
mnaserclarkb: cool, thanks!  AJaeger hopefully concerns here are addressed :) https://review.openstack.org/#/c/588598/17:32
*** gouthamr has quit IRC17:33
clarkblogan-: pabelanger hrm not seeing much in the logs. mordred can you look at nl02.openstack.org:/var/log/nodepool/launcher-debug.log and see if the sahde logs are in there as expected?17:35
clarkbmordred: if I grep for shade and sdk I see nothing17:36
mordredlooking17:36
clarkbI thinkwe may not have set up the logging properly like we thought we did17:36
mordredclarkb: what sort of thing are yu looking to see?17:36
mordredclarkb: I see things like 2018-08-03 17:36:28,450 DEBUG nodepool.TaskManager: Manager packethost-us-west-1 running task ComputeGetFlavorsDetail (queue 0)17:37
clarkbmordred: limestone hsa completely stopped processing requests, see grep limestone /var/log/nodepool/launcher-debug.log.2018-08-03_0117:37
clarkbmordred: no data to the cloud and no new node requests managed17:37
clarkbI think the hope was to see if we sent data to the cloud that caused us to go out to lunch17:38
clarkb2018-08-03 05:19:51,810 INFO nodepool.PoolWorker.limestone-regionone-main: Assigning node request <NodeRequest {'nodes': [], 'id': '200-0005370277', 'state_time': 1533273591.6037893, 'requestor': 'zuul01.openstack.org', 'reuse': True, 'declined_by': [], 'stat': ZnodeStat(czxid=3971832287, mzxid=3971832287, ctime=1533273591606, mtime=1533273591606, version=0, cversion=0, aversion=0,17:38
clarkbephemeralOwner=99428362472261746, dataLength=161, numChildren=0, pzxid=3971832287), 'state': 'requested', 'node_types': ['ubuntu-xenial']}> is the last real thing it seems to try to do17:38
clarkbafter that its just "deleting nodes" for nodes that are used17:38
clarkbI'm going to try a thread dump if nodepool suppots that17:38
mordred2018-08-03 05:12:09,447 DEBUG nodepool.TaskManager: Manager limestone-regionone ran task ComputeGetServersDetail in 68.9221122264862s17:38
mordredis the last thing I see where nodepool tried to do somethign via openstacksdk17:39
clarkbI don't see a sigusr2 in nodepool, we should add that (I would do it but parents are here and I'm likely to need to pop out soon and do some family stuff)17:39
*** hemna_ has quit IRC17:41
clarkb2018-08-03 05:19:41,785 DEBUG nodepool.PoolWorker.limestone-regionone-main: Active requests: ['100-0005370268'] seems to be the active thing it was trying to do17:41
corvusclarkb: it should support sigusr217:41
mnaserdoes infra have/use a role that maintains clouds.yaml file via ansible?17:42
clarkbcorvus: oh its in the base class17:42
clarkbbut sigusr1 is in the child class17:42
clarkbok running sigusr2 against nl02 launcher17:43
mordredmnaser: not yet17:44
mnasermordred: when the time comes, i'd love to work together on making https://github.com/openstack/openstack-ansible-openstack_openrc more generic17:44
mnaserit does 1 cloud only right now but yeah17:44
* mnaser is in the process of ansible-izing everything here and sharing stuff with infra would be good17:44
mordredmnaser: we will soon - although I wasn't really planning on making a generic role though - since you'd basically need to store the clouds.yaml contents in the ansible variables to get it to work17:45
mnaseryeah that's what bothers me a tad17:45
mordredmnaser: that said - if we can come up with a way to make one that's sensible, I'd be all for it17:45
mnaser++17:45
clarkbmordred: corvus http://paste.openstack.org/show/727270/ its blocking on that compute detail call finishing I think17:47
openstackgerritMerged openstack-infra/project-config master: Add cloud launcher role to infra. channel  https://review.openstack.org/58859817:48
corvusmordred, clarkb: does openstacksdk/requests timeout http requests eventually?17:49
corvusi think requests sessions have a timeout option17:50
clarkbcorvus: based on 2018-08-03 05:12:09,447 DEBUG nodepool.TaskManager: Manager limestone-regionone ran task ComputeGetServersDetail in 68.9221122264862s it seems like maybe it completed? possibly it timed out and then the cleanup code doesn't release the lock?17:50
openstackgerritMerged openstack-infra/system-config master: Add .gerrit_db.cfg to project rename playbook  https://review.openstack.org/58862017:50
corvusclarkb: oh, you're saying that the get limits call is waiting to start?17:51
corvuswe need to find out what the task manager thread is doing17:52
mordredyeah. there should be a running and a ran line at the top and bottom of each Task17:52
clarkbcorvus: the pool worker thread I pasted is waiting for the openstacksdk/shade task to complete. That is signaled by bumping the waiter aiui. I think the log message there shows the task did complete but took a long time (68 seconds), possibly it never bumped the waiter in handling the a timeout if there was one?17:53
clarkblet me see if I can grep for another get server detail17:53
corvusclarkb: that's a get limits call, not get servers detail17:53
corvusclarkb: the stacktrace you posted is the pool worker waiting on a get_compute_limits() call, so the complete ComputeGetServersDetail task is something else17:54
clarkbah17:54
clarkbok I don't see a running for get limits17:55
clarkbso maybe it is blocking on another task17:55
mnasercan i get very quick eyes on https://review.openstack.org/#/c/588599/1 ?17:56
clarkbcorvus: mordred everything else with limestone in the thread names is a node deleter17:56
clarkbthese requests all go through the same task manager to serialize them right? possibly one of the deletes is hanging17:56
clarkbcorvus: mordred: if that is the case then adding a timeout would probably be a good idea17:56
mordredthey do - althugh I'd still expect to see a "running" line17:57
mordredwhich the task manager should log before it actually attempts to execute the sdk operation17:57
fungilooks like the gerrit task queue is caught up except for still indexing nova (no surprise)17:58
clarkbmordred: corvus looking at the logs for tasks that ran they were all very fast (subsecond) then we have that slow 68second one last17:58
mordred(adding a timeout is probably a good idea ... I'm just bothered by not seeing breadcrumbs leading me to believe there's a hung http task)17:58
corvusmordred: what are the openstacksdk taskmanager threads named?17:58
corvusor does nodepool create those?17:59
mordredcorvus: I believe all of that is still in nodepool17:59
corvusyeah, so they should just be named with the name of the provider17:59
corvuslike just "limestone" i think17:59
corvusthere is no limestone task manager thread18:00
corvuslauncher-debug.log.2018-08-03_09:Thread: citycloud-la1 (140679866984192)18:00
corvusthat's what a task_manager thread looks like18:00
corvuslauncher-debug.log.2018-08-03_01:2018-08-03 05:12:09,450 ERROR nodepool.TaskManager: Task manager died.18:01
corvussigh. that's not the way we do error handling.18:01
*** rlandy|mtg is now known as rlandy|brb18:02
fungion par with using assert ;)18:02
corvusyeah18:02
corvushttp://paste.openstack.org/show/727274/18:02
corvusso something *did* timeout, and apparently we thought the right way to handle that was to die18:02
corvuspatch incoming18:03
*** amoralej is now known as amoralej|off18:03
mordredcorvus: well, at least some fraction of things 'worked'18:03
corvusactually, this fix is going to take me a bit.  i have to do a bunch of research.18:04
*** ccamacho has quit IRC18:04
*** ccamacho has joined #openstack-infra18:05
*** jmorgan1 has quit IRC18:05
*** electrofelix has quit IRC18:06
*** ccamacho has quit IRC18:09
corvusactually i need to afk for a bit; expect the nodepool fix sometime after (my) lunch18:10
mordredcorvus: let me know if I can be helpful18:10
mordredcorvus: kk18:10
corvusmordred: will do, thx18:11
*** jmorgan1 has joined #openstack-infra18:11
*** jmorgan1 has joined #openstack-infra18:12
corvusif folks want to restart that launcher it's ok with me, i don't need any more debug info18:13
clarkbmy quick read of it is that openstack task manager rereaises any exceptions that tasks caught. So if we have an exception that makes it allt hte way to our task manager. We likely just need to catch, log, and then continue rather than die? I think part of the behavior expectation is that shade only raises exceptions in fatal situations though18:13
clarkbshade returns real data for success, falsy data for failures and exceptions for fatal errors. A timeout is probably the exception to this if we only want to catch timeout exceptions and handle those18:13
clarkbI will restart nl02 now18:14
clarkbdone, the sha1 reported by pbr freeze has not changed since the lsat restart18:14
*** e0ne has joined #openstack-infra18:15
*** openstackgerrit has quit IRC18:19
*** derekh_afk has quit IRC18:21
clarkbok things look good I'm going to afk, now. mordred I'll try to get to your changes post lunch18:23
*** rlandy|brb is now known as rlandy18:34
*** openstackgerrit has joined #openstack-infra18:35
openstackgerritMerged openstack-infra/zuul-jobs master: Swift logs: don't allow links outside of the supplied path  https://review.openstack.org/58758018:35
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add functional tests with DevStack  https://review.openstack.org/58859418:38
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add extra_specs to flavor  https://review.openstack.org/58833218:38
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add os_project_access  https://review.openstack.org/58833518:38
mnaserif anyone has a quick second -- https://review.openstack.org/#/c/588599/18:39
mnaserthat'll help make my stack pass for review18:40
mordredclarkb: ossum18:40
funginova changes still being reindexed18:43
*** hemna_ has joined #openstack-infra18:48
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add functional tests with DevStack  https://review.openstack.org/58859418:49
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add extra_specs to flavor  https://review.openstack.org/58833218:49
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add os_project_access  https://review.openstack.org/58833518:49
*** jtomasek has quit IRC18:51
*** bobh has joined #openstack-infra18:57
*** bobh has quit IRC18:58
openstackgerritK Jonathan Harker proposed openstack-infra/puppet-elasticsearch master: Add support for setting the bind address  https://review.openstack.org/58864318:59
*** gouthamr has joined #openstack-infra19:04
*** derekh has joined #openstack-infra19:13
*** kgiusti has left #openstack-infra19:15
*** derekh has quit IRC19:15
*** fried_rolls is now known as fried_rice19:18
*** jcoufal has quit IRC19:26
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add functional tests with DevStack  https://review.openstack.org/58859419:28
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add extra_specs to flavor  https://review.openstack.org/58833219:28
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add os_project_access  https://review.openstack.org/58833519:28
*** jcoufal has joined #openstack-infra19:29
mnaserahem does anyone know how to locally clone gerrit changes, i'm trying to use this in an ansible requirements.yml19:39
clarkbmnaser: git review -d change number?19:39
mnaserhttp://paste.openstack.org/show/727279/ (change: https://review.openstack.org/#/c/588335/10 ) seems to fail19:39
mnaseransible seems to complain about `command git checkout 760da4a77bcd4ffd60eed15556c96b9b99728f01 failed in directory /Users/mnaser/.ansible/tmp/ansible-local-25557CD47wd/tmpTrWvNz/openstack.cloud-launcher`19:39
jlvillalmnaser, What clarkb said. Or if you want to get complicated: git pull origin refs/changes/20/1220/219:40
mnaserbut i guess i can look at what git review does and put the right values19:40
clarkbYa I would exec git or git review19:40
mnaseri tried putting version: refs/changes/35/588335/10 to no avail either19:41
jlvillalmnaser, If you look at: https://review.openstack.org/#/c/588335/  And on the right are the download links.  git fetch https://git.openstack.org/openstack/ansible-role-cloud-launcher refs/changes/35/588335/1019:42
jlvillalmnaser, It might need to be a two-step process. Checkout the repo and then do the fetch.19:44
mnaserjlvillal: yeah it doesnt look like i'd be able to do it through ansible requirements.yml19:53
*** mdrabe has quit IRC19:54
cgoncalvesvery often I get "Cannot retrieve metalink for repository: epel/x86_64. Please verify its path and try again" when building a CentOS 7 DIB image. anyone has an idea?19:55
cgoncalveshttp://logs.openstack.org/14/587414/5/check/octavia-v2-dsvm-scenario-centos.7/b7ce6d4/job-output.txt.gz#_2018-08-03_19_04_34_40037619:55
*** dmellado has joined #openstack-infra19:56
logan-mnaser: you need to use a refspec.. let me find you an example19:59
logan-mnaser: https://github.com/logan2211/openstack-ansible-overlay/blob/cb9d8397f1419986178aace282dd2fd25c134410/overlay/env/ansible-role-requirements.yml#L11-L1520:00
*** gouthamr has quit IRC20:00
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add functional tests with DevStack  https://review.openstack.org/58859420:09
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add extra_specs to flavor  https://review.openstack.org/58833220:09
openstackgerritMohammed Naser proposed openstack/ansible-role-cloud-launcher master: Add os_project_access  https://review.openstack.org/58833520:09
mnaserlogan-: awesome!  thank you20:09
AJaegerany infra-root has time for infra-manual reviews, please? https://review.openstack.org/587620 , https://review.openstack.org/586549 and https://review.openstack.org/#/c/570498/ are open reviews20:10
AJaegerinfra-root, ianw has done a couple of changes for accessbot, please review the stack at https://review.openstack.org/#/c/58810620:11
*** stevebaker has joined #openstack-infra20:14
*** harlowja has joined #openstack-infra20:21
fungi[2018-08-03 19:33:55,596] [Reindex v32-v32] INFO  com.google.gerrit.server.index.OnlineReindexer : Reindex to version 32 complete20:21
fungi~3 hours 5 minutes to reindex20:22
*** jcoufal has quit IRC20:29
*** e0ne has quit IRC20:40
*** e0ne has joined #openstack-infra20:41
*** agopi has quit IRC20:43
*** e0ne has quit IRC20:45
*** beekneemech is now known as bnemec-pto20:55
pabelangerdmsimard: mind a +3 on https://review.openstack.org/588368/ for ara, moves to fedora-latest20:56
pabelangerin this case fedora-2820:56
corvusclarkb, mordred: remote:   https://review.openstack.org/588656 Don't wait for task in submit_task20:56
openstackgerritMerged openstack-infra/infra-manual master: fix URL markup for PEP 440  https://review.openstack.org/58762020:57
corvusclarkb, mordred: i'm pushing that up at the moment mostly as a question.  that appears to me to be the flaw, but if that's the case, i don't understand how things have worked at all -- this should mean that *any* exception from the cloud via openstacksdk should kill nodepool task managers...20:57
openstackgerritMerged openstack-infra/infra-manual master: Update Zuul Status Page to correct URL  https://review.openstack.org/57049820:58
corvusclarkb, mordred: i'm expecting someone to tell my why that's wrong, and then we can proceed from there :)20:58
clarkbcorvus: yes I think any exception to kill the managers, I think it works becaus shade doesn't raise exceptions unless there is a fatal flaw, it returns falsy values for failure20:58
corvusclarkb: wow, so we just haven't been getting exceptions?  it looks like this is hitting the "retry network errors once" code path, so i guess like you said earlier, that's the exception to the rule.  so we get exceptions if the network is bad.... and somehow, for the past few months, the Internet has been *working*?21:00
clarkbcorvus: ya I think so21:00
clarkbmordred should confirm but that is my guess at the moment21:00
corvusi feel like i missed my chance to buy a lottery ticket21:00
*** rfolco|ruck is now known as rfolco|off21:01
pabelangercorvus: clarkb: Oh, thanks for pointing this out21:02
pabelanger2018-08-02 01:25:27,956 ERROR nodepool.TaskManager: Task manager died.21:02
pabelangerwe also get it in rdoproject21:02
pabelangerbeen trying to figure out for the last few days why our nodepool stopped working there too21:03
corvuspabelanger: strangely, that's reassuring.  i figured lack of errors suggested my analysis was wrong, but this improve my confidence.  thanks :)21:03
pabelangeryah, in this case, rdocloud goes doen for reasons not related to nodepool, but nodepool never recovers from it.,21:04
pabelangertaskmanager dying now explains why that it21:04
pabelangers/doen/down21:04
clarkbAJaeger: thanks and done21:05
clarkbor at least half done21:05
openstackgerritMerged openstack-infra/infra-manual master: Remove Zuul v2 content  https://review.openstack.org/58654921:07
clarkbcorvus: did you want to review the accessbot changes? https://review.openstack.org/#/c/588134/1 and its parent21:12
clarkbcorvus: ianw In any case I +2'd but didn't approve to give corvus a chance to review since I think corvus had indicated interest in this the other day21:17
corvusclarkb: ya, i'll look now21:18
clarkbmordred: https://review.openstack.org/#/c/587540/9 responses to that change make sense to me should we approve it?21:20
mordredclarkb: yes!21:20
mordredalso - reading scrollback about shade things above21:21
clarkbmordred: and what about child changes jus approve as things look good?21:21
openstackgerritMerged openstack-infra/puppet-accessbot master: accessbot logs : add timestamp and rotate  https://review.openstack.org/58810621:22
mordredclarkb: yup. there should be notning in that stack until the one marked WIP that should impact current production at all21:22
mordredclarkb, corvus: I agree with clarkb, we've been lucky because shade/sdk mostly doesn't throw exceptions21:23
corvusmordred: ok, you think my change is the thing to do?21:24
mordredfor network issues, there is actually a retry baked in somewhere that will cause a certainly class of exception to get auto-retried21:24
mordredcorvus: I haven't digest the change yet - mostly responding to the writeup and discussion21:24
corvusmordred: yeah, there's a retry-once in the task manager21:24
corvusmordred: ok, while you continue to digest, i'll go look into what might be involved in adding a test21:25
mordredcorvus: but yes - I believe that is correct - pending that it passes the sdk tests and the nodepool tests21:25
corvusclarkb, ianw: accessbot lgtm, thanks.  i'm not in the right frame of mind to approve those right now.21:25
mordredcorvus: we should also potentially (not this afternoon necessarily) finish https://review.openstack.org/#/c/574285/ and then get some good explicit testing of both task managers21:26
clarkbmordred: I got as far as https://review.openstack.org/#/c/587985/7 before running out of necessary +2's to approve fwiw21:27
mordredclarkb: awesome. the one 2 after that: https://review.openstack.org/#/c/58839421:27
mordredis the one I need landed most to be able to continue on bridge itself (need the install_modules change to actually land :) )21:28
corvusmordred: in 587985 why is git-server set to python2?21:28
corvuslike, isn't that the default?21:28
corvusoh you were talking about inverting that21:29
corvusin which case, why is bridge set to python3? :)21:29
mordrednot if ansible is installed with python3 - that changes the default remote python it looks for21:29
pabelangerre: 587985 shouldn't we consider a virtualenv for ansible, given recent pip10 issues we had with sudo and os packages?21:29
clarkbpabelanger: possibly21:29
clarkbI mostly wasn't going to require that to start since we don't venv today21:30
mordredcorvus: so, I figured go ahead and be explicit with the centos boxen, since they don't have 3 anyway21:30
corvusanother option is using ansible from packages.  bionic has 2.5.1.21:30
mordredwe can invert the other pretty easily- either by putting an ansible_python_interpreter in group_vars/all.yaml or by reinstsalling ansible with python221:30
corvusmordred: it's confusing to me to have both of those21:31
corvusmordred: maybe we can drop the one for bridge?21:31
clarkbcorvus: re openstacksdk patch nodepool task manager overrides submit_task and does wait on the task there21:31
mordredclarkb: both pythons?21:31
mordredgah21:31
mordredcorvus: both pythons?21:31
corvusmordred: yeah.  like, if one of those is the default, then one of those shouldn't be needed, right?21:31
mordredcorvus: yes. there is currently no python2 installed on brige21:32
mordredbridge21:32
* mordred cannot type21:32
corvusit's spelled "bilge"21:32
clarkbcorvus: oh but then it calls into _run_task21:32
mordredzomg. we should have a bilge21:32
clarkbso ya I think that may make it happier21:32
* fungi is not volunteering to man the bilge pump21:32
corvus"captain in the bilge!"21:32
clarkbso ya I think nodepool's code did/does the right thing but then that behavior change in sdk caught nodepool by surprise21:33
clarkbmordred: fungi bilge.pump.openstack.org can be the bug tracker21:34
*** apetrich has quit IRC21:34
pabelangerlolz21:37
*** pcaruana has quit IRC21:38
*** boden has quit IRC21:40
openstackgerritMerged openstack-infra/system-config master: Add base playbooks and roles to bootstrap a new server  https://review.openstack.org/58754021:43
openstackgerritMerged openstack-infra/system-config master: Add sudoers file and groups  https://review.openstack.org/58785421:43
openstackgerritMerged openstack-infra/system-config master: Add bridge.openstack.org to trusted ssh list  https://review.openstack.org/58785521:43
openstackgerritMerged openstack-infra/system-config master: Add emacs and vim to base-server packages  https://review.openstack.org/58798321:44
openstackgerritMerged openstack-infra/system-config master: Add pip and virtualenv to bridge.openstack.org  https://review.openstack.org/58798421:46
dhellmannis there a zuul API to tell me how many active jobs are running? I would like to watch for a low point before submitting a bunch of the zuul setting migration patches for Oslo.21:47
dhellmannI've just been looking at the web page and eyeballing the check queue as a proxy for business21:47
*** njohnston has quit IRC21:47
*** gouthamr has joined #openstack-infra21:47
clarkbdhellmann: the status.json file has that info (which is what the dashboard uses to render things). you can also use grafana21:48
clarkbdhellmann: http://grafana.openstack.org/d/T6vSHcSik/zuul-status?orgId=1 and something like https://zuul.openstack.org/api/status21:48
dhellmannok, I was thinking I might write a little script to fetch it and tell me when it dips below a threshold, but now that I'm thinking out loud I don't know what that value might be21:48
dhellmannit's probably just as easy for me to check it periodically during off-peak times21:49
clarkbdhellmann: I personally like to use the test nodes graph on graphana for this sort of thing because it shows you how much capacity we have free21:49
clarkbright now almost 50% of our capacity is unused21:49
dhellmannaha21:50
dhellmannthat's better than what I was doing21:50
dhellmannso we have ~1000 nodes?21:50
clarkbdhellmann: yes21:50
clarkband actually that reminds me I was going to increase packethost back to 100 if it looked happy today21:50
dhellmannok. I have somewhere around 150 patches to submit here, so I think I'll see if that number dips lower as the day moves on21:50
dhellmannwhile my script generates those, I'm going to go run an errand21:51
openstackgerritClark Boylan proposed openstack-infra/project-config master: Bump packethost back to 100 max-servers  https://review.openstack.org/58866921:53
clarkbinfra-root ^ I think we may be good to do that based on current observations21:53
*** rh-jelabarre has quit IRC21:56
clarkbfungi: pabelanger ^ if still around second revie won that would be much appreciated22:12
fungiyup, looking now22:14
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Update fontawesome to version 5  https://review.openstack.org/54567622:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Make board and worklist icons unique  https://review.openstack.org/54567722:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Convert less to scss  https://review.openstack.org/37959522:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Redesign the sidebar to neaten code and improve contrast  https://review.openstack.org/54901022:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Reduce the number of items in the sidebar  https://review.openstack.org/54905922:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Make the page background less bright  https://review.openstack.org/54921022:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Add a drop shadow beneath the header navbar  https://review.openstack.org/54921122:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Add shadow to board lanes  https://review.openstack.org/54921222:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Redesign dashboard to reduce clutter  https://review.openstack.org/54933322:19
openstackgerritAdam Coldrick proposed openstack-infra/storyboard-webclient master: Improve appearance of search bars  https://review.openstack.org/58867522:19
*** hongbin has quit IRC22:19
openstackgerritMerged openstack-infra/project-config master: Bump packethost back to 100 max-servers  https://review.openstack.org/58866922:20
clarkbfungi: thanks!22:20
corvusclarkb, mordred: i updated remote:   https://review.openstack.org/588656 Don't wait for task in submit_task22:24
corvusclarkb, mordred: i added a test and comments22:24
corvusmordred: i agree that adding https://review.openstack.org/574285 will help.  we'll still need a test like that with that change.  we can incorporate the test i wrote into that later.22:26
openstackgerritCarlos Goncalves proposed openstack/diskimage-builder master: Install ca-certificate with redhat-common  https://review.openstack.org/58867622:32
*** rlandy has quit IRC22:42
*** derekh has joined #openstack-infra22:43
*** nicolasbock has quit IRC22:43
corvusclarkb: when you have a sec, can you take a look at https://review.openstack.org/588383 ?22:46
openstackgerritJames E. Blair proposed openstack-infra/zuul-jobs master: Un-wip upload-logs-swift  https://review.openstack.org/58867722:48
clarkbcorvus: yup22:48
*** agopi has joined #openstack-infra22:51
*** tpsilva has quit IRC22:54
ianwcorvus: thanks for reviews.  i can watch it on my monday when it's quiet.  it actually didn't fully run last time, as it took longer than the puppet timeout and got killed.  i'll run it manually and see what's up with timing etc22:55
ianwaccessbot i mean22:55
clarkbcorvus: the change looks functional. I'm a little concerned about some of the performance impact ( we would still filter with crm114 ) left more details inline22:58
clarkbcorvus: I think given the existing code it would be a bit more cmplicated to implement my suggestion :/22:58
clarkbcorvus: maybe we still go through the filters but don't run f.process if a flag is set22:59
corvusclarkb: oh, good point.  i'll ponder that.23:02
corvusi was also thinking of having a filter, erm, actually *filter* things, like, be able to reject a line.  that'd be a change in how it's currently set up, but maybe worth doing based on that concern.23:03
clarkbya23:03
*** mschuppert has quit IRC23:03
*** tosky has quit IRC23:04
*** larainema has quit IRC23:07
*** derekh has quit IRC23:08
openstackgerritMerged openstack-infra/zuul-jobs master: Add HTMLify logs role  https://review.openstack.org/58810523:09
openstackgerritDoug Hellmann proposed openstack-dev/pbr master: import zuul job settings from project-config  https://review.openstack.org/58870023:14
openstackgerritDoug Hellmann proposed openstack-dev/pbr master: switch documentation job to new PTI  https://review.openstack.org/58870123:14
*** rosmaita has quit IRC23:17
dhellmanndoes git-review have a mode where it won't prompt when there are multiple commits to propose? like a -y option?23:18
dhellmannah, yes, it does23:18
*** xarses has joined #openstack-infra23:24
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: remove job settings for Oslo repositories  https://review.openstack.org/58884223:26
*** neiloy has quit IRC23:27
*** rpioso is now known as rpioso|afk23:45
*** sthussey has quit IRC23:47
*** harlowja has quit IRC23:48
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: remove job settings for Oslo repositories  https://review.openstack.org/58884223:51
mordredcorvus: patch looks great. I approved it, and also cherry-picked it back to stable/rocky and added a release note on that branch remote:   https://review.openstack.org/588845 Don't wait for task in submit_task23:51
mordredcorvus: once that lands I'll get a bugfix release cut23:52
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: remove job settings for Oslo repositories  https://review.openstack.org/58884223:54
openstackgerritDoug Hellmann proposed openstack-infra/project-config master: fix api-wg/sig project settings  https://review.openstack.org/58884623:54
dhellmanncorvus , clarkb : I have submitted a bunch of patches to move the zuul job settings from project-config to the oslo repos. See https://review.openstack.org/#/q/topic:python3-first23:56
dhellmannand https://review.openstack.org/588842 is the related project-config change to delete them23:56
dhellmannI would appreciate it if you would look over that project-config change when you have some time, and maybe spot-check some of the other ones23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!