Wednesday, 2014-01-22

*** hashar has joined #openstack-infra00:01
*** jgrimm has quit IRC00:02
*** CaptTofu has joined #openstack-infra00:02
*** markmcclain has joined #openstack-infra00:02
fungiwow, there's an odd failure to see in the gate... https://jenkins03.openstack.org/job/gate-glance-python27/142/consoleText00:02
*** vipul-away is now known as vipul00:02
*** slong has joined #openstack-infra00:03
morganfainbergfungi, py27 failure?00:03
*** gokrokve has quit IRC00:04
morganfainbergfungi, oh oh in gate?!00:04
*** slong_ has quit IRC00:04
*** gokrokve has joined #openstack-infra00:04
*** pcrews has quit IRC00:05
fungimorganfainberg: yeah, with what looks like something that would be very hard to blame on infrastructure... could indicate a bug in glance or the glance unit tests, i suppose00:05
*** markmcclain has quit IRC00:05
morganfainbergfungi, yeah, passed a couple days ago.  maybe something merged in that is changing the result of those tests now00:06
anteayadone, those patches are out of the gate00:07
fungior it's a bug which doesn't crop up on every test run00:07
anteayalet me know if you see any others, I will remove them00:07
morganfainbergfungi, possible as well.00:07
*** sarob has joined #openstack-infra00:07
fungianteaya: thanks for the info. i'll let you know if/when i spot others00:07
*** ok_delta__ has quit IRC00:08
*** ok_delta has quit IRC00:08
morganfainbergfungi, wonder if that patch needs to be pulled out of the gate.  just saw a reset at the top and it's running again00:08
anteayafungi: thanks00:09
*** gokrokve has quit IRC00:09
fungimorganfainberg: it's a judgement call for the glance devs. if you don't think that change is likely to be tyhe cause, then i'm not sure pulling it out will be uch help00:09
morganfainbergfungi, not a glance dev, so... :P00:10
*** sarob has quit IRC00:10
fungimorganfainberg: th "reset" was the pending promote of the nova fix we've been trying to get in all day00:10
morganfainbergfungi, though as a dev for openstack and waiting for lots of I2 patches, i'd say if it's failing again it might be worth pulling.00:10
morganfainbergfungi, aye, i saw that00:10
*** hashar has quit IRC00:11
morganfainbergnot complaining about the reset ;)00:11
*** sarob has joined #openstack-infra00:11
lifelessjeblair: I could hook that in quite easily00:11
anteayathe zuul status page still shows them in the gate queue00:11
fungiwhen i started the promote, there weren't any changes being tested i the gate, but the ref recalculation lag made it take long enough stuff started testing before it took effect00:12
lifelessjeblair: also I was considering moving to bulk operations - do one list of current floating ips, servers, keypairs, figure out what to delete, then issue the deletes00:12
morganfainberganteaya, likely waiting on events to process00:12
anteayamorganfainberg: I hope00:12
morganfainberganteaya, 1555 events in queue00:12
morganfainberganteaya, my guess is yes00:12
lifelessjeblair: so that we don't keep re-querying the same data - I guess it depends on the rate limit implementation of the provider whether we'd be penalised for that00:12
*** pballand has quit IRC00:12
fungianteaya: yeah, the new patchsets will be gerrit events. it'll take a while for them to land (may even be after the changes make it to the top of the gate and fall out anyway)00:13
anteayamorganfainberg: ummm, I'm talking about 2 patches I just sniped from the gate queue, I'm still seeing them in the gate queue00:13
lifelessjeblair: but since *we* have a naive seconds-since-last-op implementation, it would increase how many deletes we can send through00:13
morganfainberganteaya, ^00:13
anteayafungi: boo00:13
anteayais there something I can do that is faster?00:13
*** oubiwann_ has quit IRC00:13
openstackgerritClark Boylan proposed a change to openstack-infra/zuul: Add rate limiting to dependent pipeline queues  https://review.openstack.org/6821900:13
clarkbjeblair: ^ I am actually fairly happy with that00:13
clarkbI am now open to opinions on removing the _type and _factor stuff00:14
* fungi sees if he can thumb his nose at the internet gods long enough to review that00:14
clarkbsince that isn't strongly tested I think it should probably be on the chopping block00:14
jeblairlifeless: ok, i think with the addition of cleanup-per-provider, i'm sold on your approach;  the bulk operations change makes sense and sounds good.00:14
*** DennyZhang has joined #openstack-infra00:14
*** sarob has quit IRC00:15
fungiclarkb: a doc patch corresponding to this new feature/behavior would also make a good todo item for soon, though doesn't need to be part of that change of course00:16
*** bauzas has quit IRC00:17
clarkbfungi: definitely, was hoping to have something concrete before I documented say window_increase_factor00:17
fungiabsolutely00:17
*** senk has quit IRC00:18
fungithe test does a great job of exercising all the effects, i think00:19
*** wenlock has quit IRC00:19
clarkbfungi: yeah, I had it go through step by step. the reset and going from 2 to 1 is a nice thing :)00:20
clarkbfungi: once thing that doesn't test is the affect on dependent changes. not sure if thatneeds to be explicitly tested00:20
fungiwith window and floor i think we now need walls, a ceiling and door (the fire marshall would probably insist)00:20
jeblairfungi: we have a gate00:20
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: make scm test as the example  https://review.openstack.org/6518600:20
fungiwe do!00:20
*** vipul is now known as vipul-away00:21
*** yassine has quit IRC00:21
*** ryanpetrello has joined #openstack-infra00:22
*** fifieldt has quit IRC00:24
fungiclarkb: the only other potential misbehavior i worry about is if we have multiple changes failing in reverse order caused by one change ahead of them (through happenstance of varying provider performance impacting relative job ru-time)00:25
fungirun-time00:25
anteayaI have to get some sleep, the good thing is I will probably be awake early in the morning anyway00:25
fungiclarkb: but i think that's likely to be rare enough in practice so as to deal with it when it happens00:26
anteayaleave messages in channel if there is anything I can do when I return00:26
*** mrodden has quit IRC00:26
*** sarob has joined #openstack-infra00:27
*** markmcclain has joined #openstack-infra00:27
*** esker has joined #openstack-infra00:27
*** dangers is now known as dangers_away00:27
*** CaptTofu has quit IRC00:28
*** CaptTofu has joined #openstack-infra00:28
*** reed has quit IRC00:28
fungigah, 68147,3 failed its py27 unit tests00:29
jorisrooversfungi, I figured it out, thanks for the help :-)00:30
jog0AssertionError: False is not true. such a descriptive error00:30
*** dangers_away has quit IRC00:31
*** jasondotstar has quit IRC00:31
fungijorisroovers: sorry i couldn't be more helpful00:31
fungijorisroovers: how did you end up accomplishing it?00:31
*** sandywalsh has quit IRC00:31
jorisrooversfungi, no worries. It turned out the be a file that was removed in master and that I had moved in my patch00:31
*** carl_baldwin has quit IRC00:32
jorisrooversI just did a rebase, then during conflict removed that file and commited/reviewed00:32
*** dangers_away has joined #openstack-infra00:32
fungijog0 it passed the same job in the check pipeline too00:32
fungijorisroovers: yep, that doesn't sound so bad as what i was expecting then. thanks for the report00:33
jorisrooversfungi, yeah. I just took me a while as started over completely after messing up my local copies00:33
fungiokay, dropping offline again for a while in hopes of catching my next flight00:33
jorisrooversfungi, good luck with that. As always, your help was VERY much appreciated!00:34
*** ArxCruz has joined #openstack-infra00:34
jog0fungi: that was then, this is now maybe nova changed underneath00:34
*** zul has joined #openstack-infra00:37
clarkbfungi: that would harshly shrink the window size, but I am not sure how we could handle that00:38
clarkbfungi: actually no that won't00:38
clarkbwe only adjust the window when we report00:38
clarkbfungi: I think I will just add some dependent changes locally and bump the test count up :)00:40
clarkbdoes anyone know if the stable branches are happy now?00:42
*** flaper87 is now known as flaper87|afk00:42
jeblairclarkb: i think grizzly is but not havana00:42
clarkbjeblair: thanks00:42
*** reed has joined #openstack-infra00:44
*** reed has quit IRC00:48
*** kgriffs_afk is now known as kgriffs00:50
openstackgerritClark Boylan proposed a change to openstack-infra/zuul: Add rate limiting to dependent pipeline queues  https://review.openstack.org/6821900:50
*** kgriffs has left #openstack-infra00:50
clarkbjeblair: fungi ^ now with a bit more testing. I am fairly confident in the change now. Looking for feedback on all of the toggles00:50
clarkbI am going to context switch into putting the new SCP plugin build on jenkins0400:51
*** CaptTofu has quit IRC00:54
*** thuc has joined #openstack-infra00:56
*** dcramer__ has joined #openstack-infra00:56
*** morganfainberg is now known as morganfainberg|z01:00
russellbfungi: saw that >_<   unrelated failure01:01
clarkb68147,3 failed python27 tests01:01
russellbyeah :(01:02
clarkboh fungi noticed earlier01:02
russellbsorry ...01:02
russellbthat fail is https://bugs.launchpad.net/nova/+bug/127065401:02
clarkbjenkins04 will be idle shortly, I am going to restart it to pick up the new scp plugin01:02
clarkboh man that test01:03
russellbi'll approve again after it fails i guess01:03
clarkbrussellb: pretty sure that test was really broken before testr'ing and it only sort of got fixed after testr'ing01:03
russellbi'm bummed this failed ... took all day to get it this far01:03
russellb318 hits in the last 12 hours01:03
russellbmay be worth ninja-merging if that's possible (assuming the rest passes)01:04
russellbcheck passed fine01:04
clarkbrussellb: I will let jeblair make a call on that01:04
russellbok01:05
lifelessfungi: so, about getting ci-overcloud enabled for reals ?01:05
locke105is that a new project ?01:07
fungilifeless: we need a nodepool restart for that. it seems not worth delaying the gate with additional resource starvation right now, but maybe when i'm not sitting in an airport and zuul's got new throttling in place01:07
fungiwhich sounds very soon01:07
*** hashar has joined #openstack-infra01:08
fungiwe'll see how exhausted i am when in finally find a hotel room01:08
*** blamar has joined #openstack-infra01:08
lifelessfungi: ok; we're very very very very keen about this, as you can tell01:08
*** DennyZhang has quit IRC01:09
locke105is ci-overcloud the hyperv CI thing?01:10
fungilocke105: it's the bare metal ci thing01:10
locke105not related to this https://github.com/cloudbase/ci-overcloud-init-scripts ?01:11
fungilifeless: i'm very keen on it too, and hope i don't come across otherwise ;)01:11
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Include check in fake.yaml.  https://review.openstack.org/6829501:12
lifelesslocke105: ci-overcloud.tripleo.org01:12
lifelesslocke105: no, not related01:12
lifelesslocke105: MS have said they are going to contribute hardware to tripleo's test cloud which would get hyperv check or possibly even gate testing eventually01:12
locke105i c01:13
lifelessbut that looks like cloudbase doing third-party testing01:13
locke105yeah01:13
lifelesswhich can't (on current policy anyhow) ever become check-or-gate01:13
*** senk1 has joined #openstack-infra01:13
locke105ci-overcloud.tripleo.org doesn't seem to resolve to anything?01:14
lifelesslocke105: your DNS may be broken01:14
locke105figures01:14
lifelesslocke105: there's no web page there though, if you used a browser to test ;)01:15
fungici-overcloud.tripleo.org has address 138.35.77.1601:15
locke105comes up on my other machine yeah01:15
locke105weird01:15
fungiclarkb: i feel silly for asking, but any particular reason why the two tests have the same docstring? just two scenarios to exercise it, or is there something more subtle i'm missing between them besides the function names?01:16
fungialso, boarding. back in a bit once i'm on the plane01:16
clarkbfungi: oh, because I copy pasta'd01:16
clarkbI will fix that01:16
*** UtahDave has quit IRC01:17
openstackgerritClark Boylan proposed a change to openstack-infra/zuul: Add rate limiting to dependent pipeline queues  https://review.openstack.org/6821901:17
locke105mm copy pasta01:17
*** mestery has quit IRC01:18
*** oubiwann_ has joined #openstack-infra01:19
jeblairclarkb, fungi: i need to check out for the day (still sick); i don't feel i'm in a position to do serious code review or make good judgement calls at this point01:20
clarkbjeblair: thats fine, we can let that stew overnight then make big changes tomorrow01:20
jeblairk01:20
clarkbjeblair: I will be restarting jenkins04 shortly though01:20
russellbjeblair: hope you feel better soon01:20
russellbhealth always more important IMO01:20
clarkbto pick up the newer scp plugin, zaro tested locally and on jenkins-dev so I am fairly confident in it01:21
*** hogepodge has quit IRC01:23
sdaguefungi: I support ninja merging russellb's patch01:25
sdaguefor what it's worth, it should reduce our reset rate01:26
clarkbunless it makes that unittest super unreliable01:26
fungijeblair: speedy recovery01:26
*** pcrews has joined #openstack-infra01:26
*** mrodden has joined #openstack-infra01:26
*** smurugesan has quit IRC01:26
russellbsdague: should be completely unrelated to the unit test failure (it was a libvirt unit test)01:27
sdaguerussellb: agreed01:27
fungirussellb: sdague: nova devs are fairly certain that change is unlikely to tickle that test i a way that makes it fail more often? i can cram it in if so01:28
*** mrodden1 has joined #openstack-infra01:28
* fungi prepares01:28
sdaguefungi: yes, we're fairly sure01:28
russellbyes01:28
* russellb isn't going to bed any time soon01:28
russellband will not go to bed until i clean up a mess i made if by chance it blows up01:29
russellb:)01:29
sdagueheh01:29
* russellb just starting a pot of chili, wooo01:29
sdagueI, on the other hand, am running away from computers for the night.01:29
russellbsdague: enjoy :)01:29
sdagueooo chili :)01:29
sdaguenight all01:29
clarkbI need to do that soon, but will get jenkins04's scp plugin updated first! jenkins and I love doing battle01:30
*** praneshp has quit IRC01:30
clarkbsdague: tomorrow morning your review on my zuul change would be appreciated01:30
clarkbsdague: you may have an opinion on all o fthe toggles I added01:30
openstackgerritJoe Gordon proposed a change to openstack-infra/elastic-recheck: Add fingerprint for bug 1270654  https://review.openstack.org/6829601:31
russellbfungi: if you'd rather wait until tomorrow so that *you* don't feel like you have to sit around, that's cool too, totally understand01:31
*** mrodden has quit IRC01:31
fungirussellb: i plan to sit around. i'm waaay too behind on my work anyway01:31
fungii'm glued to a plane seat for at least another 1.5 hours as well01:31
openstackgerritMichael Krotscheck proposed a change to openstack-infra/storyboard-webclient: Storyboard API Interface and basic project management  https://review.openstack.org/6758201:31
*** gothicmindfood has quit IRC01:32
russellbfungi: oh, plane wifi, neat :)01:33
fungii have no idea how civilization managed to get any work done before in-flight network access01:33
*** harlowja has joined #openstack-infra01:33
clarkbfungi: they made people work in the same office01:33
russellbha01:33
russellbtrue story01:33
fungibarbaric01:33
fungithey had to smell one another and everything01:34
StevenKfungi: How is that different from a plane seat, then?01:34
*** jorisroovers has quit IRC01:34
fungiStevenK: airplane seats are so uncomfortable you completely forget about the smell01:34
StevenKHaha01:35
fungimy elbows are hating me01:35
*** nosnos has joined #openstack-infra01:35
*** krotscheck has quit IRC01:36
openstackgerritA change was merged to openstack-infra/askbot-theme: removed a broken line from the script  https://review.openstack.org/6818301:37
StevenKfungi: I find its easier if you leave your elbows and knees in the overhead locker01:37
lifelessoffices are terrible for productivity01:38
fungii'm surprised my typos don't make it obvious that i type with only my elbows and knees01:38
lifelessand mental health01:38
StevenKlifeless: And actual health during flu season01:39
clarkb04 is idle, restarting it now01:41
openstackgerritJoshua Hesketh proposed a change to openstack-infra/zuul: Send swift upload instructions to workers  https://review.openstack.org/6829701:42
*** dcramer__ has quit IRC01:43
clarkbits up, will watch that scp uploads are happy01:43
jheskethjeblair: ^ the start of the log storing stuff01:43
*** jasondotstar has joined #openstack-infra01:44
*** thuc has quit IRC01:44
*** yaguang has joined #openstack-infra01:49
clarkbscp on jenkins04 looks happy, a docs job with tons of files uploaded ok01:49
clarkbwe can roll that out to the remaining masters over the week and upgrade the unupgraded 201:50
clarkbthis assumes I will have time >_>01:50
*** hashar has quit IRC01:55
dimsnice to hear clarkb01:56
*** zhiwei has joined #openstack-infra02:00
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Teach periodicCleanup how to do one provider.  https://review.openstack.org/6829902:01
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Move cron loading below provider loading.  https://review.openstack.org/6830002:01
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Move cron definition out of the inner loop.  https://review.openstack.org/6830102:01
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Decouple cron names from config file names.  https://review.openstack.org/6830202:01
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Run per-provider cleanup threads.  https://review.openstack.org/6830302:01
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Include check in fake.yaml.  https://review.openstack.org/6829502:01
lifelessjeblair: ^02:01
clarkbfungi: why does the welcome message thing need a service account?02:01
lifelessjeblair: I can move that below the other refactoring if you'd prefer02:01
openstackgerritJoe Gordon proposed a change to openstack-infra/devstack-gate: Add support to run nova-api-metadata as separate binary  https://review.openstack.org/6830402:04
fungiclarkb: so that it can post a comment to a change02:06
clarkbfungi: don't we have accounts that can do that already, or are they named in such a way that would be confusing02:06
*** markmcclain has quit IRC02:07
*** sarob has quit IRC02:08
fungiclarkb: i think the hope was that the display name of the account would serve as a visual clue to reviewers (would appear as something like "Welcome New Contributor!" in bold even in collapsed comment view that way)02:08
*** starmer has joined #openstack-infra02:08
clarkbgotcha02:08
openstackgerritJoe Gordon proposed a change to openstack-infra/config: Run tempest-dsvm-postgres-full with nova-api-metadata binary  https://review.openstack.org/6830502:08
*** sarob has joined #openstack-infra02:08
fungiclarkb: gerrit suexec api calls would allow us to do it without authenticating as the account (trivial-rebase does this) but still requires a skeleton account in the db02:09
*** dims has quit IRC02:11
*** sarob has quit IRC02:13
*** gyee has quit IRC02:13
*** miqui has quit IRC02:16
jog0https://github.com/openstack/openstack/graphs/commit-activity says 35 commits to o/o yesterday (I think thats UTC)02:17
jog0which is pretty good02:17
clarkbfungi: I am going to walk home and will try to decompress as tomorrow will be a very busy day02:18
fungiclarkb: good call02:18
clarkbI have a thing at hp early (for me) then I plan on digging into my zuul change and the jenkins upgrades02:19
*** CaptTofu has joined #openstack-infra02:19
fungijog0: put a nickel in the github jar02:20
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Teach periodicCleanup how to do one provider.  https://review.openstack.org/6829902:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Move cron loading below provider loading.  https://review.openstack.org/6830002:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Move cron definition out of the inner loop.  https://review.openstack.org/6830102:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Decouple cron names from config file names.  https://review.openstack.org/6830202:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Run per-provider cleanup threads.  https://review.openstack.org/6830302:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Use the nonblocking cleanupServer.  https://review.openstack.org/6800402:21
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Include check in fake.yaml.  https://review.openstack.org/6829502:21
fungiwhen that jar is full, we'll use it to buy more git servers02:21
jog0fungi: heh, they actually have something really nice that we don't yet02:21
jog0for once02:21
lifelesswhats that?02:21
jog0lifeless: https://github.com/openstack/openstack/graphs/commit-activity pretty pictures02:21
*** michchap has quit IRC02:24
*** michchap has joined #openstack-infra02:25
*** senk1 has quit IRC02:25
*** markwash has quit IRC02:25
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Run per-provider cleanup threads.  https://review.openstack.org/6830302:28
openstackgerritlifeless proposed a change to openstack-infra/nodepool: Include check in fake.yaml.  https://review.openstack.org/6829502:28
*** miqui has joined #openstack-infra02:28
*** starmer has quit IRC02:29
*** andrew_plunk has joined #openstack-infra02:29
*** vkozhukalov has joined #openstack-infra02:30
*** dpyzhov has joined #openstack-infra02:31
*** coolsvap_away has quit IRC02:33
*** dcramer__ has joined #openstack-infra02:40
andrew_plunkhello everyone. I was wondering if anyone could point me to the code used to link launchpad blueprints to gerrit reviews. I am interested in using that information for presenting verbose changelogs between heat builds02:40
*** yamahata has quit IRC02:42
clarkbit is in the openstack-infra/jeepyb project02:44
*** AaronGr is now known as aarongr_away02:47
*** CaptTofu has quit IRC02:48
andrew_plunkawesome clarkb thank you so much02:49
andrew_plunkyeah when I looked at the huge page of openstack-infra repos I did not know where to start02:50
*** changbl has quit IRC02:50
*** melwitt1 has quit IRC02:51
andrew_plunkdang it seems to be using gerrit's database a lot02:52
andrew_plunkI was hoping for rest api or json over ssh02:52
andrew_plunkthe launchpad code is very helpful though02:53
*** changbl has joined #openstack-infra02:54
*** starmer has joined #openstack-infra02:55
*** andrew_plunk has quit IRC03:00
*** slong_ has joined #openstack-infra03:01
*** slong has quit IRC03:01
*** david-lyle_ has joined #openstack-infra03:03
openstackgerritJoshua Hesketh proposed a change to openstack-infra/zuul: Send swift upload instructions to workers  https://review.openstack.org/6829703:04
*** changbl has quit IRC03:07
*** changbl has joined #openstack-infra03:09
*** jerryz_ has joined #openstack-infra03:13
*** changbl has quit IRC03:15
*** vipul-away is now known as vipul03:17
*** dstanek has quit IRC03:17
*** changbl has joined #openstack-infra03:18
*** dstanek has joined #openstack-infra03:18
*** thuc has joined #openstack-infra03:21
clarkbanteaya: the gerrit db use should be limited to adding projects iirc03:22
clarkboh they left...03:22
*** jerryz_ has quit IRC03:25
*** rnirmal has joined #openstack-infra03:26
portanteclarkb: regarding swift gate job timeouts03:27
portante125 minutes seems kinda long, so perhaps 200s instead?03:27
clarkb200s is far too short03:28
clarkbit takes longer to install devstack03:28
*** ArxCruz has quit IRC03:28
*** vogxn has joined #openstack-infra03:29
portantethe average run time for the functional tests in the last 14 days is always less than 200s, really less than 105s03:29
portante1050s03:29
portante150s03:29
portantesorry03:29
portanteso add 200s to whatever it typically takes to install devstack for your cap03:29
portanteso maybe 25 minutes, 30 minutes at the most?03:31
jog0so no neutron patch has merged since the 18th https://review.openstack.org/#/q/status:merged+project:openstack/neutron,n,z and neutron patch 53609,10  just caused a massive failure03:31
portanteclarkb03:31
clarkbrandom change sample https://review.openstack.org/#/c/67905/ took over 13 minutes03:31
jog0should someone snipe it out of gate?03:31
clarkbI would do 25 at the low end 30 is probably safer since the test resources are so variable03:31
portantemuch better than 125. :)03:32
clarkbya03:32
jog0clarkb: ^03:32
clarkbjog0: fine with me03:33
jog0clarkb: you want to do it, I am not sure what snipe etiquette is03:33
notmynameportante: I seem to have missed something. why does lowering the timeout make make things better?03:34
clarkbi am dinnering03:34
clarkbi just leave a comment explaining the noop patch03:34
jog0clarkb: looks like anita already did, but zuul is very far behind03:35
jog0she uploaded revision 11 at 4:05PM03:35
jog0doh 1350 events in zuul03:35
jog0that explains that :/03:36
portantenotmyname: because it lanquished for 125 minutes before the devstack environment killed it03:38
*** CaptTofu has joined #openstack-infra03:38
*** CaptTofu has quit IRC03:38
*** CaptTofu has joined #openstack-infra03:39
portanteclarkb: I am done with that Jenkins instance we had for that investigation03:39
portantethanks03:39
clarkbportante: ok, I will try to remember to delete it tomorrow03:40
clarkbthough it should cleanup on its own after 24 hours03:40
portanteas long as you it is not held up thinking we still need it for investigations, I'm cool03:41
clarkbk03:42
*** slong has joined #openstack-infra03:43
*** slong_ has quit IRC03:43
notmynameportante: did the tests not run?03:44
notmynameportante: sorry, I feel I'm missing some context03:44
*** jasondotstar has quit IRC03:44
*** vogxn has left #openstack-infra03:45
clarkbnotmyname: the tests ran then hung for 2 hours because they blocked on something03:48
clarkbit took a long time for them to report back03:48
*** hub_cap has joined #openstack-infra03:48
hub_caphey krusty krew, if i have a review in the pipeline thats failed for a known reason, can i kill it somehow? ive put reverify bug XXX on it already, is that enough?03:49
clarkbhub_cap: no thats not enough, in this case you can wait or push a new patchset03:50
notmynameclarkb: ok, thanks03:50
notmynameportante: clarkb: any idea what they blocked on? was it an issue with the tests or an issue with the infrastructure?03:50
hub_capah its not my patchset (dont want to take over authorship from git's perspective), and its 3/4 done, so maybe i wait... thx clarkb for the fast answer <303:51
*** praneshp has joined #openstack-infra03:51
*** miqui has quit IRC03:52
*** miqui has joined #openstack-infra03:53
*** harlowja is now known as harlowja_away03:54
*** gokrokve has joined #openstack-infra03:55
*** Hefeweizen has joined #openstack-infra03:55
*** mriedem has quit IRC03:56
*** emagana has quit IRC03:57
notmynameclarkb: I want to see the console output from a job that doesn't seem to be on jenkins02 anymore. possible?04:00
*** jerryz_ has joined #openstack-infra04:02
*** jamielennox is now known as jamielennox|away04:05
*** coolsvap has joined #openstack-infra04:08
*** jerryz_ has quit IRC04:12
portantenotmyname: torgomatic and I discussed it in -swift04:13
portantewe don't know exactly what happened04:14
notmynamekk04:14
notmynamelooking at the scrollback04:14
*** CaptTofu has quit IRC04:15
lifelessnotmyname: the console output wasn't archived?04:18
notmynamelifeless: maybe? it's not on the jenkins box. if it goes somewhere else, I don't know about that04:18
lifelessnotmyname: was it a special thing, or a regular gerrit driven test?04:19
notmynamenormal thing04:19
lifelesswhat review #?04:19
notmynamelifeless: https://jenkins02.openstack.org/job/gate-swift-python26/3660/console04:19
lifelessnotmyname: do you know the gerrit review # ?04:20
notmynameno, sorry04:20
notmynamelifeless: there is a swift unittest error that very rarely shows up when the test system is under very heavy load. that's the last jenkins job I know that had it, and I wanted to see the error message more clearly and see how hard it would be to fix04:21
lifelesssure04:21
lifelessjenkins is bad at archive though, so we delete everything from it fairly rapidly04:22
notmynameya, makes sense04:22
notmynameturns out the oldest thing I saw on that box was job 3665, so I just missed it :-)04:22
lifelessso if you look at (say) https://review.openstack.org/#/c/63326/04:22
lifelesswe archive everything from all jobs to http://logs.openstack.org/26/63326/7/04:23
lifelessthe 26 is the last digits of the gerrit id04:23
lifeless7 is the patch set04:23
lifelessthen under http://logs.openstack.org/26/63326/7/check/ we have all the jobs04:23
lifelessand http://logs.openstack.org/26/63326/7/check/gate-swift-python26/ as you'd expect04:24
notmynamehmm..ok. thanks (is that on the wiki anywhere?)04:24
lifelessthen http://logs.openstack.org/26/63326/7/check/gate-swift-python26/fb4125a/console.html has the console log from jenkins for that job04:24
lifelessnote that there's no way to figure this uot from jenkins job #, because thats transient04:24
lifelessso the primary key, if you will, is gerrit04:25
lifelessno idea if its on the wiki04:25
lifelessI'm fairly sure most of it is described in the CI docs at ci.openstack.org04:25
notmynamelifeless: thanks04:25
*** rnirmal has quit IRC04:26
*** gokrokve has quit IRC04:28
*** miqui has quit IRC04:29
*** gokrokve has joined #openstack-infra04:29
*** DennyZhang has joined #openstack-infra04:37
*** vipul is now known as vipul-away04:40
*** dpyzhov has quit IRC04:41
*** vipul-away is now known as vipul04:44
*** nicedice has quit IRC04:51
*** nicedice has joined #openstack-infra04:52
*** gokrokve has quit IRC04:53
openstackgerritA change was merged to openstack-infra/elastic-recheck: Remove remaining cases of '@message'  https://review.openstack.org/6775404:58
*** DinaBelova_ is now known as DinaBelova04:58
openstackgerritA change was merged to openstack-infra/storyboard: Fixed doc build  https://review.openstack.org/6737604:59
*** thuc has quit IRC05:06
*** thuc has joined #openstack-infra05:06
hub_capis storyboard taking off again? does anyone know whats up w/ that? (mordred?)05:08
openstackgerritJoshua Hesketh proposed a change to openstack-infra/zuul: Send swift upload instructions to workers  https://review.openstack.org/6829705:08
*** emagana has joined #openstack-infra05:08
hub_capi was considering writing an interface to lp's disgusting ui to quell my fits of insanity in dealing w/ it05:08
*** chandankumar_ has joined #openstack-infra05:08
*** thuc has quit IRC05:11
*** thuc has joined #openstack-infra05:13
*** yamahata has joined #openstack-infra05:13
*** vipul is now known as vipul-away05:15
*** sarob has joined #openstack-infra05:15
*** emagana has quit IRC05:15
*** nicedice has quit IRC05:21
*** coolsvap is now known as coolsvap_away05:23
*** vogxn has joined #openstack-infra05:24
*** gokrokve has joined #openstack-infra05:25
*** oubiwann_ has quit IRC05:27
*** vipul-away is now known as vipul05:32
*** thuc_ has joined #openstack-infra05:35
*** pcrews has quit IRC05:35
*** thuc has quit IRC05:38
*** vipul is now known as vipul-away05:47
*** thuc_ has quit IRC06:01
*** thuc has joined #openstack-infra06:01
*** DinaBelova is now known as DinaBelova_06:04
*** starmer_ has joined #openstack-infra06:06
*** thuc has quit IRC06:06
*** starmer has quit IRC06:08
*** blamar has quit IRC06:08
*** vipul-away is now known as vipul06:10
openstackgerritJoshua Hesketh proposed a change to openstack-infra/zuul: Send swift upload instructions to workers  https://review.openstack.org/6829706:12
*** CaptTofu has joined #openstack-infra06:16
*** coolsvap_away is now known as coolsvap06:19
*** kraman has quit IRC06:21
*** CaptTofu has quit IRC06:21
*** blamar has joined #openstack-infra06:23
*** sarob has quit IRC06:30
*** sarob has joined #openstack-infra06:31
*** sarob has quit IRC06:35
*** mrda has quit IRC06:35
*** afazekas has joined #openstack-infra06:35
*** andreaf has joined #openstack-infra06:39
*** NikitaKonovalov_ is now known as NikitaKonovalov06:41
*** xBsd has joined #openstack-infra06:43
*** nosnos_ has joined #openstack-infra06:43
*** nosnos has quit IRC06:43
*** DennyZhang has quit IRC06:44
*** NikitaKonovalov is now known as NikitaKonovalov_06:50
*** mrda__ is now known as mrda_away06:51
*** kraman has joined #openstack-infra06:52
*** vkozhukalov has quit IRC06:52
*** kraman has quit IRC06:56
*** sarob has joined #openstack-infra07:01
*** emagana has joined #openstack-infra07:07
*** sarob has quit IRC07:09
*** sarob has joined #openstack-infra07:17
*** yolanda has joined #openstack-infra07:18
*** NikitaKonovalov_ is now known as NikitaKonovalov07:18
*** sarob_ has joined #openstack-infra07:19
*** afazekas has quit IRC07:19
*** vipul is now known as vipul-away07:21
*** sarob has quit IRC07:21
*** kraman has joined #openstack-infra07:22
*** sarob_ has quit IRC07:23
*** vipul-away is now known as vipul07:25
*** odyssey4me has joined #openstack-infra07:25
david-lyle_anyone available that can promote this https://review.openstack.org/#/c/68268/ ?  licensing issue that needs to make i-207:27
*** kraman has quit IRC07:27
*** afazekas has joined #openstack-infra07:27
*** vogxn has quit IRC07:33
*** lyxus has joined #openstack-infra07:36
*** yamahata has quit IRC07:38
*** vipul is now known as vipul-away07:45
clarkbdavid-lyle: the machine with keys is off. I canpromote in the morning if fungi/jeblair dont brat me to it07:50
*** obondarev_ has joined #openstack-infra07:53
*** vipul-away is now known as vipul07:54
*** vogxn has joined #openstack-infra07:56
openstackgerritElizabeth Krumbach Joseph proposed a change to openstack-infra/config: Configure automatic formatting of README files  https://review.openstack.org/6037508:02
*** flaper87|afk is now known as flaper8708:03
*** mrmartin has joined #openstack-infra08:04
*** mancdaz_away is now known as mancdaz08:08
*** yanghe has joined #openstack-infra08:09
*** xBsd has quit IRC08:12
*** jcoufal has joined #openstack-infra08:13
*** DinaBelova_ is now known as DinaBelova08:15
*** CaptTofu has joined #openstack-infra08:17
*** sarob has joined #openstack-infra08:17
*** yanghe has left #openstack-infra08:17
*** sarob has quit IRC08:22
*** CaptTofu has quit IRC08:22
*** kraman has joined #openstack-infra08:23
*** SergeyLukjanov_ is now known as SergeyLukjanov08:23
*** luqas has joined #openstack-infra08:23
*** kraman has quit IRC08:27
*** vkozhukalov has joined #openstack-infra08:30
*** pblaho has joined #openstack-infra08:34
*** fbo_away is now known as fbo08:34
*** saschpe has quit IRC08:41
*** saschpe has joined #openstack-infra08:41
*** luqas has quit IRC08:55
*** praneshp has quit IRC09:00
*** derekh has joined #openstack-infra09:03
*** yassine has joined #openstack-infra09:06
*** zhiwei has quit IRC09:09
*** jpich has joined #openstack-infra09:10
*** BobBallAWay is now known as BobBall09:11
*** San_D has joined #openstack-infra09:11
openstackgerritDerek Higgins proposed a change to openstack-infra/config: Add some dependencies required by toci  https://review.openstack.org/6768509:14
openstackgerritDerek Higgins proposed a change to openstack-infra/config: Enable precise-backports on tripleo test nodes  https://review.openstack.org/6795809:16
*** San_D has quit IRC09:17
*** sarob has joined #openstack-infra09:17
*** derekh has quit IRC09:18
openstackgerritNikita Konovalov proposed a change to openstack-infra/storyboard: API tests for rest  https://review.openstack.org/6744709:19
*** sarob has quit IRC09:22
*** kraman has joined #openstack-infra09:23
*** kraman has quit IRC09:25
*** kraman1 has joined #openstack-infra09:25
*** luqas has joined #openstack-infra09:27
*** SergeyLukjanov is now known as SergeyLukjanov_a09:27
*** SergeyLukjanov_a is now known as SergeyLukjanov_09:28
*** kraman1 has quit IRC09:30
openstackgerritPavel Sedlák proposed a change to openstack-infra/jenkins-job-builder: Add support for Test Stability with Junit  https://review.openstack.org/6815209:32
*** bauzas has joined #openstack-infra09:36
*** zhiwei has joined #openstack-infra09:36
*** markmc has joined #openstack-infra09:38
*** jooools has joined #openstack-infra09:39
*** mugsie has quit IRC09:39
openstackgerritZang MingJie proposed a change to openstack-infra/zuul: Use ssh to fetch packs instead of HTTP  https://review.openstack.org/6785809:39
*** mugsie has joined #openstack-infra09:40
*** mugsie has quit IRC09:40
*** mugsie has joined #openstack-infra09:40
*** matrohon has quit IRC09:40
*** matrohon has joined #openstack-infra09:40
*** SergeyLukjanov_ is now known as SergeyLukjanov09:40
*** jp_at_hp has joined #openstack-infra09:43
*** SergeyLukjanov is now known as SergeyLukjanov_09:47
*** NikitaKonovalov is now known as NikitaKonovalov_09:47
*** mancdaz is now known as mancdaz_away09:52
*** johnthetubaguy has joined #openstack-infra09:52
*** lyxus has quit IRC09:53
*** alexpilotti has joined #openstack-infra09:58
*** DinaBelova is now known as DinaBelova_09:58
*** jasondotstar has joined #openstack-infra10:00
*** boris-42 has quit IRC10:02
*** boris-42 has joined #openstack-infra10:04
*** vogxn has quit IRC10:07
*** pblaho has quit IRC10:12
anteayaI have been operating under the belief that reverify had been removed, I just sniped two neutronclient patches https://review.openstack.org/#/c/63986/ and https://review.openstack.org/#/c/63328/ that got back in with reverify bug <bug number>10:13
*** sarob has joined #openstack-infra10:13
*** sarob_ has joined #openstack-infra10:17
*** CaptTofu has joined #openstack-infra10:18
*** lyxus has joined #openstack-infra10:18
*** sarob has quit IRC10:18
*** vogxn has joined #openstack-infra10:20
*** sarob_ has quit IRC10:22
*** CaptTofu has quit IRC10:22
*** jhesketh_ has quit IRC10:23
*** kraman has joined #openstack-infra10:23
anteaya0 events, 0 results, 112 in the gate, 159 in check10:24
anteaya44.5 hours for the oldest gate patches10:25
*** kraman has quit IRC10:27
AJaegeranteaya, https://review.openstack.org/#/c/67708/ - this was not approved yet.10:34
AJaegeranteaya, the above is the patch you had in mind with reverify removal10:34
*** jesusaurus has quit IRC10:35
*** derekh has joined #openstack-infra10:35
AJaegeranteaya, interesting way to annotate the commit message with those two ;)10:36
*** jesusaurus has joined #openstack-infra10:36
*** jroovers has joined #openstack-infra10:38
anteayaAJaeger: thanks for pointing me to 6770810:39
anteayaI had hoped I could retire the big stick - getting tired of doing the cop routine10:39
anteayanot sure what else to do10:39
*** max_lobur_afk is now known as max_lobur10:39
AJaegerBlog about it as reference? Write another email pointing to the blog...10:41
AJaegerStill, too many will not read it, so 67708 seems the best way to do it for now10:42
AJaeger0 events, 0 results is great - compared to the over 1000 earlier...10:43
*** pelix has joined #openstack-infra10:56
*** pblaho has joined #openstack-infra10:57
anteayayes10:57
anteayaI am so tired, I haven't blogged in a long time10:57
anteayaI am a few blog posts behind10:58
*** yassine_ has joined #openstack-infra10:58
*** yassine has quit IRC10:59
*** jasondotstar has quit IRC11:00
*** mancdaz_away is now known as mancdaz11:01
*** dkranz has quit IRC11:01
pelixclarkb: wondering if you're happy with the update to https://review.openstack.org/#/c/63579 ?11:02
*** dkranz has joined #openstack-infra11:02
AJaegeranteaya, that's sad, hope you find some time to recover soon11:02
*** michchap has quit IRC11:06
anteayaAJaeger: thanks me too, it is not like I am the only one though11:06
*** michchap has joined #openstack-infra11:06
AJaegeryeah - not sure how much sleep fungi got the last week ;(11:07
*** dizquierdo has joined #openstack-infra11:08
*** jroovers has quit IRC11:10
*** gokrokve has quit IRC11:11
*** mrmartin has quit IRC11:14
anteayaAJaeger: yeah, not much11:15
*** sarob has joined #openstack-infra11:17
*** pelix has left #openstack-infra11:17
*** sarob has quit IRC11:22
*** kraman has joined #openstack-infra11:23
*** lcestari has quit IRC11:26
*** kraman has quit IRC11:27
*** derekh has quit IRC11:29
*** lcestari has joined #openstack-infra11:29
*** afazekas_ has joined #openstack-infra11:41
*** jasondotstar has joined #openstack-infra11:41
*** yaguang has quit IRC11:41
*** jroovers has joined #openstack-infra11:50
*** dpyzhov has joined #openstack-infra11:51
*** dstanek has quit IRC11:52
*** boris-42 has quit IRC11:53
openstackgerritMark McLoughlin proposed a change to openstack/requirements: Allow use of oslo.messaging 1.3.0a4 from pypi  https://review.openstack.org/6804011:54
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add fingerprint for bug 1270654  https://review.openstack.org/6829611:54
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add fingerprint for bug 1097592  https://review.openstack.org/6828211:54
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add fingerprint for bug 1270382  https://review.openstack.org/6828011:54
*** yassine_ has quit IRC11:56
openstackgerritA change was merged to openstack-infra/elastic-recheck: Sort uncategorized fails by time  https://review.openstack.org/6776111:56
*** boris-42 has joined #openstack-infra11:56
*** yassine has joined #openstack-infra11:57
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add fingerprint for bug 1271331  https://review.openstack.org/6827011:57
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add query for bug 1270710  https://review.openstack.org/6776411:57
*** mancdaz is now known as mancdaz_away11:58
*** ArxCruz has joined #openstack-infra11:58
*** mancdaz_away is now known as mancdaz11:59
*** vogxn has quit IRC12:00
*** gokrokve has joined #openstack-infra12:06
*** thuc has joined #openstack-infra12:09
*** b3nt_pin has joined #openstack-infra12:11
*** b3nt_pin is now known as beagles12:11
*** gokrokve has quit IRC12:12
*** thuc has quit IRC12:13
*** jcoufal has quit IRC12:14
*** dims has joined #openstack-infra12:14
*** jcoufal has joined #openstack-infra12:15
*** dpyzhov has quit IRC12:16
*** sarob has joined #openstack-infra12:17
*** CaptTofu has joined #openstack-infra12:19
*** dpyzhov has joined #openstack-infra12:19
*** dstanek has joined #openstack-infra12:21
*** sarob has quit IRC12:22
*** alexpilotti has quit IRC12:22
*** kraman has joined #openstack-infra12:23
*** CaptTofu has quit IRC12:23
*** dpyzhov has quit IRC12:26
*** kraman has quit IRC12:27
*** dstanek has quit IRC12:29
*** julim has joined #openstack-infra12:32
openstackgerritDavanum Srinivas (dims) proposed a change to openstack-infra/devstack-gate: Temporary HACK : Enable UCA  https://review.openstack.org/6756412:33
*** dpyzhov has joined #openstack-infra12:37
*** vogxn has joined #openstack-infra12:45
*** smarcet has joined #openstack-infra12:46
*** ociuhandu has quit IRC12:49
*** CaptTofu has joined #openstack-infra12:50
*** emagana has quit IRC12:52
*** CaptTofu has quit IRC12:55
*** mriedem has joined #openstack-infra12:57
*** gokrokve has joined #openstack-infra13:01
*** mancdaz is now known as mancdaz_away13:02
*** coolsvap has quit IRC13:03
*** mancdaz_away is now known as mancdaz13:05
*** dcramer__ has quit IRC13:06
*** gokrokve has quit IRC13:06
*** luqas has quit IRC13:08
*** heyongli has joined #openstack-infra13:09
*** ociuhandu has joined #openstack-infra13:10
*** david-lyle_ has quit IRC13:11
*** ociuhandu has quit IRC13:15
*** sarob has joined #openstack-infra13:17
openstackgerritSean Dague proposed a change to openstack-infra/elastic-recheck: update web ui for better sorting  https://review.openstack.org/6837413:18
*** xchu has joined #openstack-infra13:18
matelAjaeger: Hi, I don't quite understand your comment here: https://review.openstack.org/6836313:20
*** amotoki has joined #openstack-infra13:20
AJaegermatel, wrong review - I didn't comment on that one13:21
*** jasondotstar has quit IRC13:21
matelAjaeger: Oh, yeah, I meant this: https://review.openstack.org/6818113:22
AJaegermatel, do you mean https://review.openstack.org/#/c/68181/ ?13:22
AJaegerAh, you do ;)13:22
*** sarob has quit IRC13:22
*** kraman has joined #openstack-infra13:23
AJaegerAs part of which repository testing do you need this?13:23
AJaegerIs that repo already in projects.txt?13:23
matelSo it's a package to be synced from pip, and it's a runtime dependency for nova.13:24
matelWe usually install it with devstack: https://github.com/openstack-dev/devstack/blob/master/tools/xen/prepare_guest.sh#L2613:25
AJaegermatel, ok, then my comment is wrong, wasn't clear to me.13:25
*** dizquierdo has quit IRC13:26
*** kraman has quit IRC13:27
matelOkay, thanks, I will put this info to as a comment.13:27
matelAh, you already did it, thanks.13:28
AJaegermatel, I've added a comment as well ;)13:28
*** rfolco has quit IRC13:31
*** oubiwann_ has joined #openstack-infra13:31
*** rfolco has joined #openstack-infra13:33
*** derekh has joined #openstack-infra13:35
*** jasondotstar has joined #openstack-infra13:37
*** luqas has joined #openstack-infra13:38
openstackgerritA change was merged to openstack-infra/elastic-recheck: update web ui for better sorting  https://review.openstack.org/6837413:38
*** eharney has joined #openstack-infra13:38
*** thomasem has joined #openstack-infra13:43
*** mancdaz is now known as mancdaz_away13:44
*** mancdaz_away is now known as mancdaz13:47
*** zhiwei has quit IRC13:49
openstackgerritSean Dague proposed a change to openstack-infra/elastic-recheck: put the fails24 in the right place  https://review.openstack.org/6838513:52
*** thuc has joined #openstack-infra13:53
*** prad has joined #openstack-infra13:54
*** yamahata has joined #openstack-infra13:55
*** mestery has joined #openstack-infra13:57
*** mfer has joined #openstack-infra13:59
*** heyongli has quit IRC14:01
*** dprince has joined #openstack-infra14:01
*** gokrokve has joined #openstack-infra14:02
openstackgerritA change was merged to openstack-infra/elastic-recheck: put the fails24 in the right place  https://review.openstack.org/6838514:02
*** yamahata has quit IRC14:05
portantesdague: should I see a change on the elastic recheck page yet?14:05
sdagueI forget how often puppet triggers that update14:06
*** boris-42 has quit IRC14:06
*** CaptTofu has joined #openstack-infra14:06
*** gokrokve has quit IRC14:06
portantek thanks, I assume that fix was to address the "undefined" text on that page?14:06
sdagueyep14:06
portantethx14:06
sdagueI'm sorting the bugs by fails in last 24 hrs now14:07
sdagueand wanted to provide actual numbers14:07
sdagueinstead of just the graphs14:07
portantethat sounds like a good idea14:07
portantewe can then play wack-a-mole easier14:07
sdagueyep14:07
*** dpyzhov has quit IRC14:07
sdagueand not be distracted by things which are mostly fixed14:07
sdagueyou can see that russellb managed to fully nail - 127068014:08
sdaguewhich is great14:08
portantethe right data helps one steer the ship away from the icebergs14:08
sdaguei like it when stuff flatlines14:08
sdagueyep14:08
* portante looking14:08
russellbsdague: yar14:09
*** boris-42 has joined #openstack-infra14:10
*** dpyzhov has joined #openstack-infra14:10
portantenice work russellb14:10
anteayayes thanks for taking out 127068014:11
russellbmore to whack though14:11
russellbsdague: if you see another nova one sinking the ship ping me, otherwise i'm playing in zuul14:11
*** mriedem has quit IRC14:12
russellbthere's a libvirt unit test we need to stab ...14:12
*** matsuhashi has joined #openstack-infra14:14
*** yassine has quit IRC14:15
*** yassine has joined #openstack-infra14:15
sdagueyep, I'm working on the update email right now, it looks like the SSH bug is back on top14:16
anteayavery true14:17
anteayadang14:17
russellbsdague: ACK thanks14:17
*** sarob has joined #openstack-infra14:17
anteayaI think the ssh bug is the isolated job failure14:20
anteayaI think14:20
*** vogxn has quit IRC14:20
*** coolsvap has joined #openstack-infra14:22
*** sarob has quit IRC14:22
*** kraman has joined #openstack-infra14:23
sdaguerussellb: Bug 1270654 - test_different_fname_concurrency flakey fail is also something probably worth trying to get someone to look at14:24
sdagueit's a nova unit test race14:24
russellbsdague: yeah that's the one i was referring to ...14:24
sdagueok gotach14:24
russellbguess i should jump on it14:24
russellbasking people nicely to work on these bugs didn't really work for a long time :)14:25
sdagueit's 5th on the list of non infra fingerprints14:25
russellbOK, worth the time then i thnk14:25
sdagueso not huge, but 15 fails in last 24 hrs14:25
sdagueacross all queues14:25
*** jaypipes has quit IRC14:27
*** jaypipes has joined #openstack-infra14:28
*** xchu has quit IRC14:29
*** thuc has quit IRC14:30
*** thuc has joined #openstack-infra14:30
*** alexpilotti has joined #openstack-infra14:31
*** kraman has quit IRC14:32
anteayattx do share when you cut i214:32
*** mriedem has joined #openstack-infra14:33
*** thuc has quit IRC14:35
*** CaptTofu has quit IRC14:35
*** esker has quit IRC14:39
*** esker has joined #openstack-infra14:40
*** burt1 has joined #openstack-infra14:40
ttxanteaya: anytime now14:41
ttxbut won't do all of them at the same time, so I can do some ordering14:41
ttxbased on what's just at the top of the queue14:42
*** otherwiseguy has quit IRC14:42
*** dcramer__ has joined #openstack-infra14:42
anteayattx you can cut neutron anytime14:43
anteayaI have removed everything from the gate14:43
anteayasince everything is currently failing isolated jobs14:44
anteayaeverything == neutron in the above sentence14:44
*** esker has quit IRC14:45
*** dkliban is now known as dkliban_afk14:45
*** matsuhashi has quit IRC14:45
ttxanteaya: ok, neutron is first then14:45
*** jasondotstar has quit IRC14:46
sdagueanteaya: thanks14:48
*** yolanda has quit IRC14:49
*** miqui has joined #openstack-infra14:49
anteayasure14:50
*** sHellUx has joined #openstack-infra14:50
*** ogelbukh has joined #openstack-infra14:52
*** prad has quit IRC14:52
ttxanteaya: so I should just defer everything not implemented from https://launchpad.net/neutron/+milestone/icehouse-2, right ?14:53
anteayaah, that is a good question for markmcclain when he awakes14:53
*** prad has joined #openstack-infra14:53
anteayathat is a question I can't answer, I'm just trying to protect the integrity of the gate14:54
anteayaI have no say on the direction of neutron14:54
*** yolanda has joined #openstack-infra14:55
ttxwell, deferring -- we can al fix that if that was wrong for some14:56
anteayattx okay14:56
anteayaI'll get markmcclain to find you when he arrives unless you find him first14:56
*** nosnos_ has quit IRC14:57
*** dkliban_afk is now known as dkliban14:58
derekhfungi: hi ya, how would we go about enabling ci-overcloud in the production nodepool ? were pretty close to being able to test the tripleo-ci stuff14:59
*** oubiwann_ has quit IRC14:59
*** dstanek has joined #openstack-infra15:00
anteayaderekh: note fungi is in utah this week at a foundation thing, I'm uncertain of his online schedule - I haven't seen him so far this morning15:00
derekhanteaya: ok, thanks15:01
derekhanybody else know how to go about it ?^^15:01
*** alexpilotti has quit IRC15:01
*** gokrokve has joined #openstack-infra15:03
sdaguederekh: right now, with gate situation, a lot of things are blocked up15:03
*** odyssey4me has quit IRC15:03
sdagueespecially on infra team, so the best way to free up review time for things like that is help on some of the gate reseting bugs15:03
derekhsdague: ok, thanks, will come back when things are a bit calmer, and will see if I can pick on any of the bugs15:04
sdaguederekh: thanks!15:04
*** CaptTofu has joined #openstack-infra15:05
*** sandywalsh has joined #openstack-infra15:06
*** thuc has joined #openstack-infra15:06
*** dims has quit IRC15:08
*** gokrokve has quit IRC15:08
*** dims has joined #openstack-infra15:09
*** gokrokve has joined #openstack-infra15:09
*** rnirmal has joined #openstack-infra15:11
*** thuc has quit IRC15:13
*** thuc has joined #openstack-infra15:14
*** alexpilotti has joined #openstack-infra15:15
*** sarob has joined #openstack-infra15:17
sdaguewell that's a thing to see15:18
*** thuc has quit IRC15:18
sdaguecurrently all the gate fails triggering resets are unit tests15:18
sdague2 on nova, 1 on swift15:19
sdague2 on glance15:19
*** otherwiseguy has joined #openstack-infra15:19
anteayathat is odd15:20
anteayais there some similiarity in the unit test failures, I wonder15:21
anteayasince after isolated jobs, neutron needs to address unit test failures15:21
*** kraman has joined #openstack-infra15:21
sdaguenot really, russellb is looking at the nova one15:21
*** sarob has quit IRC15:22
*** dhellmann is now known as dhellmann_15:22
anteayak15:23
*** oubiwann_ has joined #openstack-infra15:25
*** DinaBelova_ is now known as DinaBelova15:25
*** jgrimm has joined #openstack-infra15:28
*** jroovers has quit IRC15:28
anteayaboth glance unit test failures are hitting test_index_with_sort_dir15:29
anteayaand one nova patch is hitting a pep8 which is taking out the nova patch behind it15:29
anteayabut the swift and nova unit test failures appear unique15:30
sdaguefungi: 66974 could use promotion when we get a reset15:30
sdagueanteaya: yeh, honestly, if we get a reset I might pull all the glance patches out of the queue, that unit test fail looks really regular15:30
*** dmsimard has joined #openstack-infra15:31
dmsimardWoah guys, what happened to jenkins ? It's so fast today :D15:31
anteayadmsimard: i2 cut off15:32
anteayathe rush to submit patches is off15:32
anteayasdague: yeah, if the tests are going to prevent them from merging anyway15:32
*** SergeyLukjanov_ is now known as SergeyLukjanov15:33
*** jasond` has joined #openstack-infra15:40
*** luqas has quit IRC15:41
*** senk has joined #openstack-infra15:44
*** starmer_ has quit IRC15:45
*** senk1 has joined #openstack-infra15:46
jasond`is there anyway to estimate how long it will be before an approved review gets merged?15:46
jasond`is there a queue i can view somewhere?15:47
dmsimardhttp://status.openstack.org/zuul/15:47
dmsimardjasond`: ^15:47
jasond`dmsimard: thank you15:48
*** senk has quit IRC15:48
*** emagana has joined #openstack-infra15:48
sdaguejust had a reset event about 6 deep, but I don't think we've got anyone around to do a promote after the stuff on top merges15:49
*** luqas has joined #openstack-infra15:50
anteayamordred?15:50
anteayahe is the only person I think might be around15:50
*** emagana has quit IRC15:53
*** senk1 has quit IRC15:54
*** senk has joined #openstack-infra15:54
anteaya12 in post, woohoo15:57
annegentlehey infra, just wanted to let you know that some intense Operations Guide updates are happening tomorrow and Friday15:57
anteayaexcept the swift patch is failing the post job15:57
anteayaawesome, thanks annegentle15:57
anteayado you expect increased load for zuul?15:57
anteayaas a result of the intense updates?15:57
annegentleI don't think it affects your day-to-day much, nor much for load... but how would I know?15:57
notmynameanteaya: something I need to look at in swift?15:58
anteayayeah look at the post queue15:58
anteayathe swift patch15:58
*** senk has quit IRC15:58
anteaya0000000 as an id15:58
*** chandankumar_ has quit IRC15:58
anteayaannegentle: okay thanks for the heads up15:58
annegentleanteaya: it's building in under 3 minutes so I think we're good15:59
anteayaawesome16:00
*** chandankumar_ has joined #openstack-infra16:01
sdaguenotmyname: there was also another swift unit test fail in the gate16:02
sdaguewhich I think just got reset over16:02
*** esker has joined #openstack-infra16:02
*** rcleere has joined #openstack-infra16:02
portantesdague: can you point me at it?16:02
portantenotmyname: I'll review16:02
notmynameportante: thanks16:02
sdagueportante: https://jenkins02.openstack.org/job/gate-swift-python27/3268/console16:03
portantesdague: looking ...16:03
*** DennyZhang has joined #openstack-infra16:04
*** kmartin has quit IRC16:05
*** gokrokve has quit IRC16:05
*** SergeyLukjanov is now known as SergeyLukjanov_a16:07
*** vkozhukalov has quit IRC16:07
*** CaptTofu has quit IRC16:08
openstackgerritSteve Martinelli proposed a change to openstack/requirements: Remove oauth2 requirement  https://review.openstack.org/6842216:08
*** SergeyLukjanov_a is now known as SergeyLukjanov_16:08
*** thouveng has joined #openstack-infra16:10
*** UtahDave has joined #openstack-infra16:10
portantesdague: how do I see the rest of the logs for the above?16:10
*** CaptTofu has joined #openstack-infra16:10
*** dmsimard has left #openstack-infra16:10
sdagueclick on the link towards the top16:10
sdaguethere should be a full log link16:11
portanteI meant like syslog and other things, sorry16:11
sdagueit's unit tests16:11
sdagueI don't think we collect syslog16:11
*** pcrews has joined #openstack-infra16:11
portantek16:11
sdaguehttp://logs.openstack.org/86/66986/3/gate/gate-swift-python27/1e37d7f/ is everything that's artifact collected16:12
fungiderekh: mostly we need to restart nodepool with those new patches and config to test it, which is questionable under recent gate resource starvation. we hope that if we get some changes applied to zuul today we'll be straining the current pool capacity a lot less16:12
*** nicedice has joined #openstack-infra16:12
sdaguemorning fungi16:12
sdaguefungi: 66974 could use promotion when we get a reset16:13
sdagueit's the fix horizon needs for i2 on licensing16:13
fungisdague: ttx also has a release-critical patch which needs to go i at the same time16:13
anteayahow was your flight?16:13
sdaguefungi: I think it's the same patch :)16:13
portantesdague: can I hop on that instance to see what else is running?16:13
fungisdague: oh, he said 6826816:13
ttxhhm16:13
sdaguefungi: oh, listen to ttx16:13
sdagueportante: no, the unit tests nodes are all rotated through16:13
sdagueportante: this is unit tests, why would it be going to syslog?16:14
portantek, thx16:14
anteayaportante: if the instance is still running fungi is the only one here who can grant access to a running vm16:14
ttxfungi: 68268 confirmed16:14
derekhfungi: ok, so basically we just need to wait for a good time to restart nodepool16:14
portantesdague, I don't know how, just trying to figure out what happened16:14
anteayaif it isn't still running, then the vm has been destroyed, or is being destroyed16:14
sdagueportante: ok16:14
*** amotoki has quit IRC16:14
fungisdague: derekh right, and cross our fingers and hope it's right, since debugging it if it's wrong means project-wide work stoppage16:14
fungier, not sdague16:14
*** senk has joined #openstack-infra16:14
openstackgerritZhiQiang Fan proposed a change to openstack/requirements: Upgrade six to 1.5.2  https://review.openstack.org/6842416:15
*** jcoufal has quit IRC16:16
derekhfungi: ok thanks16:16
fungisdague: now that i'm able to get the status page up, i can see we're in a reset anyway16:17
fungiso bumping it now16:17
openstackgerritMatthew Treinish proposed a change to openstack-infra/elastic-recheck: Add multi-project irc support to the bot  https://review.openstack.org/6754016:18
openstackgerritZhiQiang Fan proposed a change to openstack/requirements: Ignore egg-info directory  https://review.openstack.org/6842516:18
*** senk has quit IRC16:19
*** mrodden1 is now known as mrodden16:19
*** _ruhe is now known as ruhe16:20
*** branen has quit IRC16:20
*** thouveng has quit IRC16:21
*** andreaf has quit IRC16:24
*** wenlock has joined #openstack-infra16:24
*** jasondotstar has joined #openstack-infra16:27
*** SergeyLukjanov_ is now known as SergeyLukjanov16:28
afazekasSearching for reviewer for these changes: https://review.openstack.org/#/c/65145/ and https://review.openstack.org/#/c/65140/16:29
*** thuc has joined #openstack-infra16:29
*** herndon has joined #openstack-infra16:31
*** dangers_away is now known as dangers16:32
jeblairfungi, sdague, ttx: morning16:32
ttxjeblair: hi!16:33
*** portante has quit IRC16:33
*** gyee has joined #openstack-infra16:33
*** ladquin has joined #openstack-infra16:34
*** emagana has joined #openstack-infra16:35
*** thuc_ has joined #openstack-infra16:35
anteayajeblair: morning16:38
anteayaI hope you are feeling a bit better today16:38
jeblairanteaya: not particularly, but thanks.16:39
*** thuc has quit IRC16:39
anteaya:(16:39
*** jasondotstar has quit IRC16:42
*** portante has joined #openstack-infra16:45
*** jasondotstar has joined #openstack-infra16:46
*** fifieldt has joined #openstack-infra16:47
*** pballand has joined #openstack-infra16:47
sdaguemorning16:47
sdagueok, running to lunch16:47
*** gyee_ has joined #openstack-infra16:49
*** mrmartin has joined #openstack-infra16:51
*** gyee has quit IRC16:53
*** portante has quit IRC16:53
*** rakhmerov has joined #openstack-infra16:55
pvomordred: you guys doing better on capacity today?16:55
*** dpyzhov has quit IRC16:58
mordredpvo: I'm actually just about to start work on increasing the pool size (We have to add more jenkins masters to be able to handle more slaves - it's a vicious cycle)16:59
anteayamordred: hello there16:59
mordredmornign anteaya16:59
anteayadon't let me interrupt you16:59
*** resker has joined #openstack-infra17:00
jeblairmordred: steps: write a puppet change; create self-signed certs and put them in hiera; then talk to me about the undocumented process of setting up the initial config17:00
anteayacacti is showing 5 jenkins masters, do we have more than 5 now?17:01
ttxzuul busy reshuffling right now, most jobs queued17:01
anteayattx did you cut neutron?17:01
ttxanteaya: I did17:01
anteayathanks17:01
ttxpage still needs a bit of cleanup17:01
*** mrmartin has quit IRC17:02
anteayawhat page?17:02
anteayasorry I feel I should know and I don't17:02
ttxicehouse-2 milestone page17:02
*** esker has quit IRC17:02
ttxbut can't work on it right now17:02
ttxotherwise looks good17:02
mordredjeblair: I'm excited about that17:03
*** eharney_ has joined #openstack-infra17:03
mordredjeblair: the undocumented process part17:03
mordredjeblair: what's our current thinking on jenkins master to slave ratio?17:03
jeblairmordred: 100/117:03
mordredok. so we need 2 more masters ish17:04
*** rnirmal has quit IRC17:04
mordredbumping IAD from 60 to 192 and DFW from 2 to 100 (I believe we leave headroom in nodepool in dfw because of static slaves, yeah?)17:04
mordredso that's potentially 230 new slaves - perhaps I should just do three jenkins masters17:05
openstackgerritNadya Privalova proposed a change to openstack/requirements: Fix happybase version  https://review.openstack.org/6843517:05
jeblairmordred: i think our total quota is oto 1000, right?17:05
*** eharney__ has joined #openstack-infra17:05
*** senk1 has joined #openstack-infra17:05
*** herndon has quit IRC17:05
anteayattx okay thanks17:05
*** pballand has quit IRC17:05
*** jasondotstar has quit IRC17:06
*** eharney has quit IRC17:06
*** portante has joined #openstack-infra17:07
*** pballand has joined #openstack-infra17:07
*** jasondotstar has joined #openstack-infra17:07
*** eharney__ has quit IRC17:07
*** portante has quit IRC17:08
*** eharney_ has quit IRC17:08
mordredjeblair: I'm not sure17:09
mordredjeblair: also, do we have a "create a self-signed cert" script or a particular way we like to do that?17:09
openstackgerritMonty Taylor proposed a change to openstack-infra/config: Increase quota limits in RAX IAD and DFW  https://review.openstack.org/6843917:10
jeblairmordred: oh, no we're at 48 in hpcloud az1 and 3?17:10
*** SergeyLukjanov is now known as SergeyLukjanov_17:10
jeblairmordred: so that's 732 as the new total quota, so 3 masters would be good17:11
*** eharney has joined #openstack-infra17:11
*** branen has joined #openstack-infra17:11
Alex_GaynorTo what extent are more servers going to help? It was my impression that the biggest issue was frequent gate resets?17:12
anteayait can allow for a faster turn around on check tests17:13
jeblairmordred: root@ci-puppetmaster:~/certs might help; it's designed to create csr's but also creates self-signed certs17:13
mordredAlex_Gaynor: we're also seeing resource starvation because of the frequent resets of deep queue - so the check queue is starving17:13
Alex_Gaynormordred: ah, makes sense17:13
*** dizquierdo has joined #openstack-infra17:13
openstackgerritMonty Taylor proposed a change to openstack-infra/config: Add three new jenkins servers  https://review.openstack.org/6844217:13
Alex_Gaynormordred: is there anything I could be doing to help ya'll out with this?17:13
*** bauzas has quit IRC17:14
mordredAlex_Gaynor: I think the jenkins setup is on me - jeblair, anything Alex_Gaynor can do to help your end of things?17:14
*** eharney has quit IRC17:15
*** ociuhandu has joined #openstack-infra17:16
clarkbmorning, I am not actually here yet17:16
*** dpyzhov has joined #openstack-infra17:16
jeblairAlex_Gaynor: not that i can think of right now, thanks17:16
*** pblaho has quit IRC17:17
clarkbjeblair: re https://review.openstack.org/#/c/68219/7 do you want to address the comments while I am distracted and possibly rip out the extra toggles? I will get to it in about an hour and a half if not17:17
*** aarongr_away is now known as AaronGr17:17
*** senk1 has quit IRC17:17
*** senk has joined #openstack-infra17:17
*** rnirmal has joined #openstack-infra17:17
jeblairpvo: what are the rax api rate limits? i ran 'nova rate-limits' but get an empty response17:18
*** SergeyLukjanov_ is now known as SergeyLukjanov17:18
anteayamorning clarkb who isn't actually here17:18
clarkbjeblair: also, I realized that the way I set the window in the layout means the value will reset whenever the layout is reloaded which we probably don't want17:18
jeblairlifeless: hpcloud rate limits: http://paste.openstack.org/show/61688/17:19
clarkbjeblair: not sure if we care about tackling that in the first patch17:19
clarkbas reseting like that may be desireable as we use it initially17:19
pvojeblair: interesting. You should get something... let me check mine.17:19
pvojeblair: what region? Mine are listing from dfw.17:20
*** rakhmerov has quit IRC17:21
openstackgerritTom Fifield proposed a change to openstack-infra/config: Fix CLI args for welcome-message  https://review.openstack.org/6662317:22
fifieldtfungi, thanks for making the key :)17:23
jeblairpvo: all 3 of dfw/ord/iad17:23
jeblairpvo: let me try a new version of novaclient17:23
*** chandankumar_ has quit IRC17:24
*** eharney has joined #openstack-infra17:24
ArxCruzjeblair: hey, I would like to make a change in o-infra/config to make puppet.conf server configurable, today is hardcoded ci-openstack.openstack.org17:24
ArxCruzit will be necessary a lot of changes17:25
ArxCruzwhat's the best approach? several patches or one at once ?17:25
ArxCruzbasically all puppet recipes that uses openstack_project::base will have to be changed17:25
*** jp_at_hp has quit IRC17:26
jeblairpvo: still no joy with latest novaclient 2.15.0; all 3 regions for 'openstackjenkins' account17:27
anteayaArxCruz: right now jeblair and mordred are working on spinning up 3 new jenkinses17:27
*** pcrews has quit IRC17:27
ArxCruzanteaya: oh, okay I can wait :)17:27
anteayaso there might be a slight pause in service while that work takes place17:27
anteayaArxCruz: awesome, thank you17:27
jeblairArxCruz: probably several changes17:27
*** SergeyLukjanov is now known as SergeyLukjanov_17:28
clarkbjeblair: mordred: before yo uget too far along spinning up more jenkinses, I am not sure gaer will be able to handle when they all restart together17:29
clarkbwhen we moved zuul geard and the gearman plugin were having trouble when all 5 jenkinses attempted to register together17:29
clarkbwe had to start geard then start jenkinses individually before it would work17:29
jeblairclarkb: can you elaborate on 'trouble'?17:29
clarkbI submitted a bug to openstack-ci about it17:29
*** chandankumar_ has joined #openstack-infra17:29
clarkbjeblair: geard would throw an exception running status because some job key would be invalid17:30
*** marun has joined #openstack-infra17:30
clarkband that would kill geard17:30
jeblairclarkb: ah yeah, that's like a one line fix to geard17:30
*** yamahata has joined #openstack-infra17:31
clarkbok17:31
*** gothicmindfood has joined #openstack-infra17:32
*** browne has joined #openstack-infra17:32
*** pballand has quit IRC17:32
jasond`i keep seeing the gate tests under "openstack/heat 67971,2" go from SUCCESS to queued on the zuul status page.  is that supposed to happen?17:34
mordredjeblair, clarkb: you're saying I should launch them one at a time perhaps?17:34
jeblairmordred: no, we'll start them one at a time17:34
mordredjeblair: also, hiera changes are in17:34
mordredjeblair: which I believe means I should be able to land https://review.openstack.org/6844217:36
mordredand start launching nodes17:37
jeblairmordred: almost there17:37
anteayajasond`: yes, that means that something above it is causing a reset17:37
jasond`anteaya: oh ok, thanks17:37
anteayanp17:37
*** DennyZhang has quit IRC17:37
anteayamordred: you have a suggestion on how to expand your patch, do you want to do it, or shall I?17:38
mordredanteaya: aroo?17:39
anteayato include cacti and nodepool17:39
anteayaI can make the change if you need to focus17:39
mordredanteaya: I appreciate any and all help17:39
anteayaso I will make the change to your 68442 patch17:40
mordredanteaya: thank you17:40
*** jpich has quit IRC17:40
*** mancdaz is now known as mancdaz_away17:40
mordredoh wow. we have to parameterize that down in the manifests and not just in site.pp.17:40
*** blamar_ has joined #openstack-infra17:40
mordredanteaya: actually - perhaps we should add nodepool as a follow on patch17:41
jeblairdef17:41
anteayaokay just cacti.pp and jenkins-log-client.yaml added to 6842217:42
anteayacorrect?17:42
*** markwash has joined #openstack-infra17:43
mordredyeah17:43
*** blamar has quit IRC17:43
*** blamar_ is now known as blamar17:43
anteayak17:43
jeblairclarkb: comments/questions about tests in https://review.openstack.org/#/c/68219/17:43
fungimordred: if you're looking for another change to test the manage-projects script, i think https://review.openstack.org/61954 ready to go for this point (once the current scramble is settled, reading scrollback now to see what's broken)17:44
mordredfungi: okie. thanks17:44
*** luqas has quit IRC17:45
jeblairmordred: actually: https://review.openstack.org/#/c/65191/17:46
jeblairmordred: that lists all the places it's safe to add the new jenkins servers in the first change17:46
*** dizquierdo has quit IRC17:46
mordredanteaya: ^^ if you're updating it17:47
openstackgerritAnita Kuno proposed a change to openstack-infra/config: Add three new jenkins servers  https://review.openstack.org/6844217:47
anteayayeah, that is the one I followed, thanks to Roman for linking it17:48
*** jooools has quit IRC17:48
anteayajeblair: can I add the nodepool file then as well?17:48
anteayaclarkb ^17:48
jeblairanteaya: yes17:48
anteayaokay17:49
jeblairanteaya: but just the bits that were in that change17:49
mordredsigh. I seem to have left my power adapter at the office and my battery is now dying17:49
mordredI'm confused17:49
mordredI'll be back online in a little while17:49
*** markmcclain has joined #openstack-infra17:50
openstackgerritAnita Kuno proposed a change to openstack-infra/config: Add three new jenkins servers  https://review.openstack.org/6844217:50
*** rakhmerov has joined #openstack-infra17:51
anteayaI can't comment on my own patch17:51
anteayaplease folks make sure the syntax is consistent, even if you don't know puppet17:52
markwashI'm seeing the whole "externally hosted" problem again and again with pip install / tox, for packages psutils and pysendfile. seemingly related is a complete inability to install oslo.messaging into tox's venv. anybody got any tips for me?17:53
*** DennyZhang has joined #openstack-infra17:55
wenlockmarkwash: i wonder if this patch would help https://review.openstack.org/#/c/51425/17:55
*** melwitt has joined #openstack-infra17:56
markwashwenlock: hmm, sorry I'm actually seeing it locally rather than in the gate; it seems like that patch is more geared towards the gate? also maybe I'm in the wrong channel :-)17:56
*** hashar has joined #openstack-infra17:56
wenlockahhh :D17:57
*** mrodden has quit IRC17:58
markwashit seems like maybe if I could somehow constrain the version of pip that tox is using I could fix things, but googling did not lead me to any conclusions there17:59
*** fifieldt has quit IRC18:00
*** mriedem has quit IRC18:03
*** gyee_ has quit IRC18:04
*** vkozhukalov has joined #openstack-infra18:05
*** jasond` has quit IRC18:06
*** pballand has joined #openstack-infra18:06
*** CaptTofu has quit IRC18:10
*** praneshp has joined #openstack-infra18:11
*** mrodden has joined #openstack-infra18:11
*** otherwiseguy has quit IRC18:12
*** morganfainberg|z is now known as morganfainberg18:15
anteayaI have to jet for a meeting18:15
anteayago mordred18:15
anteayaback later18:15
*** derekh has quit IRC18:16
*** gokrokve has joined #openstack-infra18:17
*** CaptTofu has joined #openstack-infra18:17
*** gsamfira has quit IRC18:18
*** marun has quit IRC18:18
*** SergeyLukjanov_ is now known as SergeyLukjanov18:21
*** hashar has quit IRC18:21
*** SumitNaiksatam has joined #openstack-infra18:22
*** hashar has joined #openstack-infra18:23
pvojeblair: re: rate limits... ok. let me see what I can dig up.18:23
jeblairpvo: cool, thx, let me know if you need more info18:23
*** dpyzhov has quit IRC18:23
*** rakhmerov has quit IRC18:23
SumitNaiksatamhi...does any one here know how to reset the #openstack-meeting meet bot?18:23
SumitNaiksatami am trying to to end a meeting, but the meet bot is not picking up18:23
jeblairSumitNaiksatam: just try to start a meeting18:24
SumitNaiksatamjeblair: ah ok18:24
SumitNaiksatamjeblair: didn't work18:25
*** rakhmerov has joined #openstack-infra18:26
jeblairhrm i'll look18:26
SumitNaiksatamjeblair: it wouldn't let me start the new meeting, but its not letting me end either18:26
jeblairSumitNaiksatam: oh i see, your nick changed.18:27
SumitNaiksatamjeblair: yeah, i just noticed that as well18:27
jeblairSumitNaiksatam: try changing it back to 'SumitNaiksatam_' and then #endmeeting18:27
SumitNaiksatamok18:27
*** senk has quit IRC18:27
*** fbo is now known as fbo_away18:28
SumitNaiksatamjeblair: done, i think i got dc'ed and that created the problem, did not realize that the nick changed18:29
jeblairSumitNaiksatam: cool, glad that worked; if it didn't i or another infra core could have used super-user privs to end it, but it's good you could do it yourself18:29
SumitNaiksatamjeblair: thanks, was able to end it18:30
*** pcrews has joined #openstack-infra18:30
*** gothicmindfood has quit IRC18:31
*** portante has joined #openstack-infra18:32
*** gyee has joined #openstack-infra18:32
*** harlowja_away is now known as harlowja18:32
zarojeblair: i'm looking for ideas on how to  test gerritbot with review-dev.o.o18:33
*** krotscheck has joined #openstack-infra18:33
zarojeblair: would i need to run my own gerritbot that sends updates to my own channels?18:35
portantesdague: the swift test failure is very odd, still trying to figure out what caused it, have not been able to reproduce it locally yet18:35
jeblairzaro: yes, it shouldn't be too hard; i usually just have it join a test irc channel that only i have joined18:37
krotscheckHey everyone. We've got a bit of a patch backlog on storyboard, does anyone have time amidst the crazyness to look at some of these? 67520, 67729, 67731, 6501718:38
krotscheck(They're all infra patches18:38
krotscheck(I mean config)18:38
lifelessjeblair: ok, so now to see if we use PUT at all; if we do we need to limit to 1/6 seconds otherwise 4/6 seconds18:39
*** ruhe is now known as _ruhe18:39
*** praneshp has quit IRC18:40
zarokrotscheck: i can take a look, but only able to give +1.18:40
krotscheckzaro: More Eyeballs == better18:41
jeblairlifeless: actual rax ratelimits for our account are unknown; pvo is looking into it18:41
*** dpyzhov has joined #openstack-infra18:41
lifelessjeblair: ack, thanks18:42
jeblairlifeless: also, i would like to revert the logging change because i think it is verbose and the default rate change because i think 1/sec is a good default and config is the place for further tuning18:42
krotscheckzaro: Two of those can be summed up with "Hey let's just have tox run our build for us"18:42
mgagnejeblair: I want to approve this change but XML changed: https://review.openstack.org/#/c/64610/18:43
*** senk1 has joined #openstack-infra18:43
lifelessjeblair: the rate change - sure, but the logging change - I would really like to get diagnostics there at some stage; can you suggest a better mechanism?18:43
*** praneshp has joined #openstack-infra18:43
jeblairlifeless: well, by design we should pretty much always be hitting the rate limit (unless the api call itself takes longer than the interval)18:47
mordredjeblair: back18:47
jeblairlifeless: so i expect it should be emitting a constant stream of log lines, one per provider per interval18:47
jeblairlifeless: and i'm not sure what it tells you, other than (a) whether the program runs faster than the permitted rate, and (b) whether the api calls themselves take longer than the interval18:48
*** mriedem has joined #openstack-infra18:48
jeblairmgagne: that change looks harmless; zaro do you agree?18:50
zaromgagne, jeblair: yes, i agree.18:50
mgagnejeblair: side effect is that jenkins servers will be "hammered" by update requests18:50
jeblairmgagne: that should be okay.  as long as the jobs themselves don't break18:51
mgagnejeblair: although no changes should be made18:51
mgagnejeblair: ok, will approve18:52
jeblairmgagne: cool, thx18:52
*** jcoufal has joined #openstack-infra18:52
* mgagne puts on his cowboy hat18:52
jeblairyeeehaw!18:52
*** marun has joined #openstack-infra18:53
*** burt1 has quit IRC18:54
*** asadoughi has joined #openstack-infra18:54
*** julim has quit IRC18:54
openstackgerritMichael Krotscheck proposed a change to openstack-infra/storyboard: Add tests for Alembic migrations  https://review.openstack.org/6641418:55
*** julim has joined #openstack-infra18:57
*** sarob has joined #openstack-infra18:57
*** DinaBelova is now known as DinaBelova_18:57
*** sarob has quit IRC18:58
openstackgerritA change was merged to openstack-infra/config: Add three new jenkins servers  https://review.openstack.org/6844218:58
*** nati_ueno has joined #openstack-infra19:00
mordredjeblair: ok. I'm going to start launching jenkins servers19:00
*** markmcclain has quit IRC19:01
*** hashar has quit IRC19:01
*** markmcclain has joined #openstack-infra19:01
*** _ruhe is now known as ruhe19:01
*** rakhmerov has quit IRC19:03
*** markmcclain has quit IRC19:03
*** sarob has joined #openstack-infra19:03
*** markmcclain has joined #openstack-infra19:03
openstackgerritMatt Ray proposed a change to openstack-infra/config: Chef style testing enablement and minor speed cleanup starting with checks  https://review.openstack.org/6796419:05
*** browne has left #openstack-infra19:05
*** jroovers has joined #openstack-infra19:06
*** rakhmerov has joined #openstack-infra19:07
zaroclarkb, sdague : disabling gerrit drafts is a pending change.. https://gerrit-review.googlesource.com/#/c/5394719:08
clarkbzaro: I saw that :( also you don't want to prevent anonymous users from pushing drafts, they can't push drafts anyways19:09
clarkbzaro: you want to prevent registered users19:09
*** rakhmerov has quit IRC19:10
clarkbjeblair: ok I am around properly now19:11
clarkbjeblair: re double releases in the zuul tset, I think I was operating under the assumption that only the first job matching the regex would be released, I see that isn't true. I will fix that19:12
*** elasticio has joined #openstack-infra19:14
*** gsamfira has joined #openstack-infra19:14
jeblairclarkb: cool, thought that might be the case.  the *-merge releases are doubled in some cases because of dependent changes, where the merge for change B won't start until the merge for change A finishes (so there's a settle between them to allow that to happen)19:15
ttxfungi, jeblair: would be nice to get https://review.openstack.org/#/c/68135/ in icehouse-2 as well19:15
ttxfungi, jeblair: so if there is a way to bump it at the top at next reset... would be nice19:15
ttxfungi, jeblair: hmm, unless milestone-proposed changes go in a separate queue ? in which case we could propose the change there19:17
ttxbut IIRC that's not the case19:17
jeblairttx: is it a gate-fixing bug or a security fix?19:17
jeblairttx: no, not the case19:17
ttxjeblair: it's just a milestone-critical fix19:17
zaroclarkb: anonynmous covers all users, so seems ok to me.19:17
*** vipul is now known as vipul-away19:17
clarkbzaro: it covers logged in users too? weird19:18
zaroclarkb: yes, i believe so.19:18
*** max_lobur is now known as max_lobur_afk19:19
*** yassine has quit IRC19:19
jeblairttx: to prevent our becoming actual human gatekeepers, we adopted a policy of only promiting gate-fixing bugs or security fixes; do you feel strongly enough about this change to ask us to consider widening that policy?19:19
ttxjeblair: we made an exception to that rule already for the licensing issue in horizon (currently at top of queue)19:20
jeblairalas we are human19:20
ttxthat said, that should have been tested19:20
ttxso we can keep it in regular queue19:21
openstackgerritA change was merged to openstack-infra/jenkins-job-builder: Fix multibyte character problem  https://review.openstack.org/6461019:21
ttxjeblair: I would add legal issues to that policy above19:21
fungii'll apologize for promoting the licensing patch. it was legal19:21
fungiheh19:21
fungiit felt like a justifiable grey area19:22
jeblairi'll buy legal.  :)19:22
clarkbI think legal issues belong in that list19:22
jeblairdone.  gate-fixing bugs, security fixes, legal issues.  :)19:22
fungisince technically horizon was misrepresenting licensing of software in the published repository with out it19:22
*** vipul-away is now known as vipul19:23
fungiyay, we're back to being inhuman again then19:23
openstackgerritMichael Krotscheck proposed a change to openstack-infra/storyboard-webclient: Remove mock API interfaces from storyboard.  https://review.openstack.org/6846419:27
openstackgerritA change was merged to openstack-infra/storyboard: Add tests for Alembic migrations  https://review.openstack.org/6641419:27
fungispeaking of the horizon change... it looks like jenkins failed to run a couple of the jobs on it and zuul re-queued them... i don't suppose we have any easy way to get that change moving again until the next gate reset further down causes a bunch of check jobs to get new nodes before zuul will circle back around to getting nodes on those two jobs still lacking them?19:27
clarkbfungi: nope, we have no way of modifying the geard queues iirc19:28
jeblairtis true19:28
fungifigured19:28
*** johnthetubaguy has quit IRC19:28
fungijust wanted to be 100% sure before i told ttx to give up and eat lunch19:28
jeblairwhat we should do is have zuul re-enqueue those jobs with a high priority, but that is sadly non-trivial19:28
jeblairin the mean time, remind yourself (and ttx) to be happy that it didn't already kick it out because jenkins has in fact already failed once!19:29
*** sHellUx has quit IRC19:29
*** pballand has quit IRC19:30
fungioh, already mentioned that, yes19:30
fungicold comfort is better than none at all ;)19:30
ttxI feel so happy19:30
clarkbjeblair: fungi: I think we should decide on two things about my zuul change. First should I remove the _type and _factor flags? and second is it a problem that in the inital patch any config reload of the layout.yaml will resset the window value?19:31
ttxI just feel like the whole queue is going to wait for that now19:31
jesusaurusclarkb: sdague: reading the gate update email got me thinking about the (rare) need for ninja merges. whats the feasibility of modifying zuul's scheduler to use a priority queue: an "everyone" priority, and a "ninja" priority? that would allow certain changesets to be put at the front of the queue when zuul is recalculating the gate after a failure. im just not sure where/how to set the priority19:31
fungiclarkb: i think those are slight overengineering, but i can also see us wanting to experiment with them without needing a zuul restart19:32
fungiso i'm in favor of keeping them19:32
clarkbjesusaurus: jeblair hinted at that not too long ago when fungi asked about promoting specific changes. it is non trivial19:32
jeblairttx: fungi: you could re-promote it.  it would start everything over again but may be faster.19:32
clarkbjesusaurus: we do have gearman priority, but right now can only set that on a pipeline level not a change level19:33
ttxjeblair: yeah, I feel like the whole queue will be blocked for longer if we don't19:33
openstackgerritMatthew Treinish proposed a change to openstack-infra/elastic-recheck: Add multi-project irc support to the bot  https://review.openstack.org/6754019:33
fungijeblair: well, if we repromote it, everything currently in the check queue will still get all remaining nodes, and we're likely to have a reset behind that change fairly soon anyway (statistically speaking)19:34
jeblairfungi: (and as long as you're doing it you could probably accidentally promote the other m-p change behind it)19:34
fungithat's an option19:34
jeblairfungi: okay, i've not been paying enough attention to guage the odds on that19:34
jeblairfungi: your call :)19:34
*** sHellUx has joined #openstack-infra19:35
hub_capjeblair: bet on black19:35
*** beagles is now known as beagles_brb19:35
*** sHellUx has quit IRC19:35
*** pballand has joined #openstack-infra19:35
jeblairhub_cap: there are 3 blacks but they're too far down.  bad bet i think.19:35
sdaguejesusaurus: I don't know, I do think it would be interesting19:36
mgagnezaro: if you aren't too busy, would you mind rebasing your JJB changes against master?19:36
sdaguejesusaurus: though in this case a ninja merge is skip the gate entirely, and just merge directly19:36
*** marun has quit IRC19:36
jesusaurusim not familiar enough with zuuls scheduler to know how many moving pieces would be affected19:36
*** praneshp has quit IRC19:37
jeblairjesusaurus: having it wait for a reset was sufficiently hard that i punted on that when writing the manual promote script19:38
openstackgerritMichael Krotscheck proposed a change to openstack-infra/storyboard: Introducing basic REST API  https://review.openstack.org/6311819:38
*** sarob has quit IRC19:38
*** sarob has joined #openstack-infra19:38
jeblairclarkb: i'm fine keeping the extra knobs and the proviso that the window is reset on reload for now19:39
openstackgerritMatthew Treinish proposed a change to openstack-infra/elastic-recheck: Add multi-project irc support to the bot  https://review.openstack.org/6754019:39
*** sHellUx has joined #openstack-infra19:39
*** jcoufal has quit IRC19:41
*** thuc has joined #openstack-infra19:41
clarkbjeblair: ok, new patchset should arrive shortly19:41
*** hashar has joined #openstack-infra19:42
*** sarob has quit IRC19:43
*** sHellUx has joined #openstack-infra19:44
mordredjeblair: dammit. I made jenkins05 just fine. then I was still in rax-nova and not ci-rax-nova when I made 06. making 07 in the right place, will go back and fix 06 in a second19:44
*** thuc_ has quit IRC19:44
jeblairmordred: ok.  i use screen and one window per host and do them all at once when i'm doing multiple hosts19:45
openstackgerritClark Boylan proposed a change to openstack-infra/zuul: Add rate limiting to dependent pipeline queues  https://review.openstack.org/6821919:45
clarkbhow does that look?19:45
*** vkozhukalov has quit IRC19:45
*** thuc has quit IRC19:46
*** otherwiseguy has joined #openstack-infra19:46
mordredjeblair: I was actually doing one set so I could be methodical about it :)19:47
*** sHellUx has joined #openstack-infra19:47
mordredjeblair: what's the difference between rdns create and record-create/19:49
mordred?19:49
jeblairmordred: reverse and forward dns19:49
mordredoh. duh19:49
mordrednevermind19:49
mordredyup19:49
mordredsometimes asking the question is all you need to do19:50
*** gsamfira has quit IRC19:51
*** markmcclain has quit IRC19:53
*** rfolco has quit IRC19:53
*** DennyZha` has joined #openstack-infra19:54
*** sHellUx_ has joined #openstack-infra19:55
*** _david_ has joined #openstack-infra19:55
*** DennyZhang has quit IRC19:56
*** hogepodge has joined #openstack-infra19:57
*** NikitaKonovalov_ is now known as NikitaKonovalov19:57
*** sHellUx has quit IRC19:57
jeblairclarkb: i approved your change since there were only trivial changes since the last patchset.  if someone objects, i assume there's a little bit of time still left before it merges.  :)19:59
openstackgerritA change was merged to openstack-infra/storyboard: Introducing basic REST API  https://review.openstack.org/6311819:59
jeblairwhen it merges, i think we shut down and deploy19:59
anteayaback19:59
mordredjeblair: all three new jenkins servers created19:59
clarkbjeblair: ok, I will plan to be around for that19:59
mordredjeblair: I would like to read your literature on your super secret setup sauce20:00
*** DennyZha` has quit IRC20:00
mordredjeblair: at your convenience20:00
*** sHellUx_ has quit IRC20:00
jeblairclarkb: maybe lunch now then?  i'm getting ready to20:00
*** ladquin is now known as ladquin_afk20:00
clarkbjeblair: sure20:00
jeblairmordred: left out a step; need to restart iptables on the hosts that list the new jenkins servers in their firewalls20:00
jeblairmordred: (since they weren't in dns until now)20:01
mordredjeblair: ok. I'll go do that, then ping you back20:01
openstackgerritMichael Krotscheck proposed a change to openstack-infra/storyboard: Load projects from yaml file  https://review.openstack.org/6628020:02
*** obondarev_ has quit IRC20:02
_david_jeblair, mordred clarkb Can you consider giving a talk on Gerrit UC? https://groups.google.com/forum/#!topic/repo-discuss/5T0E-GG3Pag20:02
*** smurugesan has joined #openstack-infra20:03
*** DinaBelova_ is now known as DinaBelova20:04
*** MarkAtwood has joined #openstack-infra20:05
*** yamahata has quit IRC20:05
jeblairmordred: when you're ready see jenkins04:~corvus/README20:05
jeblair_david_: i bet one of us could manage that20:06
openstackgerritJames Slagle proposed a change to openstack-infra/release-tools: Added ignore to additional egg-info files  https://review.openstack.org/6847120:07
_david_jeblair, That would be really really, great. We are biggest (public) Gerrit installation site and we should make our voice in the community20:07
*** david-lyle_ has joined #openstack-infra20:08
_david_jeblair, And you and zaro gave a talk on JUC in the past, though, why not to explain Gerrit maintainer, that having 800 active contributors put some special requirements,20:08
anteayalike 8 jenkinses?20:09
jeblair_david_: yeah, i think we missed the cfp deadline in previous years20:09
jeblairso it's very good of you to remind us20:09
_david_yes, we should do that, we want to make upstream contribution process a bit easier, what ? ;-)20:10
*** derekh has joined #openstack-infra20:11
openstackgerritEric Guo proposed a change to openstack/requirements: Sort global-requirements  https://review.openstack.org/6494320:11
_david_I am giving a talk about decentralized CI infrastructure with LibreOffice-Gerrit-buildbot-plugin, so another reason to attend to meet you guys ;-)20:12
*** senk1 has quit IRC20:15
*** lcestari has quit IRC20:17
*** vipul is now known as vipul-away20:19
*** vipul-away is now known as vipul20:19
lifelessfungi: so, about getting ci-overcloud enabled ;)20:20
*** burt1 has joined #openstack-infra20:20
*** fbo_away is now known as fbo20:21
*** afazekas_ has quit IRC20:22
*** beagles_brb is now known as beagles20:24
*** CaptTofu has quit IRC20:26
*** markmcclain has joined #openstack-infra20:26
*** dizquierdo has joined #openstack-infra20:27
*** SergeyLukjanov is now known as SergeyLukjanov_20:28
*** elasticio has quit IRC20:28
*** sarob has joined #openstack-infra20:30
*** sarob has quit IRC20:31
*** markmc has quit IRC20:32
*** praneshp has joined #openstack-infra20:33
*** CaptTofu has joined #openstack-infra20:33
openstackgerritJoe Gordon proposed a change to openstack-infra/elastic-recheck: Add per job classification rate to uncategorized.html  https://review.openstack.org/6847820:34
*** vipul is now known as vipul-away20:34
*** mrda_away is now known as mrda20:35
sdagueoh, clarkb https://review.openstack.org/#/c/67591/20:35
sdagueto get the uncategorized hit list out there20:36
jog0sdague: ^^ shows per job stats20:36
jog0sdague: 99% for gate-tempest-dsvm-full20:36
sdaguejog0: yep, I just saw your new patch, haven't looked at it deeply20:36
jog0no problem20:36
jog0I am just impressed with some of the stats20:37
*** senk1 has joined #openstack-infra20:37
openstackgerritA change was merged to openstack-infra/zuul: Add rate limiting to dependent pipeline queues  https://review.openstack.org/6821920:37
jog0the jenkins interrupt a job and mark as failure bug is messing up those numbers20:37
*** DinaBelova is now known as DinaBelova_20:37
sdaguejog0: yeh, we need to get those not to be marked as fails in ES20:38
jog0sdague: yup20:38
jog0and grenade jobs need more then console.html20:38
sdaguejog0: yeh, can you work up the ES patch for that one?20:38
sdagueI know you added some other ES stuff20:38
anteayayay for 6821920:39
sdagueyep20:39
jog0sdague: not sure what best way to get the logs/new/screen-n into ES20:39
clarkbsdague: looking20:39
jog0I think we want it to be mapped to logs/screen-n*20:39
lyxusI already posted this on -dev but might be for infra actually. For tempest, I was wondering 1) Is tempest supposed to be passing 100% of the test from the trunk 2) Does the standard use the openvswitch plugin20:39
jog0so the queries all work without any changes20:39
*** senk1 has quit IRC20:40
*** senk has joined #openstack-infra20:40
sdaguejog0: we might need another piece of metadata then20:40
clarkblyxus: 1) yes, 2) no the 'standard' if there is one is nova network20:40
jog0sdague: such as?20:40
sdagueso we can tell new vs. old n-cpu20:40
sdaguebecause it might be important20:41
sdagueotherwise it will look like all of it is the same file to ES20:41
clarkbsdague: jog0: I think the filename value should have the logs/new/screen-n value20:41
clarkbthen you can glob the /new/ out when you do serach20:41
sdagueclarkb: so the problem with that is existing matches wont20:41
clarkbsdague: right you would need a glob20:41
sdagueclarkb: so what if we just made grenade_branch another piece of metadata20:42
sdagueso the filename stayed logs/screen-n20:42
clarkbsdague: we can do that too, it just complicates the mapping from file to indexed data20:42
sdaguebut we'd be able to facet20:42
sdagueclarkb: it does, but I think it simplifies use dramatically20:42
clarkbk20:42
*** NikitaKonovalov is now known as NikitaKonovalov_20:42
clarkbor, and this might be crazy20:43
clarkbwhat if we combine the logs in the jobs and just have a single log file20:43
clarkbthen there isn't old vs new just two chunks of data in a file20:43
sdaguehmmm...20:43
jeblairclarkb: ready for surgery?20:43
clarkbjeblair: give me a couple minutes20:44
sdaguethat might work20:44
clarkbsdague: I am not entirely sold on that idea yet (it just occured to me)20:44
sdagueit would at least be a start20:44
jeblairclarkb: np, back to my hacking hole20:44
sdagueif we decided that we hated it later we could change the indexer20:44
*** ociuhandu has quit IRC20:44
sdaguejog0: thoughts?20:44
lyxusclarkb, I want to test the impact of my plugin. So i should just git clone devstack and run tempest and it should be 100%20:45
*** rcleere has quit IRC20:46
jog0sdague: I like the idea of single log file20:47
*** NikitaKonovalov_ is now known as NikitaKonovalov20:47
clarkblyxus: yes tempest is expected to pass at least the tests marked gate or whateve rthe tag is20:47
jog0it wouldn't be to hard to figure out which is old nad new too20:48
jog0I think20:48
anteayamikal please don't recheck any neutron patches20:48
anteayathey won't pass check until isolated jobs are fixed20:48
clarkbjeblair: ok, I have caffeine, ping me when ready20:48
anteayaI am posting comments to neutron patches in check informing people to stop rechecking until isolated jobs are fixed20:48
anteayaand inviting them to help fix the issue20:49
jeblairclarkb: hi20:49
jog0yeah everytime a linenumber is printed we see the path (new vs old)20:49
jog0although right now we mainly care about the new logs20:50
jog0being thats what we run tempest against20:50
clarkbjeblair: ohai20:50
*** sarob has joined #openstack-infra20:50
jeblairzuul is updated.  i think it should just be a matter of stopping and starting.  probably a one-person job; i'll do it20:51
clarkbjeblair: ok20:51
*** vipul-away is now known as vipul20:51
clarkbI will tail the log and watch the window sie20:51
clarkb*window size20:51
*** SergeyLukjanov_ is now known as SergeyLukjanov20:51
jeblair#status alert Zuul is about to restart for an upgrade; changes will be re-enqueued20:51
openstackstatusNOTICE: Zuul is about to restart for an upgrade; changes will be re-enqueued20:51
*** ChanServ changes topic to "Zuul is about to restart for an upgrade; changes will be re-enqueued"20:51
jeblairstarting zuul20:54
*** burt1 has quit IRC20:55
jeblairclarkb: what's the default window behavior?20:55
*** sarob has quit IRC20:55
clarkbjeblair: 20 incrementing up exponential down to 3 iirc20:56
*** NikitaKonovalov is now known as NikitaKonovalov_20:57
clarkbyup20:57
clarkbin scheduler.py20:57
mtreinishclarkb: actually we don't have a tag for the gate tests it just runs the api, scenario, cli, and thirdparty test dirs20:57
*** DennyZhang has joined #openstack-infra20:59
*** sarob has joined #openstack-infra20:59
jeblairclarkb: i'm deleting all nodepool nodes that were marked used during the downtime21:00
clarkbok21:00
*** CaptTofu has quit IRC21:01
jeblairclarkb: fwiw it looks like #21 is not launching jobs21:01
clarkbcool :)21:01
jeblairand check jobs enqueued ahead of it seem to be21:02
*** SergeyLukjanov is now known as SergeyLukjanov_21:02
*** yolanda has quit IRC21:02
sdagueoh, right, enqueue times all reset on zuul restart21:03
clarkbsdague: ssshh! :)21:04
sdague:P21:04
anteayawhen does the gate queue limiter go into effect?21:04
*** smarcet has left #openstack-infra21:05
clarkbsdague: your puppet change lgtm for the es stuff21:05
clarkbanteaya: right now21:05
anteayaI'm seeing 106 in the gate queue21:05
sdagueclarkb: great21:05
clarkbanteaya: note things still queue up and show in status, but won't have jobs started for things outside the window21:05
jeblairanteaya: 67641,2 is the first change outside the window21:05
* anteaya looks again21:05
sdaguethe sooner we can get it in, the better, then we can farm out categorizing bugs easier21:06
jeblair(a ui indication of the window is a possible future enhancement)21:06
*** DennyZhang has quit IRC21:06
anteayaoh okay, I will wait until jobs start21:06
*** DennyZhang has joined #openstack-infra21:07
jeblairanteaya: wait for what?21:08
anteayaso right now, other than you telling me, I have no way of knowing which changes are inside the window and which aren't21:08
*** DennyZhang has quit IRC21:08
anteayasince some changes inside the window don't have jobs started21:08
anteayawaiting for jobs to start21:08
*** DennyZhang has joined #openstack-infra21:08
jeblairanteaya: all changes inside the window are running jobs21:08
*** hashar is now known as hasharMeeting21:09
anteayaokay, I have expanded them to see that21:09
jeblairanteaya: the first 20 changes are all running some number of jobs.  changes 21-106 are running no jobs21:09
*** senk has quit IRC21:09
anteayayes, expanding them shows me that21:10
anteayathanks21:10
anteayawell done21:10
*** senk has joined #openstack-infra21:10
jeblair#status ok21:11
*** ChanServ changes topic to "Discussion of OpenStack Project Infrastructure | Docs http://ci.openstack.org/ | Bugs https://launchpad.net/openstack-ci | Code https://git.openstack.org/cgit/openstack-infra/"21:11
*** senk1 has joined #openstack-infra21:13
*** mrmartin has joined #openstack-infra21:13
jeblairclarkb: check queue seems to be proceeding well past 2021:14
clarkbjeblair: perfect21:14
*** senk has quit IRC21:15
*** kraman has quit IRC21:16
*** CaptTofu has joined #openstack-infra21:19
*** dprince has quit IRC21:19
jeblairttx: your license change is 26 min out21:20
*** jooools has joined #openstack-infra21:21
*** kraman has joined #openstack-infra21:21
*** whoops has joined #openstack-infra21:22
*** thuc has joined #openstack-infra21:23
*** ruhe is now known as _ruhe21:23
*** thuc_ has joined #openstack-infra21:23
*** thuc has quit IRC21:26
portanteclarkb, sdague: do you guys know how this patch got reentered into the gate: https://review.openstack.org/6698621:27
portante?21:27
portanteI need to file a bug for the failure mode so we can track it21:27
portanteit is really weird21:27
*** jhesketh_ has joined #openstack-infra21:28
*** enqae has joined #openstack-infra21:28
*** enqae has quit IRC21:28
anteayait was possibly reenqueued on the zuul restart21:29
*** jroovers has quit IRC21:30
portanteanteaya: okay, thanks21:30
clarkbportante: yeah probably happeend when jeblair reenqueued things after the zuul restart21:33
clarkbwe just got a gate reset21:33
clarkbwoot and it was cheap. The js renders the subway graph oddly but I can live with that for now21:34
anteayayay for a cheap gate reset21:34
*** DennyZhang has quit IRC21:34
portantedid the ratelimit the gate jobs land yet?21:35
*** jasondotstar has quit IRC21:35
*** DennyZhang has joined #openstack-infra21:35
anteayaportante: yes21:35
lifelesssdague: so unit tests are dependent on other projects a lot of the time - e.g. client libraries21:35
portantenice21:35
lifelesssdague: I'm worried your slimmed down gate proposal will let lots of needle-threads through21:35
anteayaif you expand by default in the zuul status page, the gate patches in the limited window will have jobs running and those outside will not21:36
anteayaabout the first 20 patches right now21:36
anteayaI think the algorithm is flexible21:36
*** svarnau has joined #openstack-infra21:37
jheskethMorning21:37
anteayamorning jhesketh21:37
anteayaI haven't seen mattoliverau lately21:37
anteayadid we scare him away?21:37
jheskethHe's busy moving homes21:37
anteayaah21:37
jheskethI'm sure he'll be back in action soon :-)21:38
anteayathat's okay then21:38
anteayasure, from a happy new home21:38
jheskethyes, but also terrible internet ;-)21:38
anteayanoooo21:38
anteayaclose to a coffee shop with great internet?21:38
jheskethI think he's quite central so I'd be surprised if not21:38
anteayacool21:39
lifelessfungi: ping ?21:39
jheskethanything I can do to help you guys out?21:39
fungiokay, so refreshing myself on the current state of what i missed whilst abandoning you all morning... we have several additional jenkins masters leveraging the additional rackspace quota, and clarkb's dynamic zuul throttle mechanism is in place now?21:39
clarkbanteaya: you can more clearly see the window break in the UI now because the js isn't rendering it properly :)21:39
clarkbfungi: I don't think the new jenkinses are fully up21:39
fungilifeless: yes?21:39
anteayaclarkb: awesome21:39
lifelessfungi: hi! so - I understand derekh chatted w/you about enabled ci-overcloud21:40
lifelesss/enabled/enabling/21:40
clarkbbut dynamic throttling is in, you can see the window break in the subway graphs bad rendering21:40
anteayalast I heard mordred and jeblair were still working on configuring those21:40
fungilifeless: is about to ask me whether we're in a good state to take nodepool offline and play with it to see if the stack of new changes will stuff21:40
fungier, will break stuff21:40
clarkbI am reasonably happy with were we are right now21:41
lifelessfungi: no, just to enable ci-overcloud21:41
anteayajhesketh: ummm, nothing right atm, but do celebrate the configuring of 3 new jenkinses for additional rax nodes21:41
lifelessfungi: which the previously landed changes support21:41
anteayaplus rate limiting on the gate queue21:41
anteayayay21:41
jheskethanteaya: another 3? Are we up to 8?21:42
lifelessfungi: I would suggest landing your handle-flavor-lookup-errors toot21:42
lifelesss/toot/too/21:42
anteayajhesketh: as soon as they are configured we will be up to 821:42
jheskethnice stuff :-)21:42
clarkbjhesketh: ya that is about 100 slaves per master21:42
anteayait is exciting, yes21:42
fungilifeless: right. i need to see if i got recommendations on my horrible flavor list try/except patch21:43
* fungi checks21:43
lifelessyou did from me21:43
fungiexcellent--thank you21:43
*** mrmartin has quit IRC21:44
*** DennyZhang has quit IRC21:45
*** DennyZhang has joined #openstack-infra21:46
mordredclarkb: I'm still working on new jenkinses21:47
*** _david_ has quit IRC21:47
fungilifeless: we should probably also include https://review.openstack.org/6768421:48
*** rakhmerov has joined #openstack-infra21:49
fungithat one's just an outright bugfix of copy-paste errors21:49
lifelessyes; if we're including config changes then https://review.openstack.org/#/c/67958/2 and https://review.openstack.org/#/c/67685/ too please (though we can work around those in tripleo-ci)21:49
*** sarob has quit IRC21:50
fungialso, we have a separate issue in nodepool which i haven't even tracked down yet... i think we may need a longer ssh timeout/retry for image building--we're often struggling to build new images in hpcloud, particularly in az221:50
*** sarob has joined #openstack-infra21:50
jeblairfungi: oh again?  we raised it twice :(21:50
openstackgerritJoshua Hesketh proposed a change to openstack-infra/zuul: Send swift upload instructions to workers  https://review.openstack.org/6829721:51
fungiwe seem to be able to build servers, but maybe image building has a separate timeout?21:52
fungimy brief look through the image.log suggested that it was throwing paramiko exceptions between nova boot step and puppeting21:52
mattoliverauanteaya: I've been moving into a new house :)21:53
*** DennyZhang has quit IRC21:53
*** DennyZhang has joined #openstack-infra21:53
*** DennyZhang has quit IRC21:54
mattoliverauanteaya: Still am really, got boxes everywhere. but back at work today.21:54
*** DennyZhang has joined #openstack-infra21:54
fungiif you look at nodepool image-list, the devstack-precise image for hpcloud-az2 is 547 hours old, for example21:55
*** sarob has quit IRC21:55
lifelesspleia2: were you bringing up a fedora image defn patch?21:55
jeblairclarkb: i think we may have some ui issues to work through.  :)21:56
clarkbjeblair: yup :) but it shows you where the window break is21:56
jeblairclarkb: why is 67641,2 out of line?21:56
jeblairclarkb: did the window shrink?21:56
clarkbjeblair: no it did not shrink that was the old one on the outside of the window21:57
jeblairclarkb: oh, ok.  so the ui weirdness is just extra weird.  :)21:57
clarkbya I believe so21:57
jeblairclarkb: it might be good for changes outside the window to be disconnected.21:58
jeblairclarkb: (and probably get a different color dot)21:58
clarkblist of things we need to add to this new zuul feature: reporting in status.json, rpc command to set values, then we can prevent layout.yaml updates from changing the window on the fly, UI updates21:58
clarkbjeblair: wfm, I was thinking a pair of two differently shaded backgrounds, but disconnecting the usbway and changing station type fits into the existing ui well21:59
zarokrotscheck: question about 67731.  were does the version info go in the packaging?22:00
mordredjeblair: hey! I think that something is weird, because home for jenkins is /home/jenkins, not /var/lib/jenkins22:02
mordredclarkb: ^^ you know anything about that?22:02
jeblairmordred: i think we found that something changed in the jenkins packaging...22:02
clarkboh jenkins22:02
jeblairlike we expected it to create a user/homedir or something and it didn't22:02
jeblairi can't remember the way we fixed it22:02
krotscheckzaro: Nowhere, yet. I suspect that'll vary based on project, since some will be packaged as python modules and others will be packaged as tarballs.22:03
jeblairdid we remove the user and let puppet re-create it?22:03
jeblairclarkb: fungi: ^ ?22:03
clarkbjeblair: mordred: I think puppet creates the user for us22:03
clarkbperhaps the package needs to require the user?22:03
fungijeblair: yes, i believe we did22:03
* mordred tries taht22:03
jeblairmordred: that also reminds me that we probably want to manually install the deb22:03
jeblairthe jenkins debs for the same version we use on other hosts22:03
*** derekh has quit IRC22:03
clarkbyes dpkg -i the version you want, also grab the scp plugin from jenkins0422:04
mordredclarkb: which version do I want?22:04
*** ArxCruz has quit IRC22:05
jeblairlifeless: i've heard back and i think we can assume a very high number for our rax limit for the moment (exact numbers forthcoming); maybe we should just set it to the same as hpcloud for now22:05
mordredfungi: nope. I deleted the user, re-ran puppet, still ended up with user in wrong place22:05
lifelessjeblair: so hpcloud is actually lower than the default rax limit22:05
mordredperhaps delte user then dpkg -i ?22:05
lifelessjeblair: now i'm off of phones I can put up a patch for that, sec22:05
fungimordred: worth a try22:06
jeblairlifeless: what's the rax default again if you have it offhand?22:06
lifelesshttp://docs.rackspace.com/loadbalancers/api/v1.0/clb-devguide/content/Rate_Limits-d1e821.html22:06
*** svarnau has quit IRC22:06
*** david-lyle has quit IRC22:06
*** wenlock has quit IRC22:06
*** svarnau has joined #openstack-infra22:06
lifelessjeblair:22:06
*** david-lyle has joined #openstack-infra22:06
lifelessDELETE /v1.0/* ^/1.0/.* 50/minute is the lowest figure22:07
*** wenlock has joined #openstack-infra22:07
lifelessor 5/6 of a second22:07
*** sarob has joined #openstack-infra22:07
clarkbmordred: of jenkins 1.54322:07
clarkbmordred: of scp plugin the scp.jpi on jenkins0422:08
lifelessjeblair: compare to22:08
lifeless| PUT    | /{suburi} | 10    | 10     | MINUTE | 2014-01-22T17:16:52Z |22:08
zarokrotscheck: i created a script that extracts git version info to use if you like.  it would be good to append the git sha to the tarball or put it in a version file or something.  here's the script.. http://git.openstack.org/cgit/openstack-infra/config/tree/modules/jenkins/files/slave_scripts/maven-properties.sh22:09
lifeless1/5th the rate22:09
fungiso i think we're hitting ssh timeouts on hpcloud image builds semi-often, but the stale hpcloud-az2.devstack-precise is something else entirely. the image.log files dating back to the start of the month show that it only tried to build it once in january (on the 9th)22:10
krotscheckzaro: Funny- clarkb JUST pointed me at that.22:10
fungiand died with a ruby timeout22:10
*** ArxCruz has joined #openstack-infra22:10
jeblairlifeless: do we put?22:11
zarokrotscheck: the file is inappropriately named.  i should change it..22:11
lifelessjeblair: I was going to try and figure that out22:11
jeblairlifeless: also, slightly different table: http://docs.rackspace.com/servers/api/v2/cs-devguide/content/Rate_Limits-d1e862.html22:11
*** DennyZhang has quit IRC22:11
lifelessjeblair: I haven't yet; but since we only have one metric22:11
jeblairlifeless: we should be able to assume 1000+/min get/post for rax22:12
lifelessok22:12
clarkbjeblair: fungi: window was bumped to 22 then cut to 11. I was reading code and I think it may still be running in a loop using the 22 slice. will have to see where the split is. But overall seems to be happy22:12
jeblairlifeless: (i don't necessarily want to abuse that, which is why i was suggesting matching hp, but if if that ends up being _really_ slow, i guess we'll make up something reasonable sounding.  :)22:13
fungiclarkb: excellent22:13
clarkbhrm doesn't seem to have discarded builds for things in the queue yet22:13
krotscheckzaro: Cool. As soon as I get this integration suite done I'll add a patch to inject the version into the build env22:13
lifelessjeblair: btw how would you feel about s/rate/interval/ - the code is interval based not rate based22:13
*** jooools has quit IRC22:13
lifelessthe unit is seconds, not actions/second22:13
lifelessso inverted22:13
jeblairlifeless: fine by me22:13
lifelessjeblair: I'll do that when we're not in super busy mode22:14
clarkbjeblair: can you look at the gate queue and see if that looks funny to you?22:14
jeblairlifeless: k thx22:14
jeblairclarkb: it looks funny.22:14
clarkbjeblair: the first two changes seem to have done the correct thing22:14
*** david-lyle_ has quit IRC22:15
anteayamattoliverau: I hope the move went well22:15
anteayamattoliverau: ah the boxes lifestyle, I know that one22:16
openstackgerritlifeless proposed a change to openstack-infra/config: Set appropriate rate-limit for RAX clouds.  https://review.openstack.org/6850922:16
anteayamattoliverau: just glad we haven't scared you off22:16
jeblairclarkb: what do you mean by the first 2 changes?22:16
openstackgerritlifeless proposed a change to openstack-infra/config: Set a ratelimit for tripleo-test-cloud.  https://review.openstack.org/6851022:17
lifelessjeblair: how do you feel about assuming we don't do puts and basing our HPCS rate on 40/minute ?22:17
zaro_david_, jeblair : i've submitted and am on the schedule for a talk at gerrit UC.  it's a general talk about gerrit and OS CI.  I wouldn't mind submitting another one for multi-master jenkins and let someone else do the general CI talk.22:17
mattoliverauanteaya: not yet :) It takes alot to scare me off!22:17
* anteaya makes a note to try harder22:17
anteaya:D22:17
mattoliveraulol22:18
*** sarob has quit IRC22:18
jeblairlifeless: pretty good since we're assuming 60/min now and it's mostly working.  i'm pretty sure if we were hitting a 10/min limit we'd fail completely.22:18
mattoliverauI'm going down stairs to grab a coffee, brb22:18
anteayamattoliverau: k22:18
clarkbjeblair: 66986 and 6778822:18
clarkbjeblair: reading the logs I don't think it did the cancel behind failing item properly, I am now looking at code22:19
openstackgerritlifeless proposed a change to openstack-infra/config: Set HP cloud rate limits.  https://review.openstack.org/6851222:19
jeblairclarkb: what's unexpected to you?22:20
zaromgagne: will rebase my jjb changes shortly.22:20
clarkbjeblair: 66258 should have had its jobs that ran removed22:20
jeblairclarkb: https://jenkins02.openstack.org/job/gate-python-heatclient-pep8/1585/22:21
clarkbjeblair: and 57245 should not be red22:21
sdaguelifeless: unit tests don't install client libraries from git22:21
jeblairclarkb: according to jenkins it's tested on the 2 changes ahead of it22:21
jeblairclarkb: possible you missed the reset -- that's running on a static precise node, so it swooped in and ran the jobs quick22:21
lifelesssdague: right, which leads to firedrills when client libraries release and break things (like https://bugs.launchpad.net/heat/+bug/1271367)22:21
jeblairclarkb: (it's not tested based on anything not currently in the queue, so it looks right to me)22:22
clarkbjeblair: ok, I must have missed a reset then22:22
jeblairclarkb: double check, but that seems to hold for the cinderclient and tempest jobs below too22:22
*** burt1 has joined #openstack-infra22:22
jeblair(and the tempest change is not tested with the red cinderclient chaneg)22:22
clarkbhow did the heat pythonclient stuff run before the swift jobs?22:23
sdaguelifeless: that's fine, but it's currently not a feature we have. And in the move back to check land, that would redline heat in check. We'd have to fix it, but the rest would still flow22:23
fungiand that cinderclient pypy failure looks odd. like the slave did something unexpected or something's screwed up the workspace on it22:23
jeblairclarkb: static precise node vs bare-precise22:23
clarkbjeblair: gotcha, ok I feel much better about what I am reading now thanks22:24
*** miqui has quit IRC22:24
lifelesssdague: where do you see tripleo-ci deployments living? gate or just check ?22:25
mgagnezaro: https://review.openstack.org/#/c/68152/2 This change (introducing Test Stability plugin support) piggybacks the junit plugin publisher. What do you think should be the policy in that regard? Should a plugin be allowed to be configured through an other plugin section or not?22:26
*** sarob has joined #openstack-infra22:29
sdaguelifeless: check22:30
sdagueI think, honestly, having never seen one, I don't know22:30
sdagueI think we figure it out over time22:30
*** dizquierdo has quit IRC22:30
zaromgagne: so if it's not allowed what would be the alternative?  create a seperate jjb target, something like junit_stability?22:30
lifelesssdague: this plan seems to massively increase thread-the-needle events to me22:31
sdaguelifeless: I agree22:31
lifelesssdague: *and*, we have a range of tests that are not safe to run in check22:31
sdaguelifeless: well that needs to be addressed then22:31
lifelesssdague: specifically anything running on baremetal needs to be vetted before running to avoid run-malicious-code attacks22:31
sdaguebecause we can't run a test for the first time in gate22:31
sdaguefull stop22:31
*** jergerber has joined #openstack-infra22:31
mgagnezaro: tbh, I don't know. It's the first time (I'm aware of) someone introduces this kind of change22:32
pleia2lifeless: sorry, got pulled into a call - yeah, planning on writing the fedora def today, need to grab lunch first though22:33
lifelesspleia2: I'll do it now22:33
zaromgagne: so what do you think?22:33
zaromgagne: i'm ok with it because i don't see a better alternative.22:33
*** alexpilotti has quit IRC22:34
mgagnezaro: me neither I guess. Can this option be enabled without the Junit plugin or is it a dependency?22:34
pleia2lifeless: so I was thinking, in the definition will you just have it spin up 0 to start? it will still need a restart to add the appropriate amount when we're ready22:34
*** alexpilotti has joined #openstack-infra22:34
openstackgerritlifeless proposed a change to openstack-infra/config: Add a fedora image definition for tripleo-cloud  https://review.openstack.org/6851522:36
lifelesspleia2: ^22:36
mgagnezaro: ok, I give in. Test stability history is shown as a sub-option of the junit publisher =)22:36
mattoliverauSo looks like alot has happened, honestly guys I take 1 day off and you change everything :P22:36
zaromgagne: i'm guessing that installing test stability will auto install junit plugin.  but i think junit is a core plugin anyway.22:37
anteayamattoliverau: welcome to our world22:37
mgagnezaro: alright22:37
*** sdake is now known as sdake-ooo22:37
lifelesspleia2: uploading a fedora image to glance now22:37
openstackgerritEli Klein proposed a change to openstack-infra/jenkins-job-builder: Add local-branch option  https://review.openstack.org/6536922:37
pleia2lifeless: so I think with that change it will load up 4 images total - 2 precies and 2 fedora22:38
lifelessyes22:38
pleia2will that actually work with fedora?22:38
lifelesspleia2: we'll find out22:39
lifelesspleia2: we don't need to restart nodepool to iterate further though22:39
pleia2ok :)22:39
lifelessfungi: https://review.openstack.org/68515 too please22:39
pleia2I haven't tried any of the prepare_node* scripts with fedora22:39
pleia2see, I was going to test before writing the patch! anyway, I can test after lunch, it's late22:40
lifelesspleia2: yeah, shoo :)22:40
lifelesspleia2: principle of separated concerns thoug22:40
pleia2hehe22:40
*** alexpilotti has quit IRC22:41
*** dangers is now known as dangers_away22:42
openstackgerritEli Klein proposed a change to openstack-infra/jenkins-job-builder: Added rbenv-env wrapper  https://review.openstack.org/6535222:43
clarkbok I have done more digging in the zuul logs and have most of my confidence back :)22:44
clarkbwe are just being starved by the check queue which isn't super horrible because it should clear that massive list out relatively quickly22:45
russellbclarkb: nice work on the rate limiting patch22:45
clarkbrussellb: thanks, I keep second guessing it, but it appears to be doing the correct thing22:46
lifelesssdague: so what we need is jobs that run only on +A, before the integrated gate jobs are queued, then ?22:46
*** resker has quit IRC22:46
lifelesssdague: or possibly a check job that runs on +2 ?22:46
sdaguelifeless: I think you could modify zuul to run a set of jobs on first +222:47
fungilifeless: while the design requires discussion, we could probably have a separate independent pipeline for +2 events (but it would tend to get rerun on each +2)22:47
fungiright, to only have it run on the first +2 would probably need a zuul patch22:48
lifelessso what we have is virt emulation that can run in regular check22:48
*** thomasem has quit IRC22:48
lifelessand we have baremetal that must run before landing (because it's the actual verification)22:48
*** ivar-lazzaro has joined #openstack-infra22:49
lifelessbut as sdague says we don't want to trigger pipeline stalls at least until we get rid of many more bugs22:49
sdaguelifeless: I'd say right now what you probably actually want to get going is a sufficiency check in experimental22:50
sdagueso check experimental runs different jobs if it has a +2 than if not22:50
lifelesssdague: right now we're working up the testing stack22:50
lifelesssdague: we're about to have experimental actually doing shit; then nonvoting check22:51
sdaguethat would let you actually see how a job would run in the gate, and protect yuo22:51
sdaguelifeless: right, but anyone can trigger check experimental22:51
sdagueso that doesn't solve your security problem22:51
lifelesssdague: so experimental must be virt only then22:51
sdaguelifeless: which doesn't solve running real tests in any experimental way22:51
lifelesssdague: we are a ways off of having the virt stuff bedded down, and we'll get a lot of reliabilty from just that22:52
sdaguelifeless: ok, so then don't overengineer the future :)22:52
lifelesssdague: but yeah, Ironic really wants real baremetal soon22:52
lifelesssdague: so I'm just getting my head around having the design rug pulled out22:52
sdagueyour road to having real baremetal as part of the equation is a +2 experimental class22:52
sdagueclarkb: yeh, nice work on the zuul bits22:53
lifelesspleia2: fedora boots two-nics ok, mtu is wrong, eth1 is down, of course.22:53
sdagueit might be nice to put the "runable" part of the queue into the json22:53
sdagueso we could highlight the set of jobs that are in the run set22:54
lifelesspleia2: but - image is there in the ci cloud, so you can play with it if you get the nodepool user creds22:54
lifelesssdague: in the sense that experimental is the onramp for any new testing endeavour?22:54
sdaguelifeless: correct22:54
sdagueat this point the default place for a new test job to go is in experimental22:55
sdaguefor lots of good reasons22:55
fungisdague: yeah, adjusting the ui (and needing extra bits in the json to support that) has already come up, so it's presumably in the works22:58
fungii'm assuming all the changes beyond the window will appear visually disconnected, and probably with a separate colored dot22:59
lifelesshttps://bugs.launchpad.net/zuul/+bug/127176623:02
*** dcramer__ has quit IRC23:04
*** CaptTofu has quit IRC23:04
*** dims has quit IRC23:04
*** oubiwann_ has quit IRC23:04
*** senk1 has quit IRC23:06
*** sandywalsh has quit IRC23:06
*** alexpilotti has joined #openstack-infra23:08
openstackgerritJames E. Blair proposed a change to openstack-infra/zuul: Add require-approval to Gerrit trigger  https://review.openstack.org/6851623:09
jeblairsdague, clarkb, fungi, mordred: ^23:09
clarkbjeblair: cool23:10
jeblairsdague, clarkb, fungi, mordred: Um.  I'm particularly excited about the "approval with old jenkins vote causes automatic enqueue in check; then positive check result causes automatic enqueue into gate" behavior, which is actually shown in a test there.  :)23:10
ivar-lazzaroHello folks, I need some advice for configuring Jenkins+Gerrit filters... Specifically, I would like to run a Build whenever I get a specific comment from the stream23:10
jeblairsdague, clarkb, fungi, mordred: what could possibly go wrong with Zuul responding to its own events.  :)23:11
sdaguejeblair: heh23:11
sdagueit's turtles all the way down23:11
clarkbivar-lazzaro: we haven't used the gerrit trigger plugin in jenkins for a very long time. I am not personally aware of how to do that23:13
openstackgerritJames E. Blair proposed a change to openstack-infra/zuul: Add require-approval to Gerrit trigger  https://review.openstack.org/6851623:13
ivar-lazzaroclarkb: Thanks for your answer... hopefully someone around ever dealt with this problem23:14
*** eharney has quit IRC23:15
anteayaivar-lazzaro: if you lurk in #openstack-neutron and look for sukhdev he may be able to help you23:15
mikalanteaya: noted on the neutron rechecks, although my script isn't running t the moment23:15
ivar-lazzaroanteaya: thanks!23:16
*** MarkAtwood has quit IRC23:16
anteayamikal: thanks23:16
anteayaivar-lazzaro: np23:16
*** burt1 has quit IRC23:17
clarkbjeblair: in your check example pipeline, check tests will only run at most once every 48 hours?23:18
pleia2lifeless: the infra creds for nodepool?23:19
jeblairclarkb: yes; and only on changes that are producing events23:19
clarkbright, it doesn't trigger on that interval23:19
jeblairclarkb: so if no one cares about a change, it can sit there and not be updated.  if people are commenting on it, etc, it will get updated, and of course, if the only comment is an aprv and it's old, then it goes through the check->gate progression23:20
*** dims has joined #openstack-infra23:20
pleia2lifeless: and which image did you upload? fedora cloud image?23:20
*** jgrimm has quit IRC23:20
jeblairmikal: i believe what we are discussing will fill the need to have an automated system rechecking old changes23:21
openstackgerritAntoine Musso proposed a change to openstack-infra/zuul: webapp: set cache-control headers to prevent caching  https://review.openstack.org/6658323:21
jeblairmikal: (so in other words, i believe zuul is about to grow the ability to do this itself)23:21
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: make job creation consistent  https://review.openstack.org/6063323:21
mikaljeblair: yeah, I saw sdague's email bout zuul growing thus functionality, which I am fine with23:22
lifelesspleia2: Fedora 20 64-bit23:22
*** jergerber has quit IRC23:24
pleia2lifeless: I saw that much :) wasn't sure if there was a specific cloud image or something like ubuntu has23:24
lifelesspleia2: there is one23:25
*** sarob has quit IRC23:25
*** sarob has joined #openstack-infra23:25
ttxfungi: where are you hiding ?23:27
fungittx: working from my room23:27
ttxfungi: we are by the fire near the breakfast area if you want to join us (Heidi, tom)23:28
*** sarob has quit IRC23:29
fungicool, be right over23:30
lifelessfungi: hey, so hows nodepool :)23:30
*** rockyg has joined #openstack-infra23:31
sdagueclarkb: you watching -qa? we just lost a bunch of console logs23:38
clarkbsdague: ya, I think old jenkins is susceptible to that at a much lower rate than new jenkins with new scp plugin was23:39
lifelesssdague: replied to the gate thread23:39
*** mfer has quit IRC23:39
lifelesssdague: I'm fairly worried about the change now I've had time to think about it :(23:39
clarkbsdague: I am waiting for mordred's jenkinses then will do all of the others23:39
fungilifeless: not entirely sure what has caused it to decide not to do nightly builds of hpcloud-az2.devstack-precise (the logs don't show it even trying). wondering whether it will persist after we restart it23:40
lifelessfungi: /me starts chanting 'restart', 'restart', 'restart'23:40
fungiwell, it's in the middle of building ~150 nodes constantly to churn through the check pipeline23:41
lifelessfungi: does that make restarting it hard?23:42
fungioh, actually closer to 20023:42
fungilifeless: i believe restarting nodepool will abandon all of the currently building vms23:42
fungiso after a restart i will presumably need to manually delete any older than the start time23:43
lifelessfungi: yes; stop, list | grep BUILD | xards nodepool delete23:43
lifelessthen start23:43
lifelessor23:43
*** sarob has joined #openstack-infra23:43
lifelessstop; list > file; start; grep building < file | xargs -n1 nodepool delete23:43
*** whoops has quit IRC23:44
fungimmm, i haven't tried nodepool list/delete when nodepoold isn't running. i guess that works?23:44
lifelessyup23:44
jeblairyeah, that ^; if it's a lot you can parallelize it a bit23:44
lifelessIt might be an idea to make that queue things up to happen in the server, but at the moment its entirely client based23:44
fungiright, i'd split the list five ways like i'd been doing and run five delete loops in parallel. that's seemed to work well enough23:45
jeblair(which is occasionally pretty handy)23:45
jog0are there any plans to prevent the gate queue from getting stuck with the top change in queued mode23:46
jeblairjog0: nothing should ever be stuck.  can you elaborate?23:46
jog0stuck is probably the wrong word, if you look at http://status.openstack.org/zuul/23:47
*** kraman has quit IRC23:47
sdaguejog0: right, we don't have any d-g nodes23:47
jeblairjog0: node starvation due to check load23:48
sdaguethis is just starvation23:48
jog0the top gate queue patch 66986,3 is waiting for d-g nodes23:48
jeblairfungi: i also wonder how many of those building nodes are really building; can you look while you're there?23:48
fungiso, jeblair any input on https://review.openstack.org/66958 (if it's okay we should probably approve before a nodepoold restart). i'm pretty comfortable with self-approving https://review.openstack.org/6768423:48
jog0jeblair: right, what about propritizing the top n changes in gate queue?23:48
jog0for some low value of n23:48
sdaguejog0: so we did that before at one point, and it starved out the check queue entirely23:49
fungijeblair: i'm running a watch on what nodepool list reports in what states on what providers and am showing about 200 building across various providers currently23:49
jog0sdague: that was for prioritizing just the top n? what was n?23:49
fungiactually it's dropped to about 150 noe23:49
funginow23:49
jeblairfungi: aprvd23:49
sdaguejog0: no, but that's more complex logic that doesn't exist23:49
jeblairjog0: this situation is slightly abnormal23:50
jog0sdague: ahh thats what I thought.23:50
sdaguejog0: the real answer is not to be starved by a factor of 623:50
jeblairjog0: it's largely the result of an earlier zuul restart where all 100 changes were enqueued into check at once23:50
jog0jeblair: this may be abnormal now, but abnormal may become the new normal23:50
fungislammed it pretty hard23:50
sdaguewhich is basically where we stand, given the average number of nodes available23:50
jeblairjog0: i hope restarting zuul and enqueuing 100 changes at once is never normal.23:50
fungii hope manually reloading the zuul pipelines isn't about to become the new normal23:51
jog0jeblair: ahh,23:51
lifelessso here's a crazy question23:51
jog0although i have seen this before with a long check queue and a top of gate reset23:51
jeblairjog0: the point being that the gate queue is currently waiting for _all_ 100 changes to be serviced which is not normall, usually they only have to wait for a handful as they trickle in in real time.23:51
lifelesswhat about treating the middle nodes in the queue as an optimisation and running the *end* of the queue first23:51
lifelessonly if it fails do you need to the results from the ones in the middle23:51
jeblairjog0: the restart was to pick up the change clarkb wrote to only run jobs for the top N changes; once that really gets going we'll have a very different dynamic23:52
lifelessit would give up a current 'guarantee', that each commit merged is independently good23:52
openstackgerritA change was merged to openstack-infra/nodepool: Catch exceptions from nova flavor-list calls  https://review.openstack.org/6695823:52
*** markmcclain has quit IRC23:53
jeblairjog0: so i'd like to let this settle out before we tune again.  also, mordred is spinning up new jenkins masters to handle 250 more nodes23:53
jog0jeblair: ahh I see thanks for explaining23:53
jog0250 more nodes, nice23:53
jeblairlifeless: we never get to the end23:53
lifelessjeblair: I know, but I don't think that matters23:54
lifelessjeblair: the main point is not to cancel out a full stack of tests because one failed; it might be a spurious failure23:54
* StevenK blinks at the post horizon job23:55
sdaguelifeless: but you had to run the test anyway? or are you saying just squash the whole queue?23:55
lifelessjeblair: put another way, given Change C, Change C', C'', C''' etc, if any of these pass, either the predecessors had a transient bug (e.g. C' is broken but C'' fixes it), or it was a spurious failure (C' failed because of nondeterministic test) or it is a spurious pass23:56
jeblairlifeless: ah, this is similar to the 'batching changes' suggestion.  yes, it sacrifices bisectability.23:56
lifelesssdague: I'm saying, don't reset the gate queue, let it run and if a pass happens, land it23:56
sdaguelifeless: how do you land a pass?23:56
lifelesssdague: concurrently, build a parallel queue with the head ejected, and try that23:56
sdaguethat's 50 deep changes in heat requires, in which no heat tests ran?23:57
jeblairlifeless: okay, now that's the 'alternate branch' suggestion.  :)23:57
jeblairlifeless: which doesn't lose bisectability.23:57
jeblairbut uses extra resources23:57
lifelesssdague: if the original head lands, we discard the alternate; if none of them do, the alternate is the new main23:57
lifelessjeblair: actually they have very different latency characteristics, I think23:57
sdaguelifeless: ok, sure, but you did get the point that node starvation is one of our key issues right?23:58
lifelesssdague: I did, but its a key issue because we're assuming that C' failure means C'' must fail and so we throw away and restart all 50 changes23:58
clarkbso the problem with alternate branch that no one considers, is we have no resources :P23:58
lifelesssdague: which is a nontrivial exercise23:58
lifelessanyhow, just putting it out there23:59
lifelesssdague: I'm not sure what you mean by not heat tests ran23:59
sdagueso 50 deep in the queue23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!