*** kriskend_ has quit IRC | 00:02 | |
openstackgerrit | Drew Thorstensen (thorst) proposed openstack/nova-powervm: Check for instance state fix https://review.openstack.org/416315 | 00:10 |
---|---|---|
*** thorst_ has quit IRC | 00:10 | |
*** jwcroppe has quit IRC | 00:27 | |
*** jwcroppe has joined #openstack-powervm | 00:40 | |
*** jwcroppe has quit IRC | 00:45 | |
*** thorst_ has joined #openstack-powervm | 00:54 | |
thorst_ | adreznec: new theory. It looks like there is a sync power state right after the spawn. It looks like we're returning powered off. | 01:32 |
thorst_ | sometimes. | 01:32 |
*** jwcroppe has joined #openstack-powervm | 01:47 | |
openstackgerrit | Drew Thorstensen (thorst) proposed openstack/nova-powervm: Check for instance state fix https://review.openstack.org/416315 | 02:03 |
*** jwcroppe has quit IRC | 02:14 | |
*** svenkat has joined #openstack-powervm | 02:58 | |
*** tonyb has quit IRC | 03:05 | |
*** tonyb has joined #openstack-powervm | 03:05 | |
*** svenkat has quit IRC | 03:08 | |
*** thorst_ has quit IRC | 03:14 | |
*** thorst_ has joined #openstack-powervm | 03:15 | |
*** thorst_ has quit IRC | 03:19 | |
*** thorst_ has joined #openstack-powervm | 03:25 | |
*** thorst_ has quit IRC | 03:25 | |
*** jwcroppe has joined #openstack-powervm | 03:54 | |
*** thorst_ has joined #openstack-powervm | 04:01 | |
*** thorst_ has quit IRC | 04:01 | |
*** jwcroppe has quit IRC | 04:05 | |
*** tlian has quit IRC | 05:01 | |
*** kotra03 has joined #openstack-powervm | 05:26 | |
*** jwcroppe has joined #openstack-powervm | 05:43 | |
*** jwcroppe has quit IRC | 05:54 | |
*** thorst_ has joined #openstack-powervm | 06:02 | |
*** thorst_ has quit IRC | 06:07 | |
*** thorst_ has joined #openstack-powervm | 06:28 | |
*** thorst_ has quit IRC | 06:33 | |
*** jwcroppe has joined #openstack-powervm | 07:33 | |
*** jwcroppe has quit IRC | 07:44 | |
*** openstackgerrit has quit IRC | 07:50 | |
*** jwcroppe has joined #openstack-powervm | 08:28 | |
*** thorst_ has joined #openstack-powervm | 08:29 | |
*** thorst_ has quit IRC | 08:34 | |
*** jwcroppe has quit IRC | 08:37 | |
*** jwcroppe has joined #openstack-powervm | 09:22 | |
*** jwcroppe has quit IRC | 09:33 | |
*** k0da has joined #openstack-powervm | 09:57 | |
*** openstackstatus has quit IRC | 10:13 | |
*** openstack has joined #openstack-powervm | 10:15 | |
*** jwcroppe has joined #openstack-powervm | 10:17 | |
*** jwcroppe has quit IRC | 10:27 | |
*** thorst_ has joined #openstack-powervm | 10:30 | |
*** thorst_ has quit IRC | 10:35 | |
*** kotra03 has quit IRC | 11:44 | |
*** smatzek has joined #openstack-powervm | 11:58 | |
*** jwcroppe has joined #openstack-powervm | 12:07 | |
*** jwcroppe has quit IRC | 12:15 | |
*** thorst_ has joined #openstack-powervm | 12:46 | |
*** openstackgerrit has joined #openstack-powervm | 13:01 | |
openstackgerrit | Drew Thorstensen (thorst) proposed openstack/nova-powervm: Switch to synchronous power on https://review.openstack.org/416969 | 13:01 |
*** edmondsw has joined #openstack-powervm | 13:05 | |
*** wangqwsh has joined #openstack-powervm | 13:28 | |
thorst_ | wangqwsh: I don't know that we'll have the meeting today...with esberlu out and efried's laptop overheated we'll be light on attendance | 13:34 |
thorst_ | adreznec: you around? | 13:34 |
*** svenkat has joined #openstack-powervm | 13:36 | |
*** mdrabe has joined #openstack-powervm | 13:36 | |
wangqwsh | thorst: yes, will we cancel meeting today? | 13:37 |
thorst_ | yeah, but I'm wondering if you've been able to turn off the publishing of the results to the patch sets for nova itself. | 13:38 |
thorst_ | since we're throwing in a lot of red until we get this fix | 13:38 |
thorst_ | (which hopefully I have today...) | 13:38 |
*** xia has joined #openstack-powervm | 13:39 | |
wangqwsh | i read your mail, i did not turn off it. | 13:39 |
wangqwsh | ok, let me try it | 13:39 |
thorst_ | we just want publishing of the results to the patch sets turned off - only for the nova project. We still want nova-powervm to do everything, and we still want the nova patches to actually run and push logs up to the log server | 13:40 |
thorst_ | that way we can continue debugging, but we're not flooding the nova team with red. | 13:40 |
xia | Hi everyone, I'm Yan Xia, wangqwsh's team member. | 13:41 |
thorst_ | Hey xia :-) | 13:41 |
thorst_ | pretty light group at the moment - others will be signing in after a few hours | 13:41 |
xia | Hi thorst_ good to see u | 13:42 |
wangqwsh | sorry, where can i query the status for nova? | 13:42 |
thorst_ | wangqwsh: private messaged you the link since its on the IBM network | 13:43 |
thorst_ | all jobs are red now :-) | 13:43 |
thorst_ | there is also this: http://ci-watch.tintri.com/project?project=nova | 13:43 |
wangqwsh | thx | 13:43 |
thorst_ | search for powervm in there...you can see how bad we're doing at the moment | 13:43 |
thorst_ | adreznec: I added the interfaces.template to https://ibm.ent.box.com/folder/15881322842 | 13:56 |
*** jwcroppe has joined #openstack-powervm | 13:56 | |
*** jpasqualetto has joined #openstack-powervm | 14:02 | |
mdrabe | thorst_: About https://review.openstack.org/#/c/416969/1, so async PowerOn updates the VM to "powered on" then sync power states comes through and sets it to off if the job hasn't completed yet? | 14:03 |
*** tblakes has joined #openstack-powervm | 14:03 | |
thorst_ | mdrabe: yep - and it appears to be destroying everything in CI now | 14:04 |
thorst_ | because we've had so many runs (think several thousand) without the novalinks being rebooted that its now slow enough of a job to happen | 14:04 |
mdrabe | That gonna be the same kinda thing for PowerOff? | 14:04 |
*** jwcroppe has quit IRC | 14:08 | |
thorst_ | hmm...probably | 14:09 |
thorst_ | my job just failed...so lets see why | 14:09 |
adreznec | thorst_: Crap, for some reason my phone didn't pop up the reminder for the CI meeting this morning and I totally spaced it. Reading on backscroll now, but ugh. Really surprised we hadn't hit this scenario before then | 14:10 |
*** efried has joined #openstack-powervm | 14:10 | |
thorst_ | adreznec efried mdrabe: so it looks like my change worked...but now its failing on 28 other tests (I think they're new) | 14:12 |
thorst_ | crap just keeps piling in :-) | 14:12 |
adreznec | link? | 14:12 |
adreznec | nm, got it | 14:12 |
*** kriskend has joined #openstack-powervm | 14:13 | |
adreznec | Those keystone ones are definitely not new | 14:13 |
thorst_ | http://184.172.12.213/69/416969/1/check/nova-powervm-pvm-dsvm-tempest-full/bafeecc/powervm_os_ci.html | 14:13 |
thorst_ | well, they're not specific to my change | 14:13 |
efried | thorst_, how's the run time? | 14:14 |
thorst_ | and my change at least fixes the big three that I spent all yesterday on | 14:14 |
thorst_ | it was fine...finished without any noticable difference | 14:14 |
adreznec | 1h 05m 10s | 14:14 |
efried | excellent - that's normal. | 14:14 |
thorst_ | and I had several take 1h 20m yesterday | 14:14 |
thorst_ | and some 50m | 14:14 |
thorst_ | so...not bad | 14:14 |
efried | course with failing tests, can't rely on that number | 14:14 |
thorst_ | can we shove this thing through? | 14:14 |
adreznec | Anything <90m is fine imo | 14:14 |
thorst_ | ahh, crap...we can't because jenkins -1'd it. | 14:15 |
adreznec | mhm | 14:15 |
thorst_ | and recheck (for some reason) doesn't pick up the change | 14:15 |
efried | Yeah, we're gating our own stuff on our CI, right? | 14:15 |
thorst_ | which is awesome. | 14:15 |
adreznec | recheck is broken? since when? | 14:15 |
thorst_ | I noticed it yesterday | 14:15 |
thorst_ | recheck powervm - your patch isn't applied. | 14:15 |
thorst_ | I'll add a comment or something silly | 14:16 |
openstackgerrit | Drew Thorstensen (thorst) proposed openstack/nova-powervm: Switch to synchronous power on https://review.openstack.org/416969 | 14:16 |
thorst_ | maybe we can push this through while our CI is running (is that awful? I just want to start getting some greens on future nova runs) | 14:17 |
efried | So wait, I'm confused. The change you're saying fixes the resize timing bug is this 'synchronous power on' one, not the "Check for instance state fix"? | 14:18 |
thorst_ | correct :-) | 14:18 |
efried | And where were you seeing recheck problems? And what were those problems? | 14:19 |
thorst_ | review.o.o is going real slow for me...trying to load it up to send you | 14:20 |
thorst_ | (stupid network) | 14:20 |
*** tlian has joined #openstack-powervm | 14:21 | |
adreznec | Hmm | 14:23 |
adreznec | Yeah I just tried doing a regular recheck on another patch and I'm not seeing anything pop up in zuul | 14:23 |
*** smatzek has quit IRC | 14:24 | |
adreznec | All right | 14:26 |
adreznec | So "recheck" is broken | 14:26 |
adreznec | But "recheck powervm" works | 14:26 |
adreznec | Ah | 14:27 |
adreznec | efried: thorst_ I remember this now, see 4659 | 14:27 |
thorst_ | recheck powervm was what I was running | 14:28 |
adreznec | Weird | 14:28 |
thorst_ | https://review.openstack.org/416315 | 14:28 |
thorst_ | this is how slow review.o.o is for me right now | 14:28 |
thorst_ | it took all that time to load | 14:28 |
adreznec | Worked for me on https://review.openstack.org/#/c/408758/2 | 14:28 |
adreznec | Running in Zuul right now | 14:28 |
thorst_ | it runs | 14:28 |
thorst_ | but it doesn't apply your patch | 14:28 |
thorst_ | see that patch set has 'thorst' comments on VERY common things | 14:29 |
thorst_ | and the logs have nothing about thorst in it | 14:29 |
adreznec | Hrm | 14:29 |
adreznec | That's disconcerting | 14:29 |
thorst_ | +2 | 14:29 |
*** wangqwsh_ has joined #openstack-powervm | 14:29 | |
adreznec | The problem I'm having is that I don't see why a recheck would be any different in that case... | 14:30 |
adreznec | It just re-executes the same job | 14:30 |
thorst_ | once wangqwsh gets the nova publishes turned off...maybe he can investigate that | 14:31 |
efried | thorst_, so you're saying the results we got yesterday on that change set don't necessarily indicate that your fix didn't work, because the CI runs weren't including your patches?? | 14:31 |
thorst_ | so if I publish a new version...it is fine | 14:31 |
thorst_ | if I recheck an existing one it doesn't pick up the change (at all) | 14:31 |
adreznec | thorst_: FYI your new changeset failed | 14:32 |
adreznec | Hit ReadTimeoutError: HTTPSConnectionPool(host='pypi.python.org', port=443): Read timed out | 14:32 |
efried | Doesn't pick up the most recent patch set, or doesn't pick up the whole change set? | 14:32 |
adreznec | Mostly likely network issues | 14:32 |
*** wangqwsh has quit IRC | 14:32 | |
*** wangqwsh_ is now known as wangqwsh | 14:32 | |
thorst_ | efried: whole change set. | 14:32 |
efried | That's bizarre. | 14:33 |
openstackgerrit | Drew Thorstensen (thorst) proposed openstack/nova-powervm: Switch to synchronous power on https://review.openstack.org/416969 | 14:33 |
thorst_ | yuh | 14:33 |
adreznec | I'm reading through the logs of the one I just rechecked | 14:33 |
adreznec | To see if that's the case there... | 14:33 |
thorst_ | super annoying when you're working at 10 PM on something breaking the entire CI | 14:33 |
adreznec | Well, I'm seeing it do the git fetch of the change... | 14:34 |
thorst_ | weird... | 14:34 |
thorst_ | check logs...maybe I'm an idiot and it worked | 14:34 |
thorst_ | (or was frustrated and tired) | 14:34 |
thorst_ | 10 PM is late for an old dude like me | 14:35 |
*** tblakes has quit IRC | 14:36 | |
thorst_ | adreznec: In other news (while we wait...) should we put the link to the images in the wiki (https://wiki.openstack.org/wiki/PowerVM) | 14:39 |
thorst_ | I can do that unless you see a problem with it | 14:40 |
adreznec | Nope, seems fine. | 14:40 |
*** efried has quit IRC | 14:41 | |
*** efried_ has joined #openstack-powervm | 14:41 | |
*** tjakobs has joined #openstack-powervm | 14:42 | |
adreznec | And yeah, not sure what's going on here... I'm definitely seeing it say things like | 14:43 |
adreznec | 11:38:39 2017-01-04 16:38:46.706 | +functions-common:git_clone:566 [m git show --oneline | 14:43 |
adreznec | 11:38:39 2017-01-04 16:38:46.708 | +functions-common:git_clone:566 [m head -1 | 14:43 |
adreznec | 11:38:39 2017-01-04 16:38:46.708 | 8160bc3 Check for instance state fix | 14:43 |
adreznec | In the log | 14:43 |
thorst_ | weird... | 14:44 |
thorst_ | I guess next I'll try a recheck | 14:44 |
adreznec | But I'm also not seeing any of the logs you added showing up anywhere | 14:44 |
thorst_ | well, you've got one going | 14:44 |
adreznec | In n-cpu | 14:44 |
thorst_ | so I'm not blind at least. | 14:44 |
wangqwsh | thorst: jenkins job status would be reflected on gerrit for CI. only 'success' or 'failure'. i think there is no the third status... | 14:47 |
wangqwsh | if we do not want the 'red' in nova and run nova patches, one option is set the ci job for nova project is always 'success'. | 14:47 |
thorst_ | wangqwsh: I don't think we want to do that. esberglu just recently turned that on...not sure how he did that | 14:49 |
thorst_ | adreznec may have more of an idea | 14:49 |
thorst_ | adreznec: FYI - https://wiki.openstack.org/wiki/PowerVM#Installation_Requirements | 14:49 |
thorst_ | bbiab | 14:49 |
*** tjakobs has quit IRC | 14:49 | |
adreznec | wangqwsh: There is another option, which is what we had before. Basically we don't vote/publish any results on the nova patches. We still run the jobs, but no comments get posted | 14:50 |
adreznec | That way we can still see the runs in our system | 14:50 |
thorst_ | yes please | 14:50 |
adreznec | But they don't get posted to gerrit | 14:50 |
*** thorst_ is now known as thorst_afk | 14:50 | |
wangqwsh | ok...let me try it | 14:51 |
*** jwcroppe has joined #openstack-powervm | 14:51 | |
*** efried_ has quit IRC | 14:52 | |
*** efried has joined #openstack-powervm | 14:52 | |
*** xia has quit IRC | 14:53 | |
*** xia has joined #openstack-powervm | 14:54 | |
*** apearson has joined #openstack-powervm | 14:55 | |
*** jwcroppe has quit IRC | 15:03 | |
thorst_afk | geez...pypi must be REALLY slow...it has taken 45 minutes for us to get a stack through on this node. | 15:12 |
efried | Lab network problem? Which could also explain timeouts in test runs? | 15:13 |
thorst_afk | well, timeouts in test runs would all be within a given VM? | 15:16 |
thorst_afk | the timeouts I hit weren't timeouts...those three resize timeouts were because of that state change issue | 15:16 |
thorst_afk | not sure about keystone or what not | 15:16 |
adreznec | Huh | 15:17 |
adreznec | we should test from another lab | 15:17 |
thorst_afk | I'll check the network in a bit too | 15:18 |
*** wangqwsh has quit IRC | 15:22 | |
*** jwcroppe has joined #openstack-powervm | 15:22 | |
*** smatzek has joined #openstack-powervm | 15:22 | |
thorst_afk | network doesn't seem too bad in POK...must be the backbone | 15:30 |
thorst_afk | do we have a file we can try to FTP down into POK? So we can prove a bad path? | 15:30 |
*** tjakobs has joined #openstack-powervm | 15:32 | |
adreznec | thorst_afk: Could just try and grab something from rchgsa | 15:35 |
thorst_afk | nah, want non-IBM path | 15:35 |
thorst_afk | trying ubuntu.com | 15:35 |
adreznec | Ah k | 15:35 |
*** kriskend_ has joined #openstack-powervm | 15:36 | |
thorst_afk | it's the DNS. | 15:36 |
thorst_afk | damnit all | 15:36 |
adreznec | Again? Ugh | 15:37 |
*** tblakes_ has joined #openstack-powervm | 15:38 | |
*** kriskend has quit IRC | 15:39 | |
thorst_afk | well, the backup DNS is taking 80ms (which we made primary) | 15:39 |
thorst_afk | the primary (which we abandoned) is taking .8 ms | 15:39 |
adreznec | We just can't win | 15:40 |
adreznec | Sucks that the lab infrastructure is so inconsistent | 15:40 |
thorst_afk | so it only matters for the VMs themselves. | 15:41 |
*** tblakes_ is now known as tblakes | 15:41 | |
thorst_afk | adreznec: can we run our own DNS? | 15:45 |
thorst_afk | that backs to another...so we have a single server to update when we want all our CI slaves need a change/ | 15:45 |
adreznec | thorst_afk: Yeah, I suppose I could set up a dns cache if I had a VM to do it on | 15:49 |
thorst_afk | adreznec: get me an IP and I'll get it set up? | 15:49 |
thorst_afk | 9.47.x.x something or other | 15:50 |
thorst_afk | how do we even change the IPs in the base image? | 15:53 |
thorst_afk | I mean the DNS IP... | 15:53 |
*** thorst_afk is now known as thorst_ | 15:53 | |
adreznec | thorst_: Well, we could recapture with the new IP (probably not ideal) or we could just do it in the capture script for the base image | 15:55 |
adreznec | The one nodepool runs on nightly capture | 15:55 |
*** xia has quit IRC | 15:55 | |
thorst_ | adreznec: we could for an image rebuild tho | 15:55 |
*** tlian has quit IRC | 17:00 | |
*** tlian has joined #openstack-powervm | 17:20 | |
thorst_ | adreznec efried: Image built...its just saving to glance now | 17:32 |
*** tblakes has quit IRC | 17:43 | |
*** kriskend__ has joined #openstack-powervm | 17:54 | |
*** kriskend_ has quit IRC | 17:54 | |
thorst_ | adreznec efried: ready nodes are respawning now. | 18:17 |
thorst_ | jobs are running... | 18:41 |
adreznec | Hmm | 18:47 |
adreznec | OK | 18:47 |
thorst_ | stacking isn't exactly flying but...its not awful | 19:10 |
thorst_ | adreznec efried: 27 fails in my run | 20:20 |
adreznec | thorst_: Which run | 20:26 |
adreznec | I don't see anything new on 416989 | 20:26 |
thorst_ | it's uploading | 20:26 |
thorst_ | check now | 20:27 |
thorst_ | all keystone failures | 20:27 |
adreznec | Hmm | 20:27 |
thorst_ | so we've got at least that going for us | 20:27 |
adreznec | Yep, same keystone ones | 20:27 |
thorst_ | adreznec: you able to dig in on that? | 20:32 |
thorst_ | I'm fried on CI crap atm | 20:32 |
adreznec | Looking at a bit of OSA stuff quick, I can try and hop over to it after that | 20:34 |
thorst_ | wonder if we need to update a policy file... | 20:34 |
thorst_ | I bet that's it. | 20:34 |
thorst_ | I bet a new policy was added and we're using newton's devstack? | 20:34 |
adreznec | We should be using whatever devstack branch corresponds to the patch branch... aren't we? | 20:35 |
adreznec | e.g. nova master patch = devstack master branch, nova stable/newton = devstack stable/newton | 20:35 |
thorst_ | I thought esberglu pegged us to newton? | 20:36 |
thorst_ | not sure... | 20:36 |
adreznec | I thought that was just for the undercloud | 20:36 |
adreznec | Hmm | 20:36 |
thorst_ | but it looks like a policy.json issue | 20:36 |
thorst_ | ooo, maybe | 20:36 |
thorst_ | well, its not a new permission... | 20:40 |
*** tblakes has joined #openstack-powervm | 20:41 | |
efried | test | 21:23 |
efried | test | 21:23 |
*** smatzek has quit IRC | 21:29 | |
thorst_ | efried: testing 1 2 | 21:36 |
thorst_ | adreznec: think this could be it? | 21:38 |
thorst_ | https://github.com/openstack-dev/devstack/commit/80b1d0ae7db263dada7fdc4d9d8190d0518b8f6c | 21:38 |
*** svenkat has quit IRC | 22:03 | |
*** thorst_ has quit IRC | 22:04 | |
*** apearson has quit IRC | 22:15 | |
*** kriskend_ has joined #openstack-powervm | 22:23 | |
*** kriskend__ has quit IRC | 22:23 | |
*** dwayne__ has quit IRC | 22:45 | |
*** edmondsw has quit IRC | 22:51 | |
*** tjakobs has quit IRC | 23:30 | |
*** tblakes has quit IRC | 23:31 | |
*** svenkat has joined #openstack-powervm | 23:38 | |
*** mdrabe has quit IRC | 23:50 | |
*** jwcroppe has quit IRC | 23:58 | |
*** jwcroppe has joined #openstack-powervm | 23:59 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!