Friday, 2014-05-23

*** TravT has quit IRC00:10
openstackgerritGregory Haynes proposed a change to openstack/tripleo-incubator: Fix cirros MD5 fails  https://review.openstack.org/9276400:11
*** nati_ueno has joined #tripleo00:12
* tchaypo goes to gate for the 5th time this week00:12
*** michchap has quit IRC00:21
*** michchap has joined #tripleo00:21
*** matsuhashi has joined #tripleo00:23
*** killer_prince has quit IRC00:25
*** nati_ueno has quit IRC00:29
*** chuckC has quit IRC00:29
*** michchap_ has joined #tripleo00:34
*** michchap has quit IRC00:36
*** nati_ueno has joined #tripleo00:37
*** Penick has joined #tripleo00:38
*** andreaf_ has joined #tripleo00:38
*** saurabhs has quit IRC00:40
*** zackf has joined #tripleo00:41
*** andreaf has quit IRC00:41
*** Penick has quit IRC00:43
*** xuhaiwei has joined #tripleo00:48
*** andreaf_ has quit IRC00:50
*** nati_ueno has quit IRC01:02
*** nati_ueno has joined #tripleo01:03
StevenKYay, expenses ...01:07
greghaynesaw, crap01:08
greghaynesneed to do that01:08
clarkb:P I did them on tuesday01:08
StevenKAll of my amex transactions have finally hit eem01:08
clarkbgreghaynes: you are now a favorite in my expense reporting01:08
StevenKHaha01:08
greghaynesclarkb: dawwww01:09
greghaynesIll remember for next summit01:09
clarkbuh oh01:09
* StevenK has a dinner receipt with greghaynes on it too01:10
greghaynesYep. This is how you win at expenses01:10
StevenKBy having every one of your workmates do them?01:10
greghaynesPrecisely01:11
greghaynesThe only winning move is not to play01:11
StevenKSo far, nothing wants a receipt attached so far01:14
adam_gso01:14
*** nati_ueno has quit IRC01:14
adam_ghttps://launchpad.net/bugs/1316475 looks like a cloud-init issue01:14
uvirtbotLaunchpad bug 1316475 in tripleo "trusty hang on first boot post deploy" [Critical,Triaged]01:14
*** petertoft has joined #tripleo01:19
*** blamar_ has joined #tripleo01:30
*** blamar has quit IRC01:31
*** blamar_ is now known as blamar01:31
*** lazy_prince has joined #tripleo01:32
*** lazy_prince is now known as killer_prince01:32
StevenKNow to scan in receipts01:34
*** nosnos has joined #tripleo01:35
*** nati_ueno has joined #tripleo01:45
StevenKgreghaynes: "Your" expense claim has been submitted01:48
*** dkehn_ has joined #tripleo01:49
*** nati_ueno has quit IRC01:49
clarkbStevenK: when we travel internationally we tend to have to send in phsical receipts. Do you ahve to do that when its international for you or is all US centric?01:51
StevenKclarkb: They've all said "Digital preferred" for me01:51
*** nati_ueno has joined #tripleo01:51
StevenKSo far I've only attached digital copies and filed the paper ones here, and haven't had any problems01:52
StevenKlifeless: I also had no problems attaching files to expenses with Firefox 2901:53
clarkbStevenK: same here, worked fine01:54
*** nati_ueno has quit IRC01:54
*** petertoft has quit IRC01:54
*** lazy_prince has joined #tripleo01:58
*** killer_prince has quit IRC01:58
*** dkehnx has quit IRC01:58
*** jpeeler has quit IRC01:58
*** lazy_prince is now known as killer_prince01:58
*** jpeeler has joined #tripleo02:00
*** dkehn_ is now known as dkehnx02:02
openstackgerritEthan Lynn proposed a change to openstack/diskimage-builder: Add Ubuntu local image support  https://review.openstack.org/9088702:12
*** nati_ueno has joined #tripleo02:14
lifelessStevenK: they may well have fixed it by now ;)02:18
StevenKHeh, that's true.02:19
* StevenK waits for the check-tripleo queue to die down02:19
lifelessStevenK: 'two things' - subprocess to API, and no wait to waiting02:20
StevenKI did wonder if that's you meant02:21
StevenKWhich is a bit of a pain to split up, but I can do that02:21
StevenKBut should be far better than reordering bzr pipelines02:25
StevenK(Where I used to get so annoyed, that I would re-make the pipeline from scratch to save frustration)02:27
*** nati_ueno has quit IRC02:32
*** julim has quit IRC02:32
*** killer_prince has quit IRC02:44
*** killer_prince has joined #tripleo02:48
mordredclarkb: for international travel, the need for receipts varys by target country02:50
mordredfor instance...02:50
* mordred apologizes to non-hp people in channel02:50
mordredHP wants receipts for a ₤3 coffee in england, but I can buy a €100 dinner in spain with no receipt02:51
pleia2mordred: can we get an apology too?02:56
pleia2;D02:56
pleia2crazy receipt policies02:56
mordredpleia2: ;)02:57
*** untriaged-bot has joined #tripleo03:00
untriaged-botUntriaged bugs so far:03:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132194303:00
uvirtbotLaunchpad bug 1321943 in tripleo "Ceilometer Swift polling on overcloud control node fails with a 403 forbidden error" [Undecided,In progress]03:00
untriaged-bothttps://bugs.launchpad.net/diskimage-builder/+bug/132156303:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131876703:00
uvirtbotLaunchpad bug 1321563 in diskimage-builder "Add support in ramdisk element to fetch IP from dhcp" [Undecided,New]03:00
uvirtbotLaunchpad bug 1318767 in tripleo "apache element SSL cert check fails" [Undecided,New]03:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131497803:00
uvirtbotLaunchpad bug 1314978 in tripleo "Cloud vm not pingable after overcloud upgrade " [Undecided,New]03:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131535503:00
uvirtbotLaunchpad bug 1315355 in tripleo "Upgrade of overcloud failed with "Connection to neutron failed: Maximum attempts reached"" [Undecided,New]03:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131667503:00
uvirtbotLaunchpad bug 1316675 in tripleo "Saving a devtest VM results in error" [Undecided,In progress]03:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132009003:00
uvirtbotLaunchpad bug 1320090 in tripleo "pxe_ephemeral_format': u'ext4 left on nodes after deploys deleted" [Undecided,New]03:00
*** untriaged-bot has quit IRC03:00
*** lazy_prince has joined #tripleo03:02
*** killer_prince has quit IRC03:02
*** lazy_prince is now known as killer_prince03:02
*** nosnos has quit IRC03:03
openstackgerritJames Polley proposed a change to openstack/tripleo-incubator: Simplify creation/cleanup of testenvs  https://review.openstack.org/9273403:04
lifelesshttp://bazaar.launchpad.net/~cloud-init-dev/cloud-init/trunk/view/head:/cloudinit/cs_utils.py#L19 is confidence inducing03:14
*** morazi has joined #tripleo03:15
openstackgerritBen Nemec proposed a change to openstack/tripleo-specs: TripleO on OpenStack  https://review.openstack.org/9264203:17
openstackgerritJames Polley proposed a change to openstack/tripleo-incubator: Simplify creation/cleanup of testenvs  https://review.openstack.org/9273403:27
*** matsuhashi has quit IRC03:29
*** eghobo has joined #tripleo03:32
*** matsuhashi has joined #tripleo03:36
StevenKlifeless: Now I have to review my lunch03:40
*** matsuhashi has quit IRC03:43
*** matsuhashi has joined #tripleo03:43
*** matsuhas_ has joined #tripleo03:45
*** matsuhashi has quit IRC03:45
*** nosnos has joined #tripleo03:45
*** csd has quit IRC03:55
*** akuznetsov has joined #tripleo03:59
*** OldCrowEW has joined #tripleo03:59
*** killer_prince has quit IRC04:04
*** lazy_prince has joined #tripleo04:05
*** lazy_prince is now known as killer_prince04:05
*** killer_prince has quit IRC04:14
*** nati_ueno has joined #tripleo04:28
*** edmund has quit IRC04:28
*** eghobo has quit IRC04:28
*** edmund has joined #tripleo04:28
*** matsuhas_ has quit IRC04:34
*** marun has quit IRC04:40
*** lazy_prince has joined #tripleo04:40
*** lazy_prince is now known as killer_prince04:40
*** morganfainberg is now known as morganfainberg_Z04:42
*** marun has joined #tripleo04:42
*** matsuhashi has joined #tripleo04:43
*** marun has quit IRC04:44
*** eghobo has joined #tripleo04:44
*** akuznetsov has quit IRC05:01
*** lazy_prince3 has joined #tripleo05:08
greghaynesYes, I was just reading that05:09
*** dshulyak_ has joined #tripleo05:09
openstackgerritSteve Kowalik proposed a change to openstack/os-cloud-config: Switch from subprocess to ironicclient  https://review.openstack.org/9396405:10
*** OldCrowEW has quit IRC05:10
greghaynesDoes it just assume that if you have a ttyS1 theres that type of service connected on the other end?05:10
openstackgerritSteve Kowalik proposed a change to openstack/os-cloud-config: Loop for 600 seconds rather than checking ironic  https://review.openstack.org/9506605:10
openstackgerritSteve Kowalik proposed a change to openstack/os-cloud-config: Switch from subprocess to novaclient  https://review.openstack.org/9477905:10
openstackgerritSteve Kowalik proposed a change to openstack/os-cloud-config: Loop for 600 seconds rather than checking nova-bm  https://review.openstack.org/9506705:11
openstackgerritSteve Kowalik proposed a change to openstack/os-cloud-config: Switch from subprocess to keystoneclient  https://review.openstack.org/9478205:11
*** edmund has quit IRC05:11
*** edmund has joined #tripleo05:11
tchaypookay.05:13
*** eghobo has quit IRC05:15
*** eguz has joined #tripleo05:15
*** eghobo has joined #tripleo05:15
*** eghobo has quit IRC05:15
* StevenK waits for the flood of gerrit mail to cease05:15
*** eguz has quit IRC05:20
*** edmund1 has joined #tripleo05:21
*** edmund has quit IRC05:23
greghayneswow, I think it does05:24
StevenKgreghaynes: I keep punching the 'set -o pipefail' comments because bnemec has been trying to land his branch to enforce that for tie for some time, and I'd like to make his job easier.05:26
greghaynesStevenK: Yep, and I agree on anything new, but I just am not sure I should change that for an already existing file with just a one line change - seems like an easy way to introduce a bug from something undocumented in commit msg05:27
greghaynesThat shouldnt conflict with his change either since that file already exists05:28
StevenKThe file is short enough that git might mark it as needing conflict resolution05:28
StevenKBut I see your point05:28
greghaynes:( if it does id be up for basing off him05:29
greghaynesbecuase that totally is annoying05:29
greghaynesmaybe we should just land his change05:29
*** shakayumi has joined #tripleo05:30
greghaynesooo https://review.openstack.org/#/c/83929/05:30
*** ramishra has joined #tripleo05:31
tchaypoonce again on the plane, they told me that "transmitting devices" must not be switched on, but "please feel free to join our in-flight wifi"05:32
StevenKHeh heh05:33
greghaynesIts a trap!05:33
StevenKtchaypo: Did they also ask passengers to not congregate in any common areas, like the toilets, or their seats ... ? :-P05:33
*** jprovazn has joined #tripleo05:34
tchayponot for today's domestic flight05:34
tchaypothey did on the way back from atlanta05:34
tchaypothen they keep the seatbelt sign on for so long that as soon as it goes off half the plane is congregating around the toilet door05:34
StevenKHeh, it's only announced on Qantas for flights to the US, not out of05:35
greghaynesMy flight from atlanta had an awesome anouncement of "We cannot give you permission to go to use the bathroom if the seatbelt sign is on. If you get up and do it I wont tackle you, but I cant give you permission to do it. Find the grey area."05:35
StevenKIs that when you point out the carpets are grey?05:36
greghayneshaha05:36
*** eghobo has joined #tripleo05:42
*** tzumainn has quit IRC05:42
*** lazy_prince3 has quit IRC05:51
*** edmund has joined #tripleo05:54
*** edmund1 has quit IRC05:54
*** akuznetsov has joined #tripleo05:56
*** ramishra has quit IRC05:57
greghaynesok, just replicated that trusty hang thing in a vm05:58
greghaynesthis is amazing05:58
*** eghobo has quit IRC05:58
greghaynesYou just cannot have a second serial port unless its hooked up to CloudSigma05:58
greghaynesor it hangs05:58
greghaynesAs a launchpad noob - is normal etiquitte to open a new bug on https://launchpad.net/cloud-init or add cloud-init as an affected project?05:59
*** chuckC has joined #tripleo06:04
*** akuznetsov has quit IRC06:07
*** nati_ueno has quit IRC06:08
greghaynesoh, nvm, was already added06:09
StevenKgreghaynes: Usually, you add another bugtask, unless doing so will cause timeouts :-P06:10
StevenK(Which is like >8 or so)06:10
clarkb StevenK and in those cases you figure out how to do it via email06:12
greghaynesthats webscale06:13
*** akuznetsov has joined #tripleo06:13
StevenKclarkb: affects foo -- but that may cause random timeouts when changing any bugtasks state06:14
StevenKBecause notifying in request is such awesomeness06:14
*** jtomasek has joined #tripleo06:19
*** jtomasek has quit IRC06:26
*** mrunge has joined #tripleo06:27
*** dshulyak_ has quit IRC06:27
*** edmund has quit IRC06:29
tchaypoexpense report almost complete06:32
tchaypowoobles~06:33
StevenKHaha06:33
*** nati_ueno has joined #tripleo06:41
*** lazy_prince3 has joined #tripleo06:48
*** e0ne has joined #tripleo06:57
*** jcoufal has joined #tripleo07:00
*** lazy_prince3 is now known as lazy_prince07:02
*** e0ne has quit IRC07:09
*** killer_prince has quit IRC07:12
*** e0ne has joined #tripleo07:12
openstackgerritDmitry Shulyak proposed a change to openstack/tripleo-specs: Haproxy configuration options  https://review.openstack.org/9490707:16
*** killer_prince has joined #tripleo07:21
*** e0ne_ has joined #tripleo07:22
*** e0ne has quit IRC07:25
*** jtomasek has joined #tripleo07:28
*** ifarkas has joined #tripleo07:29
*** gcha has joined #tripleo07:35
*** nati_ueno has quit IRC07:39
*** zackf has quit IRC07:39
*** jistr has joined #tripleo07:43
openstackgerritDmitry Shulyak proposed a change to openstack/tripleo-image-elements: Allow multiple binds per service in haproxy  https://review.openstack.org/8992507:49
openstackgerritDmitry Shulyak proposed a change to openstack/tripleo-image-elements: Prevent duplication of net_binds per service  https://review.openstack.org/9509707:49
openstackgerritLadislav Smola proposed a change to openstack/tripleo-incubator: Adding Undercloud Ceilometer config element  https://review.openstack.org/9463707:49
openstackgerritLadislav Smola proposed a change to openstack/tripleo-incubator: Generating of password for SNMPd  https://review.openstack.org/9483807:49
*** pelix has joined #tripleo07:50
*** rwsu has quit IRC07:58
*** derekh_ has joined #tripleo08:00
derekh_ci still in trouble :-( http://goodsquishy.com/downloads/tripleo-jobs.html08:08
*** akuznetsov has quit IRC08:09
derekh_SpamapS: if your still there, want me to take over on anything?08:12
openstackgerritJan Provaznik proposed a change to openstack/tripleo-incubator: Generate overcloud keystone keys/certs  https://review.openstack.org/9510108:12
*** yamahata has joined #tripleo08:13
derekh_looks like we are only running about 8 jobs at a time most of which are failing :-(08:13
derekh_lifeless: SpamapS R1 ran 177 jobs in the last 10 hours, only 5 passed08:15
derekh_brb08:15
*** akuznetsov has joined #tripleo08:27
*** akrivoka has joined #tripleo08:27
*** ifarkas has quit IRC08:27
*** jp_at_hp has joined #tripleo08:32
lifelessderekh_: :(08:35
derekh_lifeless: trying to dig into it at the moment, the jobs that passed were all a few hours ago, non passed since then08:36
*** CLOUDOUTAGE has joined #tripleo08:37
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R108:37
*** CLOUDOUTAGE has quit IRC08:37
*** CLOUDOUTAGE has joined #tripleo08:39
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage08:39
*** CLOUDOUTAGE has quit IRC08:39
lifelessderekh_: thanks08:40
lifelessderekh_: I've been crook for two days, just coming out of it now08:40
derekh_lifeless: :-(   , no prob08:41
openstackgerritRoman Podoliaka proposed a change to openstack/tuskar: Don't display passwords when listing overclouds  https://review.openstack.org/9464808:43
*** lucasagomes has joined #tripleo08:45
*** matsuhashi has quit IRC08:48
*** akrivoka has quit IRC08:56
*** akrivoka has joined #tripleo08:56
*** pblaho has joined #tripleo08:58
*** untriaged-bot has joined #tripleo09:00
untriaged-botUntriaged bugs so far:09:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132194309:00
uvirtbotLaunchpad bug 1321943 in tripleo "Ceilometer Swift polling on overcloud control node fails with a 403 forbidden error" [Undecided,In progress]09:00
untriaged-bothttps://bugs.launchpad.net/diskimage-builder/+bug/132156309:00
uvirtbotLaunchpad bug 1321563 in diskimage-builder "Add support in ramdisk element to fetch IP from dhcp" [Undecided,New]09:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131876709:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131497809:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131535509:00
uvirtbotLaunchpad bug 1318767 in tripleo "apache element SSL cert check fails" [Undecided,New]09:00
uvirtbotLaunchpad bug 1314978 in tripleo "Cloud vm not pingable after overcloud upgrade " [Undecided,New]09:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131667509:00
uvirtbotLaunchpad bug 1315355 in tripleo "Upgrade of overcloud failed with "Connection to neutron failed: Maximum attempts reached"" [Undecided,New]09:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132009009:00
uvirtbotLaunchpad bug 1316675 in tripleo "Saving a devtest VM results in error" [Undecided,In progress]09:00
uvirtbotLaunchpad bug 1320090 in tripleo "pxe_ephemeral_format': u'ext4 left on nodes after deploys deleted" [Undecided,New]09:00
*** untriaged-bot has quit IRC09:00
*** shakayumi has quit IRC09:01
openstackgerritJon-Paul Sullivan (jp_at_hp) proposed a change to openstack/tripleo-incubator: Add the select-cloud script  https://review.openstack.org/8280709:01
derekh_ssh to overcloud controller is very jumpy, somtimes stalling for over a minutes becoming responsive again09:04
rcarrill`haven't played with devtest for some time, it now complains about TRIPLEO_ROOT pointing to a location that does not contain a tripleo checkout09:06
rcarrill`so now devtest does not download tripleo-incubator itself to .cache?09:06
*** rcarrill` is now known as rcarrillocruz09:06
rcarrillocruz(morning)09:06
*** e0ne_ has quit IRC09:08
*** e0ne has joined #tripleo09:09
*** CLOUDOUTAGE has joined #tripleo09:10
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage09:10
*** CLOUDOUTAGE has quit IRC09:10
*** e0ne_ has joined #tripleo09:11
Kiall_Anyone able to comment on https://review.openstack.org/#/c/93031/ ? Tiny patch with multiple +2's, and a seemingly spurious f20 failure :(09:11
*** Kiall_ is now known as Kiall09:12
*** athomas has joined #tripleo09:13
*** giulivo has joined #tripleo09:13
*** e0ne has quit IRC09:15
*** shardy_afk is now known as shardy09:15
*** matsuhashi has joined #tripleo09:16
derekh_Kiall: that f20 failure is because a regression was commited causing the f20 job to fail pretty much all of yesterday , thats fixed now but one of the ci racks is having other problem so don't recheck until its fixed09:20
derekh_Ng SpamapS lifeless so ssh to overcloud controller is freezing on and off for around a minute each time, would the mlx driver need to be reloaded on it ?09:21
derekh_if it does, whats the commands ? modprobe -r mlx4_en ; modprobe mlx4_en09:21
*** jcoufal has quit IRC09:23
*** jcoufal has joined #tripleo09:23
lifelessderekh_: something like that; its probably in history. I thought we'd upgraded it to trusty though :(09:31
lifelessderekh_: which shouldn't need such shenanigans09:31
derekh_lifeless: VERSION="13.10, Saucy Salamander"09:32
lifelessohhh gnar09:37
*** CLOUDOUTAGE has joined #tripleo09:41
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage09:41
*** CLOUDOUTAGE has quit IRC09:41
*** nosnos has quit IRC09:45
lsmoladerekh_: hello, have you seen this? http://pastebin.com/H2Ze849g09:46
lsmoladerekh_: seems like this is failing https://github.com/openstack/tripleo-incubator/blob/master/scripts/register-nodes#L3409:47
derekh_lsmola: it doesn't look familiar probably worth looking at the nova logs09:48
lsmoladerekh_: because there is a certificate passed as pm_password, is that alright?09:48
lsmoladerekh_: it works when I just delete pm_user and pm_password09:48
lsmoladerekh_: could it be there is a wrong character in the certificate?09:49
derekh_lsmola: off the top of my head I'm not sure what supposed to be passed in, I don't have a running devtest at the moment to check but will start one now09:50
lsmoladerekh_: ok, cool09:50
*** ccorrigan has quit IRC09:55
*** athomas has quit IRC09:57
lsmoladerekh_: seems like there should be this rsa key, so only thing that comes to my mind is that there has to be some bad character http://pastebin.com/MQXFuSbw09:59
lsmoladerekh_: so that nova baremetal returns 50009:59
lsmoladerekh_: weird, seems like this is there for a long time and I never had problems with that :-) I have fresh f20 maybe that is the cause10:01
*** ifarkas has joined #tripleo10:01
*** athomas has joined #tripleo10:04
*** akuznetsov has quit IRC10:05
*** rlandy has joined #tripleo10:07
*** matsuhashi has quit IRC10:12
*** CLOUDOUTAGE has joined #tripleo10:12
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage10:12
*** CLOUDOUTAGE has quit IRC10:12
derekh_lsmola: yup, my pm_password is a private key10:12
lsmoladerekh_: do you see the failure?10:12
derekh_lsmola: nope mine is ticking along , I'll kill it and get uptodate everything now10:13
lsmoladerekh_: I have fresh f20 from the beaker, if that helps with anything :-)10:14
derekh_lsmola: ahh ok, i don't have that, can't update mine at the moment either10:15
lsmoladerekh_: btw. could you try it with the private key I posted?10:15
*** matsuhas_ has joined #tripleo10:15
derekh_lsmola: ok, will do10:15
lsmoladerekh_: ok, that should eliminate if there is not just some a bad char10:16
derekh_lsmola: it should be in id_rsa_virt_power right?10:16
lsmoladerekh_: yep https://github.com/openstack/tripleo-incubator/blame/fd1d619f2fc94f221650c02f158427a9d2944157/scripts/devtest_testenv.sh#10:17
openstackgerritAlexis Lee proposed a change to openstack/tripleo-incubator: Promote HEAT_ENV over env vars  https://review.openstack.org/9513010:17
lsmoladerekh_: eh bad link :-) https://github.com/openstack/tripleo-incubator/blob/fd1d619f2fc94f221650c02f158427a9d2944157/scripts/devtest_testenv.sh#L19310:18
*** chuckC has quit IRC10:19
derekh_lsmola: is " seed 1 2048 60" at the end of that key file, thats not right is it ?10:19
lsmoladerekh_: nope that are the other parameters10:19
derekh_lsmola: actually its problem just being output10:19
*** openstackstatus has quit IRC10:20
*** openstack has joined #tripleo10:21
*** openstackstatus has joined #tripleo10:22
*** ChanServ sets mode: +v openstackstatus10:22
*** lathiat has quit IRC10:23
lsmoladerekh_: ok let me know if it fails, it should be like in 4 minutes10:24
*** lathiat has joined #tripleo10:24
derekh_lsmola: will do, its building the new seed now10:24
lsmoladerekh_: ok10:24
*** andreaf has joined #tripleo10:26
openstackgerritDmitry Shulyak proposed a change to openstack/tripleo-specs: Haproxy configuration options  https://review.openstack.org/9490710:28
*** ccorrigan has joined #tripleo10:34
*** akuznetsov has joined #tripleo10:36
openstackgerritGiulio Fidente proposed a change to openstack/tripleo-incubator: Move user's group membership setup in separate script  https://review.openstack.org/9395410:38
*** wfoster is now known as wfoster_afk10:42
*** CLOUDOUTAGE has joined #tripleo10:43
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage10:43
*** CLOUDOUTAGE has quit IRC10:43
openstackgerritGiulio Fidente proposed a change to openstack/tripleo-incubator: Avoid recreating the VENV in setup-clienttools if one exists  https://review.openstack.org/9395710:44
derekh_lsmola: ERROR: The server has either erred or is incapable of performing the requested operation. (HTTP 500) (Request-ID: req-b8c24ddc-6ab9-4c0d-a38f-ae059b8760d2)10:48
derekh_lsmola: gomma try with a different key now10:48
lsmoladerekh_: hm, ok10:49
lsmoladerekh_: seems like the key encoded in json gets some weird characters http://pastebin.com/7xVAhP0v10:50
lsmoladerekh_: like \10:50
derekh_lsmola: does it work for you if you use a different key ?10:53
lsmoladerekh_: I am not trying it without the pm_password and user params, i don't need them for bm_poseur right?10:54
lsmoladerekh_: so possibly yes10:54
lsmoladerekh_: does it work for you?10:54
derekh_lsmola: it got that error ^^ with your key, now trying with a new one10:54
lsmoladerekh_: seems like a bad char that leads to some crazy parameters being send to nova-baremetal cli10:55
*** matsuhas_ has quit IRC10:55
*** matsuhashi has joined #tripleo10:57
*** matsuhashi has quit IRC10:58
*** matsuhashi has joined #tripleo10:58
xuhaiwei+Ng:can i ask you a question now?10:58
*** matsuhashi has quit IRC10:58
*** matsuhashi has joined #tripleo10:59
*** akrivoka has quit IRC10:59
*** akrivoka has joined #tripleo11:00
xuhaiwei I am doing devtest, and i am confused about the nova hypervisor-stats, when the 'count' is 0, it means no baremetal node is registered?11:07
*** tzumainn has joined #tripleo11:07
*** CLOUDOUTAGE has joined #tripleo11:14
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage11:14
*** CLOUDOUTAGE has quit IRC11:14
*** ccorrigan has quit IRC11:23
*** akrivoka has quit IRC11:23
derekh_lsmola: same error with new key, must be some other problem, sorry not getting time to look into it properly11:32
giulivoderekh_, I'm seeing that same error now , reproduced it locally11:33
lsmoladerekh_: hm weird, though you didn't see the error with your original key right?11:33
derekh_lsmola: that was before I updated all the repositories11:34
lsmoladerekh_: ah, ok11:34
giulivolsmola, derekh_ here is what I see in the logs: (DataError) (1406, "Data too long for c11:36
giulivoolumn 'pm_password' at row 1")11:36
lsmolagiulivo: cool11:37
lsmolagiulivo: that explains it, seems like new constraint in nova bm?11:37
giulivobut I'm not sure how/where to fix... maybe I can try updating the column size manually first?11:37
giulivoalso I wonder if the pm_password length is really supposed to change depending on the key?11:38
giulivo pm_password        | varchar(255) | YES  |     | NULL    |                |11:38
*** andreaf_ has joined #tripleo11:39
*** lucasagomes is now known as lucas-hungry11:39
lsmolagiulivo: I thing it's caused by constraint in nova11:40
lsmolagiulivo: afaik Mysql will just cut the data and doesn't raise any error :-)11:40
derekh_lsmola: giulivo: any chance this commit changed the behaviour https://review.openstack.org/#/c/89778/ ?11:42
*** CLOUDOUTAGE has joined #tripleo11:45
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage11:45
*** CLOUDOUTAGE has quit IRC11:45
openstackgerritJan Provaznik proposed a change to openstack/os-cloud-config: Allow setup services endpoints  https://review.openstack.org/9238311:45
openstackgerritOm Kumar proposed a change to openstack/tripleo-image-elements: Adds local storage Boot support on PXE failures.  https://review.openstack.org/7928911:45
*** mrunge has quit IRC11:47
openstackgerritOm Kumar proposed a change to openstack/diskimage-builder: Refactor code to select boot kernel  https://review.openstack.org/7987311:49
lsmolagiulivo: weird it should be text https://github.com/openstack/nova/blob/2efd3faa3e07fdf16c2d91c16462e7e1e3f33e17/nova/virt/baremetal/db/sqlalchemy/models.py#L4611:51
lsmoladerekh_: not sure, I don't see anything there with a direct impact11:51
giulivolsmola, address user password all are varchar for me11:52
lsmolagiulivo: https://github.com/openstack/nova/blob/2efd3faa3e07fdf16c2d91c16462e7e1e3f33e17/nova/virt/baremetal/db/sqlalchemy/migrate_repo/versions/001_init.py#L3511:53
lsmolagiulivo: hmm11:53
giulivolsmola, yah that is what I get indeed11:54
lsmolagiulivo: eh, so on model they set is as text, but on migration there is varchar, fuu11:54
giulivobut looks like its always been like that from 3rd of feb11:54
lsmolagiulivo: yeah, it must be that just the validation is new11:55
*** wfoster_afk is now known as wfoster11:55
lsmolagiulivo: previously it was probably just ignored11:55
*** andreaf_ has quit IRC11:55
giulivolsmola, also my virt_power key is 1679 chars, how can it ever possibly fit in 255?11:57
giulivoI mean even it mysql was just truncating, then ssh to virt host should have failed11:59
lsmolagiulivo: hm these are just IPMI credentials, which are not used for bm_poseur11:59
lsmolagiulivo: but on real hardware yeah, i would say it should fail for them12:00
giulivoah yeah libvirt driver has an option which points to the keyfile12:01
lsmolagiulivo: hmm, unless the migrations were broken before12:01
lsmolagiulivo: and there was actually text12:02
giulivoso if I'm reading it right, given the libvirt virtual power driver gets the key from the config file12:03
*** akrivoka has joined #tripleo12:03
giulivowe just shouldn't paste the rsa key into bm_password12:04
*** akrivoka has joined #tripleo12:04
lsmolagiulivo: now it takes it from here  https://github.com/openstack/tripleo-incubator/blob/fd1d619f2fc94f221650c02f158427a9d2944157/scripts/devtest_testenv.sh#L19312:06
lsmolagiulivo: so I guess we should just generate some smaller password for IPMI12:06
lsmolagiulivo: did you create critical bug for this?12:06
giulivonope I'm still in the process of gathering some infos12:07
lsmolagiulivo: will you or should I?12:08
giulivoso if I get it right, what you suggest to do is not cat the key into ssh-key12:08
lsmolagiulivo: well, I am not sure if we should land a patch for nova-baremetal12:08
giulivobut nova-bm seems to be doing okay12:09
giulivoprobably ipmi drivers don't even support to find an rsa key into the password field12:09
lsmolagiulivo: the validation error comes from nova bm right?12:09
giulivoyeah but validation looks fine to me12:09
giulivoit seems it is us who are using the pm_password field improperly12:10
lsmolagiulivo: it could be12:10
*** e0ne_ has quit IRC12:10
lsmolagiulivo: let me check Ironic DB schema12:10
giulivolsmola, I think you're right when saying that the we shouldn't cat the key contents in ssh-key12:12
lsmolagiulivo: hm so Ironic stores it here https://github.com/openstack/ironic/blob/master/ironic/db/sqlalchemy/models.py#L15512:14
lsmolagiulivo: which i assume will be text or bigger12:15
lsmolagiulivo: so we need possibly just a fix for nova bm12:15
lsmolagiulivo: I guess putting there smaller password on our side should be fine12:15
*** CLOUDOUTAGE has joined #tripleo12:16
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage12:16
*** CLOUDOUTAGE has quit IRC12:16
lsmolagiulivo: will you create the bug once you will gather all the info?12:16
giulivoyes I can open the bug12:16
giulivoI'm not sure about the fix instead12:16
lsmolagiulivo: cool, thank you12:16
giulivoI'll link that to you here12:16
lsmolagiulivo: ok12:17
giulivoI'm still unsure about why we should paste the contents of the key into pm_password12:17
giulivoare the power drivers shared amongst novabm and ironic?12:17
lsmolagiulivo: no you either deploy with nova-bm or ironic12:19
*** matsuhashi has quit IRC12:20
lsmolagiulivo: I am not really sure, if they are trying that on real Baremetals with IPMI, they need to fill there real IPMI credentials12:20
giulivoso the nova virtual_power_driver gets the key from a file not from pm_password12:21
*** OldCrowEW has joined #tripleo12:21
*** matsuhashi has joined #tripleo12:21
giulivoagreed so we only populate pm_password when running on real hardware, in which case the user will have to provide the password somehow12:21
lsmolagiulivo: uh, not sure I understand? :-)12:21
giulivoso we'd still don't want to paste the contents of the private key in pm_password12:21
lsmolagiulivo: possibly not, it will be also quickest fix to just generate some smaller password and put it there12:23
*** shakayumi has joined #tripleo12:23
lsmolagiulivo: hopefully somebody will comment the bug with the correct solution :-)12:23
*** dprince has joined #tripleo12:24
*** andreaf has quit IRC12:25
giulivohere it is https://bugs.launchpad.net/tripleo/+bug/132259912:26
uvirtbotLaunchpad bug 1322599 in tripleo "nova baremetal-node-create fails with HTTP 500" [Undecided,New]12:26
*** matsuhashi has quit IRC12:26
giulivoI can't triage though12:26
giulivoderekh_, ^^12:26
derekh_giulivo: triaged as critical since 3 of us hit it now12:27
*** lazy_prince has quit IRC12:32
openstackgerritGhe Rivero proposed a change to openstack/tripleo-image-elements: Don't install openvswitch dkms modules in trusty  https://review.openstack.org/9515112:37
*** jdob has joined #tripleo12:37
*** andreaf_ has joined #tripleo12:37
*** matsuhashi has joined #tripleo12:37
*** matsuhashi has quit IRC12:38
*** matsuhashi has joined #tripleo12:38
*** matsuhashi has quit IRC12:38
*** matsuhashi has joined #tripleo12:39
*** casanch1 has joined #tripleo12:40
*** e0ne has joined #tripleo12:41
*** matsuhashi has quit IRC12:43
*** lucas-hungry is now known as lucasagomes12:43
*** CLOUDOUTAGE has joined #tripleo12:47
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage12:47
*** CLOUDOUTAGE has quit IRC12:47
giulivolsmola, I think derekh_ is right, look here, traditional mode was False by default before https://review.openstack.org/#/c/89778/12:48
giulivoand with traditional mode to False it was silently truncating12:49
derekh_giulivo: try reverting and rerunning devtest12:49
*** jcoufal has quit IRC12:53
*** jcoufal has joined #tripleo12:54
Ngmorning12:57
*** athomas has quit IRC12:57
*** yamahata has quit IRC12:59
*** jcoufal has quit IRC13:00
*** jcoufal has joined #tripleo13:01
TheJuliagood morning :)13:02
openstackgerritJon-Paul Sullivan (jp_at_hp) proposed a change to openstack/diskimage-builder: Add support for source-repos gerrit refs  https://review.openstack.org/9089013:02
Ngadam_g: nice work tracking down the trusty hang bug!13:05
*** athomas has joined #tripleo13:05
*** gcha has quit IRC13:06
*** gcha has joined #tripleo13:07
*** julim has joined #tripleo13:09
openstackgerritDmitry Shulyak proposed a change to openstack/tripleo-image-elements: Change horizon binding address to local-ipv4 in haproxy case  https://review.openstack.org/9108913:11
*** CLOUDOUTAGE has joined #tripleo13:17
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage13:17
*** CLOUDOUTAGE has quit IRC13:17
*** jrist has quit IRC13:28
*** jcoufal has quit IRC13:32
*** ddieterly has joined #tripleo13:34
*** jrist has joined #tripleo13:39
*** CLOUDOUTAGE has joined #tripleo13:48
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage13:48
*** CLOUDOUTAGE has quit IRC13:48
giulivoping dprince13:56
dprincegiulivo: hi13:59
giulivodprince, I think I need some help figuring what is the real purpose of ssh-key in our TE_DATAFILE13:59
giulivoI'm working on https://bugs.launchpad.net/tripleo/+bug/132259913:59
uvirtbotLaunchpad bug 1322599 in tripleo "nova baremetal-node-create fails with HTTP 500" [Critical,Triaged]13:59
giulivoI noticed you did some work with that in devtest_seed14:00
giulivoI'm not sure though why we keep passing the actual contents of the rsa private key around , from $TE_DATAFILE to both the seed and the undercloud (via heat)14:00
dprincegiulivo: the ssh-key is ultimately used by the virtual power driver14:01
dprincegiulivo: it allows the seed VM to issue "power" commands for other fake VMs14:01
giulivoyeah I understand what it does but it is not getting the rsa key from there14:02
giulivoit is just reading the keyfile configured as virtual_power_host_key14:02
dprincegiulivo: looking at the ticket...14:03
giulivothanks :)14:04
giulivoI think I'm missing what is the real "user" of our ssh-key , I just wanted to keep that intact while passing an empty string as pm_password when we use nova-bm14:05
dprincegiulivo: I think we should revert that Nova commit first (before making any changes)14:07
*** shakayumi has quit IRC14:07
dprincegiulivo: The TRADITIONAL changes to me seems to be something that services (Nova) should opt into14:07
dprincegiulivo: not have it forced upon them by oslo....14:07
rpodolyaka1hmm, virtual power driver doesn't seem to be using pm_password14:08
*** rwsu has joined #tripleo14:08
dprincegiulivo: we can still rework things on our side too but I'm not the biggest fan of that DB change14:08
giulivorpodolyaka1, indeed I think it is not... while it would be nice if it actually could, but currently it doesn't14:09
rpodolyaka1dprince: actually, it used too...14:09
rpodolyaka1dprince: but later it was decided that it's better to default to TRADITIONAL mode here so that we'd catch such errors as early in the development cycle as possible14:09
*** vinsh has joined #tripleo14:10
rpodolyaka1dprince: I think it's similar to how we enforce mysql innodb storage engine to be used14:10
bnemecPeople get upset when you silently corrupt their data, people get upset if you log a warning that you might silently corrupt their data, and people get upset when you refuse to corrupt their data.14:11
bnemecIt's really a no-win situation. :-)14:11
*** rpodolyaka1 is now known as rpodolyaka14:11
dprincebnemec: right, so why change it? This is very similar to the Nova v2 on v3 discussion with regards to validating requests.14:12
bnemecdprince: See "silent data corruption".  This wasn't "working" before either.14:13
dprincebnemec: v2 doesn't currently validate all requests, but the port to make it run on top of the v3 code would add those validations. Useful, sure. Breaking for some people, almost certainly.14:13
rpodolyakaaren't we just passing to nova the data that it's not going to use?14:14
dprinceIf we don't need the key then by all means lets remove it14:14
*** athomas has quit IRC14:15
mordredrpodolyaka: ++14:15
bnemecTeam call time.  biab14:15
mordredlike, we can set traditional in the gate, so that we can catch bad data error places - and if someone wants to deploy not that way - well, I mean, weird, but whatevs14:15
* rpodolyaka is not surprised column type is Text() in models definitions and VARCHAR(255) in the migration script...14:16
*** athomas has joined #tripleo14:16
giulivorpodolyaka, because of the init script it seems https://github.com/openstack/nova/blob/2efd3faa3e07fdf16c2d91c16462e7e1e3f33e17/nova/virt/baremetal/db/sqlalchemy/migrate_repo/versions/001_init.py#L3514:16
dprincemordred: my point is that just like some of the API proposals this may cause unexpected changes.14:16
mordreddprince: oh, totally. you can't just switch to traditional in an existing codebase without actually auditing and verifying that you're not already broken14:17
dprincemordred: I do like the idea of stepping towards this14:17
mordredand figuring out a plan to fix it14:17
mordredit would be interesting to spin up a non-voting job that ran mysql with traditional on and see what, if anything, breaks hard14:17
rpodolyakagiulivo: yeah, unfortunately we still have such bugs in our code. I hope my test for detecting this will land to oslo.db soon :P14:18
bnemecAlso, it is possible to disable traditional mode still.14:18
bnemecI'd rather fix the bug, but as a temporary workaround that would be an option.14:18
*** CLOUDOUTAGE has joined #tripleo14:19
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage14:19
*** CLOUDOUTAGE has quit IRC14:19
bnemechttps://github.com/openstack/oslo-incubator/blob/master/openstack/common/db/options.py#L4214:19
dprincebnemec: right, previously it was opt-in. Now its opt-out.14:19
rpodolyakaand we can just stop passing pm_password values when virtual power driver is used (as nova doesn't use it)14:20
dprincerpodolyaka: we can do that too14:20
giulivorpodolyaka, bnemec dprince ^^ this is what I would try to do actually14:20
rpodolyakaand as for using of TRADITIONAL mode in general, as bnemec said it's no-win situation, but IMO, we'd better default to have it enabled and fix bugs early in the development cycle, even it can be painful for us...14:22
bnemecdprince: It was opt-in for a long time...and nobody did.14:22
bnemecAnd what rpodolyaka said. :-)14:22
openstackgerritJay Dobies proposed a change to openstack/tripleo-specs: Initial spec for the Tuskar Juno REST API changes  https://review.openstack.org/9472014:22
*** edmund has joined #tripleo14:23
openstackgerritGerry Drudy proposed a change to openstack/tripleo-image-elements: Enable rsync daemon on swift-storage  https://review.openstack.org/9517314:25
derekh_rlandy: If your interested some of the CI work I things we need to work on https://review.openstack.org/#/c/95026/   (It may change as people add comments/ideas)14:27
rlandyderekh_: thanks ... reading14:28
giulivorpodolyaka, just to make sure, ironic doesn't use pm_password cause has its own db, right?14:37
openstackgerritLorcan Browne proposed a change to openstack/tripleo-image-elements: Add swiftclient library to swift  https://review.openstack.org/9517614:37
*** zackf has joined #tripleo14:37
*** chuckC has joined #tripleo14:40
*** chuckC has quit IRC14:41
*** akrivoka has quit IRC14:42
*** jprovazn is now known as jprovazn_afk14:46
openstackgerritjan grant proposed a change to openstack/tripleo-image-elements: Explicitly add the parted package.  https://review.openstack.org/9517714:47
*** CLOUDOUTAGE has joined #tripleo14:50
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage14:50
*** CLOUDOUTAGE has quit IRC14:50
*** hashar has joined #tripleo14:50
openstackgerritGerry Drudy proposed a change to openstack/tripleo-image-elements: Store swift account, container & object data on /mnt filesystem  https://review.openstack.org/8984714:52
*** yamahata has joined #tripleo14:52
openstackgerritLorcan Browne proposed a change to openstack/tripleo-image-elements: Add check_mk swift proxy diagnostic  https://review.openstack.org/9460714:53
openstackgerritGiulio Fidente proposed a change to openstack/tripleo-incubator: Avoid pasting ssh-key into pm_password  https://review.openstack.org/9518114:56
*** untriaged-bot has joined #tripleo15:00
untriaged-botUntriaged bugs so far:15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132194315:00
uvirtbotLaunchpad bug 1321943 in tripleo "Ceilometer Swift polling on overcloud control node fails with a 403 forbidden error" [Undecided,In progress]15:00
untriaged-bothttps://bugs.launchpad.net/diskimage-builder/+bug/132156315:00
uvirtbotLaunchpad bug 1321563 in diskimage-builder "Add support in ramdisk element to fetch IP from dhcp" [Undecided,New]15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131876715:00
uvirtbotLaunchpad bug 1318767 in tripleo "apache element SSL cert check fails" [Undecided,New]15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131497815:00
openstackgerritJon-Paul Sullivan (jp_at_hp) proposed a change to openstack/diskimage-builder: Add support for source-repos gerrit refs  https://review.openstack.org/9089015:00
uvirtbotLaunchpad bug 1314978 in tripleo "Cloud vm not pingable after overcloud upgrade " [Undecided,New]15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131535515:00
uvirtbotLaunchpad bug 1315355 in tripleo "Upgrade of overcloud failed with "Connection to neutron failed: Maximum attempts reached"" [Undecided,New]15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131667515:00
uvirtbotLaunchpad bug 1316675 in tripleo "Saving a devtest VM results in error" [Undecided,In progress]15:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132009015:00
uvirtbotLaunchpad bug 1320090 in tripleo "pxe_ephemeral_format': u'ext4 left on nodes after deploys deleted" [Undecided,New]15:00
*** untriaged-bot has quit IRC15:00
openstackgerritGiulio Fidente proposed a change to openstack/tripleo-incubator: Avoid pasting ssh-key into pm_password  https://review.openstack.org/9518115:00
derekh_I seem to remember us trying to remove the DIB dependency on busybox at one stage, did I imagine that?15:11
*** ifarkas has quit IRC15:17
*** rlandy has quit IRC15:19
*** saurabhs has joined #tripleo15:21
*** CLOUDOUTAGE has joined #tripleo15:21
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage15:21
*** CLOUDOUTAGE has quit IRC15:21
*** chuckC has joined #tripleo15:27
greghaynesoh look, our old friend cloudouta15:27
openstackgerritBen Nemec proposed a change to openstack/tripleo-incubator: Avoid pasting ssh-key into pm_password  https://review.openstack.org/9518115:29
*** TravT has joined #tripleo15:35
*** hashar has quit IRC15:37
*** hashar has joined #tripleo15:42
openstackgerritFlint Calvin proposed a change to openstack/tripleo-incubator: Making the necessary changes so that the value of metering_secret gets set appropriately in /etc/ceilometer/ceilometer.conf.  https://review.openstack.org/9420115:51
*** eghobo has joined #tripleo15:51
*** CLOUDOUTAGE has joined #tripleo15:52
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage15:52
*** CLOUDOUTAGE has quit IRC15:52
giulivoactually, the virtual power driver module doesn't even care about pm_user and pm_host15:53
giulivobut I don't think there would be a point in updating it cause the development happens in ironic right?15:54
openstackgerritBen Nemec proposed a change to openstack/diskimage-builder: Check for set -o pipefail  https://review.openstack.org/8392915:55
openstackgerritBen Nemec proposed a change to openstack/diskimage-builder: Set -o pipefail new scripts  https://review.openstack.org/9520315:55
SpamapShey what a surprise15:57
SpamapSR1 down15:57
SpamapSagain15:57
*** gcha has quit IRC16:01
SpamapSderekh_: grrrr ... so ci-overcloud's controller has the newer mellanox driver.. supposedly16:02
SpamapSderekh_: that does not bode well16:03
derekh_SpamapS: it was still on saucy, didn't look at driver version, its back up now everything I did is in the etherpad16:03
derekh_SpamapS: was hopeing nodepool would just start using it16:04
derekh_SpamapS: but that doesn't seem to have happened16:04
derekh_I guess its at quota...16:05
*** jistr has quit IRC16:05
derekh_SpamapS: exactly 40 instances tripleo-* in various states16:05
SpamapSderekh_: I'm thinking we need to write a dashboard for nodepool :)16:11
SpamapSderekh_: so we know what ones we can delete16:11
SpamapSderekh_: yesterday there were 27 h1.large's that nodepool didn't know about16:11
SpamapSderekh_: I can't get to the controller right now16:11
*** morganfainberg_Z is now known as morganfainberg16:12
*** marun has joined #tripleo16:12
derekh_SpamapS: weird, I'm ssh'd in but ssh response stalls occasionally16:12
derekh_SpamapS: ya a dashboard would be great16:13
SpamapSderekh_: that is classic mellanox fail behavior.16:13
*** e0ne has quit IRC16:14
derekh_SpamapS: new ssh connections from the undercloud controller seem to be working16:14
*** CLOUDOUTAGE has joined #tripleo16:23
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage16:23
*** CLOUDOUTAGE has quit IRC16:23
*** asparks has joined #tripleo16:25
*** cwolferh has quit IRC16:26
*** akuznetsov has quit IRC16:31
SpamapSderekh_: this may be something else. ILO is dropping connections too16:31
derekh_SpamapS: ahh yes, it was for me too, sorry I forgot to mention it16:32
*** jp_at_hp has quit IRC16:33
SpamapSderekh_: thats a different hardware port entirely, but likely the same switch fabric..16:33
derekh_SpamapS: so possibly whatever is causing that is also causing connections to drop to VMs16:34
SpamapSaye16:34
derekh_SpamapS: gotta run in coupld of minutes if jobs start running on R1 again you can look at http://goodsquishy.com/downloads/s_tripleo-jobs.html to see pass rates (gives you a rough idea if things are going ok)16:35
SpamapSderekh_: I think I'm going to take it to the NOC16:36
SpamapSderekh_: yesterday things were flying through16:36
*** cwolferh has joined #tripleo16:36
derekh_SpamapS: they were all failing though16:36
*** dkehn_ has joined #tripleo16:39
SpamapSderekh_: I didn't say they were passing, they were flying through ;)16:39
derekh_:-)16:39
SpamapSI'm honestly about to quit this job as CI cloud admin16:39
SpamapSIt is awful.16:39
* SpamapS -> Heat16:39
derekh_I'm starting to think we should drop all but a single overcloud job and only run on R2 , should give us some breathing room to redeploy and properly debug R116:39
dprincederekh_: not a horrible idea considering16:39
SpamapS+116:39
SpamapSI want to burn R116:39
SpamapSthough I will backup the db16:39
SpamapSIn fact I was thinking of just doing nova rebuilds and keeping it alive like that16:40
derekh_we can maybe trash out some ideas on tuesday meeting16:40
*** dkehnx has quit IRC16:40
*** dkehn has quit IRC16:40
SpamapSI will be in UTC+1230 on tuesday...16:41
*** dkehn has joined #tripleo16:41
derekh_plenty of coffee will get you through it :-)16:41
derekh_ok gotta run16:42
*** derekh_ has quit IRC16:42
SpamapSif R1 is failing jobs and not running anything16:42
SpamapSactually16:42
SpamapSlet's stop16:42
SpamapSI don't think this is absolutely mellanox failure16:42
SpamapSI mean, we need to update anyway16:43
SpamapSbut I wonder if this is just good old fashioned routing fail.. seems like 10.10 <-> 10.10 is fine16:43
*** lucasagomes is now known as lucas-afk16:44
*** asparks has quit IRC16:44
*** dkehn_ is now known as dkehnx16:45
*** bnemec is now known as beekneemech16:49
SpamapShm and now SSH to ci-overcloud.tripleo.org is working again16:50
*** nati_ueno has joined #tripleo16:50
*** CLOUDOUTAGE has joined #tripleo16:54
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage16:54
*** CLOUDOUTAGE has quit IRC16:54
*** akuznetsov has joined #tripleo16:57
*** nati_uen_ has joined #tripleo17:03
*** pblaho has quit IRC17:04
*** nati_ueno has quit IRC17:06
*** marun has quit IRC17:06
devanandahi guys! how do you feel about gatign disk-image-builder on ironic?17:07
devanandai realized a few days ago it's asymmetric right now17:07
*** nati_uen_ has quit IRC17:07
devanandaas in, ironic is implicitly using dib (to build the deploy ramdisk) but changes in dib aren't checking whether they break ironic17:08
greghaynesSpamapS: working again as in magically started working or did something change?17:08
*** nati_ueno has joined #tripleo17:08
*** marun has joined #tripleo17:08
*** akuznetsov has quit IRC17:10
beekneemechdevananda: Makes sense to me.17:11
SpamapSgreghaynes: magic17:11
SpamapSgreghaynes: and now it's down again17:11
greghaynes:_(17:14
devanandahttps://review.openstack.org/95220   <-- adding ironic test to dib's check/gate17:18
SpamapSopening a JIRA ticket with the hpcloud NOC.. something is quite broken17:20
SpamapSI think it's an arp thing17:23
SpamapSif I ping the external IP from inside the rack, external connections start flowing17:24
greghaynestwo hosts with same IP?17:24
SpamapSoh very well could be17:24
SpamapSor an accidental bridge17:24
*** CLOUDOUTAGE has joined #tripleo17:24
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage17:24
*** CLOUDOUTAGE has quit IRC17:24
greghaynesyea, sporatic packet blackholing and then coming back is standard arp cache battling17:25
SpamapShm17:25
*** jprovazn_afk has quit IRC17:30
*** jang1 has joined #tripleo17:31
openstackgerritFabio Giannetti proposed a change to openstack/tripleo-image-elements: Ceilometer Service Update/Upgrade in TripleO  https://review.openstack.org/9450017:32
*** nati_ueno has quit IRC17:35
SpamapSgreghaynes: so it's not a duplicate host.. removing the ip and arping for it does not reveal a second box17:35
SpamapSmust be a bridging loop17:35
greghaynesaye17:35
SpamapSoohhh hmm...17:36
SpamapSI see 10.10.16 traffic on vlan2517:36
*** athomas has quit IRC17:40
*** john3213 has joined #tripleo17:47
*** e0ne has joined #tripleo17:49
*** e0ne has quit IRC17:52
*** e0ne has joined #tripleo17:52
*** john3213 has left #tripleo17:52
*** e0ne has quit IRC17:55
*** CLOUDOUTAGE has joined #tripleo17:55
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage17:55
*** CLOUDOUTAGE has quit IRC17:55
SpamapSI wonder if they stole our IP range or something18:00
*** rcarrill` has joined #tripleo18:05
*** rcarrillocruz has quit IRC18:06
*** marun has quit IRC18:08
*** e0ne has joined #tripleo18:11
*** OldCrowEW has quit IRC18:14
*** OldCrowEW has joined #tripleo18:16
*** dprince has quit IRC18:16
Ngwe've had bridging loops at least twice before, and scratched our heads and gone to them18:26
Ngthe most recent bonkers thing, they were seeing the same mac on two ports, but upgrading the switch firmware and rebooting it, made it go away, so we chalked that up to a switch firmware bug18:26
*** CLOUDOUTAGE has joined #tripleo18:26
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage18:26
*** CLOUDOUTAGE has quit IRC18:26
*** OldCrowEW has quit IRC18:30
*** e0ne has quit IRC18:30
*** e0ne has joined #tripleo18:31
*** e0ne has quit IRC18:35
*** OldCrowEW has joined #tripleo18:37
SpamapSNg: this is suspiciously like that18:38
SpamapSNg: I'm getting nowhere with my JIRA ticket18:38
SpamapSBut we're definitely completely down as connections drop in and out constantly18:39
NgSpamapS: oh?18:39
NgSpamapS: as in, they're not helping you on the ticket?18:39
SpamapSNg: yeah, been ignored by NOC and Network18:39
SpamapSticket is sitting (NET-4002)18:39
*** akuznetsov has joined #tripleo18:43
openstackgerritPhil Neal proposed a change to openstack/tripleo-incubator: Add Swift roles required to poll for Swift usage  https://review.openstack.org/9473018:48
*** andreaf_ has quit IRC18:48
*** jogo is now known as flashgordon18:52
*** CLOUDOUTAGE has joined #tripleo18:57
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage18:57
*** CLOUDOUTAGE has quit IRC18:57
*** pelix has quit IRC18:59
*** jdob has quit IRC19:08
*** akuznetsov has quit IRC19:14
*** dshulyak_ has joined #tripleo19:19
*** lsmola has quit IRC19:20
*** chuckC has quit IRC19:28
*** CLOUDOUTAGE has joined #tripleo19:28
CLOUDOUTAGElifeless devananda Ng SpamapS jog0 GheRivero derekh dprince slagle  -- Jobs failing on R1 https://etherpad.openstack.org/p/cloud-outage19:28
*** CLOUDOUTAGE has quit IRC19:28
*** e0ne has joined #tripleo19:40
* SpamapS goes on meatspace errands for a while19:40
*** nati_ueno has joined #tripleo19:54
*** dshulyak_ has quit IRC20:20
lifelessdevananda: we'd be happy to have symmetric gating20:22
*** dshulyak_ has joined #tripleo20:22
devanandalifeless: great :)20:22
*** dshulyak_ has quit IRC20:23
lifelessthere is some impl complexity which we're going through with heat as well related to whether you consume and test releases or not20:23
lifelesssince dib is a thing that itself consumes things20:23
*** dshulyak_ has joined #tripleo20:23
lifelessbut I suspect you just need the baseline dib, not tripleo-image-elements, right?20:23
devanandaright20:24
*** OldCrowEW has quit IRC20:26
*** chuckC has joined #tripleo20:26
*** OldCrowEW has joined #tripleo20:26
*** dshulyak_ has quit IRC20:28
-openstackstatus- NOTICE: Gerrit will be offline for about 20 minutes in order to rename some projects starting at 21:00 UTC.20:34
*** nati_uen_ has joined #tripleo20:36
*** nati_ueno has quit IRC20:39
*** nati_ueno has joined #tripleo20:43
*** nati_uen_ has quit IRC20:45
*** casanch1_ has joined #tripleo20:46
*** nati_uen_ has joined #tripleo20:46
*** nati_ueno has quit IRC20:48
*** casanch1 has quit IRC20:49
*** casanch1_ has quit IRC20:51
*** nati_uen_ has quit IRC20:53
*** nati_uen_ has joined #tripleo20:55
stevebakerlifeless, I spent yesterday looking a building local pip mirrors for dib devstack gating. I think there will be push-back on building a comprehensive mirror. Pip installing from pypi.openstack.org has worked plenty fast for devstack gating so they'll want to do the same for image building20:57
*** nati_ueno has joined #tripleo20:58
*** hashar has joined #tripleo20:59
stevebakerlifeless, I tried using the pypi element to have a local mirror only of os-*-config git-built packages, but the resulting image still installed from released versions.20:59
stevebakerlifeless, tl;dr I *really* prefer my original approach ;) https://review.openstack.org/#/c/92035/21:00
*** untriaged-bot has joined #tripleo21:00
untriaged-botUntriaged bugs so far:21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132194321:00
uvirtbotLaunchpad bug 1321943 in tripleo "Ceilometer Swift polling on overcloud control node fails with a 403 forbidden error" [Undecided,In progress]21:00
untriaged-bothttps://bugs.launchpad.net/diskimage-builder/+bug/132156321:00
uvirtbotLaunchpad bug 1321563 in diskimage-builder "Add support in ramdisk element to fetch IP from dhcp" [Undecided,New]21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131876721:00
uvirtbotLaunchpad bug 1318767 in tripleo "apache element SSL cert check fails" [Undecided,New]21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131497821:00
uvirtbotLaunchpad bug 1314978 in tripleo "Cloud vm not pingable after overcloud upgrade " [Undecided,New]21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131535521:00
uvirtbotLaunchpad bug 1315355 in tripleo "Upgrade of overcloud failed with "Connection to neutron failed: Maximum attempts reached"" [Undecided,New]21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/131667521:00
uvirtbotLaunchpad bug 1316675 in tripleo "Saving a devtest VM results in error" [Undecided,In progress]21:00
untriaged-bothttps://bugs.launchpad.net/tripleo/+bug/132009021:00
uvirtbotLaunchpad bug 1320090 in tripleo "pxe_ephemeral_format': u'ext4 left on nodes after deploys deleted" [Undecided,New]21:00
*** untriaged-bot has quit IRC21:00
*** AaronGr has left #tripleo21:00
*** nati_uen_ has quit IRC21:00
*** jtomasek has quit IRC21:11
*** dshulyak has quit IRC21:14
*** bogdando has quit IRC21:15
*** dshulyak has joined #tripleo21:16
*** bogdando has joined #tripleo21:16
*** zackf has quit IRC21:19
*** edmund has quit IRC21:21
lifelessstevebaker: the local mirror will only have 2-3 pacakges in it21:31
lifelessstevebaker: its not a mirror really, its a repo21:31
*** hashar has quit IRC21:33
*** hashar has joined #tripleo21:34
-openstackstatus- NOTICE: Gerrit is offline in order to rename some projects. ETA: 22:00.21:36
*** ChanServ changes topic to "Gerrit is offline in order to rename some projects. ETA: 22:00."21:36
*** nati_ueno has quit IRC21:37
stevebakerlifeless, I did build a local repo which appeared to be valid, but the elements still installed from downloads - not sure what is going on there21:38
Lotus907efianyone who can tell me where the data that is supposed to be available at http://169.254.169.254/2009-04-04/meta-data/instance-id should come from?21:39
Lotus907efiI have a group of overcloud systems where the cloud-init runs are not successful because although that IP is on the systems, the data that is supposed to be there is not there21:39
Lotus907efiI am trying to diagnose why the heat create of the overcloud failed21:40
*** giulivo has quit IRC21:42
*** Penick has joined #tripleo21:45
*** weshay has quit IRC21:51
*** hashar has quit IRC21:51
*** jang1 has quit IRC21:59
*** yamahata has quit IRC22:10
*** lucas-afk has quit IRC22:12
*** lucas-afk has joined #tripleo22:13
Lotus907efiFrom the keystone.log file on the undercloud:  ERROR keystone.common.wsgi [-] object of type 'NoneType' has no len()22:13
*** openstackgerrit has quit IRC22:14
*** yamahata has joined #tripleo22:14
*** openstackgerrit has joined #tripleo22:15
*** OldCrowEW has quit IRC22:16
*** openstackstatus has quit IRC22:18
*** openstack has joined #tripleo22:19
*** openstackstatus has joined #tripleo22:20
*** ChanServ sets mode: +v openstackstatus22:20
*** nati_ueno has joined #tripleo22:23
*** nati_ueno has quit IRC22:26
*** ChanServ changes topic to "https://etherpad.openstack.org/p/tripleo-ci-r1-trusty | tripleo-cd running preserve-ephemeral WIP patches and https://review.openstack.org/#/c/62042/ | Using OpenStack to deploy OpenStack;meetings Tuesday 1900 UTC in #openstack-meeting-alt"22:29
*** ddieterly has quit IRC22:34
*** akuznetsov has joined #tripleo22:37
*** akuznetsov has quit IRC22:47
*** TravT has quit IRC22:56
*** morganfainberg is now known as morganfainberg_Z23:01
*** eguz has joined #tripleo23:02
*** eghobo has quit IRC23:06
*** nati_ueno has joined #tripleo23:09
*** nati_ueno has quit IRC23:13
*** ewindisch has quit IRC23:24
*** nati_ueno has joined #tripleo23:26
*** tzumainn has quit IRC23:49
*** e0ne has quit IRC23:57
SpamapSLotus907efi: that IP will be answered by neutron-metadata-agent23:57
SpamapSLotus907efi: which will look up the source IP, and map that to the currently attached nova instance.23:58
Lotus907efiwell, apparently that is not running23:58
Lotus907efiGreg says I have a keystone problem on the undercloud23:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!