Friday, 2014-02-14

derekhlifeless: so before you redeploy a new testenv, shall we00:00
derekhfor p in $(neutron port-list | grep te_testenv | awk '{print $2}') ; do neutron port-delete $p ; done00:00
lifelessderekh: we need to make this less manual - can you add that to the TODO list on the etherpad and perhaps file a bug/trello card?00:02
derekhlifeless: yup, will do,00:02
SpamapSlifeless: You know...00:03
SpamapSlifeless: there's this thing that orchestrates APIs..00:03
derekhlifeless: and running command00:03
lifelessderekh: I'm on it00:03
derekhlifeless: k00:03
* SpamapS runs off to fetch the rugrats00:03
lifelessderekh: spurious use of grep ftw btw00:03
lifelessSpamapS: yeah, and when you get involved in this stuff you can throw stones ;)00:03
lifelessSpamapS: (please do!)00:04
lifelessderekh: btw:00:05
lifelessneutron port-list | awk '/te_testenv/ {print $2}' | xargs -n1 neutron port-delete00:05
lifelessderekh: more compact00:05
derekhlifeless: thanks00:05
lifelessdeploying00:11
lifelessgrabbing food while it churns00:12
*** e0ne has joined #tripleo00:14
ccrouchrwsu: did i get confused here: https://review.openstack.org/#/c/72524/3/scripts/devtest_variables.sh ?00:15
lifelessccrouch: I think so :)00:18
*** e0ne has quit IRC00:19
ccrouchdoh :-(00:23
*** cadenzajon has quit IRC00:24
ccrouchlifeless: but given he took out the sed of overcloud.yaml https://review.openstack.org/#/c/72524/2/scripts/devtest_overcloud.sh00:24
ccrouchhow will the heat template know to use rabbit on fedora?00:25
openstackgerritA change was merged to openstack/tripleo-image-elements: Seed specific Fedora network configuration  https://review.openstack.org/7199800:29
ccrouchlifeless: the exports aren't deleted, they are just moved down. So MESSAGING_BACKEND can be inserted00:30
openstackgerritA change was merged to openstack/os-refresh-config: Move migration to run after post-configure  https://review.openstack.org/7197700:31
*** ramishra has joined #tripleo00:32
*** ramishra has quit IRC00:37
openstackgerritDerek Higgins proposed a change to openstack/tripleo-incubator: Restore generation of ssh keypair  https://review.openstack.org/7345800:39
lifelessccrouch: heat shouldn't need to know. if the parameters are really different then you need to either a) expose them both or b) have make create an appropriately specialised heat template00:39
derekhlifeless: ^^ did you mean to remove that here https://review.openstack.org/#/c/73089/7/scripts/install-dependencies00:39
lifelessderekh: I did00:40
lifelessderekh: its testenv specific. look in devtest_testenv.sh00:40
lifelessok | 3a54ea68-4921-4a33-9b19-dc189a2279d7 | testenv      | CREATE_COMPLETE | 2014-02-14T00:11:22Z |00:41
lifelessso, we should be good to ci something now, in principle00:42
derekhlifeless: ahh ok, so maybe I'm not understanding something, shouldn't we have a ~/.ssh/id_rsa and place that into authorized_keys for people running devtest locally? so the combination of local-config/stackuser works00:43
lifelessderekh: +    cat ~/.ssh/id_rsa_virt_power.pub >> ~/.ssh/authorized_keys00:43
lifelessderekh: no, we inject the ssh key to use via the testenv ssh-key key.00:43
*** CaptTofu has quit IRC00:43
derekhlifeless: yes I see that but we also need a non virt_power keypair (I'm talking about non ci runs)00:44
derekhlifeless: will try a local fresh run and see what happen and abandon if all is ok00:44
lifelessderekh: huh why?00:44
derekhlifeless: so authorized_keys is place in ~stack/.ssh/authorized_keys on images , people will have to start using ssh stack@192.0.2.2 -i ~/.ssh/id_rsa_virt_power to ssh in as stack00:46
derekhanyways will check and report back00:46
lifelessderekh: i think we have crossed wires. OTP for a minute00:48
*** ccrouch has quit IRC00:55
lifelessderekh: back00:55
lifelessso looking now...00:55
lifelessdprince: hey00:59
lifelessdprince: so I keep poking - can you remove your -1 on https://review.openstack.org/#/c/71706/. Or ask a followup question.00:59
*** edmund has quit IRC01:00
jog0lifeless: ConnectionFailed: Connection to neutron failed: Maximum attempts reached01:01
jog0in n-apu01:01
jog0n-api01:01
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Make the overcloud-user config identify itself  https://review.openstack.org/7346201:02
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Permit specifying ovs bridge mappings too.  https://review.openstack.org/7170601:02
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Permit setting a /etc/hosts entry for the API.  https://review.openstack.org/7172701:02
lifelessjog0: ah so yes, neutron01:02
derekhhmm, I can't ssh to any of the testenvs01:02
*** ccrouch has joined #tripleo01:02
lifelessderekh: wups01:02
*** tzumainn has quit IRC01:02
lifelessderekh: I can't ping the first few I've tried01:02
lifelessderekh: but maybe ...01:03
derekhan none of them have registered with neutron on the overcloud01:03
lifelessnuts01:03
lifelessI was just about to look01:03
lifelessconsole time I guess01:03
lifelessderekh: /opt/stack/diskimage-builder/bin/disk-image-create -a amd64 -o testenv-worker stackuser ubuntu testenv-worker dhcp-all-interfaces hosts -p git -p build-essential -p python-dev -p libssl-dev -p python-pip01:03
lifelessderekh: is how I built the image01:03
jog0lifeless: neutron api doesn't have any stacktraces :/01:04
jog0lifeless: and its getting traffic01:05
lifelessjog0: yeah, great eh :)01:05
derekhlifeless: When I last did it I did ./diskimage-builder/bin/disk-image-create -a amd64 -o testenv-worker ubuntu testenv-worker dhcp-all-interfaces hosts01:05
lifelessderekh: right, I just bake in the packages01:05
lifelessderekh: since it doesn't have networking01:05
lifelessderekh: erm, no, confused robert. I think I was working around unlanded bugs or something01:06
jog0lifeless: unfortunately I have to go AFK in a few minutes (dinner time)01:06
lifelessjog0: ciao01:06
lifelessderekh: I'll look at a console in a minute (or you can - the creds are all in the nova bm db ...01:06
*** ccrouch has quit IRC01:06
derekhlifeless: will look, have never accessed console before01:07
lifelessderekh: https://review.openstack.org/#/c/71727/ should be trivially +A able - it had 2x+2 before I rebased it to the bottom of the stack01:07
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Stop trying to copy out SSH keys from notcompute.  https://review.openstack.org/7346501:09
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Make the overcloud-user config identify itself  https://review.openstack.org/7346201:09
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Permit specifying ovs bridge mappings too.  https://review.openstack.org/7170601:09
lifelessderekh: ok so https://review.openstack.org/73465 is what I think is needed - I missed that before01:10
*** ccrouch has joined #tripleo01:10
*** ccrouch has quit IRC01:10
*** ccrouch has joined #tripleo01:10
*** epim has quit IRC01:10
*** taps has quit IRC01:13
*** e0ne has joined #tripleo01:13
lifelessderekh: did you try pinging /all/ the hosts ?01:14
derekhlifeless: I tried ssh to about 5 or 6 of them01:15
*** ccrouch has quit IRC01:15
lifelessI betok, in the console on 10.10.16.14801:15
derekhlifeless: +2'd both01:16
lifelessderekh: the users *own* key should let them ssh in01:16
lifelessderekh: we certainly don't want to be copying their keys around01:16
lifeless148 is meant to be a3edd6e1-dad4-4924-b1b6-49955ec41cda | testenv-testenv7-bd2uzs6jl22n01:16
lifelessbut the virtual console shows01:17
lifelessUbuntu 12.10 compute-test tty101:17
lifeless01:17
lifelesscompute-test login: [  255.895626] block nbd15: Unexpected reply (ffff8817d249dc01:17
lifelessodd01:17
*** e0ne has quit IRC01:18
lifelessoh nup.got it to reset. weird01:18
derekhlifeless: talking about the patch to generate id_rsa ? agreed, we shouldn't be copying it around but we should generate it if it doesn't exist01:18
lifelessstack:stack fails01:19
lifelessprompt is ubuntu01:19
lifelessI'm thinking metadata server hookup fail01:19
lifelessderekh: mmm, I see. I'm not sure I agree :)01:19
lifelessresetting the box, seeing what happens01:20
lifelessderekh: we probably need a waitcondition on the testenv heat template to avoid silent failures like this01:20
derekhlifeless: that is what we allways did, it needs to be generated so that the public key finds its way into authporized_keys for the stack user01:20
derekhlifeless: agreed01:20
derekhlifeless: will add to trello01:20
lifelessk01:21
lifelessrestarted n-d-a just in case we have that lease problem too01:21
*** morazi has joined #tripleo01:24
lifelessCloud-init v. 0.7.3 running 'init-local' at Thu, 13 Feb 2014 21:05:21 +0000. Up01:26
lifeless22.83 seconds.01:26
lifelesscloud-init-nonet[24.33]: waiting 10 seconds for network device01:26
lifelesscloud-init-nonet[34.37]: waiting 120 seconds for network device01:26
lifelesswhich is a good start01:26
lifelessgot a kernel and all01:26
rwsulifeless: question about https://review.openstack.org/#/c/72524/3/scripts/devtest_variables.sh01:26
lifelessrwsu: shoot01:26
openstackgerritlifeless proposed a change to openstack/tripleo-image-elements: Look for local.json in preference to config.json.  https://review.openstack.org/7340101:27
rwsulifeless: can I go back and use the $MESSAGING_BACKEND variable like I have it in patch 2?01:27
lifelessrwsu: ah so when I said 'just put in DIB_COMMON_ELEMENTS' I meant within that file only01:29
lifelessrwsu: since that file is seedrc specific01:29
lifelessrwsu: I was wrong in https://review.openstack.org/#/c/72524/2/scripts/devtest_overcloud.sh01:29
rwsuok, put it in DIB_COMMON_ELEMENTS for seed and undercloud, but for overcloud I'll have to use $MESSAGING_BACKEND01:30
lifelessderekh: also 2014-02-13 21:09:40,240 - util.py[WARNING]: Failed: growpart /dev/sda 3       might be relevant01:30
rwsulifeless: is that ok?01:30
derekhlifeless: so on my clean run of devtest, devtest can't ssh to the seed "Permission denied (publickey,gssapi-keyex,gssapi-with-mic)." keeps looping01:31
derekhlifeless: http://paste.openstack.org/show/65240/01:31
lifelessSpamapS: so for seed and undercloud, I suggest putting it in https://review.openstack.org/#/c/73032/01:31
derekhlifeless: which is why I think we need that patch01:31
lifelessrwsu: bah, SEED_DIB_EXTRA_ARGS and UNDERCLOUD_DIB_EXTRA_ARGS01:31
lifelessrwsu: for overcloud we need a new feature to let you add elements to just the control plane01:31
lifelessrwsu: which I'd make a separate patch01:31
derekhlifeless: yup, growpart looks like the problem01:32
lifelessrwsu: so start with 73032, add a patch on that to finesse overcloud images, and then your actual thing on top01:32
lifelessderekh: you don't have an ssh key ?01:32
lifelessderekh: hangon, I'm super confused here.01:32
lifelessderekh: too much context switching. Lets start with ssh.01:33
*** ramishra has joined #tripleo01:33
derekhlifeless: I removed it, the point is we specificaly used to generate it if it wasn;t there and place the public key into to hosts authorized keys for exactly this reason01:33
derekhbecause that authorized_keys file is what ends up getting placed into authorized_keys for both root and stack in the image01:34
lifelessderekh: this isn't making sense to me.01:35
lifelessderekh: we have two situations we run in01:35
lifelessa) people, b) ci.01:35
lifelesswhich one are you simulating ?01:35
rwsulifeless: ok I can do that, but I find using *_EXTRA_ARGS a bit confusing, isn't the message broker required?01:35
lifelessrwsu: it is,so you should be appending it to the args, no ?01:36
derekhlifeless: a) people , specifically so that this will work https://github.com/openstack/diskimage-builder/blob/master/elements/local-config/install.d/62-ssh-key#L901:36
lifelessderekh: so people *have* keys already. I don't think deleting the key is a good test.01:37
lifelessderekh: or we can halt and tell them to get their key on their, but adding a new key without warning is pretty odd.01:37
derekhlifeless: even if they have a key, we need to place the public part of the pair into authorized_keys on the host01:37
*** ramishra has quit IRC01:37
lifelessderekh: why?01:38
lifeless(I'm not trying to be difficult, there are multiple things we use ssh for, I want to be unambiguous)01:38
derekhlifeless: because that authorized_keys file is what get placed onto the image of the seed and ends up allowing us to ssh in as root01:38
lifelessok, so my concern there is that adding an entry to authorized_keys lets other people ssh into *their* machine.01:39
lifelessderekh: the failure is dib - dprince's dhcp-all-interfaces landed and broke01:39
*** nosnos has joined #tripleo01:39
lifelessyup01:40
derekhlifeless: yes it does, but they would have to have access to the private key that is already on the host and its what we always did and removing that code broke devtest01:40
lifelessderekh: context switch for a secnod please01:42
derekhlifeless: k01:42
openstackgerritlifeless proposed a change to openstack/diskimage-builder: Revert "Support adding DHCP interfaces one at a time. "  https://review.openstack.org/7347201:43
openstackgerritlifeless proposed a change to openstack/diskimage-builder: Revert "Add Fedora DHCP interfaces via udev rules."  https://review.openstack.org/7347301:43
openstackgerritlifeless proposed a change to openstack/diskimage-builder: Better apt-sources docs.  https://review.openstack.org/7303501:43
lifelessderekh: these ^ need to land to unbreak dhcpa-ll-interfaces01:43
lifelessthe work isn't baked yet01:43
lifelesshttps://review.openstack.org/73472 and https://review.openstack.org/73473 specifically01:43
rwsulifeless: I don't understand why in boot-seed-vm, we wouldn't put $MESSAGING_BACKEND in ALWAYS_ELEMENT but in SEED_DIB_EXTRA_ARGS, I guess i'm missing something01:44
*** peoplemerge has quit IRC01:45
lifelessrwsu: I'm sorry, I can't focus on this right now - its super late for derekh and CI is down.01:45
lifelessrwsu: do what works and looks good to you.01:45
lifelessrwsu: the key issue is that we can't put it on the hypervisors01:45
rwsuok, np01:45
lifelessthats the proble,m in the current patch,a nd sorry for steering you badly before01:46
*** wusWWW has quit IRC01:46
derekhlifeless: looking, as a matter of interest any idea what the problem was on ubuntu? its hardly causing the growparts problem is it01:47
lifelessderekh: growparts isn't a problem01:47
lifeless df -h01:47
lifelessFilesystem      Size  Used Avail Use% Mounted on01:47
lifeless/dev/sda3       469G  1.7G  448G   1% /01:47
lifelessI think its coplaining because it has already grown.01:48
lifelessits odd that /mnt isn't there01:48
lifelessI have NFI what was up with that, but the fundamental problem is that dhcp-all-interfaces is broken01:48
lifelessderekh: I don't think we should go down the rabbit hole - revert to known good, give dan time to get it fixed.01:49
derekhlifeless: ok, fair enough if your sure its what the problem is01:49
lifelessderekh: totally sure - I watched the boot on the console fail01:50
lifelessderekh: logged in on console, no network, eth2 down01:50
lifelessthats the problem d-a-i is meant to solve01:50
derekhlifeless: +A'd both01:51
lifelessthanks01:51
derekhbrb01:52
lifelessderekh: ok01:52
lifelessderekh: ssh. so I see your point.01:52
lifelesspragmatic side - if we put generation back, will it work? we'll have default ssh into a node work, which we need to run init-keystone01:52
lifelesswe can file a bug about the abuse of the users authorized_keys files01:53
lifelessderekh: so ok, lets put it back, but can we put some prose around it ?01:53
lifelesswow, what you want to see aon a server image build01:55
lifeless'regenerating fonts cache'01:55
StevenKHaha01:55
openstackgerritA change was merged to openstack/diskimage-builder: Revert "Support adding DHCP interfaces one at a time. "  https://review.openstack.org/7347201:56
openstackgerritA change was merged to openstack/diskimage-builder: Revert "Add Fedora DHCP interfaces via udev rules."  https://review.openstack.org/7347301:56
derekhlifeless: I know it doesn't work without it, I'll try now with it back to verify01:57
openstackgerritA change was merged to openstack/tripleo-incubator: Permit setting a /etc/hosts entry for the API.  https://review.openstack.org/7172701:57
openstackgerritRichard Su proposed a change to openstack/tripleo-heat-templates: Add username parameter for rabbit  https://review.openstack.org/7143001:57
lifelessah heat stack-delete, one machine at a time.02:01
lifelessStevenK: seen the remove i386 from foreign arch on server install thread ?02:02
lifelessI was considering whether we had that enabled just the other day.02:02
StevenKlifeless: I've seen it02:02
StevenKI'm currently ambilivant about it on my own servers02:03
openstackgerritDerek Higgins proposed a change to openstack/tripleo-incubator: Restore generation of ssh keypair  https://review.openstack.org/7345802:03
StevenKRead as: I have a local mirror, so apt-get update is already fast02:03
derekhlifeless: added a note to find an alternative, is that what you meant by "add prose" ?02:04
lifelesstestenv deploy running02:04
StevenKlifeless: However, it probably makes sense for us to kill it02:04
derekhok, time I went to bed02:05
lifelessderekh: will tweak and push02:05
lifelessgnight02:05
derekhfingers crossed all comes up this time :-)02:05
lifelessderekh: thanks for your help!02:06
derekhlifeless: np02:06
StevenKlifeless: chroot'd into an amd64 ubuntu image created about 80 minutes ago:02:06
*** derekh has quit IRC02:06
StevenKroot@undermined:/# dpkg --print-foreign-architectures02:06
StevenKroot@undermined:/#02:06
lifelessStevenK: ah cool02:06
StevenKlifeless: But I agree with you. It makes no sense for cloud images.02:07
* StevenK tries to teach his muscle memory to use pkill rather than skill02:07
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Restore generation of ssh keypair  https://review.openstack.org/7345802:12
*** e0ne has joined #tripleo02:13
*** dprince has quit IRC02:15
*** dprince has joined #tripleo02:17
*** e0ne has quit IRC02:18
lifelessyay te coming online02:19
*** CaptTofu has joined #tripleo02:20
*** ramishra has joined #tripleo02:30
*** dprince has quit IRC02:32
lifelessnuts, after all that we *still* had 2 te machines fail to deploy02:33
lifelessstabbity stab stba02:33
lifeless31 test environments; little low :(02:39
*** ccrouch has joined #tripleo02:52
openstackgerritA change was merged to openstack/tripleo-image-elements: Parameterize geard broker and test env public interface  https://review.openstack.org/7230502:57
lifelessSpamapS: Ithink some of the fragility is more bad BMC interactions02:59
lifelessSpamapS: very slow crotchety embedded machines02:59
*** newell has quit IRC03:01
*** e0ne has joined #tripleo03:13
*** whayutin_ has quit IRC03:13
lifelessSpamapS: also remember to restart n-d-a between each deploy :(03:14
*** vkdrao has joined #tripleo03:17
*** e0ne has quit IRC03:18
*** cwolferh has quit IRC03:19
*** jcooley_ has joined #tripleo03:26
*** cwolferh has joined #tripleo03:30
StevenKHm03:39
*** ramishra has quit IRC03:39
StevenKChanges to toci writing the datafile isn't escaped03:39
openstackgerritSteve Kowalik proposed a change to openstack-infra/tripleo-ci: Add more escaping the JSON writing  https://review.openstack.org/7349103:44
openstackgerritSteve Kowalik proposed a change to openstack-infra/tripleo-ci: Add cpu-{node,mem,disk} to the testenv JSON  https://review.openstack.org/7349203:46
*** ramishra has joined #tripleo03:47
StevenKlifeless, SpamapS: 73491 is likely quite important, since it may lead to invalid JSON or bash getting confused03:48
*** vkdrao has quit IRC03:51
*** edmund has joined #tripleo03:59
*** ramishra has quit IRC04:12
*** ramishra has joined #tripleo04:12
*** e0ne has joined #tripleo04:13
*** CaptTofu has quit IRC04:16
*** e0ne has quit IRC04:18
*** chrisfoster has quit IRC04:23
*** UtahDave has joined #tripleo04:23
*** chrisfoster has joined #tripleo04:24
*** jcooley_ has quit IRC04:24
*** saju_m has joined #tripleo04:39
openstackgerritSteve Kowalik proposed a change to openstack-infra/tripleo-ci: Add more escaping when writing the JSON datafile  https://review.openstack.org/7349104:42
openstackgerritSteve Kowalik proposed a change to openstack-infra/tripleo-ci: Add cpu-{node,mem,disk} to the testenv JSON  https://review.openstack.org/7349204:42
StevenKSpamapS: ^ Fixed, sorry04:42
lifelessStevenK: and thats why we used python :)04:45
lifelessnuts undercloudcloud fail04:45
lifeless9 failed testenvs04:45
lifelessSpamapS: are yo around ?04:46
lifelessSpamapS: I'm  betting keystone fail04:46
StevenKlifeless: Could it be due to 73491 ?04:46
lifelessStevenK: no04:46
StevenKAw04:46
lifelessStevenK: | c2a3c40e-ced6-4695-91be-8ca8524d9c5b | testenv-testenv3-xrgjgoycffwa                   | ERROR  | None       | NOSTATE     |                       |04:46
lifelesswrong layer04:46
StevenKRight04:46
lifelesshmm, 87K tokens, thats not in OMG territory yet.04:48
* lifeless truncates anyhow04:48
*** epim has joined #tripleo04:55
SpamapSlifeless: eh?04:58
SpamapSlifeless: we should be keeping tokens under control now that the purge thing actually works04:58
SpamapSlifeless: 87k is 1.32 million less than we had back when it was zomg fail ;)04:58
lifelessSpamapS: yeah, but I had 9 failed instances :(05:03
lifelessSpamapS: I'm flailing around for a cause, since all 10 of thsoe machines deploy ed ok before05:05
lifelessSpamapS: (and then totally failed to be useful05:05
SpamapSlifeless: are they failing after having pxe booted once into deploy ramdisk?05:07
SpamapSlifeless: I saw that a few times but couldn't keep my attention on ILO enough to figure out why.05:08
lifelessdunno, I just kicked it all back to start05:08
lifeless9 in a row exceeded my per-machine credularity05:08
SpamapSYeah I'm wondering if it is a tftp fail or something silly like that.05:08
*** ramishra has quit IRC05:11
*** ramishra has joined #tripleo05:12
*** jcooley_ has joined #tripleo05:13
*** e0ne has joined #tripleo05:13
lifelessone spawned05:17
lifeless9 pending05:17
lifeless3.6G in /tftpboot05:18
lifelessbut only kernels and ramdisks go there05:18
*** e0ne has quit IRC05:18
*** akuznetsov has joined #tripleo05:24
lifelessno requests in the deploy helper05:24
lifelessmachine seems stuck on boot-from-hard-drive05:31
*** epim_ has joined #tripleo05:53
*** epim has quit IRC05:54
*** epim_ is now known as epim05:54
*** saju_m has quit IRC05:58
*** al has quit IRC06:01
*** rpodolyaka1 has joined #tripleo06:02
lifelessahha06:03
lifelessStderr: 'Error in open session response message : insufficient resources for session\n\r\nError: Unable to establish06:03
lifeless IPMI v2 / RMCP+ session\r\nUnable to get Chassis Power Status\r\n'06:03
lifelessSpamapS: ^ le failure06:03
lifelessActive Session Count  : 5906:06
* lifeless sees another night of fixing ahead06:08
lifelessdevananda: ^ this may interest you06:08
*** sballe has quit IRC06:11
*** e0ne has joined #tripleo06:13
*** rcarrillocruz1 has joined #tripleo06:15
*** CaptTofu has joined #tripleo06:17
*** akuznetsov has quit IRC06:17
*** rcarrillocruz has quit IRC06:17
lifelessthese machines have 4 slots06:18
lifelessone for the screen sessions06:18
*** e0ne has quit IRC06:18
lifelessone for periodic checks06:18
lifelessone for deploy actions06:18
lifelessone for sol - we're at capacity06:18
*** vkozhukalov has joined #tripleo06:21
*** CaptTofu has quit IRC06:21
*** jcooley_ has quit IRC06:23
*** akuznetsov has joined #tripleo06:26
openstackgerritRichard Su proposed a change to openstack/tripleo-incubator: Messaging backend selection  https://review.openstack.org/7252406:27
*** mrunge has joined #tripleo06:28
*** rcarrillocruz has joined #tripleo06:33
*** tzumainn has joined #tripleo06:33
*** rcarrillocruz1 has quit IRC06:34
*** lazy_prince has joined #tripleo07:01
lifeless24310 ?        S      0:00 /usr/bin/expect -f /usr/lib/conman/exec/ipmitool.exp 10.10.16.2507:02
lifelessok, kill -9'd em all07:03
lifelesstrying a deploy07:04
lifelessI wonder, is it possible that these machines share BMCs or something?07:04
lifelessthat would explain 'deploy N at once' => overload BMC ?07:04
lifelesskilled the screen session with ilos too07:08
*** akuznetsov has quit IRC07:09
*** akuznetsov has joined #tripleo07:09
*** rcarrillocruz1 has joined #tripleo07:14
*** rcarrillocruz has quit IRC07:17
*** sballe has joined #tripleo07:24
*** saju_m has joined #tripleo07:25
*** tserong has quit IRC07:29
*** vkozhukalov has quit IRC07:33
*** jprovazn has joined #tripleo07:41
openstackgerritA change was merged to openstack/tuskar: Rename Openstack to OpenStack  https://review.openstack.org/7321207:46
*** rpodolyaka1 has quit IRC07:50
openstackgerritA change was merged to openstack/tuskar: Fix delete cascade for attribute and counts  https://review.openstack.org/7246407:50
*** epim has quit IRC07:51
jprovaznSpamapS, Hi, you still around?07:52
*** jistr has joined #tripleo08:06
*** dko has joined #tripleo08:09
*** CaptTofu has joined #tripleo08:17
*** shardy_afk is now known as shardy08:21
*** bauzas has joined #tripleo08:21
*** CaptTofu has quit IRC08:22
*** viktors has joined #tripleo08:24
*** jtomasek has joined #tripleo08:32
rpodolyakamorning tripleo08:34
*** gcha has joined #tripleo08:37
shadowerhey08:38
*** dtantsur has joined #tripleo08:39
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Use TripleO Heat Merge to manage the stack  https://review.openstack.org/5204508:41
*** tserong has joined #tripleo08:47
*** rlandy has joined #tripleo08:50
*** dtantsur has quit IRC08:54
*** dtantsur has joined #tripleo08:54
*** markmc has joined #tripleo08:56
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Use TripleO Heat Merge to manage the stack  https://review.openstack.org/5204508:59
d0ugallsmola: ^08:59
d0ugalThat should make Jenkins happier.08:59
lsmolad0ugal: cool08:59
*** ifarkas has joined #tripleo09:01
*** vkozhukalov has joined #tripleo09:05
openstackgerritLadislav Smola proposed a change to openstack/tuskar-ui: Adding real deleting and creating of overcloud  https://review.openstack.org/7334309:07
d0ugaldtantsur: Looks like that warning in Python 2.7 only09:07
*** rdopieralski has joined #tripleo09:07
d0ugaldtantsur: So, not sure we can do much about it without effecting 2.6 support.09:08
d0ugallsmola: ^09:08
lifelesswe're not even hitting _cache_tftp_images09:08
*** e0ne has joined #tripleo09:09
*** rcarrillocruz has joined #tripleo09:09
*** al_ has joined #tripleo09:10
*** al_ is now known as Guest3648909:10
*** rcarrillocruz1 has quit IRC09:10
*** derekh has joined #tripleo09:13
*** Guest36489 has quit IRC09:15
*** al0 has joined #tripleo09:16
*** marun has joined #tripleo09:16
*** al0 has quit IRC09:18
*** hashar has joined #tripleo09:18
*** al has joined #tripleo09:18
*** al has quit IRC09:20
*** al has joined #tripleo09:20
*** marun has quit IRC09:21
*** ramishra has quit IRC09:22
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Add a deployment scaling dialog  https://review.openstack.org/7355109:27
*** lucasagomes has joined #tripleo09:28
*** ramishra has joined #tripleo09:28
derekhlifeless: test env deploy didn't go smoothly ?09:31
*** dtantsur has quit IRC09:32
lifelessderekh: it did09:32
*** dtantsur has joined #tripleo09:32
lifelessderekh: wrong broker address09:32
ekarlsohmm, do I need to do something special with bridges in order to get things to work or ?09:33
lifelessderekh: redeploy died09:33
lifelessdebugging09:33
*** UtahDave has quit IRC09:33
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Update the deployment design page  https://review.openstack.org/7323809:33
ekarlsohad first a problem with ovs + briding, so removed the bridge module and I deleted the default network now it fails to run devtest_seed.sh due to default network missing p09:33
derekhlifeless: ahh ok, bugger, if you want me to look at anything just shout.09:34
*** athomas has joined #tripleo09:35
lifeless2014-02-14 09:32:12,684.684 26712 TRACE nova.network.neutronv2.api [instance: 3ab70541-a728-4f06-a556-659413821638] NeutronClientException: 409-{u'NeutronError': {u'message': u'Unable to complete operation for network e85b44c7-1136-4217-954e-cdf0acdddfe1. The mac address 78:e7:d1:23:62:95 is in use.', u'type': u'MacAddressInUse', u'detail': u''}}09:35
lifelessmight be related09:35
lifelessthough WTF that is TRACE?!??!??! is beyond me09:35
derekhgood questions, seems it should be warning at the very least09:35
lifelesswe've had BMC's run out of sessions09:37
lifelessbunch of ZOMG09:37
lsmolalifeless: hola09:41
lsmolalifeless: is stack-update working for you? https://bugs.launchpad.net/tripleo/+bug/127901109:41
lifelesslsmola: things may be all broken right now, and we can't really tell - was intending to have CI up about 10 hours ago ... but things happened09:42
lsmolalifeless: talked with heat guys, the template seems to be fine, so I have no idea :-)09:42
lifelesslsmola: what makes this critical?09:43
lsmolalifeless: ok, cool09:43
lifelesslsmola: I mean - is stack-update broken, or is scaling up not working. These are two different things09:43
lsmolalifeless: well I figured that stack-create and update are kind of most important operations we have09:43
lsmolalifeless: stack update is broken when you increase a number of computes09:44
lifelesslsmola: is it broken when you don't do that ?09:44
lsmolalifeless: other cases seems to be working09:44
lifelessok, so its not a regression09:44
lifelessits a use case that hasn't been implemented09:44
lsmolalifeless: well not sure, critical should be, when there is no workaround, right?09:45
lsmolalifeless: oh, it never worked?09:45
lifelessnoone tried it that I know of09:45
lsmolalifeless: hehe ok09:45
lsmolalifeless: so should I keep it critical? it is still 1 of the 2 most important features09:46
lifelesslsmola: no, criticals are reserved for firedrills, regressions and the like09:46
lsmolalifeless: ok, back to high09:47
lifelesslsmola: done already09:47
lsmolalifeless: ok cool09:47
lsmolalifeless: thank you for the feedback09:47
lifelessyw09:47
lifeless09:49
lifelessNo filename or root path specified09:49
lifelessNo more network devices09:49
lifelessderekh: ^ bad pxe data from neutron. wtfmoment09:49
lifelessand yet09:50
lifelesstag:9bbc56c0-60f1-4f8e-a5c7-d008686cbb6e,option:bootfile-name,pxelinux.009:50
lifelesstag:9bbc56c0-60f1-4f8e-a5c7-d008686cbb6e,option:server-ip-address,10.10.16.16909:50
lifelessvisible in neutron09:50
lifelessaddress handed out is 192.168.1.157/09:51
lifelessAHHA09:51
lifelessderekh: I know whats happening09:51
*** saju_m has quit IRC09:51
derekhhope so, I wasn't looking forward to digging into neutron :-)09:52
lifelessderekh: bug 127280309:52
lifelessderekh: just added a comment to the description. Read and weep09:53
*** vkozhukalov has quit IRC09:53
* derekh reading09:53
derekhok, I can see how this would be random on each instance, but if an entirer stack succeeds and then the entirer stack passes, seem a bit unrandom09:56
lifelessderekh: its entirely explainable09:58
lifelessderekh: I forgot to delete the port before redeploying09:58
derekhand why would the overcloud be responding to macs of hosts on the undercloud, we have removed the port allocations, suppose it mightn't have deleted the fully09:58
lifelessport/ports/09:58
lifelessderekh: if one forgets, *boom*09:59
derekhlifeless: ok, makes sense09:59
derekhlifeless: so have you deleted them since then, I don't see them now09:59
lifelessonce I realised09:59
lifelessI took action :)09:59
derekhlifeless: ok, I buy the explaination, so redeploy and we should be golden10:00
lifelessthats whats happening right now10:01
lifelessfirst box is just about to pxe boot10:01
lifelessthere it goes10:02
lifelessdeploy disk10:02
lifelesswoo10:02
derekhwoot10:02
lifelesswhat a brain-f*ck10:02
lifelessthe first deploy 9/10 failed, one worked10:02
lifelessso I didn't clue in. - like it wasn't obviously 'this doesn't work'10:02
derekhsometimes you just need to stare at something for a few hours :-(10:03
lifelessno10:03
lifelessone never *needs* that10:03
lifelessright, one active10:03
derekhok, need is the wrong  word10:04
derekhovercloud controller may need more memory : Feb 14 09:48:49 overcloud-notcompute0-t3bybsd652xw kernel: Out of memory: Kill process 5074 (glance-api) score 208 or sacrifice child10:04
derekhin user run devtest10:04
lifelessreally? lol10:04
lifelessI think we should file bugs10:04
lifelesscause, 2G should be -tonnes- for non-cache pages10:04
*** vkozhukalov has joined #tripleo10:05
derekh             total       used       free     shared    buffers     cached10:05
derekhMem:       2051192    1649940     401252      16708       7484      6107210:05
derekhglance-api got killed whill uploading the user image10:06
derekhand again, I wonder has it started storing images in memory all of a suddon10:08
derekhwill take a look, and see what needs to be filed10:08
derekhbtw, this worked https://review.openstack.org/#/c/73458/ can you remove the -210:08
derekhlifeless: ^10:08
lifelessheh, it survived the -2er - me - uploading a fixed patch10:10
lifelessubuntu@te-broker:~$ echo 'status' | nc localhost 4730 -q 1 | grep lockenv10:15
lifelesslockenv 5       0       810:15
lifelessok, *now* we're gassing with cookery10:15
lifelessecho 'status' | nc localhost 4730 -q 1 | grep lockenv10:16
lifelesslockenv 2       2       1110:16
derekhtestenvs 1 4 and 7 have registered with overcloud neutron10:16
*** max_lobur_afk is now known as max_lobur10:17
*** CaptTofu has joined #tripleo10:18
lifelessderekh:10:20
lifeless2014-02-14 10:18:33.335 | [2014/02/14 10:18:33 AM] [ERROR] Could not parse metadata file: /tmp/tmpXuYNcJ10:20
lifelessfrom https://jenkins02.openstack.org/job/check-tripleo-seed-precise/306/console10:21
lifelessderekh: did you see StevenK10:21
lifeless's fix?10:21
*** nosnos has quit IRC10:21
derekhlifeless: looking10:22
*** CaptTofu has quit IRC10:23
*** hashar has quit IRC10:23
derekhlifeless: crap, I hadn't seen it, +2'd10:24
*** hashar has joined #tripleo10:25
derekhlifeless: if you want I can redeploy it again, once merged, its probably late there10:26
*** ccorrigan has joined #tripleo10:27
lifelessit is a little10:27
lifelessderekh: please do10:29
lifeless#./deploy-testenv 10.10.16.135 192.168.1.4810:29
lifelessis how I deployed10:29
lifelessbut10:29
derekhlifeless: ok, can you +A the patch https://review.openstack.org/#/c/73491/10:29
lifelessactually, nvm , full deploy is needed anyhow10:29
openstackgerritA change was merged to openstack-infra/tripleo-ci: Add more escaping when writing the JSON datafile  https://review.openstack.org/7349110:30
derekhtestenv stack deleted / ports deleted and building new image10:36
derekhhmm, so would the fact that ports weren't been cleaned up explain a lot of the  issues we've been seeing in general with instances not booting10:41
*** boris-42_ has quit IRC10:43
derekh| f220235a-3f03-470e-a469-f9d7bb76ef67 | testenv      | CREATE_IN_PROGRESS | 2014-02-14T10:45:38Z |10:45
lifelessderekh: yes, that bug really is critical :)10:46
lifelessgnight10:50
derekhgnight, have a good weekend10:51
*** ramishra has quit IRC10:55
*** ramishra has joined #tripleo11:05
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Use TripleO Heat Merge to manage the stack  https://review.openstack.org/5204511:05
*** gcha has quit IRC11:06
*** CaptTofu has joined #tripleo11:17
*** CaptTofu has quit IRC11:18
*** CaptTofu has joined #tripleo11:19
*** CaptTofu has quit IRC11:20
*** CaptTofu has joined #tripleo11:20
*** CaptTofu has quit IRC11:21
*** CaptTofu has joined #tripleo11:21
*** CaptTofu has quit IRC11:24
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Use TripleO Heat Merge to manage the stack  https://review.openstack.org/5204511:26
*** CaptTofu has joined #tripleo11:27
*** CaptTofu has quit IRC11:32
*** CaptTofu has joined #tripleo11:32
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add CRUD operations for node profiles  https://review.openstack.org/7357611:35
*** lazy_prince has left #tripleo11:35
*** al has quit IRC11:37
*** al has joined #tripleo11:39
*** d0ugal has quit IRC11:41
*** d0ugal has joined #tripleo11:41
*** al has quit IRC11:43
*** al has joined #tripleo11:44
openstackgerritLadislav Smola proposed a change to openstack/tuskar-ui: Adding real index, deleting and creating of overcloud  https://review.openstack.org/7334311:49
dtantsurHi everyone. I have a question about architecture field in node profile UI (tuskar-ui). Do we have a fixed list of possible architectures or we'll allow user to input everything there?11:55
*** boris-42_ has joined #tripleo11:56
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Update the deployment design page  https://review.openstack.org/7323812:04
ekarlsolifeless: here boss ? You got a clue on my issue12:06
dtantsurquestion #2: does it really makes sense to make ID for node profile settable by user/editable?12:06
openstackgerritLadislav Smola proposed a change to openstack/tuskar-ui: Adding real index, deleting and creating of overcloud  https://review.openstack.org/7334312:10
openstackgerritA change was merged to openstack/tripleo-incubator: Rename Openstack to OpenStack  https://review.openstack.org/7320812:12
openstackgerritA change was merged to openstack/tripleo-image-elements: Rename Openstack to OpenStack  https://review.openstack.org/7320612:12
*** hashar has quit IRC12:12
ekarlsoI'm unable to get the devtest stuff going12:12
openstackgerritA change was merged to openstack/diskimage-builder: Rename Openstack to OpenStack  https://review.openstack.org/7290212:12
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add CRUD operations for node profiles  https://review.openstack.org/7357612:16
openstackgerritA change was merged to openstack/tuskar: Use TripleO Heat Merge to manage the stack  https://review.openstack.org/5204512:16
*** ifarkas has quit IRC12:18
*** ifarkas has joined #tripleo12:20
openstackgerritA change was merged to openstack/diskimage-builder: Teach cache-url to handle file:// URIs.  https://review.openstack.org/7296412:23
ifarkasdtantsur, you might want to check the #tuskar channel. tuskar folks doesn't read #tripleo that often12:33
openstackgerritA change was merged to openstack/tripleo-incubator: Rename EXTRA_ELEMENTS to SEED_DIB_EXTRA_ARGS  https://review.openstack.org/7303212:33
*** weshay has joined #tripleo12:34
*** mrunge has quit IRC12:35
derekhanybody else having devtest crash when uploading the user qcow image ?12:39
derekhhttps://bugs.launchpad.net/tripleo/+bug/128027512:40
ekarlsoanyone here running a devtest in a cloud ?12:42
*** gcha has joined #tripleo12:42
markmcderekh, this ? https://bugs.launchpad.net/openstack-ci/+bug/128007212:44
markmcderekh, https://review.openstack.org/7358512:44
derekhmarkmc: ahh, I hadn't spotted that, looking12:45
derekhmarkmc: seems to be different (but related to the new swift client release), my problem is extra memory usage, this one is a curruption problem, I'll add a link to the memory bug and try that patch12:48
markmcderekh, roger12:48
*** hashar has joined #tripleo12:55
lsmolajprovazn: have a minute?12:55
jprovaznlsmola, sure12:55
lsmolaWaiting for SSH hostkey.12:55
lsmolaWarning: Permanently added '192.168.122.191' (ECDSA) to the list of known hosts.12:55
lsmolaPermission denied (publickey,gssapi-keyex,gssapi-with-mic).12:55
lsmolajprovazn: after i boot seed, this repeats till timeout12:55
lsmolajprovazn: seed is fine, but it cant shh to it?12:56
lsmolajprovazn: http://paste.openstack.org/show/65415/12:56
jprovaznlsmola, seems you don't use the right key when sshing to the seed machine12:56
lsmolajprovazn: ok12:57
lsmolajprovazn: any idea how to fix it? this is devtest.sh12:57
jprovaznlsmola, can you ssh as stackuser?13:02
lsmolajprovazn: I have updated tripleo and runnign again, will tell you in 5min13:03
jprovaznif yes, then you can put your ssh key to root's authorized_keys13:03
lsmolajprovazn: ok13:03
lsmolajprovazn: what script should put it there?13:03
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Add a deployment scaling dialog  https://review.openstack.org/7355113:05
derekhlsmola: try ssh root@192.168.122.XXX -i ~/.ssh/id_rsa_virt_power13:05
*** lucasagomes is now known as lucas-hungry13:06
lsmoladerekh: ok, thanks13:06
derekhlsmola: and if that works add a +1 to https://review.openstack.org/#/c/73458/13:06
derekhor 2 :-)13:06
lsmoladerekh: ok, great13:06
lsmoladerekh: ok, I will rather try with your patch applied :-)13:08
derekhlsmola: okie dokie13:08
jistrlsmola: i hit the same thing as you described to jprovazn13:16
jprovaznjistr, derekh has already patch for the issue ^13:17
jistrderekh++13:17
jistrit works13:17
jistradding +213:17
lsmolajistr: cool13:17
jistrafter i read it :)13:18
lsmolajistr, derekh : I will try it and approve it13:18
*** howleyt has joined #tripleo13:20
*** jdob has joined #tripleo13:21
howleytHi, I'm trying to get devtest up and running and it's failing at some point in the provisioning of the noncompute node in the overcloud:13:22
howleytroot@undercloud-undercloud-b2ue6a46w5ag:~# heat stack-list13:22
howleyt+--------------------------------------+------------+---------------+----------------------+13:22
howleyt| id                                   | stack_name | stack_status  | creation_time        |13:22
howleyt+--------------------------------------+------------+---------------+----------------------+13:22
howleyt| 82eab752-5be7-4f44-8956-3c3c9307c757 | overcloud  | CREATE_FAILED | 2014-02-11T17:50:01Z |13:22
howleyt+--------------------------------------+------------+---------------+----------------------+13:22
howleytIt fails specifically on this resource:13:23
howleytroot@undercloud-undercloud-b2ue6a46w5ag:~# heat resource-list overcloud | grep FAILED13:23
howleyt| notCompute0CompletionCondition  | AWS::CloudFormation::WaitCondition       | CREATE_FAILED   | 2014-02-11T18:38:50Z |13:23
howleytAny ideas of what is going wrong?13:23
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add CRUD operations for node profiles  https://review.openstack.org/7357613:24
*** e0ne has quit IRC13:24
*** e0ne has joined #tripleo13:25
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Add a deployment scaling dialog  https://review.openstack.org/7355113:28
*** vkozhukalov has quit IRC13:34
howleytI also have errors in the os-collect-config.log  on my undercloud:13:35
howleytroot@undercloud-undercloud-b2ue6a46w5ag:/var/log/upstart# grep -i error os-collect-config.log13:35
howleyterror.log13:35
howleyt[2014/02/11 04:23:37 PM] [ERROR] key neutron does not exist in ['/var/lib/os-collect-config/heat_local.json', '/var/lib/os-collect-config/ec2.json']13:35
howleyt[2014/02/11 04:23:38 PM] [ERROR] key db-password does not exist in ['/var/lib/os-collect-config/heat_local.json', '/var/lib/os-collect-config/ec2.json']13:35
howleyt[2014-02-11 16:23:38,013] (os-refresh-config) [ERROR] during configure phase. [Command '['dib-run-parts', '/opt/stack/os-config-refresh/configure.d']' returned non-zero exit status 1]13:35
howleyt[2014-02-11 16:23:38,013] (os-refresh-config) [ERROR] Aborting...13:35
howleyt2014-02-11 16:23:38.017 1198 ERROR os-collect-config [-] Command failed, will not cache new data. Command 'os-refresh-config' returned non-zero exit status 113:35
howleytHide full text13:35
*** e0ne has quit IRC13:38
Ngmorning13:45
rpodolyakao/13:45
*** vkozhukalov has joined #tripleo13:47
slaglemorning13:49
openstackgerritJan Provaznik proposed a change to openstack/tripleo-heat-templates: Add rabbit.cookie attribute  https://review.openstack.org/7023814:01
openstackgerritA change was merged to openstack-infra/tripleo-ci: Add cpu-{node,mem,disk} to the testenv JSON  https://review.openstack.org/7349214:03
openstackgerritJan Provaznik proposed a change to openstack/tripleo-heat-templates: Add rabbit.cookie attribute  https://review.openstack.org/7023814:09
*** lucas-hungry is now known as lucasagomes14:10
*** rlandy has quit IRC14:10
*** tzumainn has quit IRC14:14
*** tzumainn has joined #tripleo14:15
howleytHi, anyone free to help me trying to get devtest up and running? It's failing in the provisioning of the noncompute node in my overcloud.14:23
* Ng runs out to a dr appt14:27
*** matty_dubs|gone is now known as matty_dubs14:31
openstackgerritRalf Haferkamp proposed a change to openstack/tripleo-image-elements: Enable iscsid on openSUSE  https://review.openstack.org/7360514:33
*** lblanchard has joined #tripleo14:34
*** dprince has joined #tripleo14:35
jprovaznhowleyt, Hi, can you check if /var/run/os-collect-config//cfn.json exists on your overcloud node?14:35
openstackgerritA change was merged to openstack/diskimage-builder: Remove tox locale overrides  https://review.openstack.org/7220814:36
howleytHi jprovazn, I certainly. I'm seeing this error for two missing keys:14:37
howleytroot@undercloud-undercloud-b2ue6a46w5ag:/var/log/upstart# grep -i error os-collect-config.log14:37
howleyterror.log14:37
howleyt[2014/02/11 04:23:37 PM] [ERROR] key neutron does not exist in ['/var/lib/os-collect-config/heat_local.json', '/var/lib/os-collect-config/ec2.json']14:37
howleyt[2014/02/11 04:23:38 PM] [ERROR] key db-password does not exist in ['/var/lib/os-collect-config/heat_local.json', '/var/lib/os-collect-config/ec2.json']14:37
howleyt[2014-02-11 16:23:38,013] (os-refresh-config) [ERROR] during configure phase. [Command '['dib-run-parts', '/opt/stack/os-config-refresh/configure.d']' returned non-zero exit status 1]14:37
howleyt[2014-02-11 16:23:38,013] (os-refresh-config) [ERROR] Aborting...14:37
howleyt2014-02-11 16:23:38.017 1198 ERROR os-collect-config [-] Command failed, will not cache new data. Command 'os-refresh-config' returned non-zero exit status 114:37
howleytHowever, those keys do exist in cfn.json14:37
*** e0ne has joined #tripleo14:38
jprovaznhowleyt, well, the grepped errors don't have to be critical - os-collect-config runs before cfn.json is fetched from heat metadata server, if you open os-collect-config.log you should see if next iterations of os-collect-config passed14:40
jprovaznhowleyt, something like "Completed phase post-configure"14:41
jprovaznhowleyt, if it finished successfully, you might check on undercloud machine: 1) if all stack's intances are active (nova list), 2) heat/nove logs - there should be a log with reason why stack was marked as failed14:44
howleytjprovazn, Well, /var/log/upstart/os-collect-config.log  ends with: [2014-02-13 14:10:49,299] (os-refresh-config) [INFO] Completed phase post-configure14:44
*** dtantsur has quit IRC14:45
howleytAll three instances are active alright.14:45
jprovaznhowleyt, ok, could you please check heat/nova logs on undercloud?14:45
howleytI can ssh onto the two compute nodes, but can't ssh onto noncompute node.14:45
howleytsure14:45
jprovaznhowleyt, ah, mmt, derekh mentioned ssh key issue earlier today14:47
jprovaznthough it was related to seed node14:48
howleytActually, I lie. I can no longer ssh onto any of the overcloud instances now ( I have run devtest once since).14:48
howleytI'll check the logs14:48
howleytJust remembered I had a cut-down version of my heat-engine log in pastebin yesterday: http://pastebin.com/nptHXCdh14:50
howleytLots of errors like this:14:51
howleyt2014-02-13 14:29:30.305 2785 ERROR heat.openstack.common.rpc.common [req-d10cb384-0109-497f-9813-7da370f3900c None] Returning exception The Resource (NovaCompute0Config) is not available. to caller14:51
howleyt2014-02-13 14:29:30.306 2785 ERROR heat.openstack.common.rpc.common [req-d10cb384-0109-497f-9813-7da370f3900c None] ['Traceback (most recent call last):\n', '  File "/opt/stack/venvs/heat/local/lib/python2.714:51
howleyt172, in dispatch\n    result = getattr(proxyobj, method)(ctxt, **kwargs)\n', '  File "/opt/stack/venvs/heat/local/lib/python2.7/site-packages/heat/engine/service.py", line 62, in wrapped\n    return func(sel14:51
howleytf, ctx, *args, **kwargs)\n', '  File "/opt/stack/venvs/heat/local/lib/python2.7/site-packages/heat/engine/service.py", line 749, in describe_stack_resource\n    raise exception.ResourceNotAvailable(resource_14:51
howleytname=resource_name)\n', 'ResourceNotAvailable: The Resource (NovaCompute0Config) is not available.\n']14:51
howleytjprovazn /var/log/upstart/nova-compute.log on my undercloud also shows an error:14:56
howleyt2014-02-13 14:12:38.595 3322 ERROR nova.compute.manager [req-e49ea18f-b82f-47ca-adae-594d06733aa0 3fda0ffd31f04b2691cf74274be4e51a 5f2fe40d1d8149489b1b84e9b65ad806] [instance: f8da7e08-8110-4939-9855-6e66823ecff3] Error: Insufficient compute resources.14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3] Traceback (most recent call last):14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]   File "/opt/stack/venvs/nova/local/lib/python2.7/site-packages/nova/compute/manager.py", line 1044, in _build_instance14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]     with rt.instance_claim(context, instance, limits):14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]   File "/opt/stack/venvs/nova/local/lib/python2.7/site-packages/nova/openstack/common/lockutils.py", line 249, in inner14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]     return f(*args, **kwargs)14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]   File "/opt/stack/venvs/nova/local/lib/python2.7/site-packages/nova/compute/resource_tracker.py", line 136, in instance_claim14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]     raise exception.ComputeResourcesUnavailable()14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3] ComputeResourcesUnavailable: Insufficient compute resources.14:56
howleyt2014-02-13 14:12:38.595 3322 TRACE nova.compute.manager [instance: f8da7e08-8110-4939-9855-6e66823ecff3]14:56
howleyt2014-02-13 14:15:32.680 3322 WARNING nova.openstack.common.loopingcall [-] task run outlasted interval by 0.512338 sec14:56
howleyt2014-02-13 14:20:48.697 3322 WARNING nova.compute.manager [-] Bandwidth usage not supported by hypervisor.14:56
*** hashar has quit IRC14:57
*** edmund has quit IRC14:58
openstackgerritA change was merged to openstack/tripleo-incubator: Restore generation of ssh keypair  https://review.openstack.org/7345814:59
jprovaznhowleyt, hm, "Insufficient compute resources" sounds like a node for an instance couldn't be allocated or there was a node with required HW params15:01
jprovaznhowleyt, what "nova list" and "nova baremetal-node-list" prints?15:02
jprovazn(on undercloud)15:02
derekhhowleyt: just wondering are you running ./scripts/devtest.sh --trash-my-machine with or without a -c ?15:05
*** jprovazn is now known as jprovazn_afk15:07
derekhwoot, first successful ci undercloud job https://jenkins02.openstack.org/job/check-tripleo-undercloud-precise/319/15:08
derekhnow to figure out the overcloud15:09
*** hashar has joined #tripleo15:10
*** ifarkas has quit IRC15:17
*** boris-42_ has quit IRC15:17
mordredderekh: awesome!15:18
*** ifarkas has joined #tripleo15:19
Ngnice15:19
openstackgerritJiri Tomasek proposed a change to openstack/tuskar-ui: Fixing the nodes table  https://review.openstack.org/7264815:21
*** jdob has quit IRC15:23
*** jdob has joined #tripleo15:23
howleytderekh: I have run devtest twice, the second time with -c15:25
howleytjprovazn:root@undercloud-undercloud-57spmb6tdxn7:~# nova list15:26
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+15:26
howleyt| ID                                   | Name                                | Status | Task State | Power State | Networks           |15:26
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+15:26
howleyt| c6d3d751-d195-4754-9595-135804e7d53c | overcloud-NovaCompute0-4djus5fwnmui | ACTIVE | -          | Running     | ctlplane=192.0.2.6 |15:26
howleyt| 14e954ce-cdbd-419b-b670-cc1505cfd0e3 | overcloud-NovaCompute1-bc2lx4xxhsui | ACTIVE | -          | Running     | ctlplane=192.0.2.7 |15:26
howleyt| f8da7e08-8110-4939-9855-6e66823ecff3 | overcloud-notCompute0-65jwa3sikh62  | ACTIVE | -          | Running     | ctlplane=192.0.2.8 |15:26
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+15:26
derekhhowleyt: I *think* the -c runs to fast, so the scheduler doesn't know yet about baremetal nodes by the time the heat stack-create runs, which might explain the Insufficient resource problem15:27
derekhhowleyt: if I'm correct this might be a workaround for it http://paste.fedoraproject.org/77284/13923916/15:27
* derekh has been meaning to look into this but hasn't gotten around to it15:28
derekhhas anybody else seen this behaviour ^ ?15:28
*** dtantsur has joined #tripleo15:33
*** hashar has quit IRC15:34
*** hashar has joined #tripleo15:34
*** jdob has quit IRC15:36
*** jdob has joined #tripleo15:36
howleytThanks derekh, I'll try that out. I seemed to have been in a better position in terms ssh'ing onto overcloud compute nodes before running with -c15:36
openstackgerritDirk Mueller proposed a change to openstack/tripleo-image-elements: Install syslinux also in nova-baremetal  https://review.openstack.org/7362315:39
openstackgerritIvan Kolodyazhny proposed a change to openstack/python-tuskarclient: Using tox instead of tuskar_with_venv.sh script  https://review.openstack.org/7338315:41
*** dko has quit IRC15:46
openstackgerritRadomir Dopieralski proposed a change to openstack/tuskar-ui: Add a deployment scaling dialog  https://review.openstack.org/7355115:48
howleytderekh I've kicked off devtest.sh -c run with those sleeps inserted. I probably won't get a result before I today, but we'll see.  Thanks15:50
derekhhowleyt: no prob15:50
howleytbefore I "leave" today, that is.15:51
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add list/created/delete operations for node profiles  https://review.openstack.org/7357615:52
*** gcha has quit IRC15:53
openstackgerritRalf Haferkamp proposed a change to openstack/tripleo-image-elements: Remove no longer needed generation of the virtual-power-key  https://review.openstack.org/7363015:53
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add list/created/delete operations for node profiles  https://review.openstack.org/7357616:00
*** edmund has joined #tripleo16:00
*** e0ne_ has joined #tripleo16:02
*** e0ne has quit IRC16:06
derekhSpamapS: is it ok to zap one of the findbroken-* instances on the undercloud ? wanna being up a test test env and it looks like we're out of hosts16:06
ccrouchso we've got 10 RH folks lined up for tripleo16:06
ccrouchwaiting for hotel information16:06
ccrouchtripleo meetup in sunnyvale16:06
*** rdopieralski has quit IRC16:07
howleytderekh: I'm still waiting for overcloud stack to be ready, but I'm already seeing this error in the undercloud:16:07
howleytroot@undercloud-undercloud-xcxtyoq4p3c4:~# sudo grep -i error /var/log/upstart/nova-compute.log16:07
howleyt2014-02-14 16:00:10.178 3317 ERROR nova.compute.manager [req-ee311015-4ab3-4c1b-b002-e3fbc6d7c9b1 3d71b5ddbf46445b8676ae0196c9201f d538acdabd22435c9aff9c8565ee2041] [instance: 1510a2e2-39e7-49b0-9b94-ff418dbbe487] Error: Insufficient compute resources.16:07
*** lblanchard has quit IRC16:07
ccrouchcody-somerville: any news on hotels ^ ?16:07
*** morazi has quit IRC16:07
derekhhowleyt: doesn't look good so, you can source ~/root/stackrc on the undercloud and run a nova list to confirm16:08
derekhhowleyt: I meant ~root/stackrc16:09
*** morazi has joined #tripleo16:10
*** lblanchard has joined #tripleo16:11
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add list/created/delete operations for node profiles  https://review.openstack.org/7357616:16
howleytderekh: root@undercloud-undercloud-xcxtyoq4p3c4:~# nova list16:17
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+16:17
howleyt| ID                                   | Name                                | Status | Task State | Power State | Networks           |16:17
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+16:17
howleyt| fd57db8d-c69a-40d6-92ab-c1ab450333ee | overcloud-NovaCompute0-h3v3fvasuuiw | ACTIVE | -          | Running     | ctlplane=192.0.2.6 |16:17
howleyt| 1510a2e2-39e7-49b0-9b94-ff418dbbe487 | overcloud-NovaCompute1-whydnad4soiy | ACTIVE | -          | Running     | ctlplane=192.0.2.8 |16:17
howleyt| c2b4574f-685c-49af-85dd-7dab878c39ae | overcloud-notCompute0-q23hiaous33x  | ACTIVE | -          | Running     | ctlplane=192.0.2.5 |16:17
howleyt+--------------------------------------+-------------------------------------+--------+------------+-------------+--------------------+16:17
cody-somervilleccrouch: Hey there. Working on contract. We're working to drive the rate down which is right now artificially inflated due to March break/Sprint break. Will announce details probably early next week.16:17
openstackgerritDerek Higgins proposed a change to openstack-infra/tripleo-ci: Get logs from undercloud  https://review.openstack.org/7363516:18
openstackgerritDmitry Tantsur proposed a change to openstack/tuskar-ui: Add list/create/delete operations for node profiles  https://review.openstack.org/7357616:18
derekhhowleyt: maybe heat retried and was successful the second time, I think it does that, what does heat list say?16:19
*** hashar has quit IRC16:20
*** e0ne_ has quit IRC16:22
*** robsparker has joined #tripleo16:23
*** dtantsur has quit IRC16:25
*** e0ne has joined #tripleo16:25
howleytderekh: It's still in progress, but I don't have much hope for it:16:26
howleytroot@undercloud-undercloud-xcxtyoq4p3c4:~# heat stack-list16:26
howleyt+--------------------------------------+------------+--------------------+----------------------+16:26
howleyt| id                                   | stack_name | stack_status       | creation_time        |16:26
howleyt+--------------------------------------+------------+--------------------+----------------------+16:26
howleyt| 4dfe1a9b-e9a7-4731-97ec-c180d24c1a6e | overcloud  | CREATE_IN_PROGRESS | 2014-02-14T16:00:01Z |16:26
howleyt+--------------------------------------+------------+--------------------+----------------------+16:26
*** vkozhukalov has quit IRC16:26
derekhhowleyt: if you want you can ssh to the notcompute node and see what os-collect-config is doing, i.e. see if it has failed16:27
*** bauzas has quit IRC16:27
ccrouchthanks cody-somerville16:28
*** morazi_ has joined #tripleo16:30
*** morazi has quit IRC16:31
*** bnemec is now known as beekneemech16:32
howleytderekh: unfortunately, I've never reached the point where I could ssh to notcompute node, currently, I get: ssh: connect to host 192.0.2.5 port 22: Connection refused16:32
derekhhowleyt: not good, the only advice I have from there is to try and look at the console with vnc or something and see if you can see any errors that way16:33
*** peoplemerge has joined #tripleo16:34
*** rcarrillocruz has quit IRC16:41
*** rcarrillocruz has joined #tripleo16:42
*** e0ne has quit IRC16:42
*** e0ne has joined #tripleo16:44
rbrady-afkhowleyt: were you able to run "heat event-list overcloud"?16:52
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Fix typo  https://review.openstack.org/7365516:53
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Install lvm2 package for cinder  https://review.openstack.org/7365616:53
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Common cinder install code  https://review.openstack.org/7365716:53
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Use os-svc-restart for cinder-api  https://review.openstack.org/7365816:53
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Remove unneeded chown of /var/run/nova  https://review.openstack.org/7365916:53
openstackgerritJames Slagle proposed a change to openstack/tripleo-image-elements: Add create-dir service for neutron ovs agent  https://review.openstack.org/7366016:53
*** saju_m has joined #tripleo16:54
howleytderekh: heat event-list: http://pastebin.com/KbeqA0KF16:55
howleytalso got final confirmation from heat that overcloud stack create failed.16:55
*** vkozhukalov has joined #tripleo16:56
*** viktors has quit IRC16:57
derekhhowleyt: so os-refresh-config on notComputes is supposed to send out a notify to heat to say its ready, in your case that didn't happen, your probably going to have to vnc into the undercloud vm to see what we wrong16:58
openstackgerritA change was merged to openstack/tripleo-image-elements: Look for local.json in preference to config.json.  https://review.openstack.org/7340116:58
derekhhowleyt: you should be looking in the os-collect-config logs, (os-collect-config calls os-refresh-config)16:59
*** ifarkas has quit IRC16:59
*** e0ne has quit IRC17:00
howleytderekh: ok, I have to run. I'll look into it next week. Thanks for your help, have a good weekend!17:04
*** bauzas has joined #tripleo17:06
*** rpodolyaka has left #tripleo17:06
derekhhowleyt: okie dokie, you too17:08
*** boris-42_ has joined #tripleo17:08
*** markmc has quit IRC17:15
*** matty_dubs is now known as matty_dubs|lunch17:16
*** vkozhukalov has quit IRC17:17
SpamapSderekh: Yes you can zap findbrokens. Those are as much to hold on to good machines as they are to find broken ones. :)17:19
*** pblaho has joined #tripleo17:20
derekhSpamapS: btw on broken machine, its possible some of them may have been a result of https://bugs.launchpad.net/tripleo/+bug/1272803 , as neutron ports on the overcloud may have been left behind by old test environments17:22
derekhSpamapS: just a hunch but if you saw nodes fail to come up it could be because the got dhcp from the wrong place17:22
cody-somervilleWow. upstart is dead!17:24
cody-somervilleUbuntu is switching to systemd.17:24
*** athomas has quit IRC17:25
SpamapSderekh: Yeah I agree.17:27
SpamapScody-somerville: yeah... <sniff>17:27
SpamapScody-somerville: best for everyone.. but still.. :(17:27
SpamapSthe trolls in Debian have been.. entertaining17:28
SpamapSif not horrifying17:28
SpamapSsuggesting forking Debian and/or physically beating Lennart.17:28
*** shardy is now known as shardy_afk17:32
Ngat least it will help us simplify17:37
openstackgerritDan Prince proposed a change to openstack/tripleo-incubator: Use $TRIPLEO_ROOT/seed.qcow2 as the cache.  https://review.openstack.org/7367417:38
Ngwell, eventually, I imagine 14.04 will have users who care about tripleo for some time to come :/17:38
SpamapSNg: meh17:45
SpamapSNg: Nobody ever cared about upstart in Ubuntu server except me. ;)17:45
*** ramishra has quit IRC17:45
NgSpamapS: I mean in the sense that we'll need to support upstart for some period of time, so 14.04 users can actually use tripleo :)17:45
SpamapSmeh meh .. who wants 14.04 ?17:47
SpamapS14.10 will be so much better17:47
SpamapSand you'll have _9_ whole months to realize that betterness17:47
*** jistr has quit IRC17:51
openstackgerritA change was merged to openstack/tuskar-ui: Use git.openstack.org instead of github.com  https://review.openstack.org/7337117:54
openstackgerritA change was merged to openstack/tuskar-ui: Switch over to oslosphinx  https://review.openstack.org/7335217:54
openstackgerritA change was merged to openstack/tuskar-ui: Rename Openstack to OpenStack  https://review.openstack.org/7321317:54
*** epim has joined #tripleo17:55
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E123  https://review.openstack.org/7367817:58
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E124  https://review.openstack.org/7367917:58
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E126  https://review.openstack.org/7368017:58
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E127  https://review.openstack.org/7368117:58
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E128  https://review.openstack.org/7368217:58
openstackgerritDougal Matthews proposed a change to openstack/tuskar: Enable gating on E122  https://review.openstack.org/7340017:58
d0ugaljdob: Tidying for this week done. ^17:58
*** cadenzajon has joined #tripleo18:01
*** newell has joined #tripleo18:02
openstackgerritA change was merged to openstack/tripleo-image-elements: Move cinder-volume creation to post-configure  https://review.openstack.org/7197518:03
*** derekh has quit IRC18:04
*** morazi_ is now known as morazi18:04
*** pblaho has quit IRC18:05
openstackgerritA change was merged to openstack/tuskar-ui: Add missing semicolons in js  https://review.openstack.org/7318218:06
devanandalifeless: interesting. which machines // what was going on at the time you ran out of sessions?18:08
*** jistr has joined #tripleo18:09
*** jistr has quit IRC18:09
*** w_ has joined #tripleo18:09
*** rbrady-afk is now known as rbrady18:10
*** peoplemerge has quit IRC18:11
*** olaph has quit IRC18:11
*** epim has quit IRC18:15
*** matty_dubs|lunch is now known as matty_dubs18:16
*** ramishra has joined #tripleo18:16
openstackgerritBen Nemec proposed a change to openstack/tripleo-incubator: Don't include unsupported distro check in docs  https://review.openstack.org/7368318:16
*** max_lobur is now known as max_lobur_afk18:20
*** ramishra has quit IRC18:21
*** lucasagomes has quit IRC18:28
*** panda_ has joined #tripleo18:30
*** rpodolyaka1 has joined #tripleo18:32
*** panda has quit IRC18:33
*** peoplemerge has joined #tripleo18:38
openstackgerritBen Nemec proposed a change to openstack/tripleo-incubator: Fix devtest_variables doc formatting  https://review.openstack.org/7368918:40
*** rcarrillocruz1 has joined #tripleo18:42
*** rcarrillocruz has quit IRC18:43
*** wusWWW has joined #tripleo18:51
*** rcarrillocruz has joined #tripleo18:57
*** rcarrillocruz1 has quit IRC19:00
*** rcarrillocruz2 has joined #tripleo19:00
*** rcarrillocruz has quit IRC19:01
slagleis there a way to force fail a heat stack-create?19:01
slaglee.g., i know it's going to fail, and I don't want to have to wait for stack-create to timeout before i can delete it19:02
openstackgerritA change was merged to openstack/tuskar-ui: Adding real index, deleting and creating of overcloud  https://review.openstack.org/7334319:02
Shrewsslagle: how much snow did you get?19:03
*** chrisfoster has quit IRC19:03
slagleShrews: 10" with another 3-6" coming according to Ray19:04
Shrewsgotta love Ray19:04
Shrewsthinking of doing some boarding up there this Monday19:04
*** rcarrillocruz has joined #tripleo19:04
slaglenice, supposed to be warmer and sunny next week19:05
slaglegood time to hit the slopes19:05
openstackgerritDan Prince proposed a change to openstack/tripleo-incubator: Add NeutronPublicInterface arg for the undercloud  https://review.openstack.org/7209419:05
*** Kim-Chi-San has joined #tripleo19:05
*** Kim-Chi-San is now known as tburnes19:05
lifelessmorning19:06
lifelessdevananda: not entirely sure19:06
*** rcarrillocruz2 has quit IRC19:06
lifelessdevananda: see hours later though when I tracked the real cause: https://bugs.launchpad.net/tripleo/+bug/127280319:06
*** e0ne has joined #tripleo19:06
lifelessekarlso: hi, whats up (and no, not here @1am :)19:07
*** e0ne has quit IRC19:10
lifelessoh yeah, seed *and* undercloud job success19:11
lifelessbooyah19:11
*** saju_m has quit IRC19:11
*** rcarrillocruz1 has joined #tripleo19:14
*** rcarrillocruz has quit IRC19:16
*** ramishra has joined #tripleo19:16
*** e0ne has joined #tripleo19:19
*** jprovazn_afk is now known as jprovazn19:20
jprovaznSpamapS, Hi19:21
*** ramishra has quit IRC19:21
*** sgrasley has quit IRC19:24
*** rcarrillocruz has joined #tripleo19:26
SpamapSjprovazn: howdy19:26
*** rcarrillocruz1 has quit IRC19:27
jprovaznSpamapS, I wanted to discuss this: https://review.openstack.org/#/c/68392/4/elements/rabbitmq-server/install.d/20-rabbitmq-server19:28
SpamapSjprovazn: Cool!19:29
jprovaznSpamapS, the reason I put symlink there is because /var/lib/rabbitmq/.erlang.cookie is now template, I didn't fine a way how to convince rabbitmq to change path for erlang cookie (though it might be I missed something)19:30
jprovaznSpamapS, and based on this I didn't like the fact that then there will be both /var/lib/rabbitmq and /mnt/state/var/lib/rabbitmq19:31
jprovaznand both of them will be used19:31
SpamapSjprovazn: it is a bug if you can't change the path to the cookie, I think the RabbitMQ devs will agree there.19:32
SpamapSjprovazn: I'd rather see just the symlink for just the file that is the problem19:32
SpamapSjprovazn: because then it becomes obvious when to remove that symlink.19:32
SpamapS(when the bug is fixed ;)19:32
lifelessjprovazn: we need to generate precious files on /mnt though19:32
lifelessjprovazn: so e.g. /mnt/state/var/lib/rabbitmq/.erland.cookie19:33
jprovaznlifeless, SpamapS sure, erlang.cookie doesn\t ahve to be saved - it will be set on each os-apply-config run19:33
*** CaptTofu_ has joined #tripleo19:33
*** CaptTofu has quit IRC19:34
jprovaznum, so if I'll not be lucky with changing cookie path, are you ok with creating symlink just for .erlang.cookie, right?19:34
SpamapSjprovazn: I would insist on that, yes. :)19:34
*** rcarrillocruz1 has joined #tripleo19:35
SpamapSjprovazn: doing the whole dir is more confusing to me, because I ask myself then "why can't this just be on the state dir?"19:35
openstackgerritBen Nemec proposed a change to openstack/tripleo-incubator: Stop trying to copy out SSH keys from notcompute.  https://review.openstack.org/7346519:35
jprovaznSpamapS, ok19:35
hewbroccaahh the dreaded erlang.cookie19:35
*** rcarrillocruz has quit IRC19:35
jprovaznSpamapS, and to answer this question: https://review.openstack.org/#/c/68392/4/elements/rabbitmq-server/os-config-applier/etc/rabbitmq/rabbitmq-env.conf19:37
lifelessjprovazn: uhm then it should be in /var/run19:37
lifelessjprovazn: we certainly don't want it changing from run to run though, that would cause cluster partitions during deploys.19:37
SpamapSBTW I went ahead and started tripleo-cd again19:39
jprovaznlifeless, I thought that when you reboot one instance of stack (or boot different node instead of the first one), then same heat provides to it same metadata -> same rabbit.cookie value19:40
*** cd-undercloud has joined #tripleo19:40
cd-undercloud************** overcloud complete status=1 ************19:40
*** cd-undercloud has quit IRC19:40
jprovazns/then same heat/then heat/19:40
lifelessjprovazn: I'm not sure about OS::Heat::RandomString for that19:41
*** rcarrillocruz has joined #tripleo19:42
lifelessjprovazn: we need to make sure its a resource that won't get recreated by mistake19:42
lifelessjprovazn: I suspect on -create => new string19:42
lifelessupdates can recreate resources19:42
jprovaznpfff19:42
*** rcarrillocruz1 has quit IRC19:43
jprovaznlifeless, then using RandomString looks little bit suboptimal in this case19:44
jprovaznbecause it would be rewritten anyway19:45
*** morazi has quit IRC19:45
*** rcarrillocruz1 has joined #tripleo19:46
*** rcarrillocruz has quit IRC19:46
pleia2lifeless:19:47
pleia2Feb 14 19:44:23 localhost sshd[577]: Invalid user fedora from 74.95.15.4119:47
pleia2Feb 14 19:44:23 localhost sshd[577]: input_userauth_request: invalid user fedora [preauth]19:47
pleia2Feb 14 19:44:23 localhost sshd[577]: Connection closed by 74.95.15.41 [preauth]19:47
pleia2Feb 14 19:44:41 localhost useradd[635]: new group: name=fedora, GID=100019:47
pleia2Feb 14 19:44:41 localhost useradd[635]: new user: name=fedora, UID=1000, GID=1000, home=/home/fedora, shell=/bin/bash19:47
pleia2AAAHHHHAHA19:47
pleia2lifeless: but now I can fix it :)19:47
*** jtomasek has quit IRC19:48
SpamapSlifeless: OS::Heat::RandomString stays stable for the life of the stack.19:48
SpamapSlifeless: unless you change the 'salt' parameter.19:48
SpamapSjprovazn: ^^19:48
jprovaznSpamapS, cool, thanks19:48
lifelesspleia2: race condition ?19:49
*** cd-undercloud has joined #tripleo19:50
cd-undercloud************** overcloud complete status=1 ************19:50
*** cd-undercloud has quit IRC19:50
jprovaznSpamapS, about this https://review.openstack.org/#/c/68392/4/elements/rabbitmq-server/os-config-applier/etc/rabbitmq/rabbitmq-env.conf - the reason why it's not hostname is that I need a simple way how to get IP and hostname pairs for all nodes on each node in cluster19:50
pleia2lifeless: yeah19:50
pleia2lifeless: not sure about best solution yet, but I'll investigate a few19:51
jprovaznSpamapS, and also nodename in cluster should not change during lifetime19:51
SpamapSjprovazn: Right, can't you just use Fn::GetAtt [ OtherServer, instance_name ]  the same way you get the network address?19:51
SpamapSjprovazn: that instance_name will stay stable as long as the instance is never replaced... which is fine because it will lose its' state if it is replaced anyway.19:52
lifelesspleia2: whinge upstream? :)19:52
jprovaznSpamapS, ok, can try19:53
lifelessjprovazn: btw there is a Gn::Join in heat19:53
lifelessjprovazn: it can do the string manioulation to ,join lists for you19:53
pleia2lifeless: hah, perhaps19:54
jprovaznlifeless, yes, this would be handy for places where just hostnames delimited by ',' are used, but I need IP too for /etc/hostnames, so hostnames list in heat template would be duplicit19:58
*** spzala has joined #tripleo19:58
*** morazi has joined #tripleo19:59
*** jprovazn is now known as jprovazn_afk19:59
lifelessjprovazn: there's no rule against duplicating data in the meatadata dict given machines20:00
*** cd-undercloud has joined #tripleo20:00
cd-undercloud************** overcloud complete status=1 ************20:00
*** cd-undercloud has quit IRC20:00
lifelessjprovazn_afk: as long as the source is single-location, heat can format it for nice consumption by instances20:00
lifeless2014-02-14 19:05:26.833 | Removing /opt/stack/new//user.qcow2-new20:00
lifeless2014-02-14 19:05:26.880 | Waiting for the overcloud stack to be ready20:00
lifeless2014-02-14 19:43:22.747 | Build timed out (after 125 minutes). Marking the build as failed.20:00
lifeless2014-02-14 19:43:22.921 | /opt/stack/new//tripleo-incubator/scripts/wait_for: line 55:  6421 Terminated              sleep $SLEEPTIME20:00
lifeless2014-02-14 19:43:22.926 | + get_state_from_host seed root@192.168.1.17320:00
*** vkozhukalov has joined #tripleo20:02
*** AaronGr is now known as AaronGr_Zzz20:05
*** AaronGr_Zzz is now known as AaronGr20:07
*** cd-undercloud has joined #tripleo20:10
cd-undercloud************** overcloud complete status=1 ************20:10
*** cd-undercloud has quit IRC20:10
SpamapS+ devtest_overcloud.sh vlan25 138.35.77.4/25 eth2 138.35.77.1 138.35.77.131 138.35.77.254 138.35.77.128/25 /opt/stack/tr20:11
SpamapSipleo-incubator/tripleo-cloud/tripleo-cd-admins /opt/stack/tripleo-incubator/tripleo-cloud/tripleo-cd-users20:11
SpamapS[2014/02/14 07:59:40 PM] [ERROR] key arch does not exist in ['/opt/stack/testenv.json']20:11
lifelessSpamapS: our testenv definition will need to be  updated on tripleo20:12
lifeless-undercloud20:12
SpamapSnot sure I understand.. but I'll look into it a bit later20:12
SpamapShave to leave on long errands now :-P20:12
lifelessSpamapS: look at the schema dox in devtest_setpu.sh20:12
SpamapSYeah, so I didn't realize we were even writing a "testenv.json"20:13
SpamapSsince this is not a test env20:13
SpamapSit is a real env20:13
lifelessnot sure tere is a difference :)20:13
SpamapSif they're intended to be the same we should call them just "environments" :-P20:14
SpamapSmy impression of testenv is "thing to run lots of these on one box"20:14
lifelessit describes the environment  devtest sripts run in20:14
SpamapSwe don't have one for the cd cloud20:15
lifelessso yes, its an environment, and devtest_testenv creates a test environment20:15
lifelessright, we need one know, to capture the arch20:15
SpamapSNODE_ARCH doesn't push it in?20:16
SpamapSgah.. out of time20:16
lifelessI've created one20:16
SpamapS-> long errands :-P20:16
SpamapSbbl20:17
*** ramishra has joined #tripleo20:17
ekarlsolifeless:20:19
ekarlso10:33 < ekarlso> hmm, do I need to do something special with bridges in order to get things to work or ?20:19
ekarlso10:33 < ekarlso> had first a problem with ovs + briding, so removed the bridge module and I deleted the default network now it fails to run devtest_seed.sh due to default network missing p20:19
*** rpodolyaka1 has quit IRC20:19
lifelessekarlso: running setup-network might fix that20:20
*** morazi has quit IRC20:20
lifelessekarlso: I'm persuming you're running devtest on a lcaol machine20:20
*** morazi has joined #tripleo20:21
*** ramishra has quit IRC20:22
*** akuznetsov has quit IRC20:22
*** akuznetsov has joined #tripleo20:23
*** cadenzajon has quit IRC20:29
*** e0ne has quit IRC20:35
rwsulifeless: about https://review.openstack.org/#/c/72524/, so I see that you want to put the MESSAGING_BACKEND into EXTRA_ARGS. I find using EXTRA_ARGS confusing because that element is required for the image to work, not extra20:36
rwsumaybe I'm not understanding your intention20:36
lifelessrwsu: everything in EXTRA_ARGS is needed for the image to work too20:36
lifelessrwsu: but like I said yesterday, do whatever makes sense to you, but fix the thing where you were installing it into the hypervisor, and I'll do a review later.20:37
lifelessrwsu: my suggestion was to permit hypervisor vs ctl plane specific extra args.20:37
lifelessrwsu: I don't think we should model every variation with new variables.20:37
lifelessrwsu: it just becomes a maintenance headache20:38
openstackgerritlifeless proposed a change to openstack/tripleo-image-elements: Fixup tripleo-cd fallout from the CI push.  https://review.openstack.org/7370820:38
rwsulifeless: ok, i'm hunged up on the name EXTRA.. I thought it would be where stackuser would eventually go, but i'll push another update to the patch with your suggestion, thanks20:40
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Make the overcloud-user config identify itself  https://review.openstack.org/7346220:48
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Permit specifying ovs bridge mappings too.  https://review.openstack.org/7170620:48
openstackgerritlifeless proposed a change to openstack/tripleo-incubator: Stop trying to copy out SSH keys from notcompute.  https://review.openstack.org/7346520:48
lifelessrwsu: I suggest there need to be two patches; one that addresses the ability to add elements to ctlplane only, and one that does the messaging choice stuff.20:49
lifelessok, I have to run - ciao!20:49
*** akuznetsov has quit IRC20:52
rwsulater, have a good weekend20:53
*** dprince has quit IRC21:03
*** shardy_afk is now known as shardy21:03
openstackgerritDevananda van der Veen proposed a change to openstack/tripleo-incubator: Add Ironic bits to _undercloud  https://review.openstack.org/7296921:03
*** cadenzajon has joined #tripleo21:11
*** rcarrillocruz has joined #tripleo21:12
*** rcarrillocruz1 has quit IRC21:14
*** jprovazn_afk has quit IRC21:17
*** ramishra has joined #tripleo21:18
*** ramishra has quit IRC21:23
*** peoplemerge has quit IRC21:35
*** cd-undercloud has joined #tripleo21:41
cd-undercloud************** overcloud complete status=1 ************21:41
*** cd-undercloud has quit IRC21:41
openstackgerritA change was merged to openstack/tripleo-image-elements: Remove no longer needed generation of the virtual-power-key  https://review.openstack.org/7363021:44
*** peoplemerge has joined #tripleo21:44
*** rcarrillocruz1 has joined #tripleo21:47
*** rcarrillocruz has quit IRC21:49
*** bauzas has quit IRC21:52
*** ftcjeff has joined #tripleo21:53
*** edmund1 has joined #tripleo21:58
*** edmund has quit IRC22:00
*** jdob has quit IRC22:03
*** rcarrillocruz has joined #tripleo22:03
*** rcarrillocruz1 has quit IRC22:05
*** rcarrillocruz1 has joined #tripleo22:07
*** rcarrillocruz has quit IRC22:09
*** matty_dubs is now known as matty_dubs|gone22:09
*** panda has joined #tripleo22:11
*** panda_ has quit IRC22:15
*** edmund1 has quit IRC22:16
*** ramishra has joined #tripleo22:19
*** lblanchard has quit IRC22:19
*** ramishra has quit IRC22:23
*** CaptTofu_ has quit IRC22:31
*** rwsu has quit IRC22:34
*** rcarrillocruz has joined #tripleo22:42
*** rcarrillocruz1 has quit IRC22:43
ekarlsolifeless: it's a cloud instance22:50
*** ftcjeff has quit IRC23:02
*** cd-undercloud has joined #tripleo23:03
cd-undercloud************** overcloud complete status=1 ************23:03
*** cd-undercloud has quit IRC23:03
*** rwsu has joined #tripleo23:04
*** vkozhukalov has quit IRC23:04
*** ramishra has joined #tripleo23:20
*** rwsu has quit IRC23:22
*** ramishra has quit IRC23:24
*** chrisfoster has joined #tripleo23:25
*** rbrady has quit IRC23:25
*** chrisfoster has quit IRC23:27
*** rcarrillocruz1 has joined #tripleo23:33
*** rcarrillocruz has quit IRC23:34
*** rcarrillocruz has joined #tripleo23:41
*** rcarrillocruz1 has quit IRC23:43
*** lazy_prince has joined #tripleo23:51
*** killer_prince has quit IRC23:52
*** lazy_prince is now known as killer_prince23:52
*** peoplemerge has quit IRC23:56
*** peoplemerge has joined #tripleo23:58
*** killer_prince has quit IRC23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!