Monday, 2016-06-27

*** oneswig has joined #tripleo00:04
*** oneswig has quit IRC00:09
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159473200:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1594732 in tripleo "CI: No connected gearman servers" [Critical,Confirmed] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)00:10
*** rook has joined #tripleo00:26
*** limao_ has joined #tripleo00:38
*** akshai has joined #tripleo00:45
*** akshai has quit IRC01:06
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159473201:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1594732 in tripleo "CI: No connected gearman servers" [Critical,Confirmed] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)01:10
*** akshai has joined #tripleo01:22
*** akshai_ has joined #tripleo01:25
*** akshai has quit IRC01:28
*** dsariel has quit IRC01:29
*** akshai has joined #tripleo01:35
*** akshai_ has quit IRC01:38
*** hanchao has joined #tripleo01:56
*** dsariel has joined #tripleo01:56
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639002:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]02:10
*** ebalduf has quit IRC02:47
*** ebalduf has joined #tripleo03:03
*** akshai has quit IRC03:05
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639003:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]03:10
*** pleia2_ is now known as pleia203:15
*** coolsvap has joined #tripleo03:17
*** _milan_ has joined #tripleo03:19
*** akshai has joined #tripleo03:20
*** milan has quit IRC03:21
*** shadower has quit IRC03:25
*** shadower has joined #tripleo03:25
*** lynxman has quit IRC03:26
*** lynxman has joined #tripleo03:34
*** ramishra has joined #tripleo03:38
*** shivrao has joined #tripleo03:40
*** shivrao has quit IRC04:02
*** shivrao has joined #tripleo04:04
*** oneswig has joined #tripleo04:06
*** akshai has quit IRC04:09
*** dsariel has quit IRC04:09
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639004:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]04:10
*** oneswig has quit IRC04:10
sshnaidmEmilienM, hi04:10
sshnaidmEmilienM, where do you see this error? Can you provide a link in bug? https://bugs.launchpad.net/tripleo/+bug/159639004:11
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]04:11
*** links has joined #tripleo04:15
*** andreas-f has joined #tripleo04:37
*** ramishra has quit IRC04:55
*** ramishra has joined #tripleo04:55
*** masco has joined #tripleo05:06
*** numans has joined #tripleo05:08
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639005:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]05:10
*** numans has quit IRC05:12
*** numans has joined #tripleo05:12
*** skramaja has joined #tripleo05:17
*** yamahata has quit IRC05:17
*** yamahata has joined #tripleo05:17
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: WIP: DON'T MERGE, TESTING  https://review.openstack.org/33428805:26
*** shivrao has quit IRC05:27
*** saneax_AFK is now known as saneax05:29
*** oshvartz has joined #tripleo05:35
*** rasca has joined #tripleo05:35
*** fragatina has joined #tripleo05:40
*** fragatina has quit IRC05:40
*** fragatina has joined #tripleo05:41
*** florianf has joined #tripleo05:42
openstackgerritMerged openstack/diskimage-builder: Updated from global requirements  https://review.openstack.org/33367305:48
*** dtantsur|afk is now known as dtantsur05:51
*** jcoufal has joined #tripleo05:57
*** jcoufal has quit IRC05:58
*** masco has left #tripleo06:02
*** bootsha has joined #tripleo06:05
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639006:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]06:10
*** shivrao has joined #tripleo06:15
*** dsariel has joined #tripleo06:17
*** ramishra_ has joined #tripleo06:19
*** ramishra has quit IRC06:21
*** ramishra_ has quit IRC06:25
*** ramishra has joined #tripleo06:25
*** rcernin has joined #tripleo06:27
*** liverpooler has joined #tripleo06:28
openstackgerritMerged openstack/diskimage-builder: Handle locales install on Fedora 24  https://review.openstack.org/33311806:29
*** dixiaoli has joined #tripleo06:40
*** pcaruana has joined #tripleo06:41
*** sthillma has joined #tripleo06:44
*** sthillma has quit IRC06:48
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: Fix JOB_NAME parameter  https://review.openstack.org/33430506:49
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: Fix JOB_NAME parameter  https://review.openstack.org/33430506:53
*** ccamacho has joined #tripleo06:57
*** d0ugal has joined #tripleo06:58
*** d0ugal has quit IRC06:59
*** tesseract- has joined #tripleo06:59
*** d0ugal has joined #tripleo07:00
*** jprovazn has joined #tripleo07:04
*** yamahata has quit IRC07:06
*** ebalduf has quit IRC07:09
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639007:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]07:10
*** jtomasek has joined #tripleo07:11
*** mcornea has joined #tripleo07:12
*** oneswig has joined #tripleo07:12
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Enable TLS in the internal network for keystone  https://review.openstack.org/32702907:14
*** jpena|off is now known as jpena07:16
*** bvandenh has joined #tripleo07:19
*** anshul has joined #tripleo07:21
*** anshul is now known as Guest5996307:22
d0ugalhrm, looks like CI job are not beings started?07:22
*** ebarrera has joined #tripleo07:23
ccamachoHey d0ugal, yeahp :(07:26
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: Fix JOB_NAME parameter  https://review.openstack.org/33430507:27
sshnaidmccamacho, d0ugal where do you look?07:27
*** limao_ has quit IRC07:28
d0ugalsshnaidm: on gerrit, the jobs are not starting :D07:28
d0ugalsshnaidm: and http://tripleo.org/cistatus.html07:28
*** ifarkas has joined #tripleo07:28
*** limao has joined #tripleo07:29
sshnaidmd0ugal, weird, just submitted to tripleo-ci and it worked07:30
d0ugalsshnaidm: It looks like it might only be working for tripleo-ci? not sure07:30
ccamachoIn my case, all my jobs were kiled by timeouts, no hosts available, also this was reported few hours ago https://bugs.launchpad.net/tripleo/+bug/159639007:30
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]07:30
sshnaidmccamacho, yeah, master jobs fail because of "no hosts"07:31
*** dmk0202 has joined #tripleo07:31
*** jpich has joined #tripleo07:31
sshnaidmccamacho, although I don't see where is "qemu-img: No space left on device" ?07:31
ccamachome neither, the bug don't have to many description, but still able to see other different errors...07:33
*** panda has quit IRC07:34
sshnaidmd0ugal, which patch do you submit that doesn't run jobs?07:34
*** numans has quit IRC07:35
*** panda has joined #tripleo07:35
d0ugalsshnaidm: https://review.openstack.org/#/c/332671/07:35
*** aufi has joined #tripleo07:36
sshnaidmd0ugal, half hour before?07:36
d0ugalsshnaidm: I just rechecked that one, because it should pass and I wanted to test CI07:36
*** zaneb has joined #tripleo07:36
d0ugalsshnaidm: Yeah, I also tried yesterday with one patch07:37
* d0ugal looks for it07:37
sshnaidmd0ugal, I see your patch running: http://status.openstack.org/zuul/07:38
sshnaidmd0ugal, remaining time: 2 hr 14 min07:38
d0ugalsshnaidm: oh, cool - maybe that problem is resolved then07:39
d0ugalI'll wait a bit and see.07:39
*** shivrao has quit IRC07:39
*** olap has joined #tripleo07:41
*** zoli_gone-proxy is now known as zoliXXL07:43
zoliXXLgood morning07:45
*** paramite has joined #tripleo07:45
*** shardy has joined #tripleo07:47
*** numans has joined #tripleo07:48
*** lucas-afk is now known as lucasagomes08:02
*** athomas has joined #tripleo08:03
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639008:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]08:10
*** dixiaoli has quit IRC08:16
*** numans has quit IRC08:17
*** itamarl has joined #tripleo08:17
openstackgerritMerged openstack/instack-undercloud: Add net config override  https://review.openstack.org/30595708:20
*** andreas-f has quit IRC08:28
*** olap has quit IRC08:29
jistrmarios: morning :) do you think you could add those to your review queue? e.g. 2 of them only need another +2 and they have a green CI https://etherpad.openstack.org/p/tripleo-liberty-mitaka-upgrades08:30
mariosjistr: hey man, yeah on it (I am looking at keystone l..m now)08:30
marioslgtm so far08:30
jistrmarios: awesome, thanks! :)08:30
*** olap has joined #tripleo08:31
*** numans has joined #tripleo08:31
*** electrofelix has joined #tripleo08:32
*** shardy has quit IRC08:32
*** shardy has joined #tripleo08:34
*** derekh has joined #tripleo08:36
openstackgerritMerged openstack/tripleo-heat-templates: Keystone liberty mitaka upgrade step  https://review.openstack.org/30223508:42
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Nothing to see here  https://review.openstack.org/11101108:57
*** mgould|afk is now known as mgould09:04
openstackgerritMerged openstack/tripleo-heat-templates: Allow pacemaker ports in firewall  https://review.openstack.org/33402209:05
openstackgerritMerged openstack/tripleo-heat-templates: Allow sahara ports in firewall  https://review.openstack.org/33402309:08
openstackgerritMerged openstack/tripleo-heat-templates: Nova needs the proper volumes to use Cinder  https://review.openstack.org/30159209:08
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639009:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]09:10
*** olap has quit IRC09:12
*** sambetts|afk is now known as sambetts09:13
*** olap has joined #tripleo09:14
*** chem has joined #tripleo09:17
matbujistr: hi, i'm looking at the https://review.openstack.org/#/c/325205/22/extraconfig/tasks/major_upgrade_controller_pacemaker_1.sh09:18
d0ugalsshnaidm: I stlll don't see tripleo gates for 332671?09:18
matbujistr: i'm wondering if we could make a general mariadb upgrade script09:18
matbu s/general/generic/09:19
jistrmatbu: i think that's the way bandini and dciabrin intended it to work09:19
matbujistr: which can be used by the UC upgrade too09:19
*** chem has quit IRC09:19
dciabrinmatbu jistr: hi guys :)09:20
*** chem has joined #tripleo09:20
matbudciabrin: hello :)09:20
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Nothing to see here  https://review.openstack.org/11101109:21
*** _milan_ has quit IRC09:22
dciabrinmatbu, we wanted to have it generic eventually, bandini though that unblocking the overcloud upgrade and documenting manual steps on the undercloud would be sufficient for a first step09:23
jistrmatbu, dciabrin: i like the idea, there are some practical challenges there (e.g. undercloud doesn't use pacemaker, which would change the procedure quite a bit i think, and there's no easy way currently to share that code between UC and OC in this sense)09:24
jistrso perhaps going step by step here would be good09:24
*** limao has quit IRC09:24
sshnaidmd0ugal, as I see the "upgrades" job failed, others runs, but it will take another 1-2 hours, not sure why the queue is so long..09:25
d0ugalsshnaidm: how did you find that? I can't see it?09:26
sshnaidmd0ugal, logs from upgrades job: http://logs.openstack.org/71/332671/1/check-tripleo/gate-tripleo-ci-centos-7-upgrades/4deea89/09:26
d0ugalsshnaidm: Thanks09:26
sshnaidmd0ugal, yeah, it's on http://status.openstack.org/zuul/09:26
*** limao has joined #tripleo09:27
sshnaidmd0ugal, you can find your patch by ctrl-F nad then click on jobs, you'll see either running log or saved logs.09:27
d0ugalsshnaidm: right, thanks :)09:27
*** sshnaidm is now known as sshnaidm|afk09:28
dciabrinjistr, i agree, we would benefit from a dedicated patch/review for UC upgrade, where we could focus on how to best share code.09:28
jistr+109:28
*** limao has quit IRC09:29
*** rasca has quit IRC09:29
*** limao has joined #tripleo09:29
matbudciabrin: ack09:30
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart: Add extra-vars for enable pacamaker to upgrade ci script  https://review.openstack.org/33437109:37
*** bootsha has quit IRC09:47
*** tosky has joined #tripleo09:47
*** rasca has joined #tripleo09:48
*** osp has joined #tripleo09:57
*** rcernin has left #tripleo10:00
*** akrivoka has joined #tripleo10:00
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Free up more diskspace on jenkins slaves befor CI starts  https://review.openstack.org/33438310:01
*** limao has quit IRC10:06
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639010:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]10:10
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Free up more diskspace on jenkins slaves befor CI starts  https://review.openstack.org/33438310:23
derekhshardy: RE^ , I'm not 100% sure what you meant, does my comment make sense or am I missing something ?10:24
openstackgerritMartin Mágr proposed openstack/tripleo-heat-templates: Availability monitoring support  https://review.openstack.org/25478810:25
*** bootsha has joined #tripleo10:26
shardyderekh: ah, I think I missed that opt/git is still removed, thanks10:26
derekhshardy: ok10:27
amoralejhi, it seems that https://github.com/openstack/tripleo-quickstart/commit/0d30b47966065ceabca54664cfdc1ad35f6d6944 has broken build-image quickstart ci jobs in RDO10:36
amoralejhttps://ci.centos.org/job/tripleo-quickstart-promote-master-delorean-build-images/365/console10:36
*** Goneri has joined #tripleo10:39
openstackgerritJulie Pichon proposed openstack/python-tripleoclient: Add 'openstack overcloud node provide' command  https://review.openstack.org/33441110:40
*** links has quit IRC10:43
*** gfidente has joined #tripleo10:47
*** fragatina has quit IRC10:48
*** fragatina has joined #tripleo10:49
jpichd0ugal: ^ When you have a few moments, I could use your wisdom!10:51
*** links has joined #tripleo10:55
*** Goneri has quit IRC10:58
*** _milan_ has joined #tripleo11:03
*** ccamacho is now known as ccamacho|lunch11:03
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639011:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]11:10
*** ooolpbot has quit IRC11:10
*** Goneri has joined #tripleo11:11
*** Goneri has quit IRC11:18
*** zaneb has quit IRC11:20
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Gnocchi composable roles  https://review.openstack.org/31841311:20
openstackgerritKeith Schincke proposed openstack/puppet-tripleo: Add RGW to the Ceph mon profile.  https://review.openstack.org/33408111:25
*** weshay_afk is now known as weshay11:26
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Add gnocchi profiles  https://review.openstack.org/31552711:28
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Gnocchi composable roles  https://review.openstack.org/31841311:29
*** sshnaidm|afk is now known as sshnaidm11:30
*** jefrite has quit IRC11:31
*** bfournie1 has quit IRC11:32
sshnaidmderekh, hi11:34
sshnaidmderekh, ping me please when you're available for bluejeans chat11:34
*** jefrite has joined #tripleo11:35
derekhNice, you can now telnet to running jenkins slaves to get the console log11:35
derekhsshnaidm: ready when you are11:35
derekhsshnaidm: sending you a lin11:36
derekhk11:36
sshnaidmderekh, ok11:36
*** lblanchard has joined #tripleo11:38
*** hjensas has joined #tripleo11:41
*** akshai has joined #tripleo11:46
EmilienMhello11:46
*** akshai has quit IRC11:48
EmilienMderekh, sshnaidm: how is CI going? did we track down space issue?11:48
EmilienMok I see 33438311:50
EmilienMeveryone: current status of CI is also broken because we need https://review.openstack.org/#/c/333511/ to land11:50
*** zoliXXL is now known as zoli|lunch11:53
*** fultonj has joined #tripleo11:54
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: fix tripleo-roles after explicit teardown change  https://review.openstack.org/33443311:54
*** jpena is now known as jpena|lunch11:56
*** lucasagomes is now known as lucas-hungry11:59
*** rhallisey has joined #tripleo12:00
*** trown is now known as trown|outtypewww12:04
*** rodrigods has quit IRC12:06
*** rodrigods has joined #tripleo12:06
EmilienMderekh: [Controller]: CREATE_FAILED ResourceInError: resources.Controller: Went to status ERROR due to "Message: No valid host was found. There are not enough hosts available., Code: 500"12:08
EmilienMseen in zuul for 334383 (nonha job)12:08
sshnaidmEmilienM, this patch should solve "no hosts"?12:08
*** jcoufal has joined #tripleo12:08
EmilienMno12:08
EmilienMit's for space12:08
EmilienMthe thing that might help to fix "no hosts found" has been merged in Nova recently12:09
EmilienMbut I don't know how updated is our OpenStack that deploy tripleo jobs12:09
derekhEmilienM: and whats breaking that needs this fix? https://review.openstack.org/#/c/333511/12:10
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639012:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]12:10
*** ccamacho|lunch is now known as ccamacho12:10
*** hewbrocca-afk is now known as hewbrocca12:10
*** amoralej is now known as amoralej|lunch12:11
EmilienMderekh: last week we had a major outage where delorean was not failing in our CI so we deployed THT from current packages and CI was not failing during 1 day, while it should have, on some patches that landed12:12
EmilienMderekh: so we had to revert some work12:12
EmilienMderekh: and this https://review.openstack.org/#/c/333511/ patch is fixing some issues on overcloud12:12
EmilienMderekh: https://review.openstack.org/#/c/334383/2/toci_gate_test.sh12:14
EmilienMwhy don't rm /opt/stack/cache/files/ubuntu-* ?12:14
EmilienMwe'll hit the problem probably with ubuntu 14.04 too12:14
derekhEmilienM: I only went for the files that we're there but ya we should future proof it a bit12:16
derekhEmilienM: once the CI run is finished I'll update it12:16
derekhEmilienM: so, are you saying that even with the disk space issue fixed, still nothing will pass until we merge https://review.openstack.org/#/c/333511/12:17
*** bfournie has joined #tripleo12:18
EmilienMderekh: maybe not, I've seen different failures over the last 3 days12:18
EmilienMhttp://logs.openstack.org/83/334383/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/005becd/logs/postci.txt.gz#_2016-06-27_12_09_00_00012:19
EmilienMMessage: No valid host was found. There are not enough hosts available., Code: 50012:19
EmilienMderekh: this is on your tripleo-ci patch for free space ^12:19
EmilienMderekh: do we have another problem?12:19
derekhEmilienM: maybe, I havn't been keeping track lately, trying to have rh2 ready for next week12:20
*** pradk has joined #tripleo12:21
derekhEmilienM: want to just merge the disk space fix and then we'll see if anything starts passing?12:21
EmilienMderekh: I can +A it12:21
EmilienMI was waiting for at least one green jo12:21
EmilienMjob*12:21
derekhEmilienM: yup, we can wait either, the ha jobs is still running12:22
EmilienMderekh: yeah but I think it's fine, your patch was already executed and it worked12:23
*** jayg|g0n3 is now known as jayg12:23
EmilienMderekh: so I think we can go ahead. Ok?12:23
derekhEmilienM: ok, go for it12:23
*** bfournie has quit IRC12:23
openstackgerritMerged openstack/tripleo-heat-templates: L->M upgrades keystone change of path of the paste_deploy config_file  https://review.openstack.org/33404412:23
openstackgerritMerged openstack-infra/tripleo-ci: Free up more diskspace on jenkins slaves befor CI starts  https://review.openstack.org/33438312:23
EmilienMderekh: ok done ^12:24
*** bfournie has joined #tripleo12:24
derekhEmilienM: sshnaidm has put a patch on our geard server to solve the problems getting testenvs this seems to be working12:24
EmilienMlet's recheck 333511 now12:24
EmilienMand see if it pass CI12:24
EmilienMwe'll need it asap too12:24
derekhEmilienM: so we shouldn't have to keep restarting geard12:25
EmilienMderekh: link?12:25
derekhEmilienM: I don't think he has push it to gerrit yet, sshnaidm is the patch anywhere?12:25
EmilienMderekh: btw, did you see https://review.openstack.org/#/c/333419/ ? It should help to stop leaking floating ips12:25
sshnaidmderekh, EmilienM - no, just in server, I'd like to check it before preparing to submit12:26
derekhEmilienM: I think maybe that this may help the "not enough hosts available" problem, as the restarting may have caused testenvs to be used by two jobs simultaneously12:27
*** rbrady has quit IRC12:27
EmilienMderekh: mhh, why do we have it since recently?12:28
derekhEmilienM: yup, I saw it, thanks12:28
*** rbrady has joined #tripleo12:28
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Drop unused VIP params to controller.yaml  https://review.openstack.org/33351112:28
derekhEmilienM: I have a theory its related to the switch  to ZUUL (from jenkins) , but its only a theory12:28
derekhEmilienM: this switch http://lists.openstack.org/pipermail/openstack-dev/2016-June/097584.html12:29
*** derekh is now known as derekh_afk12:29
*** jpena|lunch is now known as jpena12:29
sshnaidmderekh_afk, EmilienM : sort of https://review.openstack.org/#/c/33445212:30
*** rasca has quit IRC12:30
EmilienMsshnaidm: interesting12:30
*** rasca has joined #tripleo12:32
*** wfoster is now known as wfoster_afk12:32
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Re-enable Ceilometer composable roles for controller  https://review.openstack.org/33349812:35
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: fix tripleo-roles after explicit teardown change  https://review.openstack.org/33443312:35
*** snecklifter has joined #tripleo12:36
*** numans has quit IRC12:37
*** fultonj has quit IRC12:39
*** limao has joined #tripleo12:40
*** bootsha has quit IRC12:41
*** wfoster_afk is now known as wfoster12:41
openstackgerritMerged openstack/python-tripleoclient: Run post deploy config on force  https://review.openstack.org/33009612:42
*** tzumainn has joined #tripleo12:45
*** rlandy has joined #tripleo12:46
openstackgerritMartin Mágr proposed openstack/tripleo-puppet-elements: Install osops-tools-monitoring-oschecks package  https://review.openstack.org/32407512:47
pradkthx marios :)12:51
*** akshai has joined #tripleo12:51
*** bfournie has left #tripleo12:51
*** amoralej|lunch is now known as amoralej12:56
mariospradk: np thanks jistr :)12:57
mariospradk: (he hassles people for reviews in his spare time)12:58
jistryea that's my hobby12:58
*** bfournie has joined #tripleo12:59
*** jprovazn has quit IRC13:00
*** coolsvap has quit IRC13:00
*** derekh_afk is now known as derekh13:01
EmilienMthrash and other folks: no need to do 'recheck'13:04
openstackgerritJames Slagle proposed openstack-infra/tripleo-ci: Fix openstack-tripleo-common package name  https://review.openstack.org/33446713:04
EmilienMthe CI is currently broken again13:05
*** akshai has quit IRC13:05
EmilienMwe'll need 333511 probably13:05
openstackgerritJames Slagle proposed openstack-infra/tripleo-ci: Add mulitnode CI job support to tripleo-ci  https://review.openstack.org/32477713:06
*** akshai has joined #tripleo13:09
*** lucas-hungry is now known as lucasagomes13:09
openstackgerritAlfredo Moralejo proposed openstack/tripleo-quickstart: Fix build-images.yml after teardown change  https://review.openstack.org/33447213:10
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159639013:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1596390 in tripleo "qemu-img: No space left on device" [Critical,Confirmed]13:10
EmilienMjistr, marios: can you quickly review https://review.openstack.org/#/c/333347/ ? (It's a Gem change, so don't look functional jobs)13:11
EmilienMderekh: removing alert on the bug ^13:11
derekhEmilienM: ok13:11
mariosEmilienM: looking13:12
*** sshnaidm has quit IRC13:12
*** trozet has joined #tripleo13:12
*** [1]cdearborn has joined #tripleo13:13
*** zaneb has joined #tripleo13:14
*** sshnaidm has joined #tripleo13:16
*** zoli|lunch is now known as zoli|wfh13:22
*** limao has quit IRC13:25
*** limao has joined #tripleo13:26
amoralejweshay, question about https://review.openstack.org/#/c/334472/13:26
amoralejwhy "role: tripleo-inventory" is now required?13:26
weshayamoralej, depends if the ansible_ssh_args are set13:27
weshayif ansible is expecting ssh.config.ansible then you'll need to run it, if it's not set.. it should work fine as is13:28
weshayamoralej, you can join #oooq13:28
*** skramaja has quit IRC13:29
*** myoung has joined #tripleo13:30
*** xinwu has joined #tripleo13:30
*** weshay is now known as weshay_mtg13:31
*** pradk has quit IRC13:31
*** egafford has joined #tripleo13:32
*** limao_ has joined #tripleo13:33
*** limao_ has quit IRC13:34
*** limao_ has joined #tripleo13:35
*** jprovazn has joined #tripleo13:35
*** limao has quit IRC13:37
*** xinwu has quit IRC13:37
*** ebalduf has joined #tripleo13:39
*** andreas-f has joined #tripleo13:40
*** skramaja has joined #tripleo13:43
*** limao has joined #tripleo13:44
openstackgerritPradeep Kilambi proposed openstack/python-tripleoclient: Run post deploy config on force  https://review.openstack.org/33448613:45
*** pradk has joined #tripleo13:45
openstackgerritPradeep Kilambi proposed openstack/python-tripleoclient: Run post deploy config on force  https://review.openstack.org/33448613:47
*** limao_ has quit IRC13:47
*** eggmaster has joined #tripleo13:47
pradkjistr, marios, backport to mitaka https://review.openstack.org/#/c/334486/13:47
*** r-mibu has quit IRC13:48
*** r-mibu has joined #tripleo13:48
*** skramaja has quit IRC13:55
*** tosky has quit IRC13:56
*** bootsha has joined #tripleo13:56
*** akshai has quit IRC13:59
chemhi, I tryied a quickstart --release liberty and it fails.  It looks like packages in the undercloud.qcows conflict with package installed by instack during undercloud step.  Does someone could help ?13:59
*** fultonj has joined #tripleo14:02
*** fultonj has quit IRC14:02
chempanda: this is the issue I've got.  So is there a way around it ?14:02
*** fultonj has joined #tripleo14:03
*** akshai has joined #tripleo14:03
shardychem: trown|outtypewww or larsks may be able to help14:03
chemshardy: ack, thanks14:04
*** liverpooler has quit IRC14:05
socialmatbu: solution is to have cleanup in undercloud puppet for keystone and mariadb14:07
*** skramaja has joined #tripleo14:10
matbusocial: yes i saw your review14:11
matbusocial: do you have a WIP review for that ?14:11
socialI just managed to reproduce the issue so, soon :)14:11
matbusocial: ack, you want to remove the "yum update" there also ?14:12
shardyAnyone seeing local delorean builds fail with Error: No Package found for $foo?14:12
shardyI've deleted ~/tripleo/delorean and re-run tripleo.sh --delorean-setup but clearly still have some thing stale or the spec/repo is broken14:13
shardyI'm trying to build openstack/heat14:13
*** bootsha has quit IRC14:13
shardyalternatively, anyone got a link to the location where delorean builds for under-review patches are created?14:13
shardybnemec: IIRC you linked it to me a few weeks ago, but I'm failing to find it atm14:13
trozetshardy: do you have any update on if composable services are done for neutron yet?14:14
openstackgerritAlfredo Moralejo proposed openstack/tripleo-quickstart: Fix build-images.yml after teardown change  https://review.openstack.org/33447214:16
bnemecIt's https://repos.fedorapeople.org/repos/openstack-m/jenkins/rpm-build/ but that job doesn't run for Heat atm.14:16
bnemecshardy: ^14:16
bnemecIt's also broken for a couple of projects because some of the packaging repos disappeared. :-/14:16
*** rajinir has joined #tripleo14:20
EmilienMtrozet: https://etherpad.openstack.org/p/tripleo-composable-services14:20
EmilienMtrozet: we made some progress on controller, but not on compute, it's WIP14:20
EmilienMtrozet: progress is really slow because of CI issues that we have every day14:20
trozetEmilienM: thanks for the link.  What do you mean compute, that's just openvswitch agent right?14:21
ccamachoHey EmilienM pradk, quick question, can you check https://review.openstack.org/#/c/318413/ ?? I found the problem in the deployment, is related to ::gnocchi::db::sync when using it the deployment doesn't work, not sure if can be related to https://review.openstack.org/#/c/295944/14:21
EmilienMccamacho: right now CI is broken so I'll look once we have valid failures in logs14:21
EmilienMtrozet: not only, plugin conf for some drivers that require it14:21
pradkccamacho, that patch went in a while back and it only skips storage sync.. so should cause issues14:22
pradkccamacho, any tracebacks in upgrade.log?14:22
pradkshouldnt*14:22
*** ayoung has joined #tripleo14:22
*** limao has quit IRC14:22
trozetEmilienM: I'm not following.  What drivers?14:23
*** limao has joined #tripleo14:23
trozetI see TODO: evacuate neutron bits from compute (WIP emilien)14:23
EmilienMtrozet: bigswitch, etc14:23
EmilienMtrozet: yeah, but there is a bit more, look in the code, there are still some neutron bit14:24
trozetEmilienM: ok will take a look, thanks14:24
ccamachoEmilienM sure, those error are from my local CI. pradk the deployment is not failing in the upgrade, it is failing when deploying the non-ha job locally. Here: https://review.openstack.org/#/c/315527/35/manifests/profile/base/gnocchi/api.pp I have commented the line to see the behavior and the deployment completes when not using it, but not sure, if needed or nor, or if it is an error in the puppet gnocchi manifest14:25
*** jschlueter has quit IRC14:27
*** limao_ has joined #tripleo14:27
*** adarazs has quit IRC14:29
*** aufi has quit IRC14:29
*** jmiu_ has quit IRC14:30
socialmatbu: /q matbu14:30
socialerr :)14:30
matbusocial: hehe14:31
*** limao has quit IRC14:31
*** rook_ has joined #tripleo14:32
*** jschlueter has joined #tripleo14:33
shardybnemec: ack, thanks - I'll actually bookmark it this time ;)14:33
EmilienMccamacho, pradk: about db-sync, do we need to patch puppet-gnocchi or?14:34
*** ebarrera has quit IRC14:34
*** adarazs has joined #tripleo14:34
pradkEmilienM, dont think so.. this is not an issue in packstack or in our tripleo ci.. i think the composable roles patch is failing on something else .. I'll need to look at the gnocchi-upgrade traceback to see what it is14:35
*** jmiu_ has joined #tripleo14:36
ccamachopradk do you need any log file from my CI?14:36
pradkccamacho, can you paste me /var/log/gnocchi/gnocchi-upgrade.log14:37
ccamachosure, just a sec14:37
*** weshay_mtg is now known as weshay14:39
ccamachopradk that file does not exists, no file in /var/log/gnocchi/* in the OC controller, this is the config file http://paste.openstack.org/show/523583/14:42
ccamachopradk is all commented, maybe that's the issue..14:43
pradkccamacho, yea thats weird, there is no db connection so db sync will obviously fail.. wonder if there were any other errors before this14:44
*** akshai has quit IRC14:46
*** limao_ has quit IRC14:47
*** ayoung_ has joined #tripleo14:48
*** limao has joined #tripleo14:48
ccamachopradk not really, but actually, we have those default parameters in the tht submission, so, let me see if they are not passed correctly to the puppet manifests.14:48
EmilienMderekh: why did you revert?14:48
EmilienMhttps://review.openstack.org/#/c/334516/14:48
*** jmiu_ has quit IRC14:49
derekhEmilienM: testing, I think it may be causing the current problems with deploying tripleo14:49
pradkccamacho, did you update the tht after my changes to split the gnocchi services further?14:49
pradkccamacho, thats possible14:49
*** adarazs has quit IRC14:49
*** karts is now known as karthiks14:49
EmilienMderekh: we don't deploy bigswitch, I don't see how14:49
*** jschlueter has quit IRC14:49
derekhEmilienM: I went comparing CI runs from today to those that worked from last week14:49
ccamacholet me check it as is quite possible that the parameters were empty when executing the puppet mainfest14:49
*** limao_ has joined #tripleo14:49
derekhEmilienM: the ones from today install python-networking-bigswitch on the undercloud14:50
EmilienMon undercloud?14:50
derekhEmilienM: tracback on neutron on the undercloud http://paste.openstack.org/show/523585/14:50
EmilienMpuppet doesn't manage bigswitch on underclout14:50
*** limao_ has quit IRC14:51
EmilienMit's not related to bigswitch14:51
*** akshai has joined #tripleo14:51
*** limao_ has joined #tripleo14:51
derekhEmilienM: actually maybe it isn't puppet, looking elsewhere14:51
*** adarazs has joined #tripleo14:53
pandado you have a link to tripleo meeting agenda ?14:53
shardyhttps://wiki.openstack.org/wiki/Meetings/TripleO14:55
shardypanda: ^^14:55
*** limao has quit IRC14:55
EmilienMderekh: do you have a link of the patch that fails?14:55
*** jschluet has joined #tripleo14:55
EmilienMderekh: I  have telnet open on 2 different jobs and I passed undercloud14:55
EmilienMderekh: overcloud is being deployed14:55
*** jschluet is now known as jschlueter14:55
derekhEmilienM: ya and the exception I'm seeing is during the overcloud deploy, in the undercloud neutron14:55
*** jschlueter has quit IRC14:55
*** jschlueter has joined #tripleo14:55
EmilienMmhh14:55
*** jmiu_ has joined #tripleo14:55
*** adarazs has quit IRC14:55
*** adarazs has joined #tripleo14:55
EmilienMderekh: any link?14:55
*** crinkle_ is now known as crinkle14:55
derekhEmilienM: http://logs.openstack.org/27/315527/35/check-tripleo/gate-tripleo-ci-centos-7-nonha/35b4723/logs/undercloud/var/log/neutron/server.txt.gz#_2016-06-27_13_16_06_49914:56
*** limao_ has quit IRC14:57
*** limao has joined #tripleo14:57
EmilienMderekh: is it blocking something?14:57
*** itamarl has quit IRC14:58
*** osp has quit IRC14:58
*** zaneb has quit IRC14:59
derekhEmilienM: So, I think something to do with the bigswitch package is causing the "Message: No valid host was found. There are not enough hosts available" problem14:59
derekhEmilienM: I just don't know yet what / how14:59
EmilienMCREATE_FAILED ResourceInError: resources.CephStorage: Went to status ERROR due to "Message: No valid host was found. There are not enough hosts available., Code: 500"15:00
EmilienMagain15:00
derekhEmilienM: exactly15:00
EmilienMI'm not sure why it's related to bigswitch15:00
EmilienMI don't see any bigswitch thing in logs15:01
derekhEmilienM: console, logs http://logs.openstack.org/27/315527/35/check-tripleo/gate-tripleo-ci-centos-7-nonha/35b4723/console.html#_2016-06-27_11_57_02_87966815:01
EmilienMjistr: you can safely merge https://review.openstack.org/#/c/333347/ please15:01
EmilienMjistr: it's a Gemfile patch, for lint & syntax jobs15:01
derekhEmilienM: its on the undercloud cached image,  somehow15:01
derekhEmilienM: I'm still looking into it15:01
EmilienMderekh: and?15:01
EmilienMderekh: a lot of many other pkgs are updated too15:01
*** jistr is now known as jistr|mtg15:01
*** limao has quit IRC15:02
bnemecderekh: It's a glance error. http://logs.openstack.org/79/329079/4/check-tripleo/gate-tripleo-ci-centos-7-nonha/0ec8e2d/logs/undercloud/var/log/glance/api.txt.gz#_2016-06-27_13_38_16_39515:02
bnemecLooks like the same thing you were seeing in rh2.15:03
EmilienMah it looks better15:03
derekhbnemec: so is swift getting OOM'd again?15:03
bnemecderekh: I don't see any ooms, but it's the same error on the Glance side so something is wrong there.15:04
derekhbnemec: Jun 27 15:12:06 instack.localdomain kernel: Out of memory: Kill process 380 (swift-proxy-ser) score 374 or sacrifice child15:04
derekhbnemec: in ther joulan log for the one you just linked15:05
bnemecOkay, apparently I fail at reading logs.15:05
*** krotscheck is now known as krotscheck_dcm15:05
bnemecOh, I didn't actually pull down the journal.15:05
bnemecFscking binary log formats.15:05
derekhbrb15:07
*** skramaja has quit IRC15:09
EmilienMbnemec: isn't something we had in March?15:09
EmilienMwe needed extra RAM iirc and we have it now15:09
EmilienMslagle was also mentionning high CPU usage, maybe should we reduce testenvs on each host15:09
*** ayoung has quit IRC15:09
*** ayoung_ is now known as ayoung15:09
*** sshnaidm is now known as sshnaidm|afk15:09
slagleEmilienM: that was a while ago15:10
slagleEmilienM: pre-upgrade15:10
slaglewe've added ram and ssd's since then15:11
EmilienMright, but it seems like we're having a similar situation15:11
d0ugaljpich: Commented on your patch15:11
bnemecThe problem derekh was seeing in rh2 is that swift was using absurd amounts of ram.  I don't think we could add enough memory to the CI nodes to get around that.15:11
jpichd0ugal: Cheers!15:12
EmilienMbnemec: maybe we have too much swift proxy workers15:13
*** osp has joined #tripleo15:13
*** ebarrera has joined #tripleo15:13
derekhsomething is causing swift-proxy to use more RAM then it should, we need to figure out what15:13
derekhEmilienM: workers = 215:13
EmilienM2 is not bad15:14
bnemecYeah, we're not talking about a little too much ram.  We're talking about swift eating 80 GB when it's only being asked to serve a single image.15:14
bnemec(this was in rh2, which may or may not be related, but probably is)15:15
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Add gnocchi profiles  https://review.openstack.org/31552715:15
*** jistr|mtg is now known as jistr15:16
*** fragatina has quit IRC15:17
pradkccamacho, found the issue i assume? i see you uncommented sync15:17
*** fragatina has joined #tripleo15:17
*** dsariel has quit IRC15:19
ccamachopradk Yeahp, Im debugging the THT side, as the service is not configured at all.. from what we spoke before, the issue should be about the service config, now Im deploying it locally to see if how the params are passed.15:19
ccamachopradk ill let you know when getting the result15:19
pradkok cool15:19
*** bvandenh has quit IRC15:20
*** yamahata has joined #tripleo15:21
*** saneax is now known as saneax_AFK15:24
*** coolsvap has joined #tripleo15:24
*** weshay is now known as weshay_brb15:25
*** ebarrera has quit IRC15:28
*** ebarrera has joined #tripleo15:28
jistrEmilienM: re https://review.openstack.org/#/c/333347/ i'm not fan of merging all-red patches unless they are very important for some reason. BTW if we decide to really ignore the CI on that one, you can also +A yourself. I think it's ok to +A any patch (even own patch) if there are two +2s from someone else.15:28
derekhbnemec: EmilienM Jun 27 14:26:52 instack.localdomain kernel: Killed process 21018 (swift-proxy-ser) total-vm:3672584kB, anon-rss:3381984kB, file-rss:1124kB15:28
derekhEmilienM: bnemec if I'm reading that corrently swift is using >3GB of RAM15:29
bnemecYeah, that's ridiculous.15:29
*** dmk0202 has quit IRC15:30
EmilienMjistr: I'm not a fan either, but this patch has really no impact. As you probably already know, Gemfile is used when running 'bundle install' and 'rake exec'15:30
EmilienMjistr: +A myself15:30
derekhbnemec: EmilienM I'll dig at it a bit later, gotta go for a while, if anybody is looking into it and find anything out can ye email me15:31
dtantsurEmilienM++ it's strange to not merge patches not touching gate at all15:31
*** weshay_brb is now known as weshay15:31
bnemecGah, my laptop is so slow when it's applying updates.15:31
*** derekh is now known as derekh_afk15:31
dtantsurI remember a ridiculous number of rechecks on a doc-only patch once...15:31
EmilienMdtantsur: functional jobs are not in gate15:31
bnemecYeah, if it's a unit test-only change like that, I don't wait for tripleo-ci.15:32
EmilienMbnemec: it seems like we have a successful job here https://review.openstack.org/#/c/329504/215:32
EmilienMwhere all 2 jobs are working15:32
EmilienMerr, 315:32
*** noslzzp has joined #tripleo15:33
bnemecYeah, there are a handful of passes on the ci status page so _sometimes_ we're not hitting this.15:33
bnemecI think derek couldn't reproduce it in a local environment either.15:34
*** noslzzp has quit IRC15:34
*** panda has quit IRC15:34
bnemecBecause of course not.  Race bugs never reproduce in an easily debuggable environment. :-)15:34
*** panda has joined #tripleo15:35
*** noslzzp has joined #tripleo15:35
EmilienMI also see some:15:37
EmilienMResourceInError: resources.Compute.resources[0].resources.NovaCompute: Went to status ERROR due to "Message: Unknown, Code: Unknown"15:37
EmilienMin some other jobs15:37
*** shivrao has joined #tripleo15:37
*** shivrao has quit IRC15:37
openstackgerritMerged openstack/instack-undercloud: Revert "Pin puppet-lint-absolute_classname-check to 0.1.3"  https://review.openstack.org/33334715:38
*** pcaruana has quit IRC15:38
EmilienMand lot of glance api things15:39
EmilienMhttp://logs.openstack.org/21/327721/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/3df6467/logs/undercloud/var/log/glance/api.txt.gz#_2016-06-27_13_17_43_11415:39
EmilienMbnemec: have you seen this one? ^ related to swift too15:39
EmilienMprobably same error as you had with proxy going oom15:39
*** tesseract- has quit IRC15:40
EmilienMOut of memory: Kill process 21030 (swift-proxy-ser) score 411 or sacrifice child15:42
EmilienMok same thing indeed15:42
*** bnemec has quit IRC15:43
*** bnemec has joined #tripleo15:43
*** bnemec has quit IRC15:50
*** osp has quit IRC15:51
*** bnemec has joined #tripleo15:52
EmilienMI don't see anything wrong in netstat -lpn on undercloud (eventual Swift proxy memory leak)15:52
*** links has quit IRC15:52
*** tosky has joined #tripleo15:52
EmilienMbnemec: not only proxy15:54
EmilienMalso object service15:54
EmilienMlook in top -n 1 -b -o RES15:54
EmilienMswift object is taking 1410380 in virtual size, x2 (2 processes)15:54
EmilienMand proxy takes 3088892 :-O15:54
*** shivrao has joined #tripleo15:55
*** shivrao_ has joined #tripleo15:56
*** cschwede has joined #tripleo15:57
EmilienMcschwede: hey15:57
cschwedeEmilienM: hi15:57
EmilienMcschwede: so we're seeing a lot of OOM on swift-proxy-server since super recently15:57
EmilienMwe're investigating why but for now, we don't have any idea15:58
EmilienMlet me show you some logs for example15:58
cschwedehow much memory do you have on the nodes?15:58
*** zoli|wfh is now known as zoli|gone15:58
EmilienMcschwede: 5.8G for undercloud15:59
EmilienMcschwede: if you want, you can download http://logs.openstack.org/21/327721/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/3df6467/logs/undercloud.tar.xz15:59
*** zoli|gone is now known as zoli_gone-proxy15:59
*** osp has joined #tripleo15:59
*** shivrao has quit IRC15:59
*** shivrao_ is now known as shivrao15:59
EmilienMand see Jun 27 09:14:13 instack.localdomain kernel: Out of memory: Kill process 21030 (swift-proxy-ser) score 411 or sacrifice child16:00
EmilienMafter running journalctl --file var/log/journal/027a07d166557094b759a31ef1ce5c65/system.journal16:00
EmilienMcschwede: we noticed swift proxy takes insane amount of RAM16:00
*** aufi has joined #tripleo16:00
cschwedewhat do you mean by insane?16:00
*** jpena is now known as jpena|off16:01
EmilienMcschwede: > 3GB16:01
cschwedethat’s not much for a proxy16:02
cschwededoes that happen when up- or downloading?16:02
EmilienM 675   PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND16:02
EmilienM 676  1062 swift     20   0 3088892 2.692g   1824 R  18.8 46.2   0:07.05 swift-prox+16:02
EmilienMcschwede: let me look, I think it's downloading16:02
EmilienMbecause it's when deploying overcloud16:02
EmilienMbut let me look again16:03
EmilienMhttp://logs.openstack.org/21/327721/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/3df6467/logs/undercloud/var/log/glance/api.txt.gz#_2016-06-27_14_03_07_62016:03
EmilienMeventlet.wsgi.server [req-f741afae-33a7-4fac-b8a5-dcebe03d36f2 5d98f4907aef4ecd974672479562ef5f a287a9f2e7e648739e6eb25560acb740 - - -] 192.0.2.1 - - [27/Jun/2016 14:03:07] "GET /v1/images/4907e1bd-b808-4af1-b6bf-51b18a27dc62 HTTP/1.1" 500 454 214.62302316:03
EmilienMdownloading Swift -> Glance16:03
cschwedehmm, i think it happens at „Jun 27 09:14:13“ ?16:06
cschwedelet me check16:06
cschwedethe object above looks too small to me16:06
EmilienMlot of 404 though: http://logs.openstack.org/21/327721/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/3df6467/logs/undercloud/var/log/glance/api.txt.gz#_2016-06-27_13_12_19_55616:08
EmilienMRegistry client request GET /images/overcloud-full raised NotFound16:08
cschwedeis there no swap on the undercloud?16:09
cschwedethere is also a mysqld oom: Jun 27 09:14:13 instack.localdomain kernel: mysqld invoked oom-killer: gfp_mask=0x201da, order=0, oom_score_adj=016:09
EmilienMcschwede: 2GB16:09
*** yamahata has quit IRC16:10
cschwedethis is the request that triggers the oom:16:11
cschwedeJun 27 09:13:18 instack.localdomain object-server[20769]: 192.0.2.1 - - [27/Jun/2016:13:13:18 +0000] "GET /1/153411/AUTH_a287a9f2e7e648739e6eb25560acb      740/glance/4907e1bd-b808-4af1-b6bf-51b18a27dc62" 200 3707895808 "GET http://192.0.2.1:8080/v1/AUTH_a287a9f2e7e648739e6eb25560acb740/glance/4907e1bd-b8      08-4af1-b6bf-51b18a27dc62" "tx6c8f58a4e2d04aa2a7d05-005771266d" "proxy-server 21030" 0.0009 "-" 20769 016:11
cschwede3707895808 bytes ~ 3.5GB16:12
cschwedeso if the client doesn’t consume the bytes fast enough, it’s hold in memory16:12
cschwedei suggest to increase ram or swap16:12
*** hjensas has quit IRC16:13
shardywe are resource constrained in both ram and walltime in CI, so neither option is very attractive16:13
shardywe could switch glance to be file-backed for the CI tests instead of swift backed?16:13
EmilienMshardy: yes, in the meantime we figure it out16:15
EmilienMshardy: but it's a testing regression16:15
EmilienMI mean: don't do that forever16:15
EmilienMcschwede: the client == glance api?16:16
*** ayoung has quit IRC16:16
cschwedeEmilienM: i think so (whoever fetches that object)16:16
shardyEmilienM: Yup, although we might also consider if there is huge value in swift-backed glance when the swift is an all-in-one setup on the undercloud16:16
EmilienMshardy: do you want me to submit a patch to use file backend until we sort things out?16:17
*** ebarrera has quit IRC16:17
shardyEmilienM: yes that sounds like a good first step, thanks16:17
EmilienMk16:17
*** ebarrera has joined #tripleo16:20
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: glance: disable swift backend  https://review.openstack.org/33455516:20
EmilienMcschwede: maybe there is a timeout parameter somewhere that we can use?16:21
EmilienMin glance config maybe16:21
*** mcornea has quit IRC16:21
EmilienMshardy: /me filing a bug and updating the patch so we don't loose track16:22
cschwedeEmilienM: i will think of possible options and get back to you tomorrow if that’s ok16:23
shardyEmilienM: thanks16:24
shardyEmilienM: we should perhaps start a ML thread to discuss if we should leave glance file backed16:24
shardyit seems like swift is just a really expensive way to write to local disk in this particular case16:24
EmilienMhttps://bugs.launchpad.net/tripleo/+bug/159660416:25
openstackLaunchpad bug 1596604 in tripleo "swift-proxy-server OOM" [Critical,Confirmed] - Assigned to Emilien Macchi (emilienm)16:25
EmilienMcschwede: can you read it and correct me if needed? ^16:25
shardythat may have some upgrade implications tho16:25
EmilienMshardy: ok, I'll start it in a few min16:25
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: glance: disable swift backend  https://review.openstack.org/33455516:26
bnemecNote that swift memory usage running wild is causing us grief on the overcloud too.16:26
*** coolsvap has quit IRC16:26
EmilienMI didn't disable swift on the undercloud, just change the backend in Glance16:27
bnemecMaybe not in CI because of the tiny cirros image, but in real-world deployments it's OOMing.16:27
EmilienMAFIK some other bits also use swift on undercloud16:27
EmilienMbnemec: in our CI it's failing on the overcloud image download, so definitly a real world thing16:27
bnemecSee also https://bugs.launchpad.net/tripleo/+bug/159591616:28
openstackLaunchpad bug 1595916 in tripleo "Swift memory usage grows until it is killed" [High,New]16:28
EmilienMbnemec, shardy: I'll let you review https://review.openstack.org/#/c/334555/ as soon as you can (+2 for now, we'll +A if CI is passing)16:28
cschwedebnemec: that looks like a memory leak, 60-80GB should not happen16:28
bnemecThat's in a baremetal environment with lots of ram.16:28
*** coolsvap has joined #tripleo16:28
cschwedealright, i will have a look into this16:29
* EmilienM afk for lunch, back in 45 min16:30
*** ayoung has joined #tripleo16:31
*** jpich has quit IRC16:36
*** coolsvap has quit IRC16:37
tdasilvacschwede, bnemec: could it be related to this: https://bugs.launchpad.net/cloud-archive/+bug/149330316:37
openstackLaunchpad bug 1493303 in swift (Ubuntu Wily) "[OSSA 2016-004] Swift proxy memory leak on unfinished read (CVE-2016-0738)" [Undecided,Triaged]16:37
openstackgerritMerged openstack/tripleo-quickstart: fix tripleo-roles after explicit teardown change  https://review.openstack.org/33443316:39
*** shivrao has quit IRC16:43
bnemectdasilva: I don't think so.  It looks like that should be fixed, and we're running against a very recent build of Swift.16:43
*** athomas has quit IRC16:46
*** Guest59963 has quit IRC16:46
*** yamahata has joined #tripleo16:52
*** fultonj has quit IRC16:53
*** paramite has quit IRC17:02
*** aufi has quit IRC17:02
*** ayoung has quit IRC17:06
*** amoralej is now known as amoralej|off17:06
*** ebarrera has quit IRC17:09
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159660417:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1596604 in tripleo "swift-proxy-server OOM" [Critical,In progress] - Assigned to Emilien Macchi (emilienm)17:10
*** stendulker has joined #tripleo17:11
*** lblanchard has quit IRC17:20
*** ayoung has joined #tripleo17:21
EmilienMbnemec: ack for your comment on https://review.openstack.org/#/c/334555/17:23
EmilienMbnemec: I'll update commit message once CI passed and before we land it ok?17:23
*** penick has joined #tripleo17:24
bnemecEmilienM: Yeah, that's what I had in mind.  I didn't want to re-trigger CI by editing it at the time.17:24
Slower_what are people doing to get around the image building repo breakage?17:25
*** Slower_ is now known as SLower17:25
*** SLower is now known as Slower17:25
shardySlower: hey, what issues are you seeing?17:26
shardySlower: I'm testing a change to tripleo.sh which modifies the delorean command after a discussion in #rdo17:26
shardysome packages were failing to build for me locally, and it seems it may have been due to the --build-env DELOREAN_DEV=1 here17:27
shardyhttps://github.com/openstack-infra/tripleo-ci/blob/master/scripts/tripleo.sh#L35917:27
Sloweropenstack overcloud image build --all17:27
shardySlower: ah, Ok, different issue17:28
*** akuznetsov has joined #tripleo17:28
SlowerI think the repo has a redirect or something on it17:28
SlowerFile contains no section headers.17:28
Slowerfile: file:///etc/yum.repos.d/delorean.repo, line: 117:28
Slower'<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">\n'17:28
SlowerI've been messing with different ways to give it the repo but haven't found the magic yet17:29
shardySlower: ah, you might try removing all the delorean repos.d entries, pulling latest tripleo-ci repo then re-running tripleo.sh --repo-setup17:29
shardythere was a recent change to tripleo.sh which makes it follow the redirect IIRC17:29
Slowerah ok17:30
rhalliseySlower, I tried that method recently had the same issue.  Doing manual steps worked17:30
Slowerthanks guys17:30
shardyhttps://github.com/openstack-infra/tripleo-ci/commit/8a461f7fdf4855c32ff510f11187c8727c45ee9317:30
*** olap has quit IRC17:33
*** penick_ has joined #tripleo17:33
*** penick has quit IRC17:35
*** penick_ is now known as penick17:35
Slowerdoesn't dib get them itself during image build though?17:35
Slowerthe ones on teh host are fine..17:35
shardySlower: DIB_YUM_REPO_CONF points to the local host's /etc/yum.repos.d by default17:36
shardyyou can override that by exporting a different REPO_PREFIX17:37
Slowerwell what the heck17:37
EmilienMbnemec: ok it fails, /me investigating17:37
Slowershardy: ok thx ;)17:37
shardySlower: it actually works quite well, as for example you can export REPO_PREFIX then re-run --repo-setup17:38
shardye.g if you want to build images for a stable version one a host configured to use trunk or whatever17:38
EmilienMbnemec, shardy: my patch to disable swift backend on undercloud is failing, http://logs.openstack.org/55/334555/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/925831b/logs/postci.txt.gz#_2016-06-27_17_28_26_00017:40
EmilienMhave you hit this message before?17:40
Slowershardy: oh I'm not dissing that part, that's great.  I am wondering why it's not working for me17:41
Slowercause my repos locally are right17:41
bnemecEmilienM: Yes, but that isn't the problem: http://logs.openstack.org/55/334555/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/925831b/console.html#_2016-06-27_17_28_13_19898817:42
EmilienMoh it's glance17:42
shardySlower: tried removing ~/.cache/image-create ?17:42
bnemecYeah17:42
Slowermaybe I set some wierd env variable at some point.. new shell seems to be working17:43
EmilienMok, digging17:43
Slowershardy: I'll try that next17:43
*** oshvartz has quit IRC17:45
*** lblanchard has joined #tripleo17:46
bnemecEmilienM: It looks like glance-api is constantly restarting. :-/17:46
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: glance: disable swift backend  https://review.openstack.org/33455517:48
*** ifarkas has quit IRC17:48
*** ramishra has quit IRC17:49
*** fragatina has quit IRC17:49
EmilienMbnemec: yeah, trying to use "file" instead of long name17:50
*** ramishra has joined #tripleo17:51
EmilienM2016-06-27 17:11:11.517 25461 DEBUG glance_store.backend [-] Registering store glance.store.filesystem.Store with schemes ('file', 'filesystem') create_stores /usr/lib/python2.7/site-packages/glance_store/backend.py:19817:51
EmilienMfile is the right nae17:51
EmilienMname*17:51
*** hewbrocca is now known as hewbrocca-afk17:51
EmilienMsee http://logs.openstack.org/55/334555/2/check-tripleo/gate-tripleo-ci-centos-7-nonha/925831b/logs/undercloud/var/log/glance/api.txt.gz#_2016-06-27_17_06_32_41517:51
*** dsariel has joined #tripleo17:53
*** lucasagomes is now known as lucas-afk18:01
*** egafford has quit IRC18:01
*** dtantsur is now known as dtantsur|afk18:02
openstackgerritSteven Hardy proposed openstack/python-tripleoclient: Don't pass None via UpdateIdentifier  https://review.openstack.org/33459818:07
*** egafford has joined #tripleo18:07
*** _milan_ has quit IRC18:07
openstackgerritMatt Young proposed openstack/tripleo-quickstart: Fix gate failures due to change in provisioning behavior.  https://review.openstack.org/33460018:08
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159660418:10
openstackLaunchpad bug 1596604 in tripleo "swift-proxy-server OOM" [Critical,In progress] - Assigned to Emilien Macchi (emilienm)18:10
*** ooolpbot has quit IRC18:10
*** stendulker has quit IRC18:10
SlowerWARNING: map-services has been deprecated.  Please use the svc-map element.18:11
SlowerOperation failed: No such file or directory18:11
SlowerUnmount /var/tmp/dib_build.CK913eOV/mnt/tmp/yum18:11
*** amoralej|off is now known as amoralej18:15
Slowerhow do I even debug this thing?18:17
*** shardy is now known as shardy_afk18:17
openstackgerritMatt Young proposed openstack/tripleo-quickstart: Fix gate failures due to change in provisioning behavior.  https://review.openstack.org/33460018:18
openstackgerritayoung proposed openstack/tripleo-quickstart: Provision Identity VM  https://review.openstack.org/32833518:24
openstackgerritayoung proposed openstack/tripleo-quickstart: Allow for multiple undercloud nodes  https://review.openstack.org/31574918:24
openstackgerritayoung proposed openstack/tripleo-quickstart: Setup IPA server  https://review.openstack.org/32837318:24
openstackgerritayoung proposed openstack/tripleo-quickstart: added rhsso  https://review.openstack.org/33461218:24
*** ayoung has quit IRC18:27
*** rasca has quit IRC18:27
*** akuznetsov has quit IRC18:33
*** rwsu has quit IRC18:33
*** egafford has quit IRC18:34
openstackgerritAlfredo Moralejo proposed openstack/tripleo-quickstart: Fix provision/base after teardown change  https://review.openstack.org/33447218:38
*** amoralej is now known as amoralej|off18:40
*** chem` has joined #tripleo18:45
*** chem has quit IRC18:47
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Add gnocchi profiles  https://review.openstack.org/31552718:51
*** egafford has joined #tripleo18:53
openstackgerritMerged openstack/diskimage-builder: Fix copyright in docs  https://review.openstack.org/33308418:53
*** pradk has quit IRC18:54
*** egafford has quit IRC18:54
*** egafford has joined #tripleo18:56
*** egafford has quit IRC18:56
*** egafford has joined #tripleo18:56
*** pradk has joined #tripleo18:56
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Re-enable Ceilometer composable roles for controller  https://review.openstack.org/33349818:57
*** xinwu has joined #tripleo19:02
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart: Adds missing line continuation  https://review.openstack.org/33462419:03
openstackgerritAlfredo Moralejo proposed openstack/tripleo-quickstart: Fix provision/base after teardown change  https://review.openstack.org/33447219:04
*** akshai has quit IRC19:12
*** akshai has joined #tripleo19:12
*** jcoufal has quit IRC19:14
*** akrivoka has quit IRC19:14
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: glance: disable swift backend  https://review.openstack.org/33455519:14
EmilienMok it should be better now, I tested it19:14
EmilienMI also changed the bug id^19:15
*** pradk has quit IRC19:16
*** akshai has quit IRC19:16
EmilienMbnemec: did you notice oom also on overclouds?19:17
EmilienMor was it only a supposition?19:17
bnemecEmilienM: I've seen OOM on an overcloud where I was trying to boot a large image, and derekh_afk has seen the same in the upcoming rh2 env on the Mitaka release.19:18
EmilienMok19:19
*** chem` has quit IRC19:20
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Add gnocchi profiles  https://review.openstack.org/31552719:22
*** florianf has quit IRC19:23
*** pradk has joined #tripleo19:28
*** pradk has quit IRC19:29
derekh_afkEmilienM: bnemec so here is an interesting observation (maybe a coincidence), the handful of patches on master that have passed today have all been on repo's that don't use the cached image19:32
derekh_afkEmilienM: bnemec going to submit a patch to disable the cached instack image to see what happens, if that helps we can merge it and see if we can figure out the difference19:33
EmilienMderekh_afk: so you mean that a package that is in cache would be part of the reason of our OOM?19:33
EmilienMo19:33
EmilienMderekh_afk: ok thx19:33
derekh_afkEmilienM: in theory they should be the same but something could be going wrong19:33
EmilienMderekh_afk: I alsoo proposed https://review.openstack.org/#/c/334555/19:33
*** panda has quit IRC19:34
*** panda has joined #tripleo19:35
*** akshai has joined #tripleo19:36
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Temporarily disable the instack.qcow cached image  https://review.openstack.org/33463419:36
*** jefrite has quit IRC19:37
*** pradk has joined #tripleo19:39
derekh_afkEmilienM: ack +2 until we can figure it out19:39
EmilienMack19:39
*** ayoung has joined #tripleo19:42
*** rajinir has quit IRC19:44
*** derekh_afk is now known as derekh19:48
*** julim has joined #tripleo19:51
*** jayg is now known as jayg|g0n319:51
*** akshai_ has joined #tripleo19:54
*** sambetts is now known as sambetts|afk19:57
*** akshai has quit IRC19:58
*** gfidente has quit IRC19:59
*** paramite has joined #tripleo20:05
*** jprovazn has quit IRC20:21
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Drop unused VIP params to controller.yaml  https://review.openstack.org/33351120:24
EmilienMbnemec: damn, I still see the "no valid host was found" with file backend20:25
EmilienMI don't have logs yet, I just saw it in zuul/telnet20:26
*** derekh has quit IRC20:33
*** xinwu has quit IRC20:33
EmilienMbnemec: nevermind, I think I was telneting the wrong job20:34
bnemecSadly I didn't even question the possibility that there was something else wrong too. :-)20:38
*** myoung is now known as myoung|brb20:39
*** yamahata has quit IRC20:46
*** yamahata has joined #tripleo20:46
*** yamahata has quit IRC20:48
*** yamahata has joined #tripleo20:48
*** dsariel has quit IRC20:48
EmilienMbnemec: ok pingtest is running :)20:53
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Re-enable Ceilometer composable roles for controller  https://review.openstack.org/33349820:54
EmilienMbnemec: Job complete, result: SUCCESS :)21:03
*** milan has joined #tripleo21:03
bnemec\o/21:03
*** myoung|brb is now known as myoung|biab21:05
Slowerwoot!21:06
*** lucas-afk has quit IRC21:06
*** lblanchard has quit IRC21:09
openstackgerritBen Nemec proposed openstack/diskimage-builder: Generalize logic for skipping final image generation  https://review.openstack.org/33404221:12
*** lucasagomes has joined #tripleo21:13
*** lucasagomes has quit IRC21:19
*** rhallisey has quit IRC21:21
*** numans has joined #tripleo21:26
*** lucasagomes has joined #tripleo21:27
numansEmilienM, Hi, can you please add this into your review Q - https://review.openstack.org/#/c/320531/21:28
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates: Enable IPv4/IPv6 dual-stack Public API endpoints  https://review.openstack.org/28927921:35
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates: Enable IPv4/IPv6 dual-stack Public API endpoints  https://review.openstack.org/28927921:39
*** bfournie has quit IRC21:54
*** [1]cdearborn has quit IRC22:08
*** egafford has quit IRC22:13
*** oneswig has quit IRC22:13
*** onovy has quit IRC22:14
openstackgerritMerged openstack/instack-undercloud: glance: disable swift backend  https://review.openstack.org/33455522:16
*** akshai_ has quit IRC22:19
*** pradk has quit IRC22:19
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart: Fixes for dlrn gate repo injection while using devmode  https://review.openstack.org/33469122:31
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: Additional note for usbkey users regarding the latest images  https://review.openstack.org/32641922:38
*** julim has quit IRC22:46
*** rajinir has joined #tripleo23:15
*** myoung|biab is now known as myoung23:18
openstackgerritIan Wienand proposed openstack/diskimage-builder: Release notes for 1.18  https://review.openstack.org/33471123:20
*** thrash is now known as thrash|g0ne23:33
*** onovy has joined #tripleo23:39
*** tosky has quit IRC23:51
*** dmacpher has quit IRC23:53

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!