Friday, 2018-09-28

openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates master: GATE CHECK for TripleO  https://review.openstack.org/60429800:00
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537400:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256000:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)00:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)00:10
openstackgerritMerged openstack/tripleo-heat-templates master: Refactor openshift services for composable roles  https://review.openstack.org/59961800:14
*** lblanchard has joined #tripleo00:25
*** hamzy has joined #tripleo00:25
*** rlandy has quit IRC00:36
openstackgerritMerged openstack/tripleo-common master: Tag openshift images for Infra service  https://review.openstack.org/60305000:51
openstackgerritMerged openstack/tripleo-heat-templates master: Fix openshift new node detection  https://review.openstack.org/60001200:51
openstackgerritMerged openstack/tripleo-heat-templates stable/rocky: Add CephOSD service to roles/Standalone.yaml  https://review.openstack.org/60375800:51
openstackgerritMerged openstack/python-tripleoclient stable/rocky: Start websocket client before workflows  https://review.openstack.org/60549900:51
openstackgerritMerged openstack/instack-undercloud stable/rocky: Include missing config classes  https://review.openstack.org/60479900:51
openstackgerritMerged openstack/tripleo-heat-templates master: Tag step plays  https://review.openstack.org/59907200:54
openstackgerritMerged openstack/tripleo-heat-templates master: Remove "when failed" from debug task names  https://review.openstack.org/59822100:54
openstackgerritMerged openstack/tripleo-common master: Handle non-existant plan when getting deployment status  https://review.openstack.org/60275300:54
*** tzumainn has quit IRC00:55
openstackgerritMerged openstack/tripleo-common stable/rocky: Add override_ansible_cfg  https://review.openstack.org/60487900:57
*** phuongnh has joined #tripleo01:04
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537401:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256001:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)01:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)01:10
*** shardy has quit IRC01:10
*** shardy has joined #tripleo01:18
*** phuongnh has quit IRC01:30
*** dmacpher_ has joined #tripleo01:33
*** dmacpher has quit IRC01:35
*** itlinux has joined #tripleo01:44
*** itlinux has quit IRC01:44
*** zzzeek has quit IRC01:48
*** zzzeek has joined #tripleo01:49
*** mrsoul has quit IRC01:55
*** mschuppert has quit IRC01:56
*** mschuppert has joined #tripleo01:57
*** mburned is now known as mburned_out02:00
*** jamesdenton has joined #tripleo02:07
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537402:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256002:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)02:10
*** ykarel has joined #tripleo02:18
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Support podman when tagging container for Pacemaker  https://review.openstack.org/60418002:24
*** boazel has joined #tripleo02:26
EmilienMchkumar|off: http://logs.openstack.org/17/600517/35/check/tripleo-ci-centos-7-undercloud-containers/f15eb65/logs/undercloud/home/zuul/tempest.log.txt.gz#_2018-09-27_23_39_0002:27
EmilienM2018-09-27 23:39:00 | mkdir: cannot create directory '/home/zuul/tempest': Permission denied02:27
EmilienMchkumar|off: I think we're close02:27
*** ykarel has quit IRC02:32
*** jhebden has quit IRC02:44
*** jhebden has joined #tripleo02:51
*** skramaja has joined #tripleo02:53
*** phuongnh has joined #tripleo02:53
*** med_ has quit IRC02:57
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537403:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256003:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)03:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)03:10
*** ykarel has joined #tripleo03:13
openstackgerritMerged openstack-infra/tripleo-ci master: Remove toci_jobtype definition from v3 jobs  https://review.openstack.org/59386303:13
*** lblanchard has quit IRC03:28
*** psachin has joined #tripleo03:30
openstackgerritMerged openstack/tripleo-common master: Update swift_rings_backup workflow to also backup ceph fetch dir  https://review.openstack.org/59722103:31
openstackgerritMerged openstack/tripleo-validations master: Add new nova-event-callback validation  https://review.openstack.org/51333303:31
*** sanjayu_ has joined #tripleo03:48
*** iranzo has joined #tripleo03:51
openstackgerritMerged openstack/python-tripleoclient stable/rocky: Fix typo in upgrade playbook's name.  https://review.openstack.org/60468503:57
openstackgerritMerged openstack/tripleo-common master: Make ODL healthcheck IPv6 compatible  https://review.openstack.org/59698703:57
openstackgerritMerged openstack/tripleo-heat-templates stable/queens: Fix syntax for set_fact module.  https://review.openstack.org/60477403:57
openstackgerritMerged openstack/tripleo-heat-templates master: Expose IronicImageDownloadSource as a parameter  https://review.openstack.org/60379603:57
openstackgerritMerged openstack/tripleo-common master: Don't fail tripleo-bootstrap on package installs  https://review.openstack.org/60319603:57
*** jaganathan has joined #tripleo04:00
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537404:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256004:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)04:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)04:10
Tenguhello there :)04:47
*** ramishra has joined #tripleo05:09
*** udesale has joined #tripleo05:09
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537405:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256005:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)05:10
*** ykarel_ has joined #tripleo05:11
*** itlinux has joined #tripleo05:12
*** ykarel has quit IRC05:14
*** ykarel__ has joined #tripleo05:15
*** ykarel_ has quit IRC05:18
*** ykarel_ has joined #tripleo05:20
*** ykarel__ has quit IRC05:23
*** ykarel__ has joined #tripleo05:24
*** ykarel_ has quit IRC05:27
*** ykarel_ has joined #tripleo05:29
*** ykarel__ has quit IRC05:32
Tenguif anyone in here could add the missing cr+2 on that one it would be great :) https://review.openstack.org/#/c/600534/05:35
*** quiquell|off is now known as quiquell05:41
quiquellTengu: good morning05:45
*** chkumar|off is now known as chandankumar05:46
chandankumarquiquell: Tengu jaosorior \o/05:46
quiquellchandankumar: o/05:46
*** Petersingh has joined #tripleo05:47
Tenguhello quiquell, chandankumar and jaosorior :))05:49
Tengujaosorior: hey, I'm pretty sure you're in a good mind for some cr+2 :)  https://review.openstack.org/#/c/600534/  please? :)05:49
chandankumarTengu: just one question https://review.openstack.org/#/c/600534/ does this changes is not needed for tempest container?05:52
quiquellTengu, jaosorior, chandankumar: Proper handling of connection close with zaqar https://review.openstack.org/#/c/605387/05:53
Tenguchandankumar: good question - I didn't do anything with tempest testing on that. It's for the plain deploy itself in fact.05:53
quiquellGood for debugging ^05:53
chandankumarTengu: https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/tempest.yaml#L5505:53
Tenguchandankumar: hmm yep, needed.05:53
chandankumarTengu: let this patch get's merged05:54
chandankumarTengu: I need to update tempest.yaml also with some changes05:54
chandankumarI will take care in that05:54
Tenguchandankumar: ok, cool :)05:54
Tenguchandankumar: I will probably need a second pass on the whole t-h-t - I've mainly worked out the issues I got while deploying the undercloud with podman+selinux05:54
Tenguchandankumar: so if you can take care of that directory... also, take care of the creation with setype05:55
Tenguchandankumar: https://review.openstack.org/#/c/600534/12/docker/services/ironic-api.yaml@158  for example with a loop05:55
chandankumarTengu: yup, sure05:56
Tengugreat :)05:56
Tenguquiquell: reading your patch :). Debug is good05:56
quiquellTengu: yeah, you are sensible of this after a harsh rover session05:57
Tenguquiquell: no kidding ;)05:58
*** gfidente has joined #tripleo06:07
quiquellchandankumar, Tengu: Do you know why this review that has all is not merged ?06:08
quiquellhttps://review.openstack.org/#/c/594511/06:08
quiquellIt's like stuck at openstack/triple-docs06:09
Tenguquiquell: probably because of CI issues - someone needs to -w // w+106:09
jaosoriorchandankumar, quiquell: Seems we have a gigantic queue in zuul again06:10
jaosoriorand infra is starting to call us out on it06:10
jaosoriorany idea why?06:10
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537406:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256006:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)06:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)06:10
quiquelljaosorior: But I see stuff in wrong queues https://review.openstack.org/#/c/594511/06:10
jaosoriorAlso, they say we do spend too much time collecting logs...and asked if we can trim that time down06:10
quiquelljaosorior: hummm we introduces one stuff there, about gatering ARA (I was suspecting it will cost us)06:10
quiquelljaosorior: You mean collect logs or post actions ?06:11
*** pcaruana has joined #tripleo06:11
*** ksambor has joined #tripleo06:17
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common master: Add httpd and mod_ssl packages to octavia api image  https://review.openstack.org/60322006:19
quiquellTengu: I have end up at one of your patches at tht https://github.com/openstack/tripleo-heat-templates/commit/623790385292acf4cb4f357a8d089e9d08d4d21206:22
quiquellTengu: we are exercising rocky->master containerized undercloud upgrade06:22
quiquellTengu: and looks like xinetd service does not exists06:22
*** dsneddon has quit IRC06:23
*** verdurin has quit IRC06:23
*** holser_ has joined #tripleo06:25
*** anande has joined #tripleo06:27
jaosoriorquiquell: any idea what's up with these http://status.openstack.org/elastic-recheck/data/others.html ?06:27
quiquelljaosorior: zuul post timedout06:29
quiquelljaosorior: but time difference is very small, something is broken at timeout config06:30
Tenguquiquell: xinetd was used previously for 1-2 services.06:30
Tenguquiquell: the goal there is to allow to remove that deprecated service06:30
Tengucare to explain your issue?06:30
quiquellTengu: last comment https://review.openstack.org/#/c/59077406:31
quiquellTengu: looks like xinted service is not present at rocky06:31
chandankumarPost queue has 92 hrs waiting time06:31
Tenguquiquell: that makes sense in fact.06:31
*** jfrancoa has joined #tripleo06:31
jaosoriorquiquell: is that timeout config something we set in tripleo?06:31
jaosorior* in tripleo-ci or quickstart06:31
Tenguquiquell: so the code I produced should be a bit different I guess so that it's kicking ONLY if we're <rocky ?06:32
quiquelljaosorior: Wait there is something I don't understand06:32
Tenguquiquell: unless we can do a "ignore_errors: true" in there?06:32
quiquellTengu: But the tht put's rocky in the top, is not enough ?06:32
* quiquell is a total noob on tht06:32
*** holser_ has quit IRC06:33
Tenguquiquell: not sure it's used for that in fact.06:33
TenguI think it's more a validation thing in order to ensure we use the right template version for the current deploy.06:33
quiquelljaosorior: Ahh wait... the minutes are the same the hours not... yepp is a clear timeout config is ok06:33
Tenguquiquell: answered your comment - guess the "ignore_errors" directive is the right thing.06:33
quiquellTengu: yep is kind of cleanup, if not there we are good too, that's it ?06:34
Tenguquiquell: exactly06:34
quiquellTengu: cool thanks, will try to fix06:34
quiquellTengu: maybe is better to check for existence06:35
Tenguquiquell: hmm yeah why not. using "systemd" ansible module to get state06:36
jaosoriorgfidente: Seen this issue before http://logs.openstack.org/58/601558/3/gate/tripleo-ci-centos-7-scenario004-multinode-oooq-container/f6a3788/logs/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz#_2018-09-28_06_25_08_448 ?06:36
Tenguquiquell: do you take care of that? or do you want me to fix it?06:36
jaosoriorgfidente: this commit targetted for queens just hit it https://review.openstack.org/#/c/601558/06:36
quiquelljaosorior: This could be the issue https://review.openstack.org/#/c/580238/06:37
quiquellTengu: going to try to take care, so I deep into tht06:37
quiquells/deep/dig/06:37
Tenguquiquell: fine :). you should just edit that xinetd service file06:37
Tenguit's plain ansible. no real magic ;)06:37
quiquellTengu: let's see what we find after fix this... like trains in the station06:38
Tenguhehe06:38
*** mrsoul has joined #tripleo06:38
Tenguquiquell: I know that - had that same kind of thing while working on podman+selinux integration :D06:38
Tenguwell, and STILL finding things.06:38
Tengulike the modprobe done within containers -.-06:38
quiquellTengu: now that you mention containers, I have question06:39
* Tengu hides06:39
quiquellTengu: is possible to store containers at images ?06:39
Tengugni?06:39
Tengudon't understand your question06:39
Tengudo you mean generate image from a running/deployed container?06:39
quiquellTengu: yep06:40
Tenguyep, we can06:40
Tenguat least with docker06:40
quiquellTengu: so it's like having RPM installed but instead of that we have containers "installed)06:40
Tenguwe probably can do the same with either podman or buildah06:40
quiquellTengu: will try some proto, maybe need your help06:40
Tenguquiquell: this in order to prevent the whole bootstrap of the containers?06:40
quiquellTengu: to try to reduce times06:41
jaosoriorquiquell: well, as far as I can tell from the failures here http://status.openstack.org/elastic-recheck/data/others.html , it's still timeouts... maybe we did increase the time by collecting more logs, but I think those are quite useful... do we have anything else that we could cut time on, or is there any other cause for the timeouts that we're still dealing with?06:41
Tenguquiquell: well, that will eat space, and when you fetch the images, it will take network bandwidth. A delicate balance.06:41
quiquellTengu: ... in case of non-containerize, this space is consume by installed RPMs06:42
Tenguyeah, but you might get mutliple containers with the same packages06:42
quiquellTengu: so have to be similar but with the overhead of docker registry (I can be missing a lot of stuff)06:42
quiquellTengu: don't agree, depends on the layers06:42
Tenguat least same package base06:42
quiquellTengu: layers are shared if they are the same06:42
Tenguneed to control that when you generate image from a running container - not sure if it works 100% the same.06:43
quiquellTengu: hummm that's right, is very difficult to make it right...06:43
Tenguquiquell: you might want to ping EmilienM when he's connected for some discussion. I think he knows a bit more than me about all that.06:44
quiquelljaosorior: Humm you are right... RUN times out too :-/06:44
*** jtomasek has joined #tripleo06:44
quiquellTengu: cool thanks06:44
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535606:47
*** holser_ has joined #tripleo06:49
*** holser_ has quit IRC06:49
*** holser_ has joined #tripleo06:50
openstackgerritAndreas Jaeger proposed openstack-infra/tripleo-ci master: Enable featureset override  https://review.openstack.org/59451106:50
openstackgerritChandan Kumar proposed openstack/tripleo-heat-templates master: Set proper setype for tempest service directories  https://review.openstack.org/60598006:54
chandankumarTengu: ^^06:54
Tenguchandankumar: \o/06:55
openstackgerrithanish proposed openstack/puppet-tripleo master: Implements: liquidio-containerization  https://review.openstack.org/60598106:55
chandankumarI need to wait for Bodgan to come I need to make some more changes to tempest container06:55
chandankumarmandre: Hello06:56
chandankumarmandre: In tempest container, I want to have three volumes auto mounted from tempest kolla images directory06:57
chandankumarmandre: one is /var/log/tempest, tempest workspace and data directory and all these directory should be owned by tempest user06:57
chandankumarmandre: It can be handled on tht side but I donot want to do that, Is it possible to handle directly on kolla tempest dockerfile side?06:58
openstackgerrithanish proposed openstack/tripleo-heat-templates master: Implements: liquidio-containerization  https://review.openstack.org/60598207:01
*** florianf|afk has quit IRC07:01
*** shardy has quit IRC07:01
*** shardy has joined #tripleo07:02
Tenguchandankumar: you might want to use "mkdir -p {{ tempest_dir }}" in case it already exists or need to create a tree.07:03
Tenguchandankumar: (for https://review.openstack.org/#/c/605356/4/roles/validate-tempest/templates/configure-tempest.sh.j2 )07:04
*** quiquell is now known as quiquell|brb07:04
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535607:04
openstackgerrithanish proposed openstack/tripleo-heat-templates master: Implements: liquidio-containerization  https://review.openstack.org/60598207:05
chandankumarTengu: thanks, I think I need to get rid of these dir, it can be handled on tht side or may be on kolla tempest dockerfile itself07:05
Tenguchandankumar: t-h-t might be the right place indeed. I don't really know tempest though, can't judge more.07:06
chandankumarTengu: yup07:07
Tenguanyway, replacing bash scripts by ansible is always a good move :)07:08
chandankumarTengu: yes long term place is to stub all these shell scripts here https://github.com/openstack/openstack-ansible-os_tempest07:08
chandankumarinto ansible07:09
*** dtrainor has quit IRC07:09
chandankumar*plan07:09
Tengu\o/07:10
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537407:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256007:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)07:10
*** f2 has joined #tripleo07:11
*** f2 is now known as florianf07:11
openstackgerritMerged openstack/tripleo-heat-templates stable/pike: Improve nova statedir ownership logic  https://review.openstack.org/58706607:11
*** rcernin has quit IRC07:12
*** rdopiera has joined #tripleo07:14
*** ssbarnea|bkp has quit IRC07:16
*** cylopez has joined #tripleo07:20
gfidentejaosorior ah no, looking into it now07:23
gfidenteI guess this will affect rocky and master cause we use the same version for all branches07:23
*** gkadam has joined #tripleo07:23
*** cylopez has left #tripleo07:25
*** amoralej|off is now known as amoralej07:26
gfidentejaosorior I don't get it though, it's not happening for all runs?07:26
openstackgerritArx Cruz proposed openstack/tripleo-quickstart-extras master: WIP - Fix stackviz  https://review.openstack.org/60541907:27
openstackgerritCarlos Goncalves proposed openstack/tripleo-quickstart master: FS038: enable tempest run  https://review.openstack.org/59917807:27
jaosoriorgfidente: not sure, I just saw it though07:28
jaosoriorgfidente: and it was for queens too07:28
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Fix overcloud ARA data collection  https://review.openstack.org/60567807:29
*** Petersingh is now known as Petersingh|lunch07:32
*** quiquell|brb is now known as quiquell07:35
*** jpena|off is now known as jpena07:43
gfidentejaosorior so I am not sure, the copy module for that task is wrapped into a custom action so there could be issues with params mangling in the wrapper07:43
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: Ignore errors at xinetd stop/uninstall  https://review.openstack.org/60598907:43
openstackgerritMerged openstack/tripleo-heat-templates stable/rocky: Allow a containerized logrotate to access docker  https://review.openstack.org/60534907:43
gfidentebut then in http://tripleo.org/cistatus.html I see that both scenario001 and 004 are green and pretty stable07:43
*** phuongnh has quit IRC07:44
gfidenteit's pretty complicated to add a debug ling in the wrapper action because tripleo/ci won't consume it unless it's built into an rpm07:45
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Switch previous release of master from 'queens' to 'rocky'  https://review.openstack.org/59077407:45
quiquellTengu: ^07:46
*** AJaeger has joined #tripleo07:47
quiquelljaosorior: The timeouts are at rocky ?07:47
AJaegertripleo team, jaosorior, you have only *2* changes open for python3-first: both are stable changes for paunch, see https://review.openstack.org/597831 and https://review.openstack.org/597848 - and both fail ;(07:48
*** bogdando has joined #tripleo07:48
AJaegerWhat do you want to do to get those merged?07:48
AJaegeropenstack-tox-py27 is failing in both cases07:49
bandiniany takers for a simple cherry-pick ? https://review.openstack.org/#/c/601077/07:49
*** leanderthal has joined #tripleo07:50
*** tosky has joined #tripleo07:50
Tenguquiquell: hmm. I doubt this will avoid the xinetd thingy. well, let's see what ci spits.07:50
quiquellTengu: ack, ideally I want to check if the package is installed, but want to unblock at least to see next issues07:50
jaosoriorAJaeger: I'll check it out07:54
quiquelljaosorior: I don't see to much timeouts in the gates, also found that at the merge for master timeout is recent07:54
*** assassin has joined #tripleo07:54
quiquelljaosorior: at rocky07:54
*** quiquell is now known as quiquell|brb07:55
AJaegerthanks, jaosorior07:55
*** holser_ has quit IRC07:55
*** ratailor has joined #tripleo07:56
*** jpich has joined #tripleo07:58
*** holser_ has joined #tripleo07:58
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates stable/queens: GATE CHECK for TripleO  https://review.openstack.org/56722408:00
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates stable/pike: GATE CHECK for TripleO  https://review.openstack.org/60224808:00
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates master: GATE CHECK for TripleO  https://review.openstack.org/60429808:00
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates stable/rocky: GATE CHECK for TripleO  https://review.openstack.org/60429308:00
*** quiquell|brb is now known as quiquell08:05
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537408:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256008:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)08:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)08:10
*** anande has quit IRC08:17
quiquelljaosorior: I am starting to see POST_FAILURES with POST timeouts08:17
quiquellmarios ^08:18
*** sai_p has quit IRC08:26
mandrechandankumar: hi! I'm back08:26
mandrechandankumar: so you want to change ownership of some directories mounted in the container but do not want to do it in the script that starts your tempest container?08:27
chandankumarmandre: yes correct08:28
*** ykarel__ has joined #tripleo08:28
chandankumarmandre: since the same container can be used by other distribution08:28
thervequiquell: Do you know what's the failure with "gating_repo.tar.gz: No such file or directory" ?08:28
chandankumarmandre: with in that direction write and read permission should happen08:29
quiquelltherve: do you have a log around ?08:29
thervequiquell: https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-extras-gate-master-delorean-full-featureset052-719/console.txt.gz08:30
*** Petersingh|lunch is now known as Petersingh08:30
mandrechandankumar: hmm, the rule we've been following so far is that the tool using the container should set the right perms on the directories it uses08:30
chandankumarmandre: for example https://github.com/openstack/kolla/blob/master/docker/tempest/extend_start.sh#L4 -> it should point to /var/log/tempest08:30
*** ykarel_ has quit IRC08:31
mandrechandankumar: I've always hated these mkdir in extend_start and proposed to remove them08:31
*** dtrainor has joined #tripleo08:31
mandrechandankumar: that's kolla-ansible specific and should be created in kolla-ansible08:32
mandrewe do not use this path in tripleo08:32
chandankumarmandre: I want to carry minimal stuff in tht and handle all stuff in kolla container08:32
mandrethe dir is there but we don't care about it08:32
chandankumarmandre: tempest is not currently consumed in kolla-ansible08:32
mandrechandankumar: one more reason to remove everything that's in https://github.com/openstack/kolla/blob/master/docker/tempest/extend_start.sh08:33
mandrechandankumar: what is the problem with setting the right perms in tht?08:33
chandankumarmandre: nothing08:34
chandankumarmandre: our long term plan with validate-tempest role is to replace validate-tempest role with ohttps://github.com/openstack/openstack-ansible-os_tempest and consume it in tripleo , openstack-ansible and kolla-ansible08:34
chandankumarthat's why I donot wanted to keep stuff in tht08:35
mandrechandankumar: well, in that case, fixing the perms should go in your openstack-ansible-os_tempest role, shouldn't it?08:36
*** anande has joined #tripleo08:36
chandankumarmandre: yup,08:36
mandrechandankumar: I suppose tempest is a special beast and we could make an exception08:36
mandrebut if it's possible to fix the perms where the container is used I much prefer it08:36
chandankumarmandre: let me see what can I do08:37
*** derekh has joined #tripleo08:37
quiquelltherve: It's missing checking compressed_gating_repo conditional08:38
quiquelltherve: looks like it's not generating any gating_repo08:38
*** ratailor has quit IRC08:40
*** anande has quit IRC08:41
*** ratailor has joined #tripleo08:41
thervequiquell: OK not sure what that means :). What generate this?08:42
quiquelltherve: To be able to test changes at projects that are installed through RPMs we have to create a special yum repos with new RPMs containing those changes.08:44
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add OS::TripleO::Services::Rhsm to OpenShift roles  https://review.openstack.org/60599908:44
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Use Timesync service instead of Ntp  https://review.openstack.org/60600008:44
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Let openshift-ansible configure the firewall  https://review.openstack.org/60600108:44
quiquelltherve: the build-test-packages role do that inspecting the zuul/jenkins changes, and register it at a variable that have to be checked before use it.08:45
thervequiquell: So the variable is present, but why the file isn't?08:45
quiquelltherve: The variable is not present, we are not checking it at the failing task08:45
quiquelltherve: a 'whenÂ' is missing08:46
therveOK I trust you on this :)08:46
quiquelltherve: Then you are screw :-P08:46
therveI see that --extra-vars artg_compressed_gating_repo=/home/stack/gating_repo.tar.gz08:46
quiquelltherve: but don't know why this is failing now08:46
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Use glusterfs for registry when deploying with CNS  https://review.openstack.org/60582508:46
quiquelltherve: yep that's it08:47
quiquelltherve: do you have the review that trigger this job ?08:47
quiquelltherve: is weird that we don't have gating_repo, has to be over tq tqe08:48
thervequiquell: https://review.openstack.org/#/c/604979/08:48
quiquelltherve: a ok this is a change at tqe, tqe is not installed with RPM so no gating_repo is generated08:48
*** arxcruz is now known as arxcruz|doctor08:49
quiquelltherve: Then I don't know why the failing task is missing the when statment08:49
thervequiquell: Shouldn't that affect all changes then?08:49
thervetqe ones that is08:49
quiquelltherve: tqe, tq and possible tripleo-ci08:49
quiquelltherve: let me check if the task is new08:50
quiquelltherve: this is weird then when statement is here http://git.openstack.org/cgit/openstack/tripleo-quickstart/tree/roles/libvirt/setup/undercloud/tasks/main.yml#n5108:52
quiquelltherve: something is artifically generating the variable08:52
thervequiquell: Does the "artg_" prefix matter?08:52
*** shyamb has joined #tripleo08:52
quiquelltherve: where do you see  --extra-vars artg_compressed_gating_repo ?08:53
thervequiquell: The quickstart call in that file08:54
thervequiquell: bash quickstart.sh --working-dir /home/jenkins/workspace/tripleo-quickstart-extras-gate-master-delorean-full-featureset052/ --no-clone --bootstrap --extra-vars artg_compressed_gating_repo=/home/stack/gating_repo.tar.gz --playbook build-test-packages.yml --tags all --teardown all --release centosci/master 172.19.2.9908:54
quiquelltherve: that's wrong08:54
quiquelltherve: this is centos.org don't know where the script lives08:57
therve?08:58
quiquellykarel__: Do you know where is the script that do this https://ci.centos.org/artifacts/rdo/jenkins-tripleo-quickstart-extras-gate-master-delorean-full-featureset052-719/console.txt.gz08:58
thervequiquell: https://github.com/openstack/tripleo-quickstart/blob/master/ci-scripts/full-deploy.sh#L61-L71 ?08:59
ykarel__quiquell, see ci-config08:59
*** ykarel__ is now known as ykarel08:59
ykarelrdo-infra/ci-config08:59
quiquelltherve, ykarel: we are still using full-deploy.sh ?09:00
openstackgerritUdi Kalifon proposed openstack/tempest-tripleo-ui master: Selenium infra  https://review.openstack.org/60542409:00
ykarelquiquell, yes atleast in phase 1 it's used, not sure about other places09:00
therveMaybe not :)09:00
therveOh ok09:00
*** tosky has quit IRC09:02
quiquelltherve: this is standalone09:02
*** tosky has joined #tripleo09:03
therveOK I have no idea how all this works :D09:03
quiquelltherve: Looks like it have fails forever for standalone on tq, tqe changes...09:04
*** ykarel is now known as ykarel|lunch09:04
therveSounds worth fixing. Or to get rid of the job.09:05
quiquelltherve: let me verify where do we call the build-test-packages09:05
marios|roverquiquell: ack09:06
marios|roverquiquell: which jobs/info?09:06
marios|roverquiquell: nm i see one in grafana09:07
quiquellmarios|rover: http://dashboard-ci.tripleo.org/d/FEdraO0ik/jobs-exploration?orgId=1&var-influxdb_filter=result%7C%3D%7CPOST_FAILURE09:08
*** salmankhan has joined #tripleo09:08
quiquelltherve: the ansible fact is cached so no need to pass it over quickstart.sh calls I think09:08
quiquelltherve: we can totally remove it09:08
quiquelltherve: weird thing it's going to fail at tq/tqe changes at all calls09:08
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537409:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256009:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)09:10
thervequiquell: Last success for this build is 2 weeks ago...09:12
quiquelltherve: depends on the kind of change, if it's for example at THT it's going to work09:13
quiquelltherve: But it's at tq,tqe is not09:13
*** Petersingh is now known as Petersingh|afk09:13
therveSure talking about https://ci.centos.org/job/tripleo-quickstart-extras-gate-master-delorean-full-featureset052/09:14
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: Remove standalone quickstart.sh gating_repo var  https://review.openstack.org/60601209:16
quiquelltherve: ^09:16
therveThanks!09:16
quiquelltherve: add a Depends-On to see if it works now09:16
quiquelltherve: worth checking also at tht dummy change for example, can you do that for me ?09:16
*** rdo has quit IRC09:17
therveSorry I don't know what you mean09:17
therveIt's non-voting right?09:18
therveI'd rather make another recheck if possible09:18
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart-extras master: Add standalone upgrade role and playbook.  https://review.openstack.org/60473609:22
*** Petersingh|afk is now known as Petersingh09:23
shyambHi09:23
shyambOvercloud deployment for queens is failing even if I don't change anything09:24
shyambIt's taking 2-3 retries to get a successful deployment09:24
shyambshardy: Tengu: jaosorior:09:24
shyambRHOSP10 was quite stable and consistent but RHOSP13 is not same09:25
shardyshyamb: fails how?09:25
shardyand on what platform?09:25
shyambhttp://paste.openstack.org/show/731080/09:25
shyambrhel7 platform09:26
shyambshardy: Error doesn't look consistent across deployments09:26
shardyshyamb: Ok, probably need more information to figure out why - openstack stack failures list overcloud --long as a start but probably you'll need to look at the logs to work out what's up with haproxy09:28
shyambshardy: ok09:31
shyambbut if I don't change anything in the command or overcloud nodes, things should work as it is09:32
shyambin that case if deployment fails, I am not getting motivation to go ahead and debug the issue09:32
*** akrivoka has joined #tripleo09:33
shardyshyamb: :\09:39
shardyIf you're not prepared to even try to debug it, why bother asking for help here?09:39
shardysigh09:39
*** Petersingh is now known as Petersingh|afk09:41
shyambshardy: I debugged my issues09:43
shyamband fixed many09:43
shyambbut if I don't change anything in the command or overcloud and it's working on next retry09:43
shyambwhy should I debug it09:44
shyambMy concern is why should it work on retry09:44
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: WIP: read job variables from deploy playbooks  https://review.openstack.org/60601709:45
*** ykarel|lunch is now known as ykarel09:48
jaosoriorgfidente: should I raise a bug? That affected a job in pike as well.09:48
shardyshyamb: my point is if you don't capture why it failed the first time, we have zero chance of fixing the underlying issue09:49
gfidentejaosorior I think an issue in github for ceph-ansible09:49
jaosoriorI see09:49
gfidentejaosorior can you paste me a link to the pike error? because in pike we're using a different version of ceph-ansible09:49
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: DNM: To test job variables  https://review.openstack.org/60602009:50
shyambshardy: Next time, I will capture it09:50
shardyshyamb: thanks09:51
jaosoriorgfidente: oh, it was a failure; but a different(now that I'm digging into the ceph ansible logs)09:51
jaosoriorgfidente: http://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario004-multinode-oooq-container/f07f96c/logs/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz#_2018-09-28_08_05_48_54509:51
jaosoriorand09:51
jaosoriorgfidente:http://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0c9113/logs/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz#_2018-09-28_08_26_15_87009:51
*** jtomasek has quit IRC09:56
*** shyamb has quit IRC09:56
*** shyamb has joined #tripleo09:58
*** Petersingh|afk is now known as Petersingh10:03
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates stable/pike: Do not disable ipv6 on loopback interface for epmd  https://review.openstack.org/60602610:07
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537410:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256010:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)10:10
*** AJaeger has left #tripleo10:14
*** Petersingh has quit IRC10:22
*** Petersingh_ has joined #tripleo10:22
*** Petersingh_ is now known as Petersingh|afk10:23
quiquellykarel: rings any bell ->  http://logs.openstack.org/74/590774/16/check/tripleo-ci-centos-7-undercloud-upgrades/4f31e68/logs/undercloud/home/zuul/undercloud_reinstall.log.txt.gz#_2018-09-28_09_23_52 ?10:25
ykarelquiquell, yes this is to do with the undercloud validation during reinstall10:26
ykareli remeber there are already patches for it10:26
ykarelto fix it10:26
quiquellykarel: I remember we were fixing something similar last ruck/rovering10:26
ykarelyes10:26
quiquellykarel: jpena maybe ?10:26
ykarelnope10:26
ykareljistr, jfrancoa10:27
*** shyamb has quit IRC10:27
quiquellykarel: Let's see if they heard the siren song10:27
quiquelljillr, jaosorior: Are you there guys ?10:28
ykarelquiquell, https://review.openstack.org/#/c/603523/10:28
jaosoriorquiquell: I'm around, what's up/10:28
ykareland rocky backport: https://review.openstack.org/#/c/605815/,not merged, quiquell in which release u saw that error10:28
quiquellykarel: master, but we need a promotion10:29
quiquelljaosorior: was calling jfrancoa sorry10:29
ykarelquiquell, ack10:29
quiquelljaosorior: my fingers are like "manojo de pollas" sometimes10:29
jaosoriorhahaha fair enough10:30
*** abishop has joined #tripleo10:30
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Fix overcloud ARA data collection  https://review.openstack.org/60567810:30
quiquelljaosorior: btw, gates are good now ?10:33
jaosoriorstill big zuul queues. However, haven't noticed many timeouts10:33
jaosoriorso it's better10:33
quiquelljaosorior: ack10:34
jfrancoaquiquell: ykarel: right that's the patch to fix that issue. I proposed a different one but I abandoned it in favor of that one10:34
quiquelljfrancoa: I suppose we have to wait a promotion to have it10:34
quiquelljfrancoa: Or do we have already promoted ?10:35
quiquelljaosorior, jfrancoa: To have overcloud ARA correctly collected https://review.openstack.org/#/c/60567810:36
quiquellmuch needed to debug timeouts10:36
jfrancoaquiquell: no idea, I guess it's that according to the log.10:36
quiquelljfrancoa: you where near10:37
jaosoriorquiquell: ack, thanks10:37
*** sri_ has quit IRC10:50
quiquellTengu: tht change worked10:54
Tenguquiquell: what change?10:57
*** dtantsur|afk is now known as dtantsur10:57
Tengubeing thanked while I did nothing: feels weird :D10:57
quiquellTengu: https://review.openstack.org/#/c/605989/11:00
*** shyamb has joined #tripleo11:01
Tenguah, that one. ok :D11:01
*** Petersingh|afk is now known as Petersingh11:02
*** med_ has joined #tripleo11:02
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: DNM: To test job variables  https://review.openstack.org/60602011:04
*** hjensas has joined #tripleo11:05
*** rfolco has quit IRC11:07
*** jpena is now known as jpena|lunch11:07
*** ssbarnea|bkp has joined #tripleo11:08
quiquellssbarnea|bkp: you there ?11:08
ssbarneaquiquell: yes.11:09
ssbarneaquiquell: can I help with something?11:09
*** Aelia has joined #tripleo11:09
quiquellssbarnea: #oooq11:10
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537411:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256011:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,Triaged] - Assigned to Jiří Stránský (jistr)11:10
*** jjoyce has quit IRC11:15
*** ratailor has quit IRC11:17
*** jjoyce has joined #tripleo11:17
Aeliahello11:22
openstackgerritCédric Jeanneret proposed openstack/tripleo-specs master: Validation Framework specifications  https://review.openstack.org/58916911:22
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: move tripleo-ci release files inside CentOS-7 folder  https://review.openstack.org/60564211:23
Tenguflorianf: care to re-check? I just addressed slagle comment regarding namespace (re: validation framework)11:23
AeliaI have something really weird happening after I deployed successfully a ceph node on pike containerized. The OSD containers fail because /dev/vde1 does not exist. I can see no reference to that in the ceph-install-workflow.log on undercloud ...11:24
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535611:24
Tenguflorianf: guess we have now a spec that meets all the requirements and should make ppl happy :)11:24
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: Switch fs027 to deploy with podman  https://review.openstack.org/60051711:24
Tenguflorianf: sorry for taking so long addressing that - had "some" other things on the desk ^^'11:24
Tengugfidente: care to check with Aelia possible issue on ceph-ansible?11:25
Aeliaceph-ansible created a gpt partition table, but no partitions were created11:25
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Create soft links for tripleo-ci release files  https://review.openstack.org/60564311:26
*** rfolco has joined #tripleo11:26
openstackgerritJames Slagle proposed openstack/tripleo-common stable/rocky: Handle non-existant plan when getting deployment status  https://review.openstack.org/60603911:29
fultonjAelia: did you clean your disks before depoying?11:29
fultonjAelia: which task did ceph-ansible fail on as per ceph-install-workflow.log?11:30
Aeliafultonj: qcow2 images created specifically for this test. the deployment succeded no error reported. (I am testing on VMs with vbmc for ironic)11:31
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: In process-templates script write output files to provided dir when using base path  https://review.openstack.org/60573611:31
*** udesale has quit IRC11:32
florianfTengu: Done11:32
florianfTengu: \o/11:32
Tenguflorianf: great, thanks!11:32
Tenguslagle: if you have a minute just to validate one last time? https://review.openstack.org/58916911:32
fultonjAelia: so you'll want to debug directly on the ceph containers as described in https://hub.docker.com/r/ceph/daemon/11:33
openstackgerritUdi Kalifon proposed openstack/tempest-tripleo-ui master: Selenium infra  https://review.openstack.org/60542411:33
florianfakrivoka: woudl you like to have a last look as well: https://review.openstack.org/#/c/589169/11:33
florianf*would11:33
Aeliafultonj: I have a cluster working, I scaled up with one new node.11:33
Aeliabut the OSDs containers on the new node are not starting.11:34
fultonjAelia: so you were adding a new node with N OSDs to an existing cluster11:34
fultonjdid all OSDs fail?11:34
fultonjon the new node11:34
Aeliafultonj: yes, and yes. the reason it fails, is that /dev/vd{b,c,d,e} do not have the partitions expected by the container11:35
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Merge new params - nic-config templates  https://review.openstack.org/60580711:35
Aeliathis is the only thing logged by the container before terminating and being restarted by systemd11:35
Aeliafultonj: 2 lines of log only -> "2018-09-28 11:27:54  /entrypoint.sh: static: does not generate config" and "mount: special device /dev/vde1 does not exist"11:36
fultonjyou should be able to 'sgdisk -Z /dev/sdX' for each X and redo11:36
fultonjyou need to ensure the disk is clean11:36
fultonjsounds like somewhere in the middle something ahppened which is now getting in the way11:37
openstackgerritMerged openstack/tripleo-heat-templates master: Tag tasks in in common tasks  https://review.openstack.org/60325011:37
fultonjor see Zap a device11:37
fultonjunder https://hub.docker.com/r/ceph/daemon/11:37
fultonjAelia: you can have systemd stop restarting the container11:37
fultonjrestart=true --> false in the unit file11:37
*** Petersingh is now known as Petersingh|afk11:38
fultonjthere should be a prepare container run and then it activates them11:38
*** lblanchard has joined #tripleo11:38
fultonjif you look at that URL ^11:39
fultonjyou'll see "Deploy an OSD11:39
fultonj"11:39
Aeliafultonj: ok I have used the sgdisk -Z method. Will try to deploy again.11:39
fultonjdescribing what ceph-ansibel coordinates to get your OSD ready11:39
fultonjsomewhere in that process things went wrong11:39
fultonjyou can try to do it manually to find it or have ceph-ansible do it again11:40
*** agopi|brb is now known as agopi11:40
*** ssbarnea|bkp has quit IRC11:40
fultonjif it fails repeatedly you'll need to follow the aove in deatils and see ceph-ansible tasks to see what's going and where it's getting stuck11:40
openstackgerritSergii Golovatiuk proposed openstack/tripleo-heat-templates stable/pike: Always lowercase role name  https://review.openstack.org/59858811:40
*** shyamb has quit IRC11:41
*** shyamb has joined #tripleo11:41
*** panda|off is now known as panda11:42
*** Petersingh|afk has quit IRC11:42
openstackgerritSergii Golovatiuk proposed openstack/tripleo-heat-templates stable/ocata: Always lowercase role name  https://review.openstack.org/59858911:42
akrivokaflorianf: ack, looking11:42
Tenguakrivoka: thanks :)11:42
Aeliafultonj: but what I find strange is that in the ceph-ansible logs I have this -> 2018-09-28 11:26:16,237 p=21839 u=mistral |  ok: [10.27.100.12] => (item=/dev/vdb) => {"changed": false, "cmd": "parted --script /dev/vdb print | egrep -sq '^ 1.*ceph'", "delta": "0:00:00.036621", "end": "2018-09-28 09:26:16.071592", "failed_when_result": false, "item": "/dev/vdb", "msg": "non-zero return code", "rc": 1,11:44
Aelia"start": "2018-09-28 09:26:16.034971", "stderr": "Error: /dev/vdb: unrecognised disk label", "stderr_lines": ["Error: /dev/vdb: unrecognised disk label"], "stdout": "", "stdout_lines": []}11:44
Aeliaso apparently before the ceph-ansible run, there was no partition table on the disk, it was created by ceph-ansible ...11:44
*** yolanda has joined #tripleo11:44
fultonj"parted --script /dev/vdb print | egrep -sq '^ 1.*ceph'"11:44
openstackgerritBogdan Dobrelya proposed openstack/paunch master: Add support for --cap-add to add capabilities  https://review.openstack.org/60604211:46
Aeliafultonj: after that I have an action with "cmd": ["parted", "-s", "/dev/vdb", "mklabel", "gpt"] for vdb11:46
Aeliafultonj: but no action to create any partition on vdb11:46
fultonjare you colocating?11:46
fultonjthe journal or using a sep journal disk?11:47
fultonjhttps://github.com/ceph/ceph-ansible/blob/824ec6d256fc23794d69dd82f789fb05ef5c7bb6/roles/ceph-osd/tasks/check_gpt.yml#L1111:47
*** assassin has quit IRC11:47
akrivokaTengu: florianf: looks good!11:47
Tengubogdando: you create hobbit capacity? :D11:47
Tenguakrivoka: good news :)11:47
bogdandoTengu: :)11:47
Aeliafultonj: should be at least the others are. But for this one I am overriding variables using https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/node_specific_hieradata.html11:48
fultonjAelia: cool11:48
Aeliathe only variable I override is the "devices" to set other block devices than the other nodes11:48
fultonjyep11:49
fultonjAelia: i really think your disks are not clean11:49
holser_Tengu, quiquell - my concern is line 5911:50
holser_5811:50
holser_where we need to put when: remove_xinetd_pkg|bool11:50
holser_that's all11:50
fultonjwhat does lsblk return?11:50
Tenguholser_: ah, that one. well. probably no need to ignore errors on the package itself.11:50
Aeliafultonj: well the deployment is ongoing now so I will keep you informed11:51
Tenguholser_: heee... nope, there we can ignore the error - it fails it the service isn't defined.11:51
Tenguholser_: the service won't be defined if the package is removed.11:51
Aeliafultonj: I already destroyed the partition tables with " sgdisk -Z "11:51
fultonjgood11:51
fultonji think that will help11:52
holser_Tengu agree11:52
quiquellTengu, holser_: So just at service is enough ?11:52
fultonjAelia: i basically clean my nodes w/ ironic between deployments11:52
Tenguholser_: so 2 ways to do things: either ignore_errors, or do a pre-detection using "systemd" module, that will provide the state (defined, running, and so on)11:52
Tenguquiquell: yep11:52
holser_Tengu we detect on upgrades11:52
holser_on systemd level11:52
holser_let me show the sample11:52
quiquellTengu: ack will do11:53
*** mrsoul has quit IRC11:54
*** mschuppert has quit IRC11:54
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: Ignore errors at xinetd stop/uninstall  https://review.openstack.org/60598911:55
quiquellholser_: ^11:55
Tenguholser_: more over, we actually want to deactivate that service whatever is the step - the removal is optional though11:56
*** assassin has joined #tripleo11:56
Tenguholser_: this is a cleanup step, and it's already well used for now containerized services.11:56
*** mburned_out is now known as mburned11:57
holser_well if we have error ... for instance it was not stopped11:57
holser_for some reason11:57
holser_playbook will continue rather than killing it for sure11:58
Tenguholser_: yeah, so this joins the other solution I proposed in the comment :).11:58
Tenguuse systemd in order to detect service presence, and do whatever is needed.11:58
openstackgerritNicolas Hicher proposed openstack-infra/tripleo-ci master: provider: Add vexxhost  https://review.openstack.org/59643211:58
Tenguthat said - xinetd is a simple service, and it's already empty - i.e. there isn't any custom service running in it, as rsync and the other one were already removed.11:58
Tengubut indeed, I could have done that in a more.... "now you shut the f**k up and die already" way :)11:59
holser_Tengu I love your last sentence ...11:59
holser_that's my point11:59
openstackgerritSorin Sbarnea proposed openstack-infra/tripleo-ci master: fedora28 standalone job definition  https://review.openstack.org/59537411:59
Tenguholser_: so, quiquell might use the systemd module, if service is present then stop/disable it with the current code, and drop the "ignore_errors".12:00
holser_https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/iscsid.yaml#L124-L14512:00
*** mmethot has joined #tripleo12:00
*** EmilienM is now known as EvilienM12:00
openstackgerritSorin Sbarnea proposed openstack-infra/tripleo-ci master: fedora28 standalone job definition  https://review.openstack.org/59537412:00
Tenguthat way: we trigger the stop/disable IFF the service is loaded, and it fails if systemd can't kill it properly12:00
holser_systemd will do the magic12:00
Tengueeewwww12:00
Tengudon't use "command" please X(12:00
*** mschuppert has joined #tripleo12:01
jaosoriorzzzeek: around?12:01
Tenguholser_: https://docs.ansible.com/ansible/latest/modules/systemd_module.html#systemd-module  so "systemd: name: xinetd" with a register, and you should get its status.12:02
jaosoriorzzzeek: I took a read at the latest version of the global galera spec. It looks good overall; I only left one request in the patch. If that's addressed, It's +2 from my side.12:02
*** shyamb has quit IRC12:02
*** shyamb has joined #tripleo12:03
EvilienMchandankumar: thanks for taking care of https://review.openstack.org/#/c/600517/12:03
EvilienMchandankumar: nice work on https://review.openstack.org/#/c/605356/12:03
EvilienMchandankumar: I haven't figured the issues that we saw yesterday, I couldn't reproduce and it seems to be random and environmental. We'll see.12:04
*** jpena|lunch is now known as jpena12:05
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Use cap sysadmin for Neutron/OVN agents  https://review.openstack.org/60604512:05
jaosoriorflorianf: could you check this out https://review.openstack.org/#/c/602007/ ?12:06
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: [WIP] Enable full tempest api and scenario tests for basic services  https://review.openstack.org/60604612:09
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537412:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256012:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)12:10
openstackgerritJames Slagle proposed openstack/tripleo-common stable/rocky: Don't fail tripleo-bootstrap on package installs  https://review.openstack.org/60604712:11
openstackgerritJames Slagle proposed openstack/tripleo-common stable/rocky: Don't fail tripleo-bootstrap on package installs  https://review.openstack.org/60604712:12
quiquellEvilienM: was thinking about reducing jobs time, how feasible would be to have images with local docker container on them ?12:12
EvilienMquiquell: we talked about it 2 days ago, it takes a very big image that infra is unlikely willing to store12:13
openstackgerritBob Fournier proposed openstack/tripleo-heat-templates master: In process-templates script write output files to provided dir when using base path  https://review.openstack.org/60573612:13
quiquellEvilienM: even sharing layers ?12:13
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates stable/rocky: Tag step plays  https://review.openstack.org/60604812:14
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates stable/rocky: Remove "when failed" from debug task names  https://review.openstack.org/60604912:14
EvilienMquiquell: http://eavesdrop.openstack.org/irclogs/%23tripleo/%23tripleo.2018-09-26.log.html#t2018-09-26T20:41:1712:14
openstackgerritJohn Fulton proposed openstack/tripleo-common stable/rocky: Update swift_rings_backup workflow to also backup ceph fetch dir  https://review.openstack.org/60477312:14
*** dprince has joined #tripleo12:15
*** rh-jelabarre has joined #tripleo12:15
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates stable/rocky: Remove "when failed" from debug task names  https://review.openstack.org/60604912:15
openstackgerritJohn Fulton proposed openstack/tripleo-heat-templates stable/rocky: Persist ceph-ansible fetch_directory using config-download  https://review.openstack.org/60477212:15
fultonjgfidente: do you mind voting on those two ^ ?12:16
fultonj(again)12:16
fultonjclean cherry picks12:16
gfidentefultonj ack done12:17
fultonjthanks12:17
*** rfolco has quit IRC12:18
openstackgerritMerged openstack/tripleo-specs master: Remove the redundant word  https://review.openstack.org/59479912:19
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates stable/rocky: Tag tasks in in common tasks  https://review.openstack.org/60605112:19
florianfjaosorior: yes, taking a look12:21
Aeliafultonj: ceph-ansible has finished, and I am exactly in the same state as before ...12:21
openstackgerritSorin Sbarnea proposed openstack-infra/tripleo-ci master: fedora28 standalone job definition  https://review.openstack.org/59537412:21
fultonjAelia: lsblk12:21
Aeliafultonj: "vdb    252:16   0  50G  0 disk" no partition12:22
fultonjwant to put that in a pasteing?12:22
fultonjpastebin12:22
fultonjlsblk | curl -F 'f:1=<-' ix.io12:22
fultonjsend me output of ^12:22
*** weshay is now known as weshay_ruck12:23
Aeliafultonj: I used a gist -> https://gist.github.com/dabelenda/2ecf1b21a90a50ef8572763374f2e0e712:23
fultonjAelia: are you running with CephAnsibleVerbosity set to any value >0 ?12:24
*** rlandy has joined #tripleo12:24
Aeliafultonj: not overriden in my environment files12:26
*** skramaja has quit IRC12:26
mariosgfidente: ping when you're ready thanks12:26
* marios there12:26
jaosoriorflorianf: thanks12:27
fultonjAelia: https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/ceph_config.html#override-ansible-run-options12:27
fultonjfor next time  you run it ^12:27
fultonj"the ceph-ansible parameters that are passed as overrides as described in this document, are stored on the undercloud in a directory that matches the pattern /tmp/ansible-mistral-action*"12:27
Aeliafultonj: ah -> CephAnsiblePlaybookVerbosity: 112:28
fultonjAelia: ok, good12:28
fultonjls /tmp/ansible-mistral-action*12:28
*** ykarel_ has joined #tripleo12:28
fultonjAelia: look in the ceph-ansible inventory and make sure the node override is passing the correct disk list12:29
fultonjAelia: ensure you're looking at the latest one (ls -lhtr)12:29
*** med_ has quit IRC12:31
openstackgerritMerged openstack/tripleo-specs master: Validation Framework specifications  https://review.openstack.org/58916912:31
*** ykarel has quit IRC12:31
Aeliafultonj: I updated the gist showing the override of devices is ok12:31
*** ykarel_ is now known as ykarel12:31
Tenguso, see you folks. Happy weekend, see you on Monday ;).12:32
fultonjAelia: ceph-ansible must have done something with OSD tasks on those disks12:32
fultonjin the run log, you should be able to trace the tasks12:32
fultonjrealabive to the OSD role12:33
EvilienMbogdando, bandini: for the After vs Wants thing, I think I can do it on a separated patch, maybe we can go ahead with https://review.openstack.org/#/c/600849/12:33
fultonjrelative*12:33
Aeliathe log containing vdb for example is really short I will put it into the gist too12:34
*** agopi is now known as agopi|brb12:34
*** jcoufal has joined #tripleo12:34
*** tzumainn has joined #tripleo12:34
Aeliafultonj: done12:34
bandiniEvilienM: I am fine with that12:35
EvilienMbandini: I will iterate on this code during the milestone12:35
EvilienMbandini: but as is, it worked for the undercloud12:35
EvilienMit worked (tm)12:35
bandini:D12:36
fultonjAelia: you want to see more context around that12:36
fultonjwhich task was doing this?12:36
*** Petersingh|afk has joined #tripleo12:36
fultonjless the log and look for 14:15:34,54712:36
fultonjthen compare that task via the ceph-ansible code for the version of it you're using12:37
fultonjwith the output12:37
openstackgerritMehdi Abaakouk (sileht) proposed openstack/tripleo-heat-templates stable/pike: Add a way to override base path when file driver is used  https://review.openstack.org/60128612:37
*** raildo has joined #tripleo12:38
*** agopi|brb has quit IRC12:39
Aeliafultonj: I changed a bit the grep command to add -B412:39
Aeliathis is sufficient for in this case to show the TASK names12:39
Aeliafultonj: gist updates12:39
Aelias/s/d/12:39
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: Add a fact checking xinetd service present  https://review.openstack.org/60598912:41
fultonjis Sdc on your other servers?12:41
quiquellholser_: ^ this is it ?12:41
fultonjwhile Vdc is on the new one?12:41
Aeliafultonj: yes all other servers are in "Sdc" and only the new one has "Vdc"12:42
*** trown|outtypewww is now known as trown12:43
fultonjAelia: http://paste.openstack.org/show/731095/12:43
*** Petersingh|afk is now known as Petersingh12:43
openstackgerritKamil Sambor proposed openstack/python-tripleoclient master: Add fixture to replace multiple mocks  https://review.openstack.org/60041512:43
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Support podman when tagging container for Pacemaker  https://review.openstack.org/60418012:43
*** artom has quit IRC12:44
fultonjAelia: http://ix.io/1nKP12:44
fultonjwith a little cleaning and jq12:44
Aeliafultonj: ok, but I am not sure what you want to show me there :D12:45
fultonjTASK [ceph-osd : systemd start osd container12:45
fultonjAelia: run the ExecStart of that on your system with the problem12:46
fultonj /usr/share/ceph-osd-run.sh vdb12:46
fultonjAelia: you need to debug on the container itself12:46
fultonjI need to get back to my patch now12:47
*** raildo has quit IRC12:47
fultonjbut you'll need to foucus on the container failing to do what it needs to do12:47
Aeliaok... but the container is starting, it fails immediately after though12:47
fultonjAelia: right, find out why12:48
fultonjdocker ps -a12:48
fultonjdid the prepare contianer finish correctly?12:48
Aeliait says /dev/vdb1 does not exist.12:48
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: Add a fact checking xinetd service present  https://review.openstack.org/60598912:48
fultonjwas the prepare container unable to make it?12:49
*** shyamb has quit IRC12:49
fultonjor why was it unable to12:49
fultonjAelia: as per Deploy an OSD12:49
fultonjfrom https://hub.docker.com/r/ceph/daemon/12:49
fultonjthere's a prepare option12:50
fultonjrun that manually to see what it's hitting12:50
holser_quiquell +!12:50
holser_+!12:50
fultonjAelia: disable restart always in systemd too12:50
fultonjit will make troubleshooting harder12:50
quiquellholser_: testing it here https://review.openstack.org/#/c/59077412:51
AeliaError response from daemon: No such container: expose_partitions_vdb12:52
Aeliafultonj: I updated the gist with the complete output of /usr/share/ceph-osd-run.sh vdb12:53
fultonj08:49  fultonj: Aelia: as per Deploy an OSD12:54
fultonj08:49  fultonj: from https://hub.docker.com/r/ceph/daemon/12:54
fultonj08:50  fultonj: there's a prepare option12:54
fultonj08:50  fultonj: run that manually to see what it's hitting12:54
holser_quiquell quick question ... will Depends-On: https://review.openstack.org/#/c/605989/ work?12:54
holser_I thought we need to put Change-ID12:54
fultonjnot the container option to start the OSD, but the container option to prepare it12:55
fultonjit should be making that partition, it seems to have failed so you need to find out why12:55
openstackgerritDaniel Alvarez proposed openstack/tripleo-heat-templates master: Configure http/https on OVN Metadata service to talk to Nova  https://review.openstack.org/60540612:55
bcafarelhttp://logs.openstack.org/75/596275/10/check/puppet-openstack-unit-4.8-centos-7/de27998/job-output.txt.gz#_2018-09-28_09_14_59_562301 cri installation failure (2.15.1 requires ruby 2.3), is that a known issue?12:57
bcafarelseen in https://review.openstack.org/#/c/596275/ gate issue, but quick launchpad search turns up empty12:58
bcafarelprevious checks passed but they were installing cri 2.6.112:58
jaosorioralright folks, I'm off. Have a good weekend everyone!12:59
marioshappy friday jaosorior12:59
quiquellholser_: It's better the full url, you can have more than one review with the same Change-Id13:00
*** jaosorior has quit IRC13:00
holser_good to know... thanks a lot13:00
*** raildo has joined #tripleo13:00
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: Filter messages not from waiting execution  https://review.openstack.org/60552013:01
holser_indeed, sometimes we have same review but different branches13:01
holser_same change-id...13:01
slaglethrash: fyi, https://review.openstack.org/#/c/605520/ it seems to work now. but I had to fix ~100 tests13:01
slaglethrash: i'm not sure if that's good or bad :)13:01
slaglecuz all the tests assume all messages are going through, with no matching on the execution id13:02
quiquellholser_: more than welcome, we are supose to us it now, as infra guys suggets13:02
*** raildo_ has joined #tripleo13:03
*** boazel has quit IRC13:04
fultonjrunning standalone-edge.sh I hit this http://paste.openstack.org/show/73109813:04
*** raildo has quit IRC13:05
EvilienMquiquell: I guess if we wanted to do it, we would have to run the jobs in our cloud, to store the image in our side13:05
EvilienMquiquell: but again even with the container layers, it'll take a lot of space13:05
EvilienMquiquell: have you deployed an undercloud with a local registry? go do it and tell me how many GB container takes13:06
quiquellEvilienM: matbu is my here :-)13:06
quiquells/here/hero/13:06
weshay_rucklol13:06
*** rfolco has joined #tripleo13:06
quiquellEvilienM: Will check13:07
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: In process-templates script write output files to provided dir when using base path  https://review.openstack.org/60573613:07
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Merge new params - nic-config templates  https://review.openstack.org/60580713:07
*** raildo_ has quit IRC13:08
*** raildo has joined #tripleo13:08
quiquellEvilienM: thanks for the irc snippet is gold13:08
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537413:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256013:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)13:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)13:10
openstackgerritJames Slagle proposed openstack/tripleo-common master: Pass execution_id to tripleo.ansible-playbook.  https://review.openstack.org/60606413:11
openstackgerritJames Slagle proposed openstack/tripleo-common master: Fail multiple executions of config-download of the same plan  https://review.openstack.org/60606513:11
*** assassin has left #tripleo13:12
*** psachin has quit IRC13:13
*** agopi|brb has joined #tripleo13:14
*** agopi|brb is now known as agopi|afk13:14
*** chem has quit IRC13:15
*** chem has joined #tripleo13:15
Aeliafultonj: I managed to bootstrap correctly the OSD if I try to execute manually: /usr/bin/docker run -it --rm --net=host --privileged=true --pid=host --memory=3g --cpu-quota=100000 -v /dev:/dev -v /etc/localtime:/etc/localtime:ro -v /var/lib/ceph:/var/lib/ceph -v /etc/ceph:/etc/ceph -e OSD_TYPE=prepare -e OSD_FILESTORE=1 -e OSD_DMCRYPT=0 -e CLUSTER=ceph -e OSD_DEVICE=/dev/vdc  -e13:16
*** zul has quit IRC13:16
AeliaCEPH_DAEMON=OSD_CEPH_DISK_PREPARE --name=ceph-osd-overcloud-cephstorage-2-vdc docker.io/ceph/daemon:v3.0.7-stable-3.0-jewel-centos-7-x86_6413:16
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: WIP: extra volumes map  https://review.openstack.org/60272113:16
Aeliafultonj: and after that the container runs correctly with systemctl start $service13:16
fultonjAelia: nice13:17
*** zul has joined #tripleo13:17
dalvarezbeagles: mwhahaha can you guys please give some love to https://review.openstack.org/#/c/568858/  ?13:17
dalvarezthanks a lot13:17
fultonjAelia: so it's good you're up and running, i assume you can apply same process to other OSDs13:18
fultonji wonder why when ansible presumably did the same it didn't work out13:18
Aeliafultonj: something is weird -> # ceph status -> 2018-09-28 13:17:59.019785 7f52f4cdb700 -1 auth: unable to find a keyring on /etc/ceph/ceph.client.admin.keyring,/etc/ceph/ceph.keyring,/etc/ceph/keyring,/etc/ceph/keyring.bin: (2) No such file or directory13:18
Aeliaon the new ceph node, where as it works on older ceph nodes.13:18
fultonjthere's a ceph-ansible option to copy keys13:19
fultonjmight have had a default change13:20
fultonjyou can put keys there if you need to run 'ceph -s' on the osd13:20
fultonji normally only run that command from ceph mon13:20
Aeliaok13:20
*** ramishra has quit IRC13:22
*** amoralej is now known as amoralej|lunch13:23
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-quickstart-extras master: Add standalone upgrade role and playbook.  https://review.openstack.org/60473613:24
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: New workflow for standalone upgrade.  https://review.openstack.org/60470613:25
*** derekh has quit IRC13:26
*** derekh has joined #tripleo13:26
*** arxcruz|doctor is now known as arxcruz13:27
openstackgerritAndreas Jaeger proposed openstack/ansible-role-container-registry master: Remove release-openstack-server  https://review.openstack.org/60607313:27
openstackgerritAndreas Jaeger proposed openstack/ansible-role-redhat-subscription master: Remove release-openstack-server  https://review.openstack.org/60607413:28
*** artom has joined #tripleo13:29
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: New workflow for standalone upgrade.  https://review.openstack.org/60470613:29
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: DNM: To test job variables  https://review.openstack.org/60602013:29
openstackgerritAndreas Jaeger proposed openstack/ansible-role-tripleo-cookiecutter master: Remove release-openstack-server  https://review.openstack.org/60607513:30
*** toure|gone is now known as toure13:30
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Switch Heat Launcher to use Podman instead of Docker when containerized  https://review.openstack.org/60607713:31
openstackgerritAndreas Jaeger proposed openstack/ansible-role-tripleo-modify-image master: Remove release-openstack-server  https://review.openstack.org/60607913:31
*** holser_ has quit IRC13:42
*** zzzeek has quit IRC13:43
*** artom has quit IRC13:43
*** artom has joined #tripleo13:44
*** zzzeek has joined #tripleo13:45
*** mcornea has joined #tripleo13:47
*** zzzeek has quit IRC13:48
*** amoralej|lunch is now known as amoralej13:49
fultonjEvilienM: before you ran standalone-edge.sh did you preconfigure the IP on your host or let TripleO do it for you? ; it's not doing it for me, so I think i need to preconfigure it13:49
*** zzzeek has joined #tripleo13:49
EvilienMfultonj: I didn't configure networking13:49
fultonjEvilienM: ok, so export IP=192.168.0.12 was configured by tripleo, thanks13:50
fultonji need to figure out why it's not happening forme13:50
*** artom has quit IRC13:52
*** Vorrtex has joined #tripleo13:52
EvilienMfultonj: wait, no I think the IP was configured when I deployed sorry13:52
fultonjEvilienM: np, i have snapshots :)13:53
fultonjthanks i'll go do that13:53
*** agopi|afk is now known as agopi13:53
weshay_ruckmarios, fyi.. the gate failures seem to be less timeouts and more failures13:54
weshay_ruckhttp://logs.openstack.org/57/595357/2/gate/tripleo-ci-centos-7-scenario004-multinode-oooq-container/ebbc2f9/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz13:54
weshay_ruckbased on that nodepool nodes13:54
mariosweshay_ruck: ack theres a mix. tried digging into some of the timeouts earlier but didn't find something its all during overcloud deploy afaics13:55
mariosweshay_ruck: looking13:55
mariosweshay_ruck: yesi was looking at this http://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/58802fa/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz same issue looke like13:56
thrashslagle: yeah... Not sure what to say about that one. lol13:57
*** ade_lee has joined #tripleo14:00
*** artom has joined #tripleo14:01
mwhahahaweshay_ruck, marios: that would indicate ceph-ansible problems or something14:02
mwhahahaweshay_ruck, marios: cause it's happening on scenario001/004. pretty sure that's screwed the whole gate14:03
mwhahahaEvilienM: -^ fyi14:03
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart-extras master: Use subnodes groups for multinode roles and templates  https://review.openstack.org/60608714:03
mariosmwhahaha: ack yeah i recall the workflowtask are/were used for the ceph-ansible calls14:03
mariosmwhahaha: weshay_ruck i am finishing up some status and will try dig in a bit more there14:03
* fultonj looking at logs from ^14:03
fultonjhttp://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/58802fa/logs/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz#_2018-09-28_10_36_33_26514:04
slaglethrash: the API is difficult to use. as a consumer, what knowledge do I have that when I make an API call I need to open a zaqar websocket and start acting on messages?14:05
fultonjgfidente: ^ fyi14:05
slaglethrash: and likewise start ignoring messages not from the API call I made?14:05
openstackgerritJose Luis Franco proposed openstack/tripleo-heat-templates stable/pike: [Pike only] Pass DeployIdentifier in upgrade tasks.  https://review.openstack.org/60608914:05
*** Petersingh is now known as Petersingh|away14:06
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: WIP - Telemetry Framework  https://review.openstack.org/60572414:06
gfidentefultonj yeah I think there could be something wrong in the config_action module but it seems to be there unchanged from april14:06
*** Petersingh|away has quit IRC14:06
mwhahahadid we get a new version of ansible in rdo?14:06
fultonjansible-2.4.414:06
fultonjhttp://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/58802fa/logs/undercloud/var/log/yum.log.txt.gz14:06
mwhahahauh14:07
mwhahahathat's old14:07
mwhahahawtf14:07
gfidentemwhahaha yeah was thinking about some diffs in ansible itself14:07
mwhahahaoh that's queens14:07
mwhahahathat job is for queens14:07
gfidentemwhahaha still queens should be using 2.514:07
mwhahahano i'm pretty sure we were on 2.4 in queens14:08
gfidentemwhahaha right but I mean, we should be using 2.514:08
mwhahahanot necessarily14:08
cgoncalvescan we recheck changes for which verification failed for unit tests?14:08
gfidenteor I'll settle for 2.6 then14:08
mwhahahaso here's a previous successful run, http://logs.openstack.org/24/567224/110/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/264c07a/logs/undercloud/var/log/extra/rpm-list.txt.gz14:09
mwhahahawhich was on 2.4.414:09
mwhahahaso what changed14:09
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537414:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256014:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)14:10
mwhahahasame version of ceph-ansible14:10
mwhahahaweird14:10
*** dxiri has quit IRC14:10
mwhahahait was likely a backport in tripleo-common14:12
thrashslagle: that's fair. Probably need some docs first of all.14:12
fultonjFWIW i have a queens undercloud deployed by oooq with ceph-ansible working ceph-ansible-3.1.0.0-0.rc21 using ansible 2.4.414:13
bogdandois it a known blocker From puppet-openstack-unit-4.8-centos-7:14:14
bogdando2018-09-27 20:49:15.971220 | centos-7 | Gem::InstallError: cri requires Ruby version ~> 2.3.14:14
bogdando2018-09-27 20:49:15.971352 | centos-7 | An error occurred while installing cri (2.15.1), and Bundler cannot continue.14:14
bogdando2018-09-27 20:49:15.971420 | centos-7 | Make sure that `gem install cri -v '2.15.1' --source 'https://rubygems.org/'`14:14
bogdando2018-09-27 20:49:15.971450 | centos-7 | succeeds before bundling. ?14:14
slaglethrash: i was thinking if we should go back to not using a global queue. at least it would eliminate this current issue. I'm not sure about UI implications though14:14
mwhahahai wonder if the mistral container has a different version of ansible14:14
bogdandohttps://review.openstack.org/#/c/596275/ has it for a while14:14
slaglethrash: in reality though, the UI shouldn't have to be relying on other API callers all having used the same queue14:14
fultonjmwhahaha: good thought, but i thought we said this was a queens job14:14
slaglethrash: the needed state/info should be within the API responses themselves14:15
fultonjthis is the master gate14:15
fultonj?14:15
mwhahahafultonj: it is, but i'm not sure how this changed in the last 2 days14:15
mwhahahafultonj: the logs i'm looking at are for a test ci job for queens14:15
mwhahahahttps://review.openstack.org/#/c/567224/11114:15
mwhahahait broke sometime on the 26th14:16
*** chem has quit IRC14:16
thrashslagle: There is no requirement, really. The CLI could pass whatever queue name it wants.14:16
*** chem has joined #tripleo14:16
mwhahahafultonj: and ansible/ceph-ansible are the same versions on the host so it makes me think a container thing maybe14:16
thrashslagle: but I agree... We shouldn't be relying on zaqar for the response. It should be in the output of the workflow itself.14:16
cgoncalvesanswering to my own question: no. pending merge of https://review.openstack.org/#/c/605350/14:17
openstackgerritBogdan Dobrelya proposed openstack/puppet-tripleo master: Fix wrapper containers for podman w/o sockets  https://review.openstack.org/60609514:17
mwhahahacgoncalves: yea puppet unit tests are still screwed14:17
fultonjhow do we look inside the mistral_executor container for that job then?14:18
mwhahahafultonj: i don't think we can, you could manually pull down the container and look ig uess14:19
mwhahahawe don't capture the contents of the containers14:19
cgoncalvesmwhahaha, the depends-on of ^ merged, but ^ is not queued at CI. recheck?14:19
fultonjdo we know the contianer version that's running on the UC?14:20
*** bnemec is now known as beekneemech14:20
mwhahahacgoncalves: ykarel cherry-picked the depends on which blocks it since those haven't merged14:20
mwhahahafultonj: oh this is queens, it's not containerized14:20
cgoncalvesnoooo :/14:20
fultonjthat's what was bending my mind14:20
fultonji figured i must not understand something14:20
*** bogdando has quit IRC14:20
*** holser_ has joined #tripleo14:21
therveslagle: What's the issue exactly?14:21
* mwhahaha sighs 14:22
mwhahahatoo many problems14:22
mwhahahai think we need to purge the gate and let ci settle, it's far too delayed14:22
gfidentethat original_basename seems to have transformed into _original_basename14:22
slagletherve: https://bugs.launchpad.net/tripleo/+bug/179427714:22
openstackLaunchpad bug 1794277 in tripleo "openstack overcloud failures|status sometimes shows incorrect output ( from deployment process) " [Medium,In progress] - Assigned to James Slagle (james-slagle)14:22
gfidentehttps://github.com/ceph/ceph-ansible/blob/v3.1.6/plugins/actions/_v2_config_template.py#L63814:23
slagletherve: we can't be adding new functionality to tripleoclient that make workflow calls that are intended to be used as other ongoing workflows14:23
cgoncalvespost queue has an astonishing 444 jobs pending14:23
slagletherve: due to the existing model we have where everything uses a single global "tripleo" zaqar queue14:23
therveslagle: Right. I think your filtering idea is good for that no?14:23
slagletherve: we could override that and go back to have each worfklow call generate a new queue with unique uuid.14:24
therveOh I didn't know that was the original design14:24
slagletherve: yea i think my fix is ok for tripleclient14:24
fultonjgfidente: ceph-ansible-3.1.6-1.el7.noarch against queens?14:24
fultonjyeah i gues that makes sense14:24
slagletherve: what i was pontificating about is more about the usablity of the API in general14:24
gfidentefultonj yes that is correct14:24
slagletherve: how is any consumer supposed to know how to use this outsdie of tripleoclient14:25
slagleand perhaps they aren't14:25
therveslagle: Well, it could be documented14:25
slagletherve: as a user, i make an API call by starting a workflow. now what?14:25
slaglei'd be lost14:25
slagleso that's why i feel odd about fixing this in tripleoclient14:26
therveOK I see14:26
slagletherve: and my patch breaks assumptions in about 100 unit tests :)14:26
slagleso was wondering if I had done the right thing or not :)14:26
mwhahahaweshay_ruck, EvilienM: i'm going to purge the gate unless there are any objections.  are there any patches i should leave in?14:26
slaglei also patched the code so that it would ignore the dummy ID value that unit tests use14:26
slaglefigured i'd get a -1 for that though :)14:27
EvilienMmwhahaha: I'm fine14:27
slagle*almost14:27
therveNot sure it broke assumptions or assumptions weren't right in the first place14:27
slagletherve: yes, exactly14:27
*** vkapalav has joined #tripleo14:28
therveslagle: And why did we switch to a single queue?14:28
gfidenteso there must be a problem with the ansible version because up to 2.5 the param was original_basename and from 2.6 it became _original_basename14:28
weshay_ruckmwhahaha, oh snap you never do that14:28
gfidentesomehow the version of ansible in the pike and queens jobs is bumped to include that change14:28
slagletherve: that's what i'm after as well. will need to check with jtomasek probably to see if it was a UI requirement14:28
weshay_ruckmwhahaha, marios what patches are you clearing it for?  I don't have any one my list14:28
weshay_ruckmarios, do you?14:28
mariosweshay_ruck: what are we clearing? /me readsback (but no i don't have a list of patches :) )14:29
mwhahahamarios: the gate, we need to understand the current state of things and land patches that'll stop the resets in the gate14:29
*** agopi is now known as agopi|afk14:29
mariosmwhahaha: ack i am just coming back here but was there a solution to that scen4 worfklow tasks thing14:30
mwhahahamarios: is that affecting master? or just queens?14:30
therveslagle: We could cheat and post to 2 queues as well14:30
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: WIP: read job variables at deploy playbooks  https://review.openstack.org/60601714:30
mwhahahaseems like it's also affecting pike14:31
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: DNM: To test job variables  https://review.openstack.org/60602014:31
slagletherve: heh, yea, that may very well work14:31
*** quiquell is now known as quiquell|off14:31
gfidentemarios I think it's the version of ansible on the nodes14:31
gfidentemaking scenario4 failing in queens14:31
mariosmwhahaha: queens examples so far14:31
*** mjturek has joined #tripleo14:32
marioshttp://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/58802fa/logs/undercloud/var/log/extra/errors.txt.gz and http://logs.openstack.org/57/595357/2/gate/tripleo-ci-centos-7-scenario004-multinode-oooq-container/ebbc2f9/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz but let me also dig for master14:32
mwhahahamarios: i just saw a pike patch fail on scenario001/00414:32
*** cmurphy has left #tripleo14:32
marioshttps://review.openstack.org/#/c/603275/6 green here mwhahaha14:32
mwhahahamarios: https://review.openstack.org/#/c/604708/14:33
mariosack gfidente do we have a bug yet? weshay_ruck i'll file it if we dont14:34
mwhahahahttp://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0c9113/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-09-28_08_26_3114:34
gfidentemarios don't have a bug yet but I am pretty sure about the root cause14:34
gfidentemarios in 2.5 the copy module accepted original_basename parameter https://github.com/ansible/ansible/blob/stable-2.5/lib/ansible/modules/files/copy.py#L27214:34
gfidentemarios in 2.6 it doesn't anymore https://github.com/ansible/ansible/blob/stable-2.6/lib/ansible/modules/files/copy.py#L28614:34
mariosack mwhahaha so Q/P14:35
gfidentemwhahaha fultonj ^^14:35
mariosgfidente: does it make sense same root for P too?14:35
mwhahahamarios: i wonder if quickstart is installing a newer version?14:35
gfidentemarios for pike haven't checked, will do14:35
gfidenteso pike is showing a different problem14:36
gfidentehttp://logs.openstack.org/48/602248/4/check/tripleo-ci-centos-7-scenario004-multinode-oooq-container/52a00ae/logs/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz#_2018-09-28_11_46_31_84714:36
gfidentebut I think same root cause14:36
*** dxiri has joined #tripleo14:36
gfidentein pike we have slighly different workflow and ceph-ansible version14:37
weshay_ruckhttp://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?panelId=61&fullscreen&orgId=114:37
gfidentethough nothing changed recently in either14:37
gfidentein both14:37
gfidentein either?14:38
weshay_ruckmwhahaha, that is a lot canned air man.. there be dust bunny dragons?14:38
gfidentein neither?14:38
gfidentewhatever14:39
*** artom has quit IRC14:39
mwhahahaweshay_ruck: cats14:39
marioshttps://bugs.launchpad.net/tripleo/+bug/1795009 weshay_ruck gfidente fyi14:40
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Giulio Fidente (gfidente)14:40
mariosgfidente: its yours btw congrats14:40
mariosyou sound like you know what you're doing14:40
gfidentemarios and why did you assign it to me!14:40
gfidenteWTF14:40
marios:)14:40
gfidenteI know why it's failing14:40
gfidentenot how to fix it14:41
mariosgfidente: ack re-assigning14:41
mariosgfidente: can you please add a comment there?14:41
gfidentemarios yes sorry with link14:41
*** ksambor has quit IRC14:43
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Allow to actually disable heat-native  https://review.openstack.org/60610014:43
mwhahahamarios, gfidente: is the ansible installed via quickstart accidently upgrading ansible on the undercloud or something?14:44
gfidentemarios so trying to be serious, there seems to be something updating further ansible at some point14:44
gfidenteyeah what mwhahaha said14:44
*** DirectorN00b has joined #tripleo14:45
DirectorN00bomg, i'm so happy I found this channel :-)14:45
gfidentewe're so happy to see you14:46
*** gfidente is now known as gfidenteN00b14:46
DirectorN00blol14:46
mariosmwhahaha: well the ansible version on the failing pike seems to be still 2.4  afaics ? http://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0c9113/logs/undercloud/var/log/yum.log.txt.gz14:47
DirectorN00bAny of you guys use the "plan" aspect of director/triopleo? I am trying to figure out a few things. Even basic things like can I use multiple plans against my infrastructure to test out different template/env configs?14:47
mwhahahamarios: yea but doesn't quickstart pip install ansible?14:47
mwhahahamarios: http://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0c9113/logs/undercloud/var/log/extra/pip.txt.gz14:47
mwhahahamarios: ansible 2.6.4 is pip installed14:48
mwhahahaglobally14:48
mwhahahaso did we break the venv in quickstart?14:48
weshay_ruckmwhahaha, qs should be doing it in a virtenv14:48
mwhahahaweshay_ruck: yes it *should* be :D but things never do what we think they do14:48
mariosmwhahaha: ack i see the 2.6 indeed14:48
mariosmwhahaha: looking for any recent commit (but why queens only not master that is strange)14:49
mwhahahamarios: it's likely a quickstart or quickstart-extras change in master that would affect this14:49
openstackgerritJohn Trowbridge proposed openstack/tripleo-common master: Add wrapper for openshift-ansible docker command  https://review.openstack.org/60539914:49
mwhahahamarios: actually i wonder if it's coming from the image14:50
mwhahahamarios: because quickstart is installing 2.5.x14:50
mwhahahai think we need to pip remove ansible14:52
mwhahahain prep14:52
DirectorN00bAlso, is rhel's "Director" just a red wrapper for tripleo? OR is is a fork? Or...14:52
mwhahahamarios, gfidenteN00b: http://logs.openstack.org/24/567224/109/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/8b476ee/logs/undercloud/var/log/extra/pip.txt.gz14:53
mwhahahaso we used to have 2.4.414:53
mwhahahaso something is upgrading it14:53
mariosmwhahaha: or pin version in requirements and let pip so its thing14:53
mwhahahawe shouldn't be using the pip version14:53
mariosmwhahaha: oh i see14:53
mwhahahaso something is pip installing ansible on the image14:54
DirectorN00bI'm trying to figure if it's behind and some bugs are still lingering, or whether it's me doing something.14:54
DirectorN00bI think this might be to blame, but not sure... https://review.openstack.org/#/c/530225/14:54
weshay_ruckhttp://logs.openstack.org/08/604708/1/gate/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0c9113/job-output.txt.gz#_2018-09-28_06_41_40_65887814:54
mwhahahaweshay_ruck: so i don't think it's quickstart because quickstart is install 2.5.714:55
weshay_ruckyup yup..14:55
weshay_ruckmwhahaha, it could be infra's ansible?14:55
mwhahahayes14:56
mwhahahaor even the images themselves14:56
mariosmwhahaha: weshay_ruck ack so we are already pinning in requrements14:56
mwhahahasince we're getting a version newer than we're expecting anywhere, i'd probably check the images first14:56
weshay_ruckmarios, yes.. we always have14:57
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: WIP - Telemetry Framework  https://review.openstack.org/60572414:57
weshay_ruckmwhahaha, what about updating the undercloud deployment ansible.cg14:58
weshay_ruckcfg14:58
weshay_ruckto use a specific known install of ansible14:58
mwhahahato do what?14:58
weshay_rucklike the overcloud14:58
openstackgerritUdi Kalifon proposed openstack/tempest-tripleo-ui master: Selenium infra  https://review.openstack.org/60542414:58
mwhahahawe need to figure out where this newer version is coming from14:58
mariosweshay_ruck: wondering where we can land some band-aid for now to pip remove it? like in https://github.com/openstack-infra/tripleo-ci/blob/master/playbooks/tripleo-ci/run-v3.yaml14:58
mwhahahamarios: yes that would be a stop gap14:59
openstackgerritMartin André proposed openstack/tripleo-common master: Add wrapper for openshift-ansible docker command  https://review.openstack.org/60539914:59
weshay_ruckmwhahaha, this is kind of a nightmare14:59
weshay_ruckwe also have osp shipping a different version of ansible14:59
mwhahahayes i am aware14:59
* weshay_ruck wonders if this is another reason to request a tripleo-centos image15:00
openstackgerritDavid Vallee Delisle proposed openstack/tripleo-heat-templates master: Validate that a detected ceph-disk is member of a cluster before considering that we need ceph-osd package  https://review.openstack.org/60610515:00
mwhahahahappy friday15:00
gfidenteN00bweshay_ruck funny thing ceph-ansible did wnt to upgrade ansible!15:01
gfidenteN00bweshay_ruck so we're serving great testing here15:01
weshay_ruckand then that15:01
weshay_ruckif only there was a company that could package linux binaries into a useful format15:01
weshay_ruckand sell support for that15:01
weshay_rucksome kind of package management15:02
*** Vorrtex has quit IRC15:02
* weshay_ruck installs an ansible flatpak15:02
gfidenteN00bit's a bit like running oooq in a container and install in the container image whatever version of ansible you want15:02
gfidenteN00bbut honestly15:02
gfidenteN00bthis is terrible on the ansible side15:02
weshay_ruckmwhahaha, marios if we start uninstalling infra's version of ansible then some infra tasks in post could fail15:03
gfidenteN00bbreaking compatibility at will15:03
gfidenteN00bevery minor update15:03
weshay_ruckmwhahaha, marios seems like this may require.. hrm... so kind of coordination15:03
mwhahahaweshay_ruck: they shouldn't be running ansible on the host itself, they run it from zuul15:03
mwhahahaweshay_ruck: so i don't think that's an issue15:04
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Allow to actually disable heat-native  https://review.openstack.org/60610015:05
openstackgerritMerged openstack/tripleo-quickstart-extras master: Use local ansible connection for libvirt repro  https://review.openstack.org/60501315:09
openstackgerritMerged openstack/tripleo-heat-templates master: Don't merge /etc/collectd.d  https://review.openstack.org/60312315:09
*** artom has joined #tripleo15:09
openstackgerritRussell Bryant proposed openstack/tripleo-docs master: Update standalone doc title.  https://review.openstack.org/60610915:09
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537415:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256015:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500915:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)15:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)15:10
*** Vorrtex has joined #tripleo15:11
openstackgerritDaniel Alvarez proposed openstack/tripleo-heat-templates master: Configure http/https on OVN Metadata service to talk to Nova  https://review.openstack.org/60540615:12
*** bugzy has joined #tripleo15:14
*** mwhahaha changes topic to "Welcome to Rocky | CI Status: RED, DO NOT WORKFLOW OR RECHECK (unless explicitly for CI fixing) https://docs.openstack.org/tripleo-docs/latest/"15:15
*** chem has quit IRC15:16
*** chem has joined #tripleo15:16
*** bugzy_ has quit IRC15:17
*** dtrainor has quit IRC15:21
*** dtrainor has joined #tripleo15:21
*** iranzo has quit IRC15:23
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: WIP: test remove pip ansible as workaround for scen1/4  https://review.openstack.org/60611615:23
mariosweshay_ruck: incase you want to do that and cos i'm almost eod here15:24
marios^^^15:24
mariosbut not sure if that is too early15:24
mariosi.e. do we need to do that in toci_gate_test/quickstart?15:24
mariosnot sure where/when the pip install is happening15:24
*** Vorrtex has quit IRC15:27
*** Vorrtex has joined #tripleo15:28
*** leanderthal has quit IRC15:28
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Allow to actually disable heat-native  https://review.openstack.org/60610015:29
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates stable/queens: WIP testing the depends-on for +bug/1795009 workaround  https://review.openstack.org/60611815:32
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Switch Heat Launcher to use Podman instead of Docker when containerized  https://review.openstack.org/60607715:32
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart-extras master: Fix quickstart undercloud selinux configuration  https://review.openstack.org/60270315:32
*** ykarel is now known as ykarel|away15:35
*** AJaeger has joined #tripleo15:36
*** chem has quit IRC15:37
AJaegerhttps://review.openstack.org/#/q/topic:update-zuul+projects:openstack/ansible-role are some reviews for a few ansible-role repos that have wrong Zuul setup, they use by error a release job in-repo. That one should be in project-config. Could you put them on your review list and merge once you unfreeze, please?15:37
*** chem has joined #tripleo15:39
*** zul has quit IRC15:41
*** boazel has joined #tripleo15:44
*** dxiri has quit IRC15:46
*** holser_ has quit IRC15:52
*** jfrancoa has quit IRC15:55
weshay_ruckmarios, mwhahaha ansible is built into the centos image in the python module path15:56
weshay_ruck9/20 centos image has __version__ = '2.4.2.0'15:57
* weshay_ruck will download the latest and see but cetainly it's at 2.615:57
EvilienM2.4.2.0 ?15:57
EvilienMmhh too old15:58
mariosweshay_ruck: tanks15:58
dtantsurhi folks! can someone please check this backport? https://review.openstack.org/#/c/601613/15:58
mariosthanks even weshay_ruck15:58
openstackgerritRedHat RDO CI proposed openstack/tripleo-quickstart-extras master: GATE CHECK for quickstart-extras  https://review.openstack.org/56044516:00
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates master: GATE CHECK for TripleO  https://review.openstack.org/60429816:00
weshay_ruckmwhahaha, I thought you were going to kill the queue16:00
mwhahahai did except for the stuff that would be useful in ci16:04
mwhahahalet me check what's left16:04
mwhahahai might have missed something16:04
mwhahahayea some stuff has snuck in afterwards16:05
openstackgerritmathieu bultel proposed openstack/python-tripleoclient master: Add 2h timeout when waiting for websocket messages on package_update  https://review.openstack.org/60432516:05
* mwhahaha slaps EvilienM's hand for approving stuff16:05
EvilienMah16:06
weshay_ruckha ha16:06
EvilienMthat's what happens when I try to be nice16:06
weshay_ruckhe's such a yes man16:06
openstackgerritEmilien Macchi proposed openstack/python-tripleoclient master: Allow to actually disable heat-native  https://review.openstack.org/60610016:07
* mwhahaha slaps gfidenteN00b's hand for approving stuff 16:07
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537416:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256016:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500916:10
*** ooolpbot has quit IRC16:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)16:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)16:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)16:10
weshay_ruckmwhahaha, ignore me if you are in a mtg... which bug is address the ansible version, 1795009?16:12
gfidenteN00bmwhahaha that's bribe16:13
gfidenteN00bmwhahaha not just approving stuff16:13
weshay_ruckNOOOB16:14
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates master: Remove artificial constrains around notification drivers  https://review.openstack.org/60612616:14
*** sanjayu_ has quit IRC16:15
mwhahahagfidenteN00b: https://bugs.launchpad.net/tripleo/+bug/179500916:16
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)16:16
openstackgerritNicolas Hicher proposed openstack-infra/tripleo-ci master: provider: Add vexxhost  https://review.openstack.org/59643216:17
openstackgerritNicolas Hicher proposed openstack-infra/tripleo-ci master: provider: Add vexxhost  https://review.openstack.org/59643216:25
openstackgerritAthlan-Guyot sofer proposed openstack-infra/tripleo-ci master: New workflow for standalone upgrade.  https://review.openstack.org/60470616:25
*** zul has joined #tripleo16:27
*** ykarel_ has joined #tripleo16:29
*** akrivoka has quit IRC16:29
*** shyamb has joined #tripleo16:30
*** ykarel|away has quit IRC16:30
*** ykarel__ has joined #tripleo16:31
*** ykarel_ has quit IRC16:31
chandankumarEvilienM: sorry podman tempest need some more changes16:32
EvilienMchandankumar: no prob16:32
openstackgerritBen Nemec proposed openstack/os-collect-config master: Don't ignore SIGPIPE  https://review.openstack.org/60613316:34
chandankumarmwhahaha: EvilienM: https://review.openstack.org/#/c/605980/16:34
chandankumarsome fixes related to selinux part16:35
*** shyamb has quit IRC16:35
EvilienMchandankumar: lgtm16:35
*** shyamb has joined #tripleo16:35
openstackgerritRussell Bryant proposed openstack/python-tripleoclient master: Fix misspelling in deployment complete message.  https://review.openstack.org/60613416:36
*** dxiri has joined #tripleo16:36
chandankumarmwhahaha: EvilienM http://logs.openstack.org/46/606046/1/check/tripleo-ci-centos-7-standalone/968ace6/logs/undercloud/home/zuul/tempest.log.txt.gz16:38
chandankumaronly 18 failed tests on full tempest with standalone16:38
*** dxiri has quit IRC16:38
*** salmankhan has quit IRC16:38
chandankumarIf i make them passing it will be a good replacement for any job taking 1 hr 30 mins16:38
chandankumarEvilienM: https://review.openstack.org/60604616:39
*** dxiri has joined #tripleo16:39
openstackgerritHarald Jensås proposed openstack/tripleo-heat-templates master: Merge new params - nic-config templates  https://review.openstack.org/60580716:41
*** gkadam has quit IRC16:42
*** dtantsur is now known as dtantsur|afk16:45
*** rdopiera has quit IRC16:48
*** thrash is now known as thrash|f00dz16:49
DirectorN00bHi all. Not to sound repetative, but is redhat director just tripleo? I'm trying to get familiar with Director, but jut need to understand a bit of the basics.16:49
DirectorN00b(ie: overcloud plan methods)16:50
chemweshay_ruck: I've got an error during repo setup where https doesn't work because it point to an http scheme16:51
chemweshay_ruck: but first, hi :)16:51
weshay_ruck?16:51
weshay_ruckHI!16:51
weshay_ruckhachem16:51
weshay_ruckchem, link?16:51
*** jpich has quit IRC16:52
chemweshay_ruck: https://mirror.regionone.rdo-cloud.rdoproject.org:8080/rdo/centos7/0f/e2/0fe2e39140ff038ce66f43a478fc792e8a271fe2_b2d2686b/delorean.repo16:52
weshay_ruckovb job?16:52
chemweshay_ruck: no standalone-upgrade testing16:52
weshay_ruckwhy did it get the rdo mirror16:53
weshay_ruckthat's odd16:53
chemweshay_ruck: reproducer script in rdo16:53
weshay_ruckOH16:53
chemweshay_ruck: the thing is that curl http://... work fine16:53
chemweshay_ruck: no "s"16:54
weshay_ruckhrm16:54
weshay_ruckk16:54
weshay_ruckpokes16:54
* weshay_ruck pokes around16:54
*** derekh has quit IRC16:55
weshay_ruckchem, hrm..http://git.openstack.org/cgit/openstack/tripleo-quickstart-extras/tree/roles/nodepool-setup/templates/mirror_info.sh.j2#n6616:56
weshay_ruckchem, however our release files have https16:56
chemso we substitue but switch from http to https right ?16:57
chemweshay_ruck: ^16:57
*** jpena is now known as jpena|off16:57
weshay_ruckhttps://github.com/openstack/tripleo-quickstart/blob/master/config/release/tripleo-ci/master.yml#L4416:58
chemweshay_ruck: yeah that was I had in mind, so it happen there .. I'll try to hardcode http there and let you know16:59
weshay_ruckchem, I wonder if we exposed a bug by including only the tasks16:59
weshay_ruck chem ya.. I think the config may need to change from https to http16:59
weshay_ruckchem, is that working as expected btw?16:59
weshay_ruckthe include_role: task: foo.yml16:59
chemweshay_ruck: yeah it seems it point to master and all17:00
*** ykarel__ has quit IRC17:00
*** shyamb has quit IRC17:03
*** jaganathan has quit IRC17:06
chemweshay_ruck: well, it's confusing and I need to go, I'll look at the result in the ci later on17:08
weshay_ruckk17:08
*** psachin has joined #tripleo17:09
*** gfidenteN00b has quit IRC17:10
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537417:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256017:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500917:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)17:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)17:10
*** trown is now known as trown|lunch17:11
DirectorN00bSo, "openstack overcloud plan delete this-plan" --> Cannot delete a plan that has an associated stack.17:19
EvilienMchandankumar: ack, I would show https://review.openstack.org/#/c/606046/ to mwhahaha as well17:19
DirectorN00bopenstack stack list shows a stack name associated with CREATE_FAILED17:19
DirectorN00bCan I just "stack delete" this and then remove the plan?17:20
slagleDirectorN00b: yes, although openstack overcloud delete will delete both17:20
DirectorN00bslagle: It's not a new overcloud, just a new plan. Can I mix plans with a single overcloud?17:21
slagleno17:21
DirectorN00b(or am I confusing terminologies)17:21
slagleopenstack overcloud delete <name> -- will delete both a stack and plan with that <name>17:22
DirectorN00bslagle: Oh. So, I cannot use one plan against an overcloud, and then when it fails, create a new plan and try to execute that against the overcloud with modified template information?17:22
DirectorN00bokay, I am executing the overcloud delete for that plan.17:23
slagleyou can also just re-run the same deployment command and will update both the existing plan, and then try and update the stack17:24
DirectorN00bThough that tends to suggest deleting of the nodes already deployed :-)17:24
*** jtomasek has joined #tripleo17:24
DirectorN00bslagle: I am struggling with that bit. I can create a plan with --template, but then when I ant to update those templates in the plan, I cannot see how to do this.17:24
DirectorN00bI mean, I can update env stuff easy (just re-include with parameters) but the templates once modified...17:25
slagle"openstack overcloud deploy" updates both the plan and stack17:25
slagleit will save the updated templates in the plan, then do a stack-update with Heat17:26
DirectorN00bI thiought I tried that, and when I exported it, the templates were the same17:26
DirectorN00bokay, that plan has gone now, thanks.17:27
DirectorN00bSo, if I go and change a template now (/usr/share/openstack-tripleo-heat-templates) and then tell the plan to export (container save) to a local folder here, then that should have the updated template details?17:28
DirectorN00b(Sorry if i'm getting confused. I'm still trying to wrap my head around what is going on)17:28
slagleno, you'd have to actually run openstack overcloud deploy after making the change for the plan to get updated17:29
slaglethen if you did export, you should see the change17:29
DirectorN00bAh, I See.17:30
DirectorN00bokay, let me have a tinker with that - very much apprieciated feedback!!17:30
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535617:31
DirectorN00bslagle: Why do I have to deploy it before I can see the change it has made? on reflection that seems a little backward :-/17:37
slagleDirectorN00b: you can use openstack overcloud deploy with --update-plan-only if you only want to update the plan and skip the stack update17:42
DirectorN00bslagle: oooooh That's very good to know, thankyou :-)17:47
*** dsneddon has joined #tripleo17:47
*** dsneddon has quit IRC17:47
DirectorN00bSeems logical you'd want to see the effects of what you are changing before pressing the "push to all nodes as an update!" button :-)17:48
*** dsneddon has joined #tripleo17:49
*** agopi|afk is now known as agopi17:51
DirectorN00bERROR configuring gnocci.17:57
DirectorN00bI think this might be to blame, but not sure... https://review.openstack.org/#/c/530225/17:57
DirectorN00bSo I'll get rid of that, and then redeploy.17:57
EvilienMdon't blame me17:57
EvilienMgnocci doesn't exist, it's gnocchi17:58
openstackgerritMerged openstack/ansible-role-tripleo-modify-image master: Remove compare_host_packages strategy  https://review.openstack.org/60027317:59
DirectorN00bIt's been a long day :-)18:00
DirectorN00bAlso, no blame cultre here, I'm just trying to learn :-)18:00
* DirectorN00b points finger and scowles18:01
DirectorN00bIt fails on a few things. Not sure of it's what I have configured(/not configured) or whether it's a bug. Trial and error now.18:01
weshay_ruckmwhahaha, EvilienM seems like queens is installing ansible 2.6.418:02
weshay_ruckand master is installing 2.5.418:02
weshay_rucksweet18:02
EvilienMDirectorN00b: no worries :-)18:02
weshay_rucksorry.. 2.5.218:02
*** TheJulia is now known as needssleep18:02
*** raildo has quit IRC18:06
*** raildo has joined #tripleo18:06
*** jcoufal has quit IRC18:08
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537418:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256018:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500918:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)18:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)18:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)18:10
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Set proper setype for tempest service directories  https://review.openstack.org/60598018:11
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Switch fs027 to deploy with podman  https://review.openstack.org/60051718:12
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: Add podman support to validate-tempest role  https://review.openstack.org/60535618:12
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Switch fs027 to deploy with podman  https://review.openstack.org/60051718:12
EvilienMchandankumar: fixed order ^18:12
chandankumarEvilienM: thanks!18:14
*** med_ has joined #tripleo18:15
*** chem has quit IRC18:18
*** chem has joined #tripleo18:18
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: remove ansible from triplo-upgrade requirements  https://review.openstack.org/60615618:19
*** thrash|f00dz is now known as thrash18:19
openstackgerritwes hayutin proposed openstack/tripleo-upgrade stable/rocky: remove ansible from triplo-upgrade requirements  https://review.openstack.org/60615718:19
openstackgerritwes hayutin proposed openstack/tripleo-upgrade stable/queens: remove ansible from triplo-upgrade requirements  https://review.openstack.org/60615818:19
openstackgerritwes hayutin proposed openstack/tripleo-upgrade stable/pike: remove ansible from triplo-upgrade requirements  https://review.openstack.org/60615918:20
*** trown|lunch is now known as trown18:37
EvilienMthrash: do you think we could get +A on https://review.openstack.org/#/c/605633/ today?18:38
thrashEvilienM: Let me try18:38
*** med_ has quit IRC19:00
* mwhahaha sighs19:00
mwhahahano on reads their email19:00
mwhahahaanyway did we figure out where the newer ansible is coming from yet?19:01
AJaegertripleo cores: https://review.openstack.org/606075 and https://review.openstack.org/606074 don't use ansible - could you review those to help with Zuul job setup, please?19:02
AJaeger(https://review.openstack.org/606079 and https://review.openstack.org/606073 are the same change for repos that need ansible in testing, so won't ask for +A now;)19:02
AJaegerthanks, mwhahaha19:04
mwhahahanp i'll try and get the others later19:04
AJaegerthanks19:04
openstackgerritMerged openstack/ansible-role-redhat-subscription master: Remove release-openstack-server  https://review.openstack.org/60607419:10
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537419:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256019:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500919:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)19:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,Triaged] - Assigned to Marios Andreou (marios-b)19:10
openstackgerritMerged openstack/ansible-role-tripleo-cookiecutter master: Remove release-openstack-server  https://review.openstack.org/60607519:13
*** florianf is now known as florianf|afk19:14
*** Chaserjim has joined #tripleo19:16
*** chem has quit IRC19:19
*** chem has joined #tripleo19:19
*** artom has quit IRC19:22
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart-extras master: Allow pinning of ara in undercloud-setup  https://review.openstack.org/60617919:23
mwhahahaweshay_ruck, marios, EvilienM -^ fix for scenario001/00419:24
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add sample designate environment for ha  https://review.openstack.org/58402619:25
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Split designate envs  https://review.openstack.org/58453219:25
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add /v2 suffix to Designate uris  https://review.openstack.org/58588219:25
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Set correct project name for designate-neutron integration  https://review.openstack.org/58590219:25
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Don't configure BIND to listen on localhost  https://review.openstack.org/60618019:25
thrashEvilienM: No cores available...19:26
thrashI'll send an email to d0ugal and apetrich about it.19:26
openstackgerritBen Nemec proposed openstack/tripleo-quickstart master: Run Designate tempest test in scenario003  https://review.openstack.org/57132119:26
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart master: Pin older versions of ara for pike/queens  https://review.openstack.org/60618119:28
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart-extras master: Unpin quickstart undercloud ara version  https://review.openstack.org/60618219:29
*** artom has joined #tripleo19:30
openstackgerritCarlos Goncalves proposed openstack/tripleo-common master: Add scenario010 to the check queue  https://review.openstack.org/58701519:34
openstackgerritCarlos Goncalves proposed openstack/tripleo-common master: Fix skip of octavia-undercloud Ansible role  https://review.openstack.org/59141319:35
weshay_ruckah19:42
* weshay_ruck looks19:42
weshay_ruckmwhahaha, why would that lead to diff versions across branches?19:42
weshay_ruckstill we need to pin that, but I don't think that's it19:43
mwhahahaweshay_ruck: because we have a sufficient version in rocky+19:43
mwhahaha0.16.1 requires ansible 2.4.519:43
mwhahahawe have 2.4.4 in queens/pike19:43
mwhahahain rocky+ it's already there19:43
mwhahahaweshay_ruck: it's likely because we just switched it on recently19:44
mwhahahaweshay_ruck: because i fyou check 2 days agao, ara was not installed http://logs.openstack.org/24/567224/109/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/8b476ee/logs/undercloud/var/log/extra/pip.txt.gz19:44
mwhahahaweshay_ruck: but it is installed now http://logs.openstack.org/24/567224/111/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/448effe/logs/undercloud/var/log/extra/pip.txt.gz19:45
openstackgerritCarlos Goncalves proposed openstack/tripleo-common master: Download CentOS-based amphora image if not present  https://review.openstack.org/59199719:45
weshay_ruckok dumb question19:46
weshay_ruckwhy does ara install ansible? https://github.com/openstack/ara/blob/master/requirements.txt19:46
weshay_ruckI see19:46
weshay_ruckha19:46
mwhahahahttps://github.com/openstack/ara/blob/master/requirements.txt#L419:46
weshay_rucklooking right at it19:46
weshay_ruckdammit19:46
weshay_ruckthat is like a circular dependency19:47
mwhahahaweshay_ruck: so can you add a line item to get ara pacakged in rdo19:47
mwhahahadmsimard pasted the rpm specs in #rdo19:47
mwhahahacause we shouldn't be system pip installing anything19:47
mwhahahabecause as we all know, it breaks things19:47
*** dprince has quit IRC19:47
weshay_ruckaye19:48
* mwhahaha looks at everyone who touched that ansible role to install pip/setuptools/ara19:48
weshay_ruckmwhahaha, thanks19:50
weshay_ruckthat landed 4 months ago .. dang19:51
mwhahahabut we didn't turn it on until recently19:52
weshay_ruckhrm.. I don't think that's right19:52
mwhahahamaybe it wasn't working until recently?19:52
weshay_ruckwhat do you mean turn it on.. ara has been capturing the undercloud tasks for a while19:53
weshay_ruckactually I think the overcloud work broke the undercloud ara work19:53
weshay_rucktbh19:53
mwhahahaoh maybe it's because 0.16.0 was just published19:53
mwhahahawhen was that19:53
mwhahahacause 0.15.0 was fine, but that wouldn't explain http://logs.openstack.org/24/567224/109/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/8b476ee/logs/undercloud/var/log/extra/pip.txt.gz19:53
weshay_ruckmwhahaha, well you can't get too mad19:54
mwhahahahttp://logs.openstack.org/24/567224/109/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/8b476ee/job-output.txt.gz#_2018-09-26_23_00_55_08391219:54
mwhahahait was always failing19:54
mwhahahaignore_errors: true19:54
* mwhahaha sighs19:54
weshay_ruckit was working at one point19:54
weshay_ruckyou +219:55
weshay_ruck:)19:55
* weshay_ruck looks for rpm reviews19:55
weshay_ruckthere was duress re: timeouts19:56
weshay_ruckas usual19:56
dpeacockRight - I'm bailing - need to get my folks to the airport.  Have a good weekend folks. :-)20:02
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates stable/queens: DNM: ci test  https://review.openstack.org/60619120:04
mwhahahaweshay_ruck: -^ that should test the whole thing, we'll see20:04
fultonjEvilienM: earlier we were talking about how you preconfigured your IP on your edge cloud node20:04
weshay_ruckk20:05
weshay_ruckthanks mwhahaha20:05
fultonji'm hitting http://paste.openstack.org/show/731116 because my role IP map is empty20:05
fultonjhttp://paste.openstack.org/show/731120/20:05
DirectorN00bNotice: heira(): cannot load backend module_data: cannot load such file -- heira/backend/module_data_backend    <--- anyone know what this is alluding to?20:06
fultonjbut tripleo seems to know about it as my HostsEntry was correctly populated with the IP e exported20:06
fultonjhttp://paste.openstack.org/show/731121/20:07
fultonjs/tripleo/my second heat stack20:07
*** openstackgerrit has quit IRC20:07
DirectorN00bslagle: I have redeployed the overcloud, and then container save'd the config it's using, but the template does not have the modifications I made :-(20:08
fultonjslagle: ^ ?20:08
*** Chaserjim has quit IRC20:10
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537420:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256020:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500920:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)20:10
DirectorN00bFrom our discussion before (perhaps I misunderstood?) But I changed the heat-base.yaml template at /usr/share/opentack-triple-heat-templates, re-overcloud deployed20:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)20:10
*** psachin has quit IRC20:10
DirectorN00bI then saed out, as I thought you said after the deployment (or the --update-plan-only) it will modify the plan?20:10
DirectorN00bs/saed/saved20:11
DirectorN00bNow checking the templates in the rendered save output, I see the code still there that I commented out :-/20:11
*** tzumainn has quit IRC20:12
DirectorN00bAlso, seems --update-plan-only isn't a valid argument :-(20:13
mwhahahaDirectorN00b: what version? --update-plan-only on tripleo deployhas been a thing for like 2 years20:16
mwhahahahttps://github.com/openstack/python-tripleoclient/blame/master/tripleoclient/v1/overcloud_deploy.py#L65720:16
*** mjturek has quit IRC20:18
*** openstackgerrit has joined #tripleo20:20
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: break out release config by distro type  https://review.openstack.org/60238720:20
DirectorN00bMaybe i#m not using it correctly? Or maybe because this is a redhat director version it's different? "opentack overcloud deploy mycloud --update-plan-only" ?20:21
*** rlandy is now known as rlandy|brb20:27
DirectorN00bLooks like i'm using version 7.6.x or so.20:29
bandinimwhahaha: throw some cluebones my way on https://bugs.launchpad.net/tripleo/+bug/1795027 (see #4 and #5)?20:36
openstackLaunchpad bug 1795027 in tripleo "redis is installed by default in the containerized undercloud" [Medium,Triaged]20:36
*** agopi is now known as agopi|brb20:37
mwhahahabandini: you figured it out?20:37
mwhahahabandini: zaqar environment enables redis but we configure zaqar with swift20:37
bandinimwhahaha: yeah I know why it happens, not sure how to best fix it20:37
bandiniexactly20:37
mwhahahaoh, well20:37
mwhahahayea20:37
bandiniI could add an environments/service/zaqar-noredis.yaml and use that?20:38
bandiniseems a bit ugly though?20:38
mwhahahain the past we had an undercloud-<service>.yaml which is silly20:38
mwhahahabandini: go patch it out in undercloud_paramers.yaml20:38
mwhahahabandini: because that comes from python-tripleoclient20:38
* bandini looks20:39
mwhahahahrm maybe not20:39
mwhahahathat just seems to be parameters_defaults20:39
mwhahahabandini: do we have a disable-redis.yaml somewhere?20:40
mwhahahawe could add that to the end of the deploy command20:40
mwhahahathat would probably be a more explicit thing easier to follow thing20:40
bandinilet me see, I don't think we have it20:40
*** agopi|brb has quit IRC20:41
bandininope20:41
bandinimwhahaha: I am starting to feel that 'environments/services/undercloud-zaqar.yaml' is almost the less horrible option?20:42
mwhahahai really wanted to get rid of those undercloud-* ones20:42
bandiniI see20:43
mwhahahamaybe we just pull the redis out of zaqar.yaml and into a redis.yaml20:43
mwhahahaalternatively zaqar-swift-backend.yaml that doesn't have redis enabled20:43
* mwhahaha shakes his fist at redis and zaqar20:44
bandiniI think the latter is preferable as it does not break existing file users20:44
bandinilol20:44
* bandini tries20:44
mwhahahaso THT/environments/services/zaqar-swift-backend.yaml that enables zaqar but disables redis and then we just change out the file in tripleoclient20:45
bandiniright20:45
mwhahahaat least it doesn't have the undercloud- in the name :D20:45
bandini:)20:45
bandiniCLEAR WIN20:45
*** rlandy|brb is now known as rlandy20:49
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: turn off named prior to validation for scen003  https://review.openstack.org/60619820:52
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates master: Add a zaqar-swift-backend environment file  https://review.openstack.org/60620020:53
openstackgerritMichele Baldessari proposed openstack/python-tripleoclient master: Zaqar on the containerized undercloud should not use Redis  https://review.openstack.org/60620120:53
weshay_ruckmwhahaha, ok.. so https://review.openstack.org/#/c/606180/ > https://review.openstack.org/60619820:53
weshay_ruck?20:53
*** artom has quit IRC20:54
mwhahahaweshay_ruck: named is used by designate20:54
mwhahahaweshay_ruck: so yea use ben's patch20:54
weshay_ruckk.. I briefly looked at the tempest tests.. wasn't sure if that was validated20:54
mwhahahait's not actually blocking anythign but is showing up in the openstack health reports (elastic search is triggering)20:55
weshay_ruckya.. just trying get the gate to be healthier20:56
DirectorN00bBah, I need to see what the version of this are. REdhat drives me insane.20:56
DirectorN00bSubscriptions and stuff, and blah blah.20:56
DirectorN00bThanks for input today. I will be back asking more dumb questions soon enough :-)20:57
*** panda has quit IRC20:57
*** panda has joined #tripleo20:58
*** shardy has quit IRC21:00
*** mmethot has quit IRC21:00
*** mmethot has joined #tripleo21:00
openstackgerritDavid Vallee Delisle proposed openstack/tripleo-heat-templates master: Validate that a detected ceph-disk is member of a cluster before considering that we need ceph-osd package  https://review.openstack.org/60610521:02
*** raildo has quit IRC21:04
*** mmethot has quit IRC21:05
*** Vorrtex has quit IRC21:07
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537421:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256021:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500921:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)21:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)21:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)21:10
*** rfolco has quit IRC21:18
openstackgerritMerged openstack/os-net-config stable/rocky: Restart ivs/nvfswitch after config file is updated  https://review.openstack.org/60566821:22
*** dtrainor_ has joined #tripleo21:22
*** dsneddon has quit IRC21:22
*** dsneddon has joined #tripleo21:23
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart-extras master: Unpin quickstart undercloud ara version  https://review.openstack.org/60618221:24
*** dtrainor has quit IRC21:24
*** agopi|brb has joined #tripleo21:28
*** slaweq has quit IRC21:28
* mwhahaha flips tables over failure in the gate again21:41
mwhahahaargh it was my own fault for restoring changes too21:42
*** vkapalav has quit IRC21:42
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537422:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256022:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500922:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)22:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)22:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)22:10
*** mcornea has quit IRC22:12
*** artom has joined #tripleo22:13
*** EvilienM is now known as EmilienM22:20
*** toure is now known as toure|gone22:24
*** panda is now known as panda|off22:26
*** rlandy has quit IRC22:27
*** boazel has quit IRC22:29
*** tosky has quit IRC22:42
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/171537423:10
openstackLaunchpad bug 1715374 in tripleo "Reloading compute with SIGHUP prenvents instances to boot" [Critical,In progress] - Assigned to Bogdan Dobrelya (bogdando)23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179256023:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/179500923:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1792560 in tripleo "Upgrades in CI still using Q->master instead of R->master release" [Critical,In progress] - Assigned to Jiří Stránský (jistr)23:10
openstackLaunchpad bug 1795009 in tripleo "the scenario001/004 multinode-oooq-container job is failing for Queens and Pike for overcloud deployment error "overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:" " [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)23:10
*** dtrainor__ has joined #tripleo23:17
*** dtrainor_ has quit IRC23:20
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates stable/queens: DNM: ci test  https://review.openstack.org/60619123:36
openstackgerritAlex Schultz proposed openstack/tripleo-quickstart master: Pin older versions of ara for pike/queens  https://review.openstack.org/60618123:40
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates stable/queens: DNM: ci test  https://review.openstack.org/60619123:42

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!