Saturday, 2017-01-21

sbezverk	sdake: done with cleanup	00:00
sbezverk	please address my comment here: https://review.openstack.org/#/c/423555/	00:00
sdake	working on it	00:00
* sdake n ot superman ;)		00:00
sdake	gerrit is sluggish at present - not accepting my change via git review	00:04
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Use environment variables to specify versions https://review.openstack.org/423555	00:04
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Use environment variables to specify versions https://review.openstack.org/423555	00:05
sdake	the reason gerrit is sluggish at present is daily backups at 00:00	00:06
sdake	sbezverk can you revisit https://review.openstack.org/#/c/423046/	00:07
sdake	kfox1111 - need all 5 of those patches reviewed now that i've made it through the sbezverk documentation guantlet :)	00:07
sbezverk	sdake: done	00:08
sdake	sbezverk thx	00:08
sdake	sbezverk gerrit is showing me 423046 as -1	00:08
sdake	wonder if that is thedaily backups or if you retained the -1 vote on it	00:08
sdake	oh there it goes	00:08
sdake	wow dailybackups make gerrit pokey	00:09
portdirect	sdake: not sure about mysql test chart	00:10
sdake	portdirect not sure abou twhat?	00:10
portdirect	it requires pvc support, so actually should really fail	00:10
sdake	portdirect - you added it in there not me :)	00:10
portdirect	lol	00:11
sdake	is pvc support not implemented in the dev env?	00:11
sdake	i thought that was fixed up	00:11
portdirect	not untill you install kolla...	00:11
portdirect	so its not there at that point	00:11
portdirect	we add it	00:11
sdake	ok - well i can address that in yet another followon patch	00:11
sdake	what would you recommend installing that doesn't require pvc?	00:11
sdake	fwiw, I ran the stable/mysql chart and it worked for me	00:12
sdake	although I didn't test mysql, just that helm placed it in the deploy state	00:12
portdirect	yeah - helm deploys it fine	00:12
portdirect	just the pods fail	00:12
portdirect	anyway +2ed	00:12
sdake	ok - well we shortly remove it after deploying it	00:12
sdake	thanks :)	00:12
sdake	I can change it to somehting else if someone has suggestions	00:12
portdirect	memcached	00:14
portdirect	tiny image too :)	00:14
sdake	portdirect cool	00:16
sdake	portdirect what was your fix for vagrant	00:16
sdake	I did a vagrant destroy -and that cratered tiller	00:16
*** sayantani01 has joined #openstack-kolla		00:16
sdake	portdirect and you were typing quickly so I missed the uri you used for tiller and what precisely ou did to workaround the problem	00:16
portdirect	np	00:16
portdirect	2 mins	00:16
sdake	I thought you did a docker pull port/tiller:2.1.3	00:16
sdake	to load tiller in the cache	00:16
portdirect	i did but thats not needed	00:17
sdake	k - i'll be h ere :)	00:17
portdirect	1:) kubectl delete --namespace=kube-system deploy tiller-deploy	00:17
portdirect	2:) helm init --tiller-image docker.io/port/tiller:v2.1.3	00:18
portdirect	that image is exactly the same as the gcr.io one	00:19
portdirect	https://github.com/kubernetes/helm/issues/1838	00:20
portdirect	looks like it may be spreading :(	00:20
sdake	portdirect what is your github id?	00:21
portdirect	intlabs	00:22
sdake	portdirect i couldnt' credit you properly - intlabs wasn't foudn by github	00:24
sdake	but anywya i did dad your irc nick in there ;)	00:24
portdirect	lol	00:24
sdake	brb, going to actually verify that works	00:24
portdirect	https://github.com/intlabs	00:24
*** sdake has quit IRC		00:25
*** sdake has joined #openstack-kolla		00:26
sdake	portdirect duno what to tell ya -when i type @intlabs in github issue tracker, it didn't pop up your name in the list	00:26
portdirect	yeah - I'm actually pretty hard to track down. I tend to work via https://github.com/portdirect, which is an organisation in github - and that really confuses the UI	00:28
portdirect	kinda nice, cause its virtually impossible to track where I am. KInda rubbish for exactly the same reason - I don't get credited for about 70% of my commits.	00:29
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Change mysql to memcached in docs https://review.openstack.org/423563	00:29
sdake	[sdake@minime-03 halcyon-vagrant-kubernetes]$ helm ls	00:31
sdake	Error: could not find a ready tiller pod	00:31
sdake	any ideas?	00:31
sdake	oh its workign now	00:31
sdake	just a little slow to respond	00:31
sbezverk	sdake: helm is not initialized	00:31
sbezverk	helm init --client-only	00:31
sdake	sbezverk helm is indeed initialized	00:31
*** goldyfruit has joined #openstack-kolla		00:32
sdake	sbezverk portdirect 's workaround works	00:32
portdirect	sounds like the image was still pulling	00:32
sdake	right	00:32
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Break apart helm init and helm ls oeprations https://review.openstack.org/423566	00:37
*** caowei has joined #openstack-kolla		00:38
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Break apart helm init and helm ls oeprations https://review.openstack.org/423566	00:38
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Change mysql to memcached in docs https://review.openstack.org/423563	00:38
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Add a workaround for docker-engine 1.13.0 https://review.openstack.org/423568	00:46
sdake	portdirect sbezverk srwilkers_ kfox1111 can one of you fellas ack: https://review.openstack.org/#/c/423067/	00:47
sdake	actually two of youths	00:48
sdake	thanks jascott1 :)	00:49
*** adrian_otto has quit IRC		00:51
*** sdake has quit IRC		00:52
*** adrian_otto has joined #openstack-kolla		00:52
*** sdake_ has joined #openstack-kolla		00:52
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Add a workaround for docker-engine 1.13.0 https://review.openstack.org/423568	00:55
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Break apart helm init and helm ls operations https://review.openstack.org/423566	00:55
openstackgerrit	Merged openstack/kolla-kubernetes: Add a simple test to see if horizon is running https://review.openstack.org/422932	00:55
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	00:57
sdake_	portdirect quick q about storage providers	01:00
sdake_	(for kubernetes)	01:00
portdirect	shoot	01:00
sdake_	portdirect if kolla-kubernetes had a desire to ue a different storage provider	01:00
sdake_	what would that take in terms of implementation	01:01
sdake_	ue/use	01:01
sdake_	for example iscsi or nfs	01:01
portdirect	or gce/ec2	01:01
sdake_	right	01:01
sdake_	there are a whole bunch	01:01
portdirect	2 mins	01:01
sdake_	sure	01:02
portdirect	you would need to somehow feed the appropraite values into these: https://github.com/openstack/kolla-kubernetes/blob/master/helm/kolla-common/templates/_common_pv.yaml	01:03
portdirect	https://github.com/openstack/kolla-kubernetes/blob/master/helm/kolla-common/templates/_common_pvc.yaml	01:03
portdirect	though i dont know how to do that	01:03
portdirect	as I've not managed to keep ontop of the globals/values work that kfox/sbezverk have been doing	01:04
portdirect	I dont think that they have been tested since the jinja2 days	01:04
sdake_	looks like there is some gce work in there: https://github.com/openstack/kolla-kubernetes/blob/master/helm/kolla-common/templates/_common_pv.yaml#L25	01:05
portdirect	yeah - though those are both straight conversions of the jinja2 templates	01:05
sdake_	right - so hasn't been tested lately :(	01:05
sdake_	i wonder if it was ever tested	01:06
portdirect	the prefered way to do it now is to provide a storage class	01:06
sdake_	how is that different than what kolla-kubernetes has n ow?	01:06
portdirect	as those maifests taget the method used in k8s 1.2	01:06
portdirect	https://github.com/openstack/kolla-kubernetes/blob/master/helm/kolla-common/templates/_common_pvc.yaml#L8	01:06
portdirect	if we just used that, then users could brign whatever storage they wanted	01:07
portdirect	but I'm not sure how hard it would be unwind kolla-k8s from the way it uses storage currently	01:07
sdake_	ok - so storage as used today is base dupon a kubernetes 1.2 construct	01:08
sdake_	and in 1.5, there is a better way to do the job	01:08
portdirect	yes	01:08
sdake_	are there docs on the better way to do the job?	01:08
sbezverk	portdirect: we cannot use storage class, as kube controller has an not yet resolved issue	01:09
portdirect	though this only beacme possible for bare metal with 1.5	01:09
portdirect	sbezverk: what issue?	01:09
sbezverk	otherwise we would need to lock on att's kube controller	01:09
sbezverk	missing ceph binaries	01:09
portdirect	we have kube-controlelr in kolla	01:09
portdirect	i added it just for this	01:09
portdirect	sdake_: for context	01:10
*** sdake_ is now known as sdake		01:10
sbezverk	the actual controller kubernetes-controller-manager is missing ceph binaries	01:10
portdirect	ceph is so uncommon in k8s, that google do not provide rbd binary in the upstream image	01:10
portdirect	you have to add your own	01:10
*** sacharya has quit IRC		01:10
sdake	what delivable is kube-controller in?	01:11
portdirect	sbezverk: we can use kolla image?	01:11
*** sacharya has joined #openstack-kolla		01:11
sdake	and what is kube-controller in a little more detail?	01:11
portdirect	sbezverk: https://github.com/openstack/kolla/tree/master/docker/kube	01:11
* sdake groans at more curls in the codebase		01:13
* portdirect thinks its the only way :(		01:14
sdake	portdirect there is always a better way :)	01:14
sdake	portdirect although a short term curl hack works for now	01:14
sdake	portdirect the idea then is you deploy this container in the baremetal os?	01:14
portdirect	what? I'd like to use it	01:14
sdake	portdirect package it into rpm and deb	01:14
portdirect	lol	01:14
sdake	portdirect yup - thats the better way	01:14
sdake	portdirect then you get signed binaries from a trusted source	01:15
sdake	vs random curls from who knows where	01:15
portdirect	yeah mostly id agree with you	01:15
sdake	anyway back on topic :)	01:15
portdirect	ok - so sbezverk is right	01:16
portdirect	in that with a fesh vainila install of k8s via kubeadm you cannot use ceph stoage class	01:16
portdirect	but you could if you installed from rpms	01:16
portdirect	aor any method that had the binarys for k8s running on the hosts, not in containers	01:17
sdake	is there any way to get the righ tcontianer we need into kubeadm?	01:17
portdirect	yes	01:17
sdake	tell me more plz :)	01:17
portdirect	we cannot get ceph rbd into those images upstream	01:17
sdake	i understan	01:17
sdake	can kubeadm be overridden as is?	01:18
portdirect	there are two ways of doing it	01:18
sdake	to use different images?	01:18
portdirect	china has this problem in a BIG way	01:18
portdirect	the recommened advice there - its to prepull images to hosts, and then tag them a gcr.io images	01:19
portdirect	^^ WOW :/	01:19
portdirect	so kubeadm is being extended to allow you to specify images	01:19
sdake	nice	01:19
sdake	when is that landing?	01:19
portdirect	like we did to unblock you with tiller	01:19
portdirect	in master now	01:19
sdake	nice	01:19
portdirect	will need to check	01:20
*** unicell has quit IRC		01:20
portdirect	but we will still have issue	01:20
portdirect	as the images kolla makes will not be tagged correctly	01:20
portdirect	2 sec	01:20
kfox1111	nice.	01:20
portdirect	and using as a bad example:	01:21
portdirect	gcr.io/google_containers/kubernetes-dashboard-amd64:v1.5.0	01:21
portdirect	this is the naming format that kubadm will expect	01:21
portdirect	'-amd64:v1.5.0' is key here	01:21
kfox1111	ah. yeah. :/	01:21
portdirect	we need those images in kolla to have a similar suffix	01:22
kfox1111	we should try and patch that in trunk asap?	01:22
portdirect	with platform arch:k8s version	01:22
portdirect	kfox1111: we should ideally byt Jeffrey4l was very unkeen on the idea	01:22
sdake	portdirect i can fix that	01:23
sdake	portdirect i also -1'ed that review	01:23
sdake	i recall it	01:23
portdirect	they made me remove the '-amd64' beofre accepting the ps	01:23
* portdirect crys uncontrollably again		01:23
sdake	portdirect you didn't really provide a compelling reason as to why -amd64 was needed :)	01:23
portdirect	I rememeber sdake, i rememeber	01:24
sdake	portdirect iirc the review said "this is what kubernetes does"	01:24
kfox1111	sdake: yeah, but I did, but was shot down. ;)	01:24
portdirect	lets move forawrd though	01:24
kfox1111	+1	01:24
kfox1111	past is the past.	01:24
portdirect	that only gets us half way	01:24
portdirect	as we still need to sort the actuall tag at the end	01:25
portdirect	kolla-build would need to be mofified to do that	01:25
sdake	portdirect ther eis a --tag override	01:25
portdirect	or we would need docs covering it	01:25
sdake	docs would work	01:25
portdirect	sdake: docs the :)	01:25
portdirect	*then	01:25
sdake	plus the hub can provide the images	01:25
sdake	its too late to make massive changes to kolla-build.py at this time	01:25
sdake	have to wait for pike to open up	01:26
sdake	i'll cover the docs, if someone else can ocver the deliverable rename of the packages	01:26
sdake	the kolla deliverable that is	01:26
portdirect	are those images gonna be in ocata?	01:26
sdake	whatever is in master now is going to ocata yes	01:26
portdirect	cool	01:26
portdirect	they are on hub atm	01:26
sdake	and ocata is cut in about 10 days	01:26
portdirect	tagged master	01:26
sdake	portdirect can you submit a review to change the image names to what kubeadm expects	01:27
sdake	and link this chat log here:	01:27
sdake	http://eavesdrop.openstack.org/irclogs/%23openstack-kolla/%23openstack-kolla.2017-01-21.log.html#t2017-01-21T01:00:37	01:28
sdake	in the commit msg	01:28
sdake	portdirect if I submit the change, I can't +2 it or argue really with anyone about why it needs to be done :)	01:28
kfox1111	sdake: I have a ps up for revew to add two more packages. think it can still make it in for ocata?	01:29
*** sdake has quit IRC		01:29
portdirect	roger 30 mins - just finishing off something else	01:29
*** sdake has joined #openstack-kolla		01:29
kfox1111	sdake: https://review.openstack.org/#/c/422950/	01:29
sdake	portdirect i got dced	01:30
portdirect	roger 30 mins - just finishing off something else	01:30
sdake	portdirect my last msg was 18:27:48] <sdake>portdirect can you submit a review to change the image names to what kubeadm expects	01:30
sdake	[18:27:56] <sdake>and link this chat log here:	01:30
sdake	[18:28:10] <sdake>http://eavesdrop.openstack.org/irclogs/%23openstack-kolla/%23openstack-kolla.2017-01-21.log.html#t2017-01-21T01:00:37	01:30
sdake	[18:28:24] <sdake>in the commit msg	01:30
sdake	[18:28:44] <sdake>portdirect if I submit the change, I can't +2 it or argue really with anyone about why it needs to be done :)	01:30
sdake	portdirect cool thanks :)	01:30
kfox1111	20:29 < kfox1111> sdake: I have a ps up for revew to add two more packages. think it can still make it in for ocata?	01:30
sdake	kfox1111 you mean two more containers?	01:30
*** jascott1_ has quit IRC		01:30
sdake	kfox1111 I think so	01:30
kfox1111	sdake: yeah.	01:31
sdake	kfox1111 although the gate is imploding with your change	01:31
kfox1111	I think its just failing due to docker 1.13 issue?	01:31
kfox1111	it was submitted about that time the other day.	01:31
kfox1111	I just kicked off a recheck.	01:31
kfox1111	I've never submitted a container before, so I took a wild guess.	01:32
sdake	you need a recheck as that patch has docker-engine 1.13.0	01:32
kfox1111	please let me know if I did somethign wrong. :)	01:32
sdake	it looks ok to me	01:32
kfox1111	oh...	01:32
kfox1111	do I need a rebase?	01:32
sdake	http://logs.openstack.org/50/422950/1/check/gate-kolla-dsvm-build-centos-binary-centos-7-nv/e89613a/console.html#_2017-01-20_01_15_30_788162	01:32
sdake	kfox1111 moment	01:32
portdirect	kfox1111: you have logs of gate expolosion	01:33
sdake	kfox1111 your recheck will fix it up i think	01:33
portdirect	^^ that it?	01:33
sdake	portdirect yup thats the log	01:33
openstackgerrit	Kevin Fox proposed openstack/kolla: Add ceph fs containers https://review.openstack.org/422950	01:33
kfox1111	well, I just rebased just in case.	01:33
sdake	kfox1111 a recheck will do it - no need to rebase ;)	01:33
sdake	kfox1111 recheck checks against latest master	01:33
kfox1111	if the docker 1.13 fix came after that ps, it would probablly need a rebase.	01:33
sdake	no that is not the case	01:34
kfox1111	really? I'ave always had to rebase for kolla-kubernetes I think.	01:34
openstackgerrit	Jeffrey Zhang proposed openstack/kolla-ansible: Load murano dashboard dynamic https://review.openstack.org/402156	01:34
sdake	i asked this same question of infra over a year ago and they corredcted me	01:34
*** adrian_otto has quit IRC		01:34
sdake	feel free to ask anyone in openstack-infra :)	01:34
kfox1111	hmm... does the 'recheck' do an automatic rebase then?	01:34
sdake	SamYaple and I both individually chekced their statements as well	01:34
kfox1111	weird. ok.	01:35
sdake	and they were correct	01:35
sdake	ya its not intuitive	01:35
*** v1k0d3n has joined #openstack-kolla		01:35
sdake	a recheck will do it no need to rebase	01:35
sdake	Jeffrey4l__ was going that too as of a month or so ago	01:35
sdake	Jeffrey4l__ and I think he has stopped now that I explained i to to him	01:36
sdake	Jeffrey4l__ might check with him - he has alot of code output and runs into those typeso fproblems often	01:36
*** v1k0d3n has quit IRC		01:36
sdake	the only time a rebase is needed is if there is a merge conflict or gerrit is "stuck"	01:36
Jeffrey4l__	hrm what are u talking?	01:36
kfox1111	gotta head out. l8r all.	01:36
*** Jeffrey4l__ is now known as Jeffrey4l		01:36
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: test rebooting in the gate https://review.openstack.org/413193	01:37
sdake	later kfox1111	01:38
*** unicell has joined #openstack-kolla		01:42
*** salv-orlando has joined #openstack-kolla		01:43
*** unicell has quit IRC		01:45
*** zhangkun_ has joined #openstack-kolla		01:47
*** salv-orlando has quit IRC		01:47
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	01:49
*** hfu has joined #openstack-kolla		02:07
*** hfu has quit IRC		02:08
*** vcn has joined #openstack-kolla		02:09
*** sacharya has quit IRC		02:21
*** sacharya has joined #openstack-kolla		02:22
*** hfu has joined #openstack-kolla		02:24
openstackgerrit	sayantani proposed openstack/kolla: Update Bifrost documentation https://review.openstack.org/417114	02:26
openstackgerrit	sayantani proposed openstack/kolla: [WIP]Update Bifrost documentation https://review.openstack.org/417114	02:27
*** pramodrj07 has quit IRC		02:33
*** MasterOfBugs has quit IRC		02:34
openstackgerrit	Jeffrey Zhang proposed openstack/kolla: Fix the gate failure https://review.openstack.org/423574	02:39
openstackgerrit	Wei Cao proposed openstack/kolla: Install karbor dashboard into horizon image https://review.openstack.org/423575	02:42
*** tonanhngo has joined #openstack-kolla		02:46
*** fguillot has quit IRC		02:46
*** pramodrj07 has joined #openstack-kolla		02:52
*** MasterOfBugs has joined #openstack-kolla		02:52
openstackgerrit	Wei Cao proposed openstack/kolla-ansible: Load karbor dashboard https://review.openstack.org/423576	02:52
*** aj_power has joined #openstack-kolla		03:08
*** rhallisey has quit IRC		03:30
*** zhubingbing has joined #openstack-kolla		03:32
*** vcn has quit IRC		03:55
*** adrian_otto has joined #openstack-kolla		04:07
*** kristian__ has joined #openstack-kolla		04:10
*** adrian_otto has quit IRC		04:14
*** kristian__ has quit IRC		04:15
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	04:16
*** sdake_ has joined #openstack-kolla		04:22
*** sdake has quit IRC		04:22
sdake_	Jeffrey4l are you about	04:28
sdake_	Jeffrey4l earlier we were talking about how you don't need to rebase a patch prior to running recheck	04:30
sdake_	Jeffrey4l and also that neeed to rename some of the kolla images related to kubernetes to amd64 (to be loaded properly by kubeadm)	04:31
*** masuberu has joined #openstack-kolla		04:40
* sdake_ -> bed		04:41
sdake_	Jeffrey4l if you come about - could you ack this doc change: https://review.openstack.org/#/c/423067/	04:43
*** adrian_otto has joined #openstack-kolla		04:53
*** sdake_ has quit IRC		05:01
*** caowei has quit IRC		05:02
*** esmiurium has quit IRC		05:02
*** adrian_otto has quit IRC		05:18
*** aj_power has quit IRC		05:23
*** goldyfruit has quit IRC		05:38
*** tonanhngo_ has joined #openstack-kolla		05:41
*** tonanhngo has quit IRC		05:43
*** unicell has joined #openstack-kolla		05:44
*** sdake has joined #openstack-kolla		05:53
*** hfu has quit IRC		05:59
*** kristian__ has joined #openstack-kolla		06:00
*** kristian__ has quit IRC		06:01
*** kristian__ has joined #openstack-kolla		06:01
openstackgerrit	zhubingbing proposed openstack/kolla: Add vitrage images https://review.openstack.org/417593	06:01
*** zhubingbing has joined #openstack-kolla		06:01
zhubingbing	hello guys	06:01
*** kristian__ has quit IRC		06:01
*** hfu has joined #openstack-kolla		06:02
*** hfu has quit IRC		06:02
*** hfu has joined #openstack-kolla		06:03
*** tonanhngo_ has quit IRC		06:05
*** tonanhngo has joined #openstack-kolla		06:05
*** hfu has quit IRC		06:08
openstackgerrit	zhubingbing proposed openstack/kolla-ansible: Remove precheck.yml in baremetal https://review.openstack.org/423585	06:08
*** lamt has joined #openstack-kolla		06:09
*** caowei has joined #openstack-kolla		06:19
*** sdake_ has joined #openstack-kolla		06:21
*** sdake has quit IRC		06:24
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for senlin https://review.openstack.org/423336	06:26
openstackgerrit	zhubingbing proposed openstack/kolla-ansible: Add bool enable_elasticsearch in searchlight.conf.j2 https://review.openstack.org/423586	06:30
*** zhubingbing has quit IRC		06:40
*** tonanhngo has joined #openstack-kolla		06:42
*** tonanhngo has quit IRC		06:47
*** sacharya has quit IRC		06:56
*** sacharya has joined #openstack-kolla		06:57
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for searchlight https://review.openstack.org/423590	06:59
*** kristian__ has joined #openstack-kolla		07:00
*** unicell has quit IRC		07:05
*** sayantani01 has quit IRC		07:05
*** kristian__ has quit IRC		07:05
*** kristian__ has joined #openstack-kolla		07:26
*** zhurong has joined #openstack-kolla		07:27
*** kristian__ has quit IRC		07:30
*** bmace_ has quit IRC		07:46
*** bmace_ has joined #openstack-kolla		07:47
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for searchlight https://review.openstack.org/423590	07:48
*** salv-orlando has joined #openstack-kolla		07:48
*** pramodrj07 has quit IRC		07:58
*** MasterOfBugs has quit IRC		08:00
*** zhurong has quit IRC		08:09
*** sdake_ has quit IRC		08:19
*** prameswar has joined #openstack-kolla		08:22
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for senlin https://review.openstack.org/423336	08:27
openstackgerrit	Jeffrey Zhang proposed openstack/kolla-ansible: Add fluentd role https://review.openstack.org/407392	08:33
*** prameswar has quit IRC		08:39
*** kristian__ has joined #openstack-kolla		08:55
*** msimonin has joined #openstack-kolla		08:58
openstackgerrit	Jeffrey Zhang proposed openstack/kolla: Install all python packages in global-requirements.txt file https://review.openstack.org/394150	08:59
*** salv-orlando has quit IRC		09:12
openstackgerrit	jianyi proposed openstack/kolla-ansible: Add zun ansible role https://review.openstack.org/416179	09:17
*** tonanhngo has joined #openstack-kolla		09:19
*** tonanhngo has quit IRC		09:23
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for ceilometer https://review.openstack.org/423606	09:23
*** kristian__ has quit IRC		09:25
openstackgerrit	Christian Berendt proposed openstack/kolla-ansible: Rename task in destroy role that removes images https://review.openstack.org/423609	09:34
openstackgerrit	Merged openstack/kolla: Fix ironic inspector https://review.openstack.org/380774	10:05
*** harish23r has joined #openstack-kolla		10:08
*** vcn has joined #openstack-kolla		10:12
*** harish23r has quit IRC		10:14
*** vcn has quit IRC		10:24
*** Masber_080 has joined #openstack-kolla		10:36
*** zhangkun_ has quit IRC		10:37
*** masuberu has quit IRC		10:38
openstackgerrit	jianyi proposed openstack/kolla-ansible: Add zun ansible role https://review.openstack.org/416179	10:41
*** salv-orlando has joined #openstack-kolla		10:43
*** salv-orlando has quit IRC		10:47
*** harish23r has joined #openstack-kolla		10:51
*** Syffs has joined #openstack-kolla		11:09
Syffs	hello, I have kolla deployed as an all-in-one install, but on my host machine: I have this in loop in my /var/log/messages	11:10
Syffs	Input 'mariadb_logstreamer_input' error: open /var/log/kolla/mariadb/mariadb.log: permission denied#015	11:10
Syffs	Input 'openstack_logstreamer_input' error: open /var/log/kolla/neutron/dnsmasq.log: permission denied#015	11:11
*** hfu has joined #openstack-kolla		11:22
*** prameswar has joined #openstack-kolla		11:26
*** sp___ has joined #openstack-kolla		11:27
*** caowei has quit IRC		11:31
*** salv-orlando has joined #openstack-kolla		11:44
*** salv-orlando has quit IRC		11:48
openstackgerrit	Christian Berendt proposed openstack/kolla-ansible: Sync tools/cleanup-images with kolla repository https://review.openstack.org/423618	11:50
*** hfu has quit IRC		11:51
sp___	Jeffrey4l: ping...	11:55
*** tonanhngo has joined #openstack-kolla		12:00
*** Masber_080 has quit IRC		12:04
*** tonanhngo has quit IRC		12:05
harish23r	hey guys. need some help here in Kolla deployment.	12:06
harish23r	I'm trying to deploy a single node kolla setup. using the Mitaka release	12:07
harish23r	I get the following error while "using kolla-deploy"	12:07
harish23r	ameswar (~prameswar@60.243.144.5) has joined	12:07
harish23r	* sp___ (~confisury@103.209.223.125) has joined	12:07
harish23r	* caowei has quit (Ping timeout: 245 seconds)	12:07
harish23r	* salv-orlando (~salv_orla@5.87.196.101) has joined	12:07
harish23r	* salv-orlando has quit (Ping timeout: 248 seconds)	12:07
harish23r	<openstackgerrit> Christian Berendt proposed openstack/kolla-ansible: Sync tools/cleanup-images with kolla repository https://review.openstack.org/423618	12:07
harish23r	* hfu has quit (Quit: My MacBook has gone to sleep. ZZZzzz…)	12:07
harish23r	<sp___> Jeffrey4l: ping...	12:07
harish23r	* tonanhngo (~tonanhngo@ip-64-134-216-234.public.wayport.net) has joined	12:07
harish23r	sry.	12:07
harish23r	This error:	12:07
harish23r	TASK: [common \| starting heka container] *************************************	12:07
harish23r	failed: [localhost] => {"changed": true, "failed": true}	12:07
harish23r	msg: APIError(HTTPError(u'500 Server Error: Internal Server Error for url: http+docker://localunixsocket/v1.25/conteiners/create?name=heka',),)	12:07
harish23r	FATAL: all hosts have failed -- aborting	12:07
harish23r	localhost: 0k=28 changed=0 unreachable=0 failed=1	12:07
*** prameswar has quit IRC		12:08
Jeffrey4l	sup sp___	12:09
sp___	Jeffrey4l: i have improved the coverage by upto 80% for kolla BP	12:10
sp___	https://review.openstack.org/#/c/422444/	12:10
sp___	Jeffrey4l: please review this	12:10
Jeffrey4l	cool.	12:10
Jeffrey4l	ok. will review it.	12:10
sp___	Jeffrey4l: thanks	12:10
harish23r	can anyone help me out!	12:19
harish23r	does it deal with a docker error? or I should configure a setting?	12:20
*** h16mara_ has quit IRC		12:23
*** saneax-_-\|AFK is now known as saneax		12:25
*** msimonin has quit IRC		12:32
openstackgerrit	Christian Berendt proposed openstack/kolla-ansible: Set docker_restart_policy_retry to 0 by default https://review.openstack.org/423623	12:32
*** salv-orlando has joined #openstack-kolla		12:45
*** salv-orlando has quit IRC		12:49
*** Masber_080 has joined #openstack-kolla		12:49
*** dave-mccowan has joined #openstack-kolla		13:06
*** harishr23 has joined #openstack-kolla		13:23
*** harish_23r has joined #openstack-kolla		13:25
*** harish23r has quit IRC		13:26
*** harishr23 has quit IRC		13:28
*** esmiurium has joined #openstack-kolla		13:29
*** harish_23r has quit IRC		13:29
*** dave-mccowan has quit IRC		13:30
*** dave-mccowan has joined #openstack-kolla		13:31
*** saneax is now known as saneax-_-\|AFK		13:33
*** sdake has joined #openstack-kolla		13:45
*** salv-orlando has joined #openstack-kolla		13:46
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfigure action for cloudkitty https://review.openstack.org/422516	13:49
*** salv-orlando has quit IRC		13:49
sdake	morning	13:52
sdake	Jeffrey4l you about?	13:52
Jeffrey4l	sdake, yep.	13:52
sdake	would you mind acking a doc patch	13:52
sdake	in kolla-ubernetes	13:52
Jeffrey4l	np. link?	13:52
sdake	https://review.openstack.org/#/c/423067/	13:53
sdake	a core reviewer has already been first to review	13:53
sdake	that is familiar with kolla-kubernetes	13:54
sdake	i try not to sk you to review code in detail because you probably don't know how it works	13:54
sdake	although docs are sort of like code, this is simpel enough	13:54
Jeffrey4l	hrm. kolla-k8s better run on physical machine? but we run gate in vms ..	13:56
sdake	Jeffrey4l the dev env runs multipel vms	13:56
sdake	4 to be precise	13:56
sdake	the gate runs 1 vm	13:56
sdake	also this change if you wouldn't midn: https://review.openstack.org/#/c/423563/	13:57
sdake	running 4 vms in a vm is feasible, however, then your runnning virt (provided by vagrant) on vagrant	13:57
sdake	rather virt on virt	13:57
sdake	running virt (provided by vagrant) on virt (provided by your underlay virtual machine provider)	13:58
portdirect	Or you can just run the ansible that vagrant drives direct on some hardware.	13:58
sdake	morning portdirect	13:58
portdirect	But that's not documented yet ;)	13:58
portdirect	Hey guys	13:58
sdake	Jeffrey4l and finally this change: https://review.openstack.org/#/c/423566/ ;-)	13:59
sdake	all docs changes	13:59
Jeffrey4l	sdake, hrm. may other kolla-k8s core is better ;) i have no idea and context for there.	13:59
sdake	Jeffrey4l ok -well can you atleast ack the overview doc change	13:59
sdake	Jeffrey4l and ill find someone else to ack the other two reviews	14:00
Jeffrey4l	i am think why memcache rather than mysql ;(	14:00
sdake	i've like 6 patches lined up	14:00
sdake	backed up that is in the queue	14:00
*** schwicht has joined #openstack-kolla		14:00
sdake	Jeffrey4l roger on the other patches	14:00
portdirect	Jeffrey, memcache has no storage requirements	14:00
portdirect	Mysql does not install ootb	14:01
sdake	Jeffrey4l if you could just ack https://review.openstack.org/423067 - this chane is innocous enough and reviewed by several people failiar with kolla-kubernetes	14:01
sdake	I took jascott1 's text verbatum :)	14:01
portdirect	Oh dear :)	14:02
Jeffrey4l	ok. done.	14:02
sdake	thanks fine sir	14:02
sdake	Jeffrey4l i have another request	14:02
sdake	the kolla-kubernetes deliverable needs -amd64 for container images	14:02
sdake	both you and I -1'ed the review because it was out of conformity with the rest of hte containers	14:03
Jeffrey4l	sdake, why?	14:03
sdake	for just some container images	14:03
Jeffrey4l	any meaning for the suffix?	14:03
sdake	kubeadm depends on -amd64 being in the prefix to override image names	14:03
sdake	its only af ew containers	14:03
openstackgerrit	Merged openstack/kolla-kubernetes: Add an overview section to the documentation https://review.openstack.org/423067	14:03
sdake	Jeffrey4l portdirect mentioend he would have a patch ready today for it	14:03
openstackgerrit	Merged openstack/kolla-kubernetes: add a cd to halcyon-vagrant-kubernetes https://review.openstack.org/423039	14:04
openstackgerrit	Merged openstack/kolla-kubernetes: Install kubectl and helm on the host OS in docs https://review.openstack.org/423046	14:04
openstackgerrit	Merged openstack/kolla-kubernetes: Clean up mysql installed during test of helm https://review.openstack.org/423120	14:04
sdake	see what you did there Jeffrey4l ;)	14:04
Jeffrey4l	just curiosity, if there is no -amd64, it will break anything?	14:04
sdake	yes it wont work	14:04
Jeffrey4l	ok.	14:04
sdake	kubeadm --overide image=kube-controller	14:04
sdake	kubeadm is compiled for adm64	14:04
sdake	it automatically adds -amd64 to the suffix internally it expects	14:05
sdake	so if there is no -amd64 suffix, the image cannot be found	14:05
openstackgerrit	Merged openstack/kolla-kubernetes: Use environment variables to specify versions https://review.openstack.org/423555	14:05
sdake	which requires manual tagging	14:05
sdake	i didn't catch it either when i -1'ed it	14:05
sdake	and the submitter didn't push back hard enough on the reviewers :)	14:05
sdake	Jeffrey4l i am making up the --overide image=kube-controller syntax, I don't know the exact syntax, but I know there is one	14:06
Jeffrey4l	ok.	14:06
sdake	Jeffrey4l and I know kubeadm internally determines what platform its compiled for and launches the image it requires	14:06
sdake	Jeffrey4l did I explain it clearly?	14:08
Jeffrey4l	yep. fair enough.	14:08
sdake	Jeffrey4l rading back it looks a little unclear even to me and I wrote it :)	14:08
Jeffrey4l	for my understanding, kubeadm adds the suffix and we can not remove it..	14:09
*** sdake_ has joined #openstack-kolla		14:10
*** schwicht has quit IRC		14:13
*** sdake has quit IRC		14:13
*** sp___ has quit IRC		14:15
openstackgerrit	Pete Birley proposed openstack/kolla: Kubernetes: Rename Images to allow use with native deployment tools https://review.openstack.org/423633	14:19
*** sp___ has joined #openstack-kolla		14:27
*** salv-orlando has joined #openstack-kolla		14:29
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfigure action for gnocchi https://review.openstack.org/423133	14:30
*** zhubingbing has joined #openstack-kolla		14:36
*** breitz has quit IRC		14:51
openstackgerrit	Jeffrey Zhang proposed openstack/kolla-ansible: Add chrony ansible role https://review.openstack.org/399355	14:52
*** breitz has joined #openstack-kolla		14:52
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfigure action for cinder https://review.openstack.org/422384	14:59
*** pomac has quit IRC		15:00
kfox1111	morning	15:08
*** v1k0d3n has joined #openstack-kolla		15:14
*** zhubingbing has quit IRC		15:15
Syffs	Hello, I have lots of permission denied on mariadb and dnsmasq logs in my /var/log/messages: https://ask.openstack.org/en/question/101760/logs-permission-denied-mariadblog-and-dnsmasqlog/	15:17
Syffs	any idea?	15:17
kfox1111	selinux?	15:19
*** sdake has joined #openstack-kolla		15:20
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfigure action for cinder https://review.openstack.org/422384	15:20
*** tonanhngo has joined #openstack-kolla		15:21
sdake	morning kfox1111	15:21
kfox1111	morning. :)	15:21
*** sdake_ has quit IRC		15:21
kfox1111	how goes the battle?	15:21
Syffs	Selinux is disabled	15:23
sdake	just spilled a soda all over my laptop groan	15:23
sdake	but it appears unharmed	15:23
sdake	kfox1111 can you hit the review queue and ack the doc changes	15:24
sdake	kfox1111 i am working really hard to get the docs into workable shape	15:24
sdake	running shape	15:24
sdake	whatever the right words are;)	15:24
sdake	remove the bit rot in essence	15:24
kfox1111	yeah. sec.	15:25
sdake	doesn't have to be now	15:25
sdake	just at some point - some of thes epatches sit around for a couple of days	15:25
sdake	which blocks work since its al on one file	15:25
sdake	i think the dev env is gtg now with the docker change that was -1'ed	15:25
sdake	haven't looked why it was -1'ed yet	15:26
sdake	;)	15:26
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Add a workaround for docker-engine 1.13.0 https://review.openstack.org/423568	15:33
sdake	yak shaving complete with that change and its children I think :)	15:33
sdake	off to visit my uncle in the hopsital, then have some tech support for my parents which involves staring at a backup screen for hours on end	15:34
sdake	during that, I'll take a crack at standing up kolla-kubernetes with ceph	15:35
*** sayantani01 has joined #openstack-kolla		15:37
*** sayantani01 has quit IRC		15:42
*** sdake_ has joined #openstack-kolla		15:45
sdake_	ok gott aroll	15:45
*** eaguilar has joined #openstack-kolla		15:47
*** sdake has quit IRC		15:48
*** Syfffs has joined #openstack-kolla		15:49
*** sdake_ has quit IRC		15:50
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: test rebooting in the gate https://review.openstack.org/413193	15:54
*** Syffs has quit IRC		15:54
*** eaguilar has quit IRC		15:55
*** tonanhngo has quit IRC		15:58
openstackgerrit	Merged openstack/kolla-kubernetes: Change mysql to memcached in docs https://review.openstack.org/423563	15:58
openstackgerrit	Merged openstack/kolla-kubernetes: Break apart helm init and helm ls operations https://review.openstack.org/423566	15:59
*** sayantani01 has joined #openstack-kolla		15:59
*** tonanhngo has joined #openstack-kolla		16:00
*** sdake has joined #openstack-kolla		16:06
openstackgerrit	caoyuan proposed openstack/kolla-ansible: Optimize reconfiguration for aodh https://review.openstack.org/423670	16:07
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: Support rolling upgrades of horizon https://review.openstack.org/333996	16:07
*** sdake has quit IRC		16:14
*** sacharya_ has joined #openstack-kolla		16:17
*** dave-mccowan has quit IRC		16:19
*** sacharya has quit IRC		16:20
*** sdake has joined #openstack-kolla		16:22
*** Syfffs is now known as Syffs		16:22
*** sacharya_ has quit IRC		16:22
*** salv-orlando has quit IRC		16:22
*** tonanhngo has quit IRC		16:26
*** sdake has quit IRC		16:29
*** adrian_otto has joined #openstack-kolla		16:31
*** Syffs has quit IRC		16:38
*** Syffs has joined #openstack-kolla		16:39
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: Support rolling upgrades of horizon https://review.openstack.org/333996	16:39
*** vcn has joined #openstack-kolla		16:56
*** vcn has quit IRC		17:01
*** Masber_080 has quit IRC		17:07
*** adrian_otto1 has joined #openstack-kolla		17:11
*** adrian_otto has quit IRC		17:14
*** salv-orlando has joined #openstack-kolla		17:23
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: test rebooting in the gate https://review.openstack.org/413193	17:23
*** sacharya has joined #openstack-kolla		17:26
*** salv-orlando has quit IRC		17:28
*** goldyfruit has joined #openstack-kolla		17:29
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: Support rolling upgrades of horizon https://review.openstack.org/333996	17:32
*** adrian_otto1 has quit IRC		17:49
*** adrian_otto has joined #openstack-kolla		17:57
*** masuberu has joined #openstack-kolla		17:59
*** sp___ has quit IRC		18:03
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: Support rolling upgrades of horizon https://review.openstack.org/333996	18:07
*** adrian_otto has quit IRC		18:09
*** adrian_otto has joined #openstack-kolla		18:10
kfox1111	sbezverk: you around?	18:19
*** salv-orlando has joined #openstack-kolla		18:25
*** goldyfruit has quit IRC		18:27
*** salv-orlando has quit IRC		18:29
openstackgerrit	Merged openstack/kolla-ansible: Fix docker restart policy issue on docker engine 1.13 https://review.openstack.org/423122	18:44
*** adrian_otto has quit IRC		18:50
SamYaple	kfox1111: does k8s mount a _single_ rbd? or an rbd per volume created?	18:58
*** n0isyn0i1e has joined #openstack-kolla		19:13
*** n0isyn0ise has quit IRC		19:16
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	19:22
*** sacharya has quit IRC		19:24
*** sacharya has joined #openstack-kolla		19:25
*** salv-orlando has joined #openstack-kolla		19:25
Syffs	hello, I'm having permission denied in loops in my system logs: https://ask.openstack.org/en/question/101760/logs-permission-denied-mariadblog-and-dnsmasqlog/ ... any idea?	19:29
*** salv-orlando has quit IRC		19:30
*** sacharya has quit IRC		19:30
*** williamcaban has quit IRC		19:30
kfox1111	SamYaple: rbd per volume.	19:40
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	19:48
*** williamcaban has joined #openstack-kolla		19:51
*** adrian_otto has joined #openstack-kolla		19:53
*** dave-mccowan has joined #openstack-kolla		20:07
*** sacharya has joined #openstack-kolla		20:26
*** salv-orlando has joined #openstack-kolla		20:27
*** salv-orlando has quit IRC		20:31
*** adrian_otto has quit IRC		20:31
*** sacharya has quit IRC		20:31
*** sdake has joined #openstack-kolla		20:35
*** sdake has quit IRC		20:40
*** sdake has joined #openstack-kolla		20:42
sdake	sup peeps	20:47
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	20:52
kfox1111	hey	20:53
openstackgerrit	Steven Dake proposed openstack/kolla-kubernetes: Add a workaround for docker-engine 1.13.0 https://review.openstack.org/423568	20:57
sdake	sup kfox1111	20:57
sdake	kfox1111 if you could ack https://review.openstack.org/#/c/423568/	21:10
sdake	i'd like that to merge prior to monday if possible :)	21:10
sdake	sbezverk ^^	21:10
*** adrian_otto has joined #openstack-kolla		21:15
kfox1111	sdake: I can. is that the correct workaround though?	21:18
sdake	kfox1111 tested and works for me	21:18
kfox1111	seems like its really a bad docker version, and we should recommend around it.	21:18
sdake	portdirect suggested it	21:18
sdake	docker isn't changing to permit -'s in their organization names	21:18
kfox1111	if it really is a problem with docker not supporting '-'s in repo names.	21:18
kfox1111	that would be a docker bug.	21:19
sdake	i woudl bet 1 dollar against 100	21:19
kfox1111	they use to.	21:19
sdake	its going to be a permanant docker thing	21:19
kfox1111	all other versions of docker did.	21:19
sdake	in any regard, its a short term fix - i suspect helm can sort it out with the docker cats	21:19
sdake	the problem without it is nobody can get any work done	21:19
sdake	and if its not documented its not a thing	21:19
kfox1111	I expect docker to revert that change.	21:20
sdake	its part of docker hub	21:20
kfox1111	the other way to document it is with a workaround to avoid that version.	21:20
sdake	its not like we manually install docker in the instructions	21:20
kfox1111	there are lots of docker repos. quey, google's, artefacteries, gitlab	21:20
sdake	that happens via the dev env	21:20
kfox1111	if they use to supprot a -, but don't anymore, that will affect a lot of repo's.	21:20
sdake	possible - not our problem	21:21
sdake	its none of our business on docker's policies	21:21
sdake	if we make it our business, i predict badness wil loccur	21:21
kfox1111	pointing at a non official container seems more risky then telling users to avoid a buggy version of docker.	21:21
sdake	lets let helm sort it out	21:21
sdake	hsort term fix - that is why there is a date in the docs	21:21
sdake	the docs are updated continously	21:22
sdake	or shoudl be	21:22
sdake	they haven't	21:22
sdake	that is changing starting as of thursday of last week	21:22
* kfox1111 shrugs. you asked for my optinion. Thats my opinion.		21:22
sdake	i asked for an ack, not a dissertation on how docker is evi :)	21:22
kfox1111	working docks are preferable to non working docs.	21:22
kfox1111	no, just being consistent.	21:22
kfox1111	you didn't like how I ponted to a docker contiainer in my own docker hub namespace before.	21:23
sdake	ya - fwiw i did ask pete to pin docker-engine to an older version in the dev env	21:23
kfox1111	and I did agree with you at the time.	21:23
sdake	if he does that, we can just remove that warning	21:23
sdake	yup i dont like it	21:23
kfox1111	I don't htink this is different.	21:23
sdake	i expec tthis to be in the docs for max 1 week	21:23
sdake	a thundering herd of people are trying to get kolla-kubernetes rolling	21:23
sdake	and running into problem after p roblem	21:23
sdake	i say trying because they are failing	21:24
sdake	me included	21:24
sdake	if my answer is 'we can't figure out how to get the stuff fixed or worked around" they will be like "are u srs?"	21:24
sdake	in this case, we are doing two things	21:24
sdake	working around (by changing dev env that portdirect is taking on, and documenting a seocnd workaroudn until pete gets the docker-engine pinned)	21:25
sdake	the fixing part comes from helm upstream - I'd hope	21:25
sdake	the reality is we cna't wiat 2 months for a new docker-engine release to fix this problem	21:25
kfox1111	I don't think helm can work around how gogle lays out their repo's.	21:25
sdake	docker is on a 2 month release cadence	21:25
sdake	not expecint helm to work around it, i'm expecting them to work with docker to fix it as you requested	21:26
kfox1111	yeah. that.	21:26
sdake	this wil lbe a major emergency for them	21:26
sdake	it isn't a major emergency for us	21:26
kfox1111	yeah. I hope they take it seriously.	21:26
sdake	our major emergency is that the dev env as docuemtned doesn't work properly :)	21:27
kfox1111	agreed.	21:32
kfox1111	theres a second smaller emergency.	21:32
kfox1111	I've been working on the reboot testing gate.	21:32
kfox1111	found an issue with rbd support and the way k8s uses it.	21:32
kfox1111	so I don't think we can use ceph rbd to back k8s volumes.	21:33
*** salv-orlando has joined #openstack-kolla		21:33
*** l4yerffeJ has joined #openstack-kolla		21:34
*** Jeffrey4l_ has joined #openstack-kolla		21:35
*** l4yerffeJ_ has quit IRC		21:35
*** Jeffrey4l has quit IRC		21:35
*** goldyfruit has joined #openstack-kolla		21:36
*** Bigkol has joined #openstack-kolla		21:38
Bigkol	Where are 3.0.2 release notes?	21:41
*** Bigkol has quit IRC		21:45
sdake	portdirect if you could pin the docker-engine in the dev env, that owuld be helpful :)	21:46
* sdake groans		21:46
*** klindgren__ has joined #openstack-kolla		21:48
*** klindgren_ has quit IRC		21:49
sdake	Jeffrey4l_ you around?	21:50
sdake	Jeffrey4l_ something is odd about the kolla.yaml file	21:50
sdake	also kolla-ansible.yaml is not in the releases repo	21:50
* sdake groans		21:51
sdake	oh i see - i'm in the newton directory	21:52
sdake	Jeffrey4l_ I think basically the release-notes section needs to be added to the newton directory	21:52
sdake	Jeffrey4l_ as well as the ocata directory for kolla.yml and kolla-ansible.yaml	21:53
*** sdake has quit IRC		22:02
openstackgerrit	Kevin Fox proposed openstack/kolla-kubernetes: WIP: Support rolling upgrades of horizon https://review.openstack.org/333996	22:03
*** imcsk8_ has quit IRC		22:04
*** imcsk8 has joined #openstack-kolla		22:05
openstackgerrit	Marcus Williams proposed openstack/kolla-ansible: [WIP] Split Openvswitch into own role https://review.openstack.org/416793	22:17
portdirect	o/	22:19
* portdirect reading		22:19
portdirect	kfox1111: sup with k8s rbd?	22:21
kfox1111	was trying to get k8s to recover.	22:21
kfox1111	seeing mariadb not ever recover.	22:22
portdirect	you have logs?	22:22
kfox1111	dug in and saw the rbd cli stuck.	22:22
*** l4yerffeJ has quit IRC		22:22
*** l4yerffeJ has joined #openstack-kolla		22:23
kfox1111	talked to samyample, and he told me the kernel rbd stuff has a lot of issues.	22:23
kfox1111	in particular, there are a few scases on startup, and even if ceph goes offliene for a bit, where the kernel mount can lockup until reboot.	22:23
portdirect	hmmm	22:23
portdirect	logs?	22:23
kfox1111	logs here: http://logs.openstack.org/93/413193/66/experimental/gate-kolla-kubernetes-deploy-centos-binary-2-ceph-reboot-nv/391a2e1/	22:24
*** salv-orl_ has joined #openstack-kolla		22:24
kfox1111	for the ceph case, we're only using it to back mariadb/rabbitmq and helm-repo.	22:24
kfox1111	mariadb/rabbbit I was contimplating running with local storage anyway in a cluster. so could work around it that way.	22:25
kfox1111	helm-repo we could maybe just ditch the pv.	22:25
kfox1111	wasn't sold on the need anyway.	22:25
portdirect	i kinda wonder if we should pin helm-repo to kube master node(s)	22:25
portdirect	and then we could just use local storage	22:26
kfox1111	yeah. should be easy to do by just tweaking the selector_key	22:26
*** salv-orlando has quit IRC		22:27
portdirect	so looking at logs	22:27
portdirect	could you remind me of how things work atm?	22:27
portdirect	mariadb seems running: http://logs.openstack.org/93/413193/66/experimental/gate-kolla-kubernetes-deploy-centos-binary-2-ceph-reboot-nv/391a2e1/console.html#_2017-01-21_17_58_41_081954	22:27
kfox1111	seems that way, but not.	22:28
kfox1111	so, I pull the plug on the node to simulate a power failure:	22:28
kfox1111	https://review.openstack.org/#/c/413193/66/tests/bin/gate_reboot_master.sh line 84.	22:29
kfox1111	k8s comes back up, and things everything is still running.	22:29
kfox1111	takes it a bit to figure out it s not.	22:29
kfox1111	if you look under logs/pods/mariadb	22:30
kfox1111	you can see the state of the current mariadb. it complains there that it cant start due to volume not mounting.	22:30
kfox1111	http://logs.openstack.org/93/413193/66/experimental/gate-kolla-kubernetes-deploy-centos-binary-2-ceph-reboot-nv/391a2e1/logs/pods/kolla-mariadb-0.txt	22:30
portdirect	got it	22:31
kfox1111	if you look at the ceph logs, it looks like it running and recovered properly by that point.	22:31
portdirect	:(	22:31
kfox1111	I added a ps -eo etime,cmd to the logs,	22:31
kfox1111	to see whats running, and how long:	22:31
kfox1111	http://logs.openstack.org/93/413193/66/experimental/gate-kolla-kubernetes-deploy-centos-binary-2-ceph-reboot-nv/391a2e1/console.html#_2017-01-21_18_06_37_173757	22:32
kfox1111	you can see rbd commands stuck for 7 minutes, 21 seconds.	22:32
kfox1111	looking in logs/dmesg, you can see its not printing out anything ugly a bout ceph either.	22:32
kfox1111	so at that point, Iwent to ceph to ask if they've ever seen rbd lock up.	22:32
kfox1111	then SamYample said, its a known ugly with the kernel rbd stuff, and its rather broken.	22:33
portdirect	yeah - thats not looking too great	22:33
kfox1111	said the only way to recover is to reboot. :/	22:33
portdirect	moving to dynamic pvc would make it a tiny bit better i think	22:33
kfox1111	it fails a timeout sometimes and then wont ever try and recover.	22:34
kfox1111	no. would still fail in this case.	22:34
portdirect	as the controller manager would catch that rbd is busted	22:34
kfox1111	as the pvc won't be recreated on reboot.	22:34
portdirect	so the mariadb pod would not come up at all	22:34
portdirect	but that doesnt give us a working system :(	22:34
kfox1111	I asked if I could wrapper the rbd-fuse client. said it would have a big performance hit.	22:35
kfox1111	cephfs is an option too, but again, a performance hit.	22:35
portdirect	urgh	22:35
kfox1111	it may be good enough for a small cloud though. would have to benchmark it.	22:36
portdirect	gluster, lets use gluster	22:36
portdirect	:trollface:	22:36
kfox1111	heh.	22:36
kfox1111	I'd expect it to perform just like cephfs.	22:36
kfox1111	so, galera pinned to nodes is probably our best bet then.	22:36
portdirect	similar, running databases on gluster is stupid	22:36
kfox1111	which is desirable for ha anyway.	22:37
portdirect	i think that that may be ther right solution	22:37
portdirect	but need to think it through	22:37
kfox1111	yeah.	22:37
kfox1111	we were considering buying some ssd's to back one of our cloud's cluster databases anyway.	22:37
kfox1111	so would probably need that functionality anyway.	22:38
kfox1111	this may just expidite it.	22:38
portdirect	running galera on ceph as always seemed a bit overkill to me	22:38
kfox1111	it was a simple way not to have to cluster to get some level of ha.	22:38
portdirect	yup	22:38
kfox1111	so, had a bit of benifit. but if its unstable, then its not worht it.	22:39
portdirect	though, lets think thos through before we assume the sky is falling	22:39
kfox1111	sure.	22:39
portdirect	can we fix this without restarting a node?	22:40
kfox1111	I asked if delaying till ceph was stable would fix the issue.	22:40
kfox1111	said it would help, but if ceph goes down for any reason for a while, the kerenl could do the same thing.	22:40
kfox1111	I personally don't want to take that risk. :/	22:40
kfox1111	we've had ceph's need attention before. things always recovered after fixing so was just fine.	22:41
portdirect	did they have rommendations?	22:41
kfox1111	no. just, don't use kernel rbd.	22:41
portdirect	was this SamYaple or peeps at ceph?	22:41
kfox1111	said the most performant way was rbd in kvm in a vm. though we don't have that luxury.	22:41
* portdirect trusts both		22:41
kfox1111	SamYaple.	22:42
portdirect	lol - we actually could on most installs	22:42
kfox1111	yeah.	22:42
portdirect	but cant really go down that path :(	22:42
kfox1111	we'd still need a local mariadb at some point I think.	22:43
kfox1111	so the simplest solution would be required in either case I think. :/	22:43
portdirect	right - so (and plaing devils advocate)	22:43
portdirect	the issue with local storage is stat an operator needs to look after a node	22:43
kfox1111	right. or a set of nodes.	22:44
portdirect	vs ceph, where we have a nice blug amorphous blob to puth things in	22:44
kfox1111	right.	22:44
portdirect	local storage is a multiple of times faster than ceph	22:44
kfox1111	possibly.	22:45
kfox1111	in latency sensitive cases, probably.	22:45
portdirect	my gut stays go local	22:45
kfox1111	yeah. mine too.	22:45
portdirect	but this is too big for us to decide here	22:45
kfox1111	its in the relm of building blocks too though. so we can always provide different building block options and let the operators decide the best way for themselves.	22:46
portdirect	+1	22:46
portdirect	if you can work it in as an option simply then I think that that makes sense	22:46
kfox1111	yeah. I thinnk we can. just put a conditional around the pvc and add an attribute to pin it to a node.	22:47
kfox1111	though, I think there's one other unknown...	22:47
portdirect	just so you know (and remeber my scale has not been the same as yours)	22:47
kfox1111	currenty its implemented as a statefulset.	22:47
portdirect	i've typically run my dbs with local storage backed by simple raid	22:47
kfox1111	we could do it locally, as a deployment, a statefulset, or a daemonset.	22:47
kfox1111	yeah. we have too.	22:48
kfox1111	only contimplating ssd's on one of my clouds, as we've foudn the db load high enoguh,	22:48
kfox1111	that when we pull out/put back a member, it takes about an hour to fully sync.	22:48
kfox1111	performing a full rolling upgrade then takes 3. :/	22:49
portdirect	do we have a health check on mariadb pod?	22:49
kfox1111	doesn't look like it yet.	22:49
portdirect	we need that	22:49
kfox1111	yeah, we need a bp to do it everywhere.	22:49
portdirect	as it would have sorted the issue in the first logs you showed	22:50
kfox1111	no,	22:50
portdirect	not improved the situation, just the visibility of it	22:50
kfox1111	the very first issue was k8s showing data from etc right after reboot,	22:50
kfox1111	but before it talked to kubelet.	22:50
kfox1111	etcd\	22:50
portdirect	ah	22:50
portdirect	how fast does a node reboot?	22:50
kfox1111	pretty quick. 10's of seconds I think?	22:51
portdirect	if its under 30 then that is a problem (i think)	22:51
kfox1111	it took longer for kubelet to recover kube-apiserver.	22:51
kfox1111	and the controller manager.	22:51
portdirect	right - so thinking	22:53
portdirect	local stoarge = good	22:53
portdirect	need to work out reboot issues a bit more	22:54
portdirect	rebooting a AIO is not the same a s a DC power outage	22:54
portdirect	similar but not the same :/	22:54
kfox1111	yeah.	22:55
portdirect	soon as we can run outside of the gat this weill become a lot easier to work out	22:55
kfox1111	well, the gate is starting to test it now.	22:55
kfox1111	so we can test more frequently then is possible with dedicated hardware	22:55
portdirect	yeah but the gate is not interactive :/	22:56
kfox1111	true that.	22:56
kfox1111	anothther advantage of doing it with local, is it does simplify the install instructions.	22:57
kfox1111	as k8s no longer needs ceph support at all.	22:57
portdirect	also while cant run outside of gate - we don't really have a deverable, but a science project :)	22:57
kfox1111	true. we have building blocks for assembling a cloud.	22:57
kfox1111	but no instructions for how to do it.	22:58
portdirect	can you take that on?	22:58
kfox1111	building one outside, or instructions?	22:58
portdirect	al teast an set of rough notes that someone could massage	22:58
portdirect	either/both	22:59
kfox1111	next week or the weke after, I'm going to be deploying it on our test hardware.	22:59
kfox1111	sdake's working on fresh docs right now.	22:59
kfox1111	I offered to help him too, wherever he needs it.	22:59
kfox1111	so I think docs will show up soon.	22:59
portdirect	great :)	22:59
portdirect	thanks man	23:00
kfox1111	np. :)	23:00
kfox1111	sbezverk has it running on some of his hardware too.	23:00
kfox1111	so it does work outside the gate for sure.	23:00
kfox1111	I had it running on my laptop in a vm for a while too. but I let that bitrot as the gate was more important to prevent regressions.	23:01
portdirect	yeah - not trying to needle you , just its a bit of a black box atm	23:01
portdirect	and behing honest, I gave up	23:01
portdirect	and thats not good :/	23:01
kfox1111	yeah, no needling felt. yeah, its a black box currently. :/	23:01
kfox1111	agreed.	23:01
kfox1111	if you needed help though, you could have asked? or did I miss it?	23:02
kfox1111	happy to help	23:02
portdirect	cheers man :) I know you are - just had too much else on to devote time to it	23:02
kfox1111	k. yeah. I get that.	23:03
portdirect	will have a shot once sdakes got docs in progress	23:03
kfox1111	cool. sounds good.	23:03
portdirect	:)	23:03
portdirect	right I better get back to git merge hell	23:03
kfox1111	always good to have someone that hasn't tried the docs, or done a deploy test out the new docs. :)	23:03
kfox1111	fun. :/	23:03
portdirect	I miss gerrit for some of the stuff im working so much at the moment	23:03
kfox1111	oh...	23:03
kfox1111	so, back to the previous question real quick.	23:04
*** dave-mccowan has quit IRC		23:04
kfox1111	I can see 3 options for local storage mariadb.	23:04
portdirect	shoot	23:04
kfox1111	statefulset, daemonset, or deployment. all can be pinned to a set of nodes.	23:04
portdirect	satefulset	23:04
kfox1111	statefulset would work, but replica's woudl always have to be set to 1,	23:04
kfox1111	and then pinned to the node.	23:05
*** adrian_otto has quit IRC		23:05
kfox1111	if you wanted a cluster, you'd have to launch multiple of them.	23:05
kfox1111	same with deployment.	23:05
kfox1111	though deployment is a little ligher wieght for that use case.	23:05
portdirect	kinda	23:05
kfox1111	daemonset's interesting, in that you coudl just label the nodes as mariadb,	23:05
portdirect	so (and working this though as I type)	23:05
kfox1111	and set net=host	23:05
portdirect	you could do the same with staefulset	23:05
portdirect	thats what i was going to suggest	23:06
portdirect	but you would need to have same number of nodes as replicas	23:06
kfox1111	and instances too.	23:06
kfox1111	where with a daemonset, you just launch one. and it launches one on all labeled nodes.	23:06
portdirect	replicas=insatces	23:06
kfox1111	so a cluster might be easier.	23:06
portdirect	daemonset would work for sure	23:06
kfox1111	though without rolling suppor t yet,	23:07
portdirect	but k8s gods would not like you	23:07
kfox1111	no way to do a rolling cluster upgrade.	23:07
portdirect	yes	23:07
kfox1111	thats in ps review though.	23:07
portdirect	mirantis have a soltion for that as well	23:07
portdirect	https://github.com/Mirantis/k8s-daemonupgradecontroller	23:08
kfox1111	ah.	23:08
*** salv-orl_ has quit IRC		23:08
kfox1111	seems like they really like to add stuff outside of k8s. :/	23:08
kfox1111	coreos has a ps going to k8s itself based on the deployment upgrade code.	23:08
* portdirect wont say the things he wants too		23:09
* portdirect but maybe that comment just said it?		23:09
kfox1111	perhaps.	23:09
portdirect	yeah i like the coreos approach to this	23:09
kfox1111	you see their ps for self hosting kubelet?	23:10
kfox1111	its fasinating. :)	23:10
kfox1111	not sure it will work well, but they have a prototype working I hear.	23:10
kfox1111	they can then use the deployment upgrade code to upgrade all the kubelets.	23:10
kfox1111	pretty slick.	23:10
portdirect	yeah - I've been looking into k8s-on-k8s quite seriously	23:11
kfox1111	so, the tradeoff between deployments and statefulset, is a deployment is easily upgraded.	23:11
kfox1111	we wouldn't be usinjg any of the features of statefulset either.	23:11
kfox1111	so in a cluster, a deployment with local storage and net=host might be better?	23:12
portdirect	yeah - though the really nice thing about statefulsets for us is how they start up	23:12
portdirect	if you have livelyness checks	23:12
portdirect	they start in order	23:12
portdirect	and for galera this pretty sweet	23:12
kfox1111	yeah. but since replica's = 1 in this case always, it wouldn't help. :/	23:12
kfox1111	yeah.	23:12
portdirect	but whay run replicas=1?	23:13
portdirect	why not replicas=3?	23:13
kfox1111	case we need to pin the pod to the host for local storage to always be there.	23:13
kfox1111	since there is no automated way to do that yet. :/	23:13
portdirect	but is you set replcias=3, and then have only 3 nodes labeled mariadb	23:13
portdirect	that will do what you want	23:13
kfox1111	hmm...	23:14
kfox1111	ordered start wouldn't be garanteed then?	23:14
kfox1111	well...	23:14
portdirect	it would be	23:14
kfox1111	well, yes/no...	23:14
portdirect	you just would not be able to controll the order of nodes that the pod was started on	23:14
kfox1111	the names would always be 0-2,	23:14
portdirect	but therw ould always be ordeing in st startup	23:14
kfox1111	but which nodes got which pods would be random.	23:14
portdirect	yes	23:14
kfox1111	so the order of galera startup would be random.	23:15
portdirect	but thats fine	23:15
portdirect	but the first to start would always be 0	23:15
kfox1111	yeah.	23:15
kfox1111	that might be ok.	23:15
kfox1111	autorecovery there has always made me a little worried there.	23:15
kfox1111	the galera docs left it as an exercise to the operator to determine "the last db down",	23:16
kfox1111	and chosen to be the first up.	23:16
portdirect	"the last db down",none of what we have discussed solves that :/	23:16
kfox1111	yeah.	23:16
portdirect	and thats a whole other question :)	23:16
kfox1111	thats another issue. though, statefulsets may make it worse?	23:17
portdirect	no	23:17
kfox1111	one way I can think to solve it,	23:17
kfox1111	is to do a trigger file on the host via hostpath.	23:17
kfox1111	the operator touches a file on the host saying, this is the master to start first.	23:17
kfox1111	if all 3 pods came up at the same time, watched for the file to show, and only changed to be the master if they saw it,	23:18
kfox1111	it should then come up pretty smooth.	23:18
portdirect	but how would the operator know where to puth that file?	23:18
kfox1111	would have to be in the recovery docs.	23:18
portdirect	meaning manual?	23:18
kfox1111	yeah.	23:19
portdirect	ok	23:19
kfox1111	so that passes the buck the same way gallera passed the buck.	23:19
portdirect	i mean that works	23:19
kfox1111	as the end user, I have no idea most of the time how to tell, other then random.	23:19
kfox1111	:/	23:19
kfox1111	one of the problems with galera. :/	23:20
portdirect	thats not really true	23:20
portdirect	but its not easy	23:20
*** salv-orl_ has joined #openstack-kolla		23:20
kfox1111	yeah. there's some basic sanity checks.	23:20
mgkwill	Hiya, if anyone is hanging around from kolla-ansible, I'd love further review of 'Split Openvswitch into own role' https://review.openstack.org/#/c/416793	23:20
kfox1111	but if all 3 nodes basically died at the same time, and all seem to have data, its hard to guess which one is newest.	23:21
portdirect	you whould be able to work out which node went down last by looking at the state files on each node	23:21
kfox1111	pretty much every time I've had to recover the cluster it was due to power outage.	23:21
portdirect	but getting that automated is a proejct in itself :/	23:21
kfox1111	yeah.	23:21
portdirect	right back to merging hell :/ laters man	23:22
kfox1111	ok. l8r.	23:22
*** jistr has quit IRC		23:41
*** sdake has joined #openstack-kolla		23:43
*** jistr has joined #openstack-kolla		23:49
sdake	kfox1111 aren't ceph rbd volumes used now to back k8s volumes?	23:50
sdake	kfox1111 sorry missed your msg from earlier - was in the middle of getting rady for naptime :)	23:50
sdake	kfox1111 or does it have to do with what happens after a reboot?	23:50
*** dims has quit IRC		23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!