16:01:14 #startmeeting openstack_ansible_meeting 16:01:15 Meeting started Tue Oct 10 16:01:14 2017 UTC and is due to finish in 60 minutes. The chair is evrardjp. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:01:16 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:01:20 The meeting name has been set to 'openstack_ansible_meeting' 16:01:22 evrardjp: ok, sure. 16:01:22 #topic rollcall 16:01:32 o/ 16:01:41 #info hwoarang 16:01:53 #info hwoarang 16:02:04 that's interesting. 16:02:06 o/ - here partly 16:02:20 ok we have quorum, let's start. 16:02:24 lol 16:02:26 :p 16:02:44 If anyone wants to handle bug triage in the future, that would be great. 16:03:08 #topic new bugs 16:03:15 #link https://bugs.launchpad.net/openstack-ansible/+bug/1722273 16:03:15 Launchpad bug 1722273 in openstack-ansible "haproxy_keepalived_vars_file not overriding keepalived configuration" [Undecided,New] 16:04:03 that sounds valid docs issue 16:04:22 and I am probably the one to blame. 16:04:29 but let's skip that part. 16:04:32 :D 16:04:54 confirmed, but what level? IMO we are setting wrong expectations, so I'd put it as high 16:05:12 ok 16:05:46 spotz: do you have time to look at this one? 16:06:17 next 16:06:19 #link https://bugs.launchpad.net/openstack-ansible/+bug/1722200 16:06:20 Launchpad bug 1722200 in openstack-ansible "percona-xtrabackup unmet dependencies" [Undecided,New] 16:07:21 andymccr: the docs for the upgrade is still manually updated, right? It's not coming from the upgrade script or anything 16:07:24 hmm 16:07:48 the docs for the upgrade? 16:08:04 they arent automatically updated anyway i know of at least 16:08:09 ok 16:08:21 maybe it's out of sync with the code 16:08:35 but also we don't test this in our periodics... 16:08:42 so we might have missed it 16:09:39 we dont do db upgrades in the os upgrade? 16:09:40 yeah 16:09:45 tbh i think those 2 things should be separate 16:09:58 if you want to update your DB go for it, we have a process for that, but it isn't specifically tied into openstack itself 16:10:37 oh you mean completely removing the db part of the maintenance, and do it separately? 16:10:54 i mean that would be up for debate 16:10:55 in our run_upgrade script we do implement the flag to upgrade the DB 16:10:56 but i personally think so 16:11:04 oh so we do an upgrade test during the upgrade scenarios? 16:11:08 in that case. sure 16:11:15 yes, one should do it seperately, but that's why we outline all the steps and recommend you do them manually 16:11:20 i mean i still think the 2 are separate 16:11:24 i wouldnt advise doing a db upgrade during your openstack upgrade 16:11:26 but thats just me :) 16:11:27 andymccr: yes we do test it, but not this one particularily because we don't test N to O 16:11:38 evrardjp: don't we? i thought we had an ocata upgrade scenario 16:11:41 we do test N to O 16:11:47 mmm 16:11:55 i think i checked last time and it was working too - but could be wrong 16:12:02 the periodic upgrade test for 'ocata' is an upgrade to Ocata 16:12:17 and yes, it's been reliably working for some time 16:12:27 it doesn't show up on health 16:12:38 the galera_server repo also does a cluster upgrade per commit for the newton/ocata branches IIRC 16:12:41 oh maybe we don't generate subunit for it? 16:12:52 no subunit for any branches older than pike 16:12:59 ok 16:13:03 that's it 16:13:19 http://logs.openstack.org/periodic/periodic-openstack-ansible-upgrade-aio-ocata-ubuntu-xenial/ 16:14:06 the latest one succeeded 16:15:01 this issue may be due to using the percona repo instead of the downloaded package? 16:15:13 not sure - the issue will need confirming 16:15:27 we know that the defaults aren't showing that failure though due to the periodics 16:15:35 yeah, so what concerns me there is the version 16:15:42 2.3 instead of 2.4 16:16:27 on the job you see it tries to install 2.4 from 2.3 and it works 16:16:41 I will check with armaan in more details tomorrow I guess 16:17:39 ok let's move on 16:17:43 #link https://bugs.launchpad.net/openstack-ansible/+bug/1721912 16:17:44 Launchpad bug 1721912 in openstack-ansible "delegate_facts does not work with container connection plugin" [Undecided,New] 16:18:32 a fun one :( 16:18:50 it looks pretty valid 16:18:55 ugh 16:19:01 we should probably have a test for that 16:19:12 that'd be at least high - because that will cause some weird outcomes 16:19:21 yup. 16:19:50 logan-: or jmccrory, did one of you got the chance to work on those? 16:20:15 we start to have too many "high" bugs :/ 16:20:31 ok let's continue 16:20:39 #link https://bugs.launchpad.net/openstack-ansible/+bug/1721878 16:20:40 Launchpad bug 1721878 in openstack-ansible "os_magnum: Pike: missing default_docker_volume_type" [Undecided,New] 16:21:11 I think Florian has a point 16:21:58 but it also makes sense to wire things up 16:23:26 ja, it probably makes sense to wire up if we can 16:23:28 yeah 16:23:43 regardless of whether magnum should be fixed, we'll have to carry a workaround 16:24:16 I do think that magnum should handle the issue more gracefully though 16:24:20 in the meantime, because we can workaround in user variables, I'd tend to mark it as medium 16:24:38 funny enough, handling missing deps gracefully is one of the 12FA principles :p 16:24:53 we can either wire it up, or add a reno with a known issue 16:25:27 odyssey4me: that's true 16:25:37 we still need work there 16:25:47 either docs for magnum, or wire code 16:25:54 medium, ok everyone? 16:26:05 maybe docs, similar to horizon here https://github.com/openstack/openstack-ansible-os_cinder/blob/master/doc/source/configure-cinder.rst#openstack-dashboard-horizon-configuration-for-cinder 16:26:17 ja, an established workaround is available 16:26:28 good suggestion there jmccrory 16:28:18 ok next 16:28:33 #link https://bugs.launchpad.net/openstack-ansible/+bug/1721554 16:28:34 Launchpad bug 1721554 in openstack-ansible "os_ceilometer fails without swift installed" [Undecided,New] 16:28:46 ceilometer... 16:29:08 I am working on the role maturity FYI, but in the meantime... 16:29:58 what should we do? 16:30:06 are we going ahead with pulling those out from master at least? 16:30:36 whatever happens, we're going to have to maintain up to pike anyway because we didn't pull it from pike 16:30:57 so I guess we may as well figure out how to make it work and fix it 16:31:39 ... 16:31:44 lovely 16:31:56 that bug will need confirmation, but it looks plausible 16:32:08 I'd mark it low, given that those roles are unmaintained 16:32:22 hmm, maybe medium actually because there's no workaround 16:32:34 well its a problem in that it doesnt work in pike im sure so we are basically going to say "we cant fix it cos tahts a feature, but we cant drop it" 16:32:48 seems sensible to still have a thing saying "this role isnt maintained properly since newton" or w/e the case may be 16:33:18 if we take that stance, then perhaps we should remove those roles from integration all the way back to newton and do a minor rev 16:33:24 with a reno obviously 16:33:43 maybe not newton if it works in N? 16:33:49 well i just think its misleading to keep them in for the sake of it 16:33:56 I agree there 16:33:59 maybe leave N 16:34:00 like they dont work, or nobody is using/maintaining them 16:34:06 i think N works 16:34:12 I think it's bad to set wrong expectation 16:34:17 expectations 16:34:17 but O onwards has been unmaintained 16:34:17 yeah ^ 16:34:24 ok 16:34:35 yup - we just need to move all the stuff out into the roles and have the 'extras' stuff like cloudkitty 16:34:37 we get a tonne of bug reports on those and its like "cool but we need your help to fix it" 16:35:18 yeah if I got a cent every metering stack bug report, I'd be iphone X world right now. 16:35:33 ok maybe a dollar instead. 16:35:36 anyway. 16:35:40 that is a strange task though. why do ceilometer hosts need a swift user? 16:35:45 Manuel Buil proposed openstack/openstack-ansible-os_neutron master: Make sure iptables is installed https://review.openstack.org/510935 16:35:55 https://github.com/openstack/openstack-ansible-os_ceilometer/blob/master/tasks/ceilometer_pre_install.yml#L32-L40 16:36:19 jmccrory because of gnocchi 16:36:33 if gnocchi uses swift, then it needs a swift user 16:36:57 hmm ok 16:37:29 In the meantime, let's not mark it as confirmed, I marked it as medium 16:37:50 if we continue with the role deprecation work, and validate it, I can send a mail to the ML asking for contributors 16:38:05 if no one comes over at next community meeting, we are done and we can deprecate it 16:38:18 let's move to some other bug, for andymccr's pleasure: 16:38:23 #link https://bugs.launchpad.net/openstack-ansible/+bug/1721356 16:38:24 Launchpad bug 1721356 in openstack-ansible "nova-placement-api haproxy health check is broken" [Undecided,New] 16:38:27 we should have pulled it out in ocata :/ 16:38:34 i thought i fixed this 16:39:22 andymccr: https://github.com/openstack/openstack-ansible/blob/master/group_vars/haproxy_all/haproxy.yml#L179-L180 ? 16:39:43 well apparently i didnt 16:39:44 in https://github.com/openstack/openstack-ansible/commit/c4105464e6e5c1f199e8e256ced17208e913935c ? 16:39:45 since it hasnt changed 16:40:07 my bug is from ocata 16:40:10 maybe it didn't get backported 16:40:11 ahh 16:40:16 so maybe we just need to backport that 16:40:18 although i think 16:40:20 in ocata we do it slightly diff 16:40:29 because in ocata we use nginx + uwsgi 16:40:35 in pike+ we use uwsgi direct 16:40:36 right 16:40:39 oh 16:40:42 that's it 16:40:59 so in ocata when uwsgi fails to start, nginx does the bad gateway response and haproxy leaves the endpoint up 16:41:07 logan-: ahh 16:41:17 on top of that haproxy config is different in O 16:41:26 maybe we need an expect there? 16:41:36 rather status expectation on haproxy 16:41:41 and to fix whatever is causing uwsgi not to start 16:41:57 the uwsgi failing to start was my fault, but it uncovered this :) 16:42:03 couldn't this be changed too https://github.com/openstack/openstack-ansible/blob/stable/ocata/playbooks/vars/configs/haproxy_config.yml#L153 ? 16:42:04 logan-: ahh ok fair enough 16:42:45 I'd agree with andymccr there, maybe expect not 5xx 16:42:50 yeah 16:42:56 makes sense to me 16:43:09 well 16:43:16 in ocata we could probably do an expect 20x or something 16:43:33 the reason its diff in pike+ is because removing nginx means it doesnt forward to like /placement - you just hit direct 16:43:37 and that causes some weirdness with responses 16:43:40 I thought you had all the 4xx or 3xx not really an issue 16:43:53 401 is because it doesnt find the endpoint directly, unless you request /placement 16:44:09 but if you are using nginx it forwards / --> /placement so it should exist and not give a 401 16:44:16 fair enough 16:44:18 but could be wrong 16:44:26 although doing not 5xx would bea. good start point 16:44:32 since if its actually broken it will 5xx 16:44:38 yeah 16:44:50 ok let's mark this as confirmed and ... ? 16:44:57 medium 16:45:00 k 16:45:02 its only a problem if things are broken already 16:45:08 and dont do a logan- and break placement service :P 16:45:11 agreed 16:45:12 :P 16:45:17 haha 16:45:49 haha 16:46:04 anyway, let's figure things out for last week's bugs not handled... 16:46:07 #link https://bugs.launchpad.net/openstack-ansible/+bug/1719517 16:46:08 Launchpad bug 1719517 in openstack-ansible "Ceilometer policy.json includes python unicode encoding" [Undecided,New] 16:46:12 lol 16:46:38 ;) 16:46:43 let's move on to 16:46:44 maybe this thing does work? 16:46:45 or at least 16:46:50 christian seems to be giving it a go at fixing it? 16:46:54 yeah 16:46:59 sure! 16:47:01 if we can convince somebody to manage it that'd be awesome 16:47:01 :p 16:47:14 otherwise my response is the same as the previous ceilometer bug 16:47:16 I will contact him directly 16:47:24 on top of the rest 16:47:26 it appears so - except it got caught in the jenkins/zuul fight 16:47:39 yeah i mean 16:47:42 that patch got shot down :P 16:47:50 well, maybe he could help maintain the roles ;) 16:48:05 ok I understand your stance :) 16:48:06 si si 16:48:11 let's move on a different bug 16:48:15 #link #link https://bugs.launchpad.net/openstack-ansible/+bug/1714597 16:48:16 Launchpad bug 1714597 in openstack-ansible "multi OS repo-build needs to be more intuitive " [Undecided,New] 16:48:18 darn 16:48:19 #link https://bugs.launchpad.net/openstack-ansible/+bug/1714597 16:48:54 no one reproduced it? 16:48:59 can we continue our way? 16:49:06 I think you may have fixed the actual bug there evrardjp - the ansible_local thing not being a string? 16:49:38 https://review.openstack.org/#/q/I63817bb3c58a2c44d3e948bdb81005dc2e734c21 16:49:55 yeah I remember that, but I am not sure it's linked to the first part of the bug 16:50:25 dict object has no attribute repo_build 16:50:45 so the bug itself is incorrect 16:50:46 maybe 16:50:59 I think we should close this one down then 16:51:07 or mark it incomplete 16:51:08 the stuff does detect and do the right things, but there have been reports of issues and unfortunately I have no test environment to work with on this 16:51:25 I can assign to me to try and replicate with your patch included 16:51:53 I intend to put some time into working on repo things on Friday this week. 16:51:55 ok I'll leave it as is 16:52:00 oh great 16:52:15 I just added your name into the assignee 16:52:18 let's move on 16:52:28 #link https://bugs.launchpad.net/openstack-ansible/+bug/1708360 16:52:29 Launchpad bug 1708360 in openstack-ansible "Openstack-Ansible Install Fails with OVS" [Undecided,New] 16:53:05 so here I think what would be great (in accordance with what odyssey4me and andymccr already said to me)... to have an ovs scenario 16:53:12 so question 16:53:22 hwoarang: what's your status there? :D 16:53:29 i see what you did there 16:53:30 yeah, that'd be nice - scenario and example config along with the others 16:53:46 andymccr: D 16:55:03 evrardjp: i will have to ask 16:55:06 ok 16:55:07 thanks 16:55:09 since i am not the one doing the work 16:55:23 Merged openstack/openstack-ansible-apt_package_pinning master: Add OpenStack-Ansible metadata https://review.openstack.org/510469 16:55:24 do you know who handles that? 16:55:29 Merged openstack/openstack-ansible-ceph_client master: Add OpenStack-Ansible metadata https://review.openstack.org/510470 16:55:38 oh we have precedent ^ 16:55:44 but that's for another topic 16:55:57 I suggest we close the bug triage conversation for today 16:56:04 +2 16:56:08 evrardjp: it's epalper 16:56:12 we can continue the OVS conversation outside the meeting 16:56:17 thanks everyone! 16:56:23 #endmeeting