16:01:02 #startmeeting Fuel 16:01:03 Meeting started Thu Mar 12 16:01:02 2015 UTC and is due to finish in 60 minutes. The chair is kozhukalov. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:01:05 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:01:08 The meeting name has been set to 'fuel' 16:01:16 #chair kozhukalov 16:01:17 Current chairs: kozhukalov 16:01:24 hey guys 16:01:28 o/ 16:01:34 is anyone here? 16:01:34 \o 16:01:39 hi 16:01:42 hello 16:01:59 just 4 people 16:02:03 hi 16:02:13 ok let's start 16:02:39 if anyone has any announcements, please share them right now 16:02:56 before we go through the agenda 16:03:21 hi 16:03:22 agenda as usual 16:03:29 merge status? 16:03:32 #link https://etherpad.openstack.org/p/fuel-weekly-meeting-agenda 16:04:05 xarses: feature by feature including merge status 16:04:15 hi 16:04:36 ok, looks like no one has any announcements 16:04:46 let's start discussing topics 16:04:55 #topic Separating MOS from upstream ubuntu (vparakhin) 16:05:09 hello 16:05:10 spec: https://review.openstack.org/#/c/148279/35/specs/6.1/separate-mos-from-linux.rst 16:05:11 brain461: could you please share the status 16:05:17 Code is ready for review: https://review.openstack.org/#/c/161222/ 16:05:21 We're testing both features (separation + external ubuntu) on a set of cloned jobs in product CI (iso + bvt), there's an iso with latest patches. 16:05:26 Currently, ddmitriev is working on implementing changes to system tests to consume external repos. 16:05:31 Bad news: we're blocked by issues in vrouter feature. 16:05:35 We're expecting to have vrouter fixes on review this evening. 16:06:15 vrouter fixes are in master? 16:06:22 mihgen: nope 16:06:26 they are on review 16:06:39 how comes you are blocked then? 16:06:52 mihgen: default gateway 16:06:57 ikalnitsky: please give a link for vrouter 16:07:14 currently, we have vrouter gateway on controllers, and that's an issue 16:07:32 #link https://review.openstack.org/#/c/163408/ 16:07:33 here's the fix 16:07:42 currently we don't have such a thing as vrouter, there is public gateway propagated from UI, is not it? 16:07:59 nope. 16:08:01 here we go, so it's in master 16:08:42 in master, we already have vrouter feature and it works with issues, since it sets default gateway to vip__management, not to br-ex 16:08:56 on controllers, which is wrong 16:09:11 controllers MUST have br-ex as default gateway 16:09:14 yeah I see. We might want to revert it if it blocks from split repo / ubuntu thing 16:09:47 we definetely should apply the fix above (which is revert, yeah) 16:10:01 it is better to fix this than to revert it at all 16:10:10 revert is a fix 16:10:14 it should be so 16:10:23 it was wrong that the patch was applied 16:10:41 ikalnitsky: why must controllers have a IP on br-ex:? 16:10:51 this is bad from security side 16:10:58 and I intend to have it removed 16:11:05 xarses: public gateway must be not from management 16:11:12 as I undestand it became now 16:11:18 mihgen: true 16:12:01 ok folks you better voice sync with xenolog / vrouter lead and make a change.. I really don't want your feature to be blocked 16:12:21 mihgen: we already do 16:12:21 we have to make local mirror script, etc. 16:12:51 mihgen: yes, I started to work on it 16:13:47 ok, and what we need to make it fixed? reviewers? another voice call? anything else? 16:14:47 ok, let's move on 16:15:01 #topic Consume external ubuntu mirrors (ikalnitsky) 16:15:15 today i managed to deploy cluster with external ubuntu and separate repos. 16:15:20 obviously, it was required to apply vrouter fixes manually. 16:15:25 ostf tests were passed. 16:15:32 i've built an iso with all patches (including vrouter ones) and i want to ask qa to test it. 16:15:59 so i think we should start merging some fixes one-by-one, in order to do not break master 16:16:05 can you post a link to the iso? 16:16:08 though, we don't test inird downloading 16:16:20 angdraug: yep, sure 16:16:35 #link http://jenkins-product.srt.mirantis.net:8080/view/Repo%20Separation/job/6.1.all.clone/30/ 16:16:41 can you share patch which downloads initrd pls? 16:17:01 mihgen: there couple of them on review 16:17:22 #link https://review.openstack.org/#/c/163599 script itself 16:17:25 yeah, it really hard to find them right now. 16:17:34 here's the link to working etherpad https://etherpad.openstack.org/p/consume-external-ubuntu 16:17:47 #link https://review.openstack.org/#/c/161640/ package for this script 16:18:08 my iso includes all patches (even inird), so i don't know how it works. just going to figure out it. 16:18:15 #link https://review.openstack.org/#/c/161208/ download task 16:18:36 thx 16:18:50 ikalnitsky: how close we are to merging pre-provision tasks ? 16:19:22 actually, i was thinking about merging it today.. but it didn't happen :( 16:19:30 i've built and iso with these three patches 16:19:39 but centos bvt has been failed 16:19:43 we have some issues 16:19:44 in master 16:19:48 not in our patches 16:20:03 but i want to pass them both bvts 16:20:17 but master is green, isn't it? 16:20:23 nope 16:20:48 oh, it's now green. 16:20:52 hi 16:20:54 few hours ago, it wasn't 16:21:55 thx ikalnitsky. Any blockers here? 16:22:05 what is left to be done? 16:22:17 so, we can build another iso with pre-provision tasks and check them again, right? 16:22:43 can't we just go shortcut here, take green ISO and inject your changes? 16:22:54 mihgen: i don't see any blockers. i think we should test the latest iso and start merging things. 16:22:55 verifiy if it works, and then merge? 16:23:09 mihgen: we have merging plan which is on this etherpad ^^^ 16:23:19 kozhukalov: that's right. i'll run it again. 16:23:30 mihgen: sorry, what do you mean? 16:23:46 first of all, we should merge pre-provision tasks 16:23:52 they shouldn't break master 16:24:04 they will just build provisioning images on master node 16:24:27 patches were reviewed and tested manually 16:24:37 we need just get bvt passed and merge them 16:24:54 ok, i think the details of the merging order is up to us, just need to keep master green 16:25:06 ok thanks guys 16:25:09 to get bvt passed we need patches to fuel-qa to be merged as well, don't we? 16:25:21 sure 16:25:40 moving on? 16:25:41 nope 16:25:44 not for this patches 16:25:46 bookwar: ^ 16:26:06 any of the bandit peeps around? 16:26:06 ikalnitsky: ok 16:26:08 bookwar: hi, do we have mirrors of ubuntu prepared in our infra in every location? 16:26:27 mihgen: yes, upstream mirrors are ready 16:26:37 cool 16:27:14 #topic IBP (agordeev) 16:27:41 hi 16:27:50 agordeev: hi 16:27:53 merge status of blueprints: 16:27:55 1 was landed https://blueprints.launchpad.net/fuel/+spec/ibp-build-ubuntu-images and the remaining 2 was not. 16:27:57 https://blueprints.launchpad.net/fuel/+spec/ibp-reconnect : 1 patch is waiting to be reviewed and merged https://review.openstack.org/157090 16:27:59 https://blueprints.launchpad.net/fuel/+spec/ibp-image-checksums : 2 patches are waiting to be reviewed and merged https://review.openstack.org/#/c/161318/ https://review.openstack.org/#/c/161759/ 16:28:01 I'll ask for getting FFE for 1d for them right after the meeting ends 16:28:03 bugs: 16:28:05 1 critical was filed https://bugs.launchpad.net/fuel/+bug/1428759 looks like it was just h/w issue 16:28:07 Launchpad bug 1428759 in Fuel for OpenStack "[IBP] degraded RAID prevents operating system from boot" [Critical,Confirmed] - Assigned to Fuel provisioning team (fuel-provisioning) 16:28:07 backports to 6.0: 16:28:09 I'm insisting on backporting of ibp-reconnect. Even IBP for 6.0 is in experimental status. Additionaly it closes on of customer-found bugs 16:28:11 https://review.openstack.org/#/c/161721/ 16:28:13 https://review.openstack.org/#/c/161722/ 16:28:15 other: 16:28:17 it's time to start re-implementing ibp-build-ubuntu-images as a pure python tool. 16:29:56 agordeev: thanks and yet another issue about ibp 16:30:02 thanks agordeev. anything prevents us from merging reconnects to master now? 16:30:44 agordeev: provision progress bar does not work and taking into account that we are going to spend ~10mins building images just before starting provisioning 16:30:45 mihgen: i think that nothing is blocking it. Just needs to be reviewed and merged. 16:31:02 about backport to stable/6.0: I think we need to take a closer look on it. I see dependency introduced (urllib3), so I'm trying to understand how risky it can be 16:31:09 we definitely need progress bar to be working 16:31:26 we generally don't backport such things of course, I'm just trying to see if we can make exception here 16:31:35 and if we really need to 16:32:43 guys, please take part in reviewing IBP patches if you have some time 16:32:55 patches are pretty small 16:33:18 mihgen: that dependecy is already installed into bootstrap image. As far as netchecker tool requires it. 16:33:21 we'd really appreciate this if you had a chance to take a look at them 16:34:19 ok, if there are no other q, moving on then 16:34:35 #topic Plugins (evgeniyl) 16:34:41 Hi guys. 16:34:48 evgeniyl___: hi 16:34:56 All plugins related features are merged, currently we are working on the docs. 16:35:06 Today we’ve done plugins migration guide. 16:35:12 #link https://wiki.openstack.org/wiki/Fuel/Plugins#How_to_migrate_plugins_from_1.0.0_to_2.0.0_package_version 16:35:43 Also I’ve just had a discussion with guys who develop melanox plugin, they really need plugable bootstrap image, we’ve figured out a solution, I’m not sure if it will be able to implemented it in the current release. 16:36:17 Q: about an issue with 2+ plugins, what is the order nailgun chooses to install those? 16:36:19 Another story is ability to define plugins execution order, I need approve to start working on it, if we are going to do it as a feature freeze exception. 16:36:41 evgeniyl___: yep, this is my question 16:37:16 mihgen: are you asking about execution order? Afaik currently it depends on the name. 16:37:30 evgeniyl___: name of what? 16:37:35 mihgen: of plugin 16:37:54 mihgen: we don't have proper mechanism to define plugins execution order. 16:37:58 so there is a workaround, right ;) 16:38:28 mihgen: Oh, I don't think that we should call it a workaround. 16:38:42 where is the issue comes from? 16:38:53 contrail + zabbix = wrong order? 16:39:12 according to you, zabbix would be installed after contrail, so it should be just fine 16:39:14 mihgen: it's not general solution 16:39:28 mihgen: if we want to solve specific case, yes we can use workaround 16:39:51 I get that. So what's your estimate and approach to possibly resolve it in 6.1. ? 16:39:52 mihgen: yeah, we should check it, but it should work this way 16:40:03 solving it would require metadata format change (exec priority field) 16:40:19 wouldn't it be better to make such change early rather than later worry about backwards compatibility? 16:40:24 mihgen: one week including spec, fpb fixes, nailgun fixes, and merging process with all of the reviews. 16:41:30 this is pretty serious gap, actually, frankly speaking… 16:42:06 if we don't want people to name their roles like "A_Zabbix", "B_Contrail", etc., we need to fix it.. 16:42:06 evgeniyl___: if you have managed to find a solution on pluggable bootstrap, can you share you opinion about this? how is it going to look like? 16:42:10 It's my current velocity, if we take into account that the most of the day I spend on side tasks, reviewes, docs, meetings 16:43:01 are you the only one who can do this? 16:43:10 bus factor alert... 16:43:38 kozhukalov: plugins can share some hooks, and when node is booted, it can iterate through plugins hooks and execute them. In one of the hook guys from mellanox can add kernel module loading. 16:43:56 angdraug: I'm not the only person who can do that 16:44:14 agordeev: for example Vladimir Sharshov in the current release implemented reboot tasks from end to end. 16:44:36 evgeniyl___: mihgen https://bugs.launchpad.net/fuel/+bug/1428427 16:44:38 Launchpad bug 1428427 in Fuel for OpenStack "Priority of plugins execution can't be defined" [Medium,Invalid] - Assigned to Fuel Python Team (fuel-python) 16:44:56 thanks evgeniyl___. if you are available after this meeting, I'd like to do a short call, together with dpyzhov_ 16:45:08 mihgen: np 16:45:17 ok 16:45:23 it was marked invalid 16:45:33 xarses: yes, because it's not a bug. 16:45:38 sort of 16:45:39 xarses: it's a feature request 16:45:42 it's a defect 16:45:51 xarses: it's known constraint 16:45:53 a flaw 16:46:26 evgeniyl___: thanks a lot 16:46:34 kozhukalov: same meaning effectively 16:46:35 moving on to open discussion 16:46:45 xarses: sure 16:47:25 xarses: i just started to think we are going to list synonyms to the word bug ) 16:47:42 #topic Open discussion 16:47:50 I have something 16:47:53 Small update for 200 nodes bp 16:47:53 there are two patches not merged yet. 16:47:53 One is about allowing to fail nodes during provision. I'm refactoring it after evgeniyl___ comments. 16:47:53 Second one is network verification. There is some dissagreement about backward compatibility https://review.openstack.org/#/c/138760/6 :( I think that we should just add warrning that it's not backward compatible and desribe in docs what and how to upgrade to fix it. 16:48:49 salmon_: I agree to merge it, but lets create a bug, and start the discussion in ML to solve the problem, that we can break it in the future releases 16:49:13 salmon_: just curious to know, have your guys tried 200 nodes against IBP ? 16:49:16 salmon_: also "Roman said that backward compatibility is not a issue for him" doesn't look like a valid argument 16:49:22 why cant we fix it in a backward way? 16:49:25 salmon_: user care about it 16:49:26 agordeev: no, only on 100 16:49:57 salmon_: I'm curios if we can make it backward compatible by 'small blood' ? 16:49:59 evgeniyl___: well, the qustion is if user will ever notice it 16:50:05 salmon_: and I didn't say that I don't want to merge it just because, I provided a technical solution how to solve the problem 16:50:33 evgeniyl___: yeah, this is why I gave a link to patch 16:50:35 salmon_: I think user can easily notice that network checker is broken for old releases 16:50:47 mihgen: if user updates some packeges it will work 16:51:03 mihgen: but he should manually get required packages 16:51:10 yup :/ 16:51:12 mihgen: or we should backport it to all previous releases 16:51:48 folks, technically we can make everything backward compatible. But if it takes from us SO many resources to get done, I'm inclined to think that we should just document it / come up with any other workaround 16:51:49 mihgen: which is not the case I think, because it brakes the previous interface, and it doesn't look like a good candidate for backport 16:52:09 so here, if it's easy to make backward compatible, why not to do so? 16:52:37 mihgen: guys, we implement backward compatibility on almost every layer now, I don't see problems to add the same stuff into MCagents. 16:52:44 it takes some time and make everything look uglier. Updating package is simpler for me :) 16:53:28 I changed mcollective-agent so user has to updata package on each node 16:53:54 frankly UX doesn't look like as great one. Let's say you had Fuel master with N nodes in bootstrap 16:54:08 then upgraded fuel master, and added more bootstraps 16:54:26 mihgen: if you have bootsrap nodes, just reboot them 16:54:36 and now your net-verify fails for some of the nodes, and you are lost, as you get unclear error messages 16:55:48 evgeniyl___: can we get agent version? If yes it's easy to make it bacward compatible 16:55:50 i think salmon_ is right, if you have bootstrap nodes, just reboot them. why can't we reboot those bootstrap nodes just after upgrade? 16:56:09 good idea 16:56:28 ot just add a notice that is good idea to reboot 16:56:32 *or 16:56:47 problem is that no one reads all those long release notes. . 16:57:00 salmon_: I don't know, we should check it and look at the api. But anyway as I said, there is another workaround, just implement "version" method and if it's not there, it means that it's old version. 16:57:09 so you plan on making a feature to reboot un-used discovered nodes when we update the master, or the base image? 16:57:14 mihgen: it can be just above error message after faile verification 16:57:17 3 minutes 16:57:43 salmon_: really if it's easy to make, we need backward compatibility layers anyway, please consider it 16:58:05 if you see it takes too long .. then just release notes .. 16:58:06 ok, I will start discussion on ML 16:58:10 thx 16:58:30 salmon_: thanks 16:58:36 ok, guys, thanx everyone for attending 16:58:42 time to end 16:58:48 thx 16:59:02 #endmeeting