16:00:21 #startmeeting fuel 16:00:23 #chair xarses 16:00:23 Meeting started Thu Aug 6 16:00:21 2015 UTC and is due to finish in 60 minutes. The chair is xarses. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:24 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:25 Todays Agenda: 16:00:26 The meeting name has been set to 'fuel' 16:00:27 #link https://etherpad.openstack.org/p/fuel-weekly-meeting-agenda 16:00:28 Current chairs: xarses 16:00:29 Who's here? 16:00:31 hi! 16:01:14 \~/ 16:01:19 o/ 16:01:20 hi 16:01:25 hi 16:01:43 hi 16:01:46 hi 16:01:48 hi 16:02:02 ok, lets get started 16:02:05 #topic SCF Status (angdraug) 16:02:11 7.0 soft code freeze will be in effect at 12AM tonight 16:02:11 today is the last day to merge anything that is not a fix for High or Critical bug 16:02:11 we're still in the red zone with the number of bugs 16:02:20 dixi :) 16:02:50 questions? moving on? 16:02:54 angdraug: rationale is to focus on high and critical now, right? 16:03:00 yes 16:03:03 question: 16:03:04 so to converge by 3rd of Sep 16:03:26 we will not merge anything related to Med bugs after SCF,right ? 16:03:32 yes 16:03:35 angdraug: and I think if there is minor work to be completed in some low / medium bugs in progress, those can still be merged after SCF 16:03:43 let's say for another week 16:03:52 ok 16:04:07 our review stat might affected 16:04:13 if there is medium bug or lower priority related work still pending, now is the time to speak up and ask for merge or exception 16:04:29 mihgen: how will we accomidate that? SCF exceptions? 16:04:50 reviews affected by code freeze should be marked workflow-1 until HCF when we create stable/7.0 branches 16:05:05 what if there is a review request which closes 1 high and 2 medium bugs? 16:05:22 vkramskikh: that doesn't need an exception 16:05:34 as mihgen said, the motivation is to focus on fixing high bugs 16:05:35 this is not going to work.. we will launch auto abandon script which will abandon reviews with -1 16:05:59 amaksimov: its not supposed to hit workflow -1 16:06:27 If some one sees it do so, please speak up so we can fix the script 16:06:33 sure? I think we need to check it with devops. 16:07:05 tiran ? 16:07:09 So I would say there are two major things with SCF: 1) to focus on High/ Criticals to meet deadlines; 2) to lower the risk of introducing regressions by fixes of low/medium. The less code change is, the lesser risk is 16:07:13 #link https://review.fuel-infra.org/#/c/10080/3/requests-abandon/abandon_old_reviews.sh 16:07:13 teran: ^^ 16:07:33 amaksimov: script should not touch workflow -1, if it does - then it has to be fixed 16:07:39 it only abandons reviews with -2 16:08:06 mihgen: angdraug AFAIR it works already such way 16:08:11 we could check it out 16:08:21 yes, that's my impression as well 16:08:38 amaksimov: does that answer your question? 16:08:42 yes 16:08:43 some mediums just need to be checked. what to do with them? 16:08:55 akasatkin: like what? 16:09:12 https://bugs.launchpad.net/fuel/+bug/1399004 16:09:12 Launchpad bug 1399004 in Fuel for OpenStack "Running network verification on running env may break network" [Medium,Confirmed] - Assigned to Fuel Python Team (fuel-python) 16:09:14 Launchpad bug 1399004 in fuel "Running network verification on running env may break network" [Medium,Confirmed] 16:09:15 Launchpad bug 1399004 in fuel "Running network verification on running env may break network" [Medium,Confirmed] https://launchpad.net/bugs/1399004 16:09:33 this sounds rather High 16:09:45 okay 16:09:45 +1 16:10:15 ya, thats high 16:10:19 break of deployment in general sounds critical issue, if it can be reproduced only in certain cases - then it's high 16:11:05 it doesn't break deployment 16:11:09 just verification 16:11:12 sometimes 16:11:14 to reiterate, the reason we're announcing soft code freeze earlier is to spend more time on high and critical bugs 16:11:49 amaksimov: nope according to the bug, you break working env 16:11:53 you load kernel module 16:11:56 and ovs breaks 16:12:10 moving on? 16:12:11 ok, lets move along 16:12:15 #topic puppet-librarian (angdraug) 16:12:50 puppet-librarian commits are being merged according to plan: 16:12:50 #link http://lists.openstack.org/pipermail/openstack-dev/2015-August/071347.html 16:12:50 to honor soft code freeze, I propose to merge the firewall module one day early 16:12:50 cinder module will need a rebase over https://review.openstack.org/209945 16:12:50 we should discuss an SCF exception for it next week 16:13:07 thoughts? objections? 16:13:29 I think its minor, and we need to move forward with it 16:13:30 +1 16:13:52 #link https://wiki.openstack.org/wiki/Fuel/Library_and_Upstream_Modules 16:13:55 We had 1 custom patch to firewall module for docker stuff, but that isn't relevant today 16:14:12 actually... now that I think about it, it was just a backport of a more modern version 16:14:24 carry on, please continue with firewall module 16:14:29 thanks! 16:14:57 apache is already merged, this means we'll also merge apt and firewall today, and leave cinder until next week 16:15:17 that's it from me 16:15:20 thanks 16:15:24 angdraug: cinder until HCF 16:15:36 Hi Fuelers... 16:15:57 mihgen: most likely, unless there's strong consensus in favor of SCF exception 16:16:01 #topic Open discussion (bugs, reviews that need attention) 16:16:06 If we enter SCF I'm not sure if we want to continue doing exceptions of any kind 16:16:28 if we want to release on time - no exceptions please 16:16:33 fair enough 16:16:40 xarses, now that we're trying to focus on high priority bugs, we should make an effort to not "hog" all the bugs 16:16:41 amaksimov: +1 16:17:02 If you hold a high priority bug and you're not working on it, pass it back to fuel-{python,library,etc} 16:17:08 one that needs to be fixed for 7.0 16:17:12 that way others can pick them up 16:17:14 mattymo_: are we having problems with bugs assigned and not being worked again? 16:17:21 yes 16:17:25 +1 mattymo_ 16:17:37 our statistics are bad, but the # in fuel-library queue are not so high 16:17:45 mattymo_: +1 16:18:09 can we get a report on number of assigned open bugs per person? 16:18:12 we need to do daily triage passes over all bugs left 16:18:16 https://bugs.launchpad.net/fuel/+bug/1365368 is tricky. I've been trying to fix it but I'm facing the problems with syncronization. 16:18:16 Launchpad bug 1365368 in Fuel for OpenStack 7.0.x "Cannot update ip_ranges for management and storage networks" [High,In progress] - Assigned to Ivan Kliuk (ivankliuk) 16:18:16 Launchpad bug 1365368 in fuel/7.0.x "Cannot update ip_ranges for management and storage networks" [High,In progress] 16:18:17 Launchpad bug 1365368 in fuel/7.0.x "Cannot update ip_ranges for management and storage networks" [High,In progress] https://launchpad.net/bugs/1365368 16:18:46 mihgen, triage of all bugs owned by individuals? that's a tricky challenge. better to solve it in a bug squashing session where we can get everyone together 16:18:53 angdraug in our KPI report 16:19:11 mattymo_: I mean get everyone together, and go over all bugs 16:19:16 +1 16:19:32 IvanKliuk: wha't ca we do to help? 16:20:01 It seemed to work fine for networks of type ip_range, doesn't work for type 'cidr' 16:20:15 xarses: No help needed, thanks 16:20:20 mihgen: "going over" 100 bugs is going to take all day, doing it with the whole team will take up all day of the whole team, not sure about that 16:20:48 angdraug, we manage to do it every release, usually twice :) 16:20:53 I just cannot estimate when it's going to be fixed 16:21:02 ok 16:21:09 because it's not trivial fix 16:21:29 any one else have CR or bugs to discuss? 16:21:35 I managed it before in this way: I was going over bugs, and doing ping-pong of folks working in the area 16:21:42 I'll let you know about the results by the end of the day via email 16:21:51 to whether provide update, ask if they work on a bug or not, etc. 16:22:23 I like that approach better than "get everyone together":) 16:22:35 however one or two passes by whole team are still needed 16:22:51 to see the whole picture and hear concerns from everyone 16:22:55 amaksimov: found it, thanks. we can use that to identify who's hogging bugs and make sure they don't just sit on any of them 16:22:59 around certain bugs 16:23:30 sounds like a good agenda for the next IRC meeting 16:23:41 how are we doing with upgrades? 16:23:55 I hear last bits were merged yesterday 16:24:01 yes 16:24:09 I talked with ogelbukh today 16:24:18 all ISO related stuff were merged 16:24:42 but scripts still not ready 16:24:54 but they are not part of ISO 16:25:22 ok.. 16:25:38 at least we merged all ISO-related code 16:25:58 yes 16:26:00 what about centos 6.6 for master node support? 16:26:08 not yet merged 16:26:30 staging was blocked earlier today 16:26:38 as soon as it's green we can merge it 16:27:02 why was it red? 16:27:03 the cinder commit I mentioned earlier is expected to fix staging bvt: 16:27:17 #link https://review.openstack.org/209945 16:27:31 a regression introduced by a cinder package 16:28:05 ok 16:28:26 so do we have deadline set for 6.6 ? 16:28:58 today 16:29:15 btw shouldn't we discuss the necessity to merge centos 6.6 ? 16:29:33 amaksimov: didn't we already have enough discussions about it? 16:29:33 we had several conversations inside 16:29:46 maybe we also should discuss it here? 16:30:04 it's not a fuel change, it's a mos change 16:30:22 it's Fuel relevant I'd say 16:30:31 Fuel itself has to support it 16:30:34 we will update kernel 16:30:57 do we update bootstrap as well? 16:31:13 yes 16:31:17 upgrade script will become more complicated 16:31:29 it will require reboot of master node 16:31:35 amaksimov: who will be changing upgrade script? 16:32:07 this is additional task which we didn't plan 16:32:13 that's why I am against it 16:32:14 do we have patchsets on review for it? 16:32:21 nope 16:32:36 do we have anyone here who is working on it? 16:32:36 this is not just bugfix 16:32:39 this is feature 16:32:49 this is feature; I agree 16:32:59 so I was against it 16:33:06 because it is too late 16:33:12 for such big changes 16:33:25 angdraug: do you know if vkozhukalov/dpyzhov was consulting about upgrade scripts 16:33:27 and the fact that we are removing resource from fuel 7.0 16:33:31 amaksimov: mihgen: please invite people actually working on centos6.6 if you want to have a meaningful discussion of it here 16:33:41 why can't we just tell the user to reboot after? 16:34:40 well, we can.. but again, we need to think about UX etc 16:35:08 this should be tracked as feature. not just like an ordinary bug 16:35:09 good form would have been to add it to agenda yesterday and invite the feature lead 16:35:10 ok guys we can't discuss anything here since relevant people are not here. Let's follow up on this 16:35:38 angdraug: good form would be to bring yourself to the meeting if you are hacking features in fuel 16:35:51 amnk: please comment on centos 6.6 impact on upgrade script? 16:36:15 angdraug: it has no impact to my knowledge 16:36:43 amaksimov: what makes you think there will be an impact? 16:36:45 how we can upgrade kernel w/o reboot amnk ? 16:36:49 amnk: could you please update your name in IRC settings? 16:36:49 I'm not sure if upgrade script should restart the nodes, it's dangerous action and we agreed that it should be user's conscientious decision. 16:37:04 evgenyl: +1 16:37:10 evgenyl: +1 16:37:25 amnk is Aleksandr Mogylchenko, feature lead for centos 6.6 16:37:40 amaksimov: well, it is your decision to reboot node or not. It will work in any case 16:38:15 if we are not going to amend upgrade script to reboot the node, and I agree that we shouldn't, what other impact is there? 16:38:26 amnk but you will have to notify user about it 16:38:32 amaksimov: why? 16:38:37 evgenyl, +1 to user initiated reboot 16:39:46 currently upgrade script tell user to run 'yum update; docker destroy all; docker start all' 16:40:17 angdraug: as you've been in the discussion, do we have committment to be on the hook to fix bugs / support upgrade procedure if needed? 16:40:17 adding 'reboot' here doesn't seem to add too much manual action :) 16:40:35 amnk has committed to that, yes 16:40:51 ok that's good 16:40:56 amnk: right? 16:41:03 yes 16:41:06 amnk: did you guys check scale up story after upgrade? 16:41:30 let's say you have env with openstack, then you upgrade fuel, and now you want to add couple of nodes to old env 16:41:56 I'm about centos-based env specifically 16:42:24 mihgen: this is for 7.0 - there are no centos-based envs there. 16:42:49 amnk: mihgen is talking about a 6.1 env deployed by a 6.1 fuel before upgrade to 7.0 16:44:09 as I said in the email already - I could not get upgrade feature to work on stock 6.1. And others confirmed that it is not stable. So I do not have reliable test case to rely on 16:44:41 upgrade of Fuel? 16:44:49 yes. 16:44:54 nurla: ^^ ? 16:45:19 can you please comment on stability of upgrade procedure? I've not heard about major issues for 6.0 -> 6.1 16:45:39 regardless, it doesn't mean that we just drop this from our testing matrix 16:45:45 and declare that we don't support it 16:45:46 has anyone tested 6.1-7.0 yet? 16:46:11 if we are about to deprecate a feature, it has to be done properly 16:46:18 I just said that upgrade feature can not be used as reliable test case. 16:46:31 I'm not saying we should deprecate it 16:46:40 amnk: doesn't have to be reliable, you only have to prove that you can get it to work 16:46:49 this is not a test case itself. angdraug +1 16:47:03 it 16:47:30 user use case is simple, I had 6.1 with centos; now I've heard 7.0 is cool and has kilo support 16:47:42 so I upgrade my fuel keeping old env on centos 16:47:53 and now I try to scale up old centos-based env 16:48:03 or do any other operations on that one 16:48:42 well, if that user is able to get it working there will be no problems after we merge our custom build of kernel. 16:48:46 If we can't make it as such, then we would better not to even recommend upgrade to 7.0, and instead provide manual precedure 16:49:37 amnk: that's exactly what mihgen wants you to confirm 16:49:50 angdraug: I confirm 16:50:20 amnk: the problem can be in cobbler snippets for example 16:50:53 if you update any of those as part of 6.6 work, ensure that it stays backward compatible 16:51:07 mihgen: if there is a problem - I'll fix it. 16:52:08 I'm concerned that no one answered my question about testing 6.1-7.0 upgrade 16:52:27 "I could not get upgrade feature to work on stock 6.1" - we have upgrade tests which test ubuntu deployments after 6.1 to 7.0 upgrade, and they are green. Afaik Centos-based are work in progress by qa team 16:52:45 thanks 16:53:33 nurla: can you update on status of testing centos envs after upgrade to 7.0? 16:54:21 amnk: if the auto test of upgrades to 7.0 is green, you should be able to test it manually 16:54:59 we're running out of time, lets sum up 16:55:23 1) there's no impact on Fuel, including upgrade script, but its message should be updated to recommend Fuel node reboot 16:56:08 2) upgrades from 6.1 to 7.0 are verified by swarm and are green, so amnk is not blocked with testing 6.1 centos env scale up post-upgrade 16:56:24 any other blockers? 16:57:05 just common sense - big change after FF 16:57:09 swarm doesn't check customized environments or environments with plugins 16:58:06 this is a big change, and risks have to be taken seriously 16:58:10 please stop calling it a "big change". It is not. 16:58:23 sgolovatiuk: that's generic issue, for any kind of upgrade 16:58:38 amnk: it's a big change 16:58:41 yes it is - there are many things which can be affected 16:58:58 we already have centos 6.6 in the fuel docker containers 16:59:03 and we don't have any thorough test matrix for it 16:59:04 sgolovatiuk: it was done silently inside docker containers, and nobody even noticed. 16:59:40 amnk: no one would notice change centos -> debian in containers 16:59:41 but not here 16:59:50 biggest part of it all is kernel upgrade 16:59:52 Do we test such scenarios? Do we test upgrades on production (live) environments? Do we have any --noop run? 16:59:53 when you change bootstrap as well, and other things 17:00:04 are affected, such as upgrade of fuel 17:00:14 o/ 17:00:16 o/ 17:00:18 but if a kernel upgrade to a different patch level within the same major version is a feature for us we have a bigger problem 17:00:23 time folks 17:00:28 xarses: closing 17:00:36 bye! thanks! 17:00:49 #endmeeting