17:00:32 #startmeeting ironic-qa 17:00:33 Meeting started Wed Nov 18 17:00:32 2015 UTC and is due to finish in 60 minutes. The chair is krtaylor. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:35 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:37 The meeting name has been set to 'ironic_qa' 17:00:48 anyone here for ironic-qa? 17:00:54 o/ 17:00:54 o/ 17:00:57 o/ 17:00:59 o/ 17:01:46 hi everyone 17:02:19 hey 17:02:21 I'll chair again this week since jlvillal is busy 17:02:34 here is the agenda for the meeting: 17:02:38 #link https://wiki.openstack.org/wiki/Meetings/Ironic-QA 17:03:18 I'll start with actions, I had one to talk to lucas about the spec 17:03:24 and that was done 17:03:37 any quick announcements? 17:03:49 o/ 17:04:14 ok, well, then let's jump in 17:04:19 so I assume that the spec will land soon? 17:04:25 hi 17:04:32 #topic grenade testing 17:04:50 sinval, hope so, lets talk about that in the CI section 17:04:56 o/ 17:05:19 o/ 17:05:20 I don't have anything advancements or status here, anyone? 17:05:24 o/ 17:06:21 we'll have to wait until jlvillal gets back for a grenade update I guess 17:06:33 sorry guys, little late 17:06:52 hi liliars, no worries 17:06:58 #topic functional testing 17:07:10 same here for me, nothing to report, anyone? 17:07:55 more like related to the third party ci things, but I talked to people over here and we are good to publish our functional tests 17:08:18 well, for oneview driver, we created something like a "functional test suite", there is a bunch of code that was developed and could be reused for other CI 17:08:19 good, that needs a nudge to get started 17:08:20 I'm hoping someday next week *possibly* 17:08:41 I thought that maybe we could work on it and make something like a piece of code that would be good to reuse in other CI, even for Ironic functional testing, what do you guys think about that? 17:08:58 * jroll apologizes for being late 17:09:33 sinval, I'd like to understand it, could you present the basic ideas next week? 17:09:41 is there functional testing for oneview that doesn't apply to other things? 17:09:47 I'd rather just have those in the main tree 17:09:59 krtaylor, sure 17:10:00 yeah that's what we talked about last week 17:10:12 (for a few seconds) 17:10:18 Actually, I think that the basic functional test suite for each driver could be totally reusable, something like: "if you are developing a driver CI: here is a lib for functional testing in Ironic that your CI can reuse to test you driver, and in this lib there is a bunch o test cases already implemented that can be configured and you can reuse and take care of more specific test cases of your driver", maybe this lib can be useful in Ironic func 17:10:19 tional testing module 17:10:40 yeah, I'd rather that just go up into ironic 17:10:53 nice 17:11:01 cool 17:11:04 +1000 17:11:08 we'll plan on it 17:11:59 sinval, liliars - please add to the agenda for next week 17:12:04 will do 17:12:23 We'll formalize that on something like a RFC for you guys 17:12:36 well 17:12:44 is it just api commands to play with power control and such? 17:13:00 yep, kind of 17:13:10 plus some management stuff 17:13:17 deploy is on its way 17:13:18 but we did some tricky test scenarios, and setup things to test drivers 17:13:22 yeah, I'd just propose the code honestly 17:13:36 sounds great 17:13:44 maybe a quick etherpad if you want to lay out what it does first 17:13:45 yes, that will be good to understand 17:13:52 krtaylor: ^ you good with that? 17:14:00 jroll, ok 17:14:27 I also think it's more reasonable, most of the tests are pretty easy to understand anyway 17:14:30 +1, and an explanation if needed next week should get everyone on the same page 17:14:35 (I hope hah) 17:15:15 cool, anything else on functional testing? 17:15:18 cool :) 17:15:40 #topic 3rd party CI 17:15:48 so, the spec is coming along 17:15:52 good comments 17:16:09 awesome 17:16:23 I think the rest of the details can be documented in documentation patches 17:16:27 ++ 17:16:27 and refined there 17:16:49 main thing is to get all the drivers contacted and aware of the schedule 17:16:56 is there a link to the spec? sorry I'm new to the meeting. What spec are we talking about 17:16:56 agreed 17:17:16 docs can have more details, but I understand folks concerns over there 17:17:30 Spec: https://review.openstack.org/#/c/241294 ? 17:17:31 then get the driver teams in the infra CI meetings, understanding zuul/jenkins etc 17:17:53 yeah rajinir 17:18:05 krtaylor, +1 17:18:15 yes, thats it 17:18:30 rloo, jroll: please take a look, is it right way to add notes https://review.openstack.org/#/c/226201/ 17:19:08 missed 17:19:16 aarefiev, can we move review requests to #openstack-ironic? 17:19:30 krtaylor: sorry 17:19:38 aarefiev, np 17:20:19 so, anything else on CI? everybody gearing up? :) 17:20:58 we (IBM PowerKVM) should turn on our ironic ci testing soon 17:21:12 yep! :) 17:21:14 then we can help document what we did 17:21:24 Is everyone following asselin's docs to setup the CI? 17:21:24 o/ 17:21:32 I was hoping to get the doc structure out soon 17:21:53 That would be great, evn though our driver is not (yet) in tree I am planning on getting CI going as soon as I can 17:21:55 We (Cisco) have a plan :) 17:21:58 https://review.openstack.org/#/c/227584/9/contrib/README.md ? 17:22:09 * jroll gets excited 17:22:15 rajinir, a better way to ask might be is everyone using upstream infra for their test services? 17:22:19 krtaylor: cool, feel free to ping me when you decide to \o/ 17:22:53 liliars, will do 17:23:24 krtaylor: ok,sure (this is Dell, we just getting started) 17:23:47 also, are we ready to talk about the definition of a reliable ci? 17:24:22 rajinir, get involved in the infra CI meeting, a great place for getting statred help 17:24:24 last meeting we kinda talked about it but no conclusions 17:24:32 * krtaylor runs away... 17:24:37 hahahah 17:24:56 wait!! 17:24:57 well, I finally did get everyone to agree on a CI dashboard 17:24:59 hah 17:25:08 krtaylor: sure, thx 17:25:19 Thats something that we'd like to know too, as well how many events we should be expected so we know how to scale our CI 17:25:21 so we have one solution for that now, but there is so much more 17:25:43 sambetts, we have been using satckalytics and guessing 17:25:45 I have some thinking about that, but maybe is better to have this kind of discussion during the doc patches... 17:25:58 sambetts +1 for scaling 17:26:36 first I thought we were going to consider every ironic patch but I saw that changed in the spec 17:26:40 * jroll tries to get real numbers 17:26:49 sambetts, sinval - agreed, a recommendation of what is required and a rough average on patch counts would be good for planning 17:26:51 wait, that changed? 17:26:55 it should still be every patch 17:27:04 no, didnt change 17:27:10 patch = change 17:27:15 1 sec 17:27:40 liliars, every change = every patch 17:27:50 it was a wording change 17:28:03 We are currently looking at ordering hardware for our CI, so knowing how much parralisation we need is important :) 17:28:19 from the spec: *..to test the driver against every code change proposed for Ironic, unless that change is to code or documentation that is irrelevant to the driver.* 17:28:42 thanks liliars 17:28:55 sambetts, it was an evolving sizing for us, we are still getting it right after 2+ years 17:29:06 dont't know if I misunderstood that 17:29:30 liliars, change = patch, should I make that more clear? 17:29:32 "irrelevant" for me here is OneView CI should not test iLO implementation changes 17:29:43 exactly 17:29:49 liliars: every change to ironic could cause a breakage, so every change is relevant 17:30:11 every core change, that is what that clarified 17:30:13 well, I think we need to start with all changes and narrow down from there 17:30:29 krtaylor, agree 17:30:43 I agree, iLo could do something weird that for some reason breaks my code, and I'd want to test against that 17:31:20 have you guys decided about the time to test things? 17:31:38 yeah I get it 17:31:40 but if that driver is not enabled, generally speaking, isn't that a no-op anyway? 17:31:42 we are facing ~40 mins to run a deploy interface test case... 17:32:04 I think I misunderstood the meaning of "irrelevant" here 17:32:23 thanks for the clarifications guys krtaylor sambetts 17:32:23 in order to run all the core patches we would need a few more machines for the CI (I'm from OneView Team) 17:32:45 liliars, let's refine that wording in the requirements documentation :) 17:32:57 and with what we have thing would stack and one patch could take very long to start running 17:33:00 sinval: the gate also takes ~40 minutes to run the pxe_ipa job 17:33:05 krtaylor: yeah! will make a note on that :) 17:33:31 sinval, that is about right time-wise 17:34:00 jroll, krtaylor nice :) 17:34:10 I havn't got any data on ours yet, but I know that the boot time on our servers is not fast 17:34:32 40 minutes + setup/teardown * daily patch average = machines needed for peak load, ymmv 17:34:33 sambetts, feel you pain :( 17:34:42 Our SPARC servers are also quite slow to boot 17:36:23 let's table this for now, we can continue to share timing/sizing estimates in future meetings 17:36:47 cool :) 17:36:50 #topics General QA topics 17:37:10 all drivers case - testing conflicts 17:37:23 does someone want to lead that discussion? 17:37:31 krtaylor: I don't think the topic changed 17:37:39 oops 17:37:47 #topic General QA topics 17:37:52 much better 17:38:52 anyone? 17:39:07 Ok, so in the ironic meeting on Monday, it was brought up, should we be testing the case of enabling all the drivers to check for 17:39:12 conflicts 17:39:18 ? 17:39:41 * krtaylor is trying to come up with a use case 17:39:45 so I thought more about that and want to defer doing things with this until pavlo files the bug he promised 17:39:47 yeah that one, was trying to remember the context, thanks sam 17:39:58 and it's actually about installing dependencies, not necessarily enabling the drivers 17:40:08 packaging is a good use case for this 17:40:13 what were the drivers involved on this problem? IRMC and iLO? 17:40:38 is it a pre-req versioning test? or a namespace test? 17:41:49 it's a "make sure services run with all the things installed" 17:42:22 jroll, but would that ever happen in real life? 17:42:43 krtaylor: a distro may ship with all the driver deps installed, yes 17:42:52 I believe their bug was they installed a dependecy and it blows up the conductor, which should be covered by a third party CI one right? 17:43:06 jroll, ah, thats the use case I needed, thanks 17:43:24 yeah, third party CI should also cover this 17:43:34 I'm not inclined to make this a priority, at least until someone files a bug 17:44:19 so, the test is to install all packages and see if the install worked, seems easy enough 17:44:53 right, it would need to stay non-voting so packages not in g-r can't explode the gate 17:44:55 but yeah 17:45:11 ok, well we can defer this until the need arises 17:45:18 not all drivers reqs are installable via PyPi, so we'd have to have a bunch of special sauce too 17:45:26 thanks for the clarification, I understand the need now 17:46:10 anything else on this? 17:46:25 #topic Open Discussion 17:46:31 I have a thing here 17:46:41 http://bit.ly/1OQD1mv <- graph of ironic patches + rechecks per day 17:46:55 * krtaylor look 17:47:03 so this seems roughly like what a third party CI needs to support 17:47:38 max:30 17:47:46 ok, and other systems handle this within a set time frame 17:47:56 so you don't need a system that can run 30 at once 17:48:10 other projects say within 4 hours 17:48:15 nova for instance 17:48:15 what is the interval ? 17:48:24 a second thing: I'm in talks with some folks, it tentatively appears that OSIC can provide some hardware for ipmitool third party testing. I can't promise that I can maintain the third party CI for that, but I can probably gather some help from either other rackspace people or the community 17:48:25 and any predictions on the growth rate? 17:49:00 better graph, covers the past year: http://graphite.openstack.org/render/?width=586&height=308&_salt=1447868663.13&lineMode=connected&from=00:00_20141115&until=23:59_20151118&target=summarize%28stats_counts.zuul.pipeline.check.openstack.ironic.total_changes,%20%271d%27%29 17:49:24 krtaylor: so 4 hours * 30 / how long it takes to run your test = how many machines we'll need 17:49:26 jroll, I know there are other discussions for ipmi too 17:49:40 cool! thanks jroll, thats helpful 17:50:23 krtaylor: cool, I've got a soft yes on it - they want to see what we need from a network perspective 17:50:31 sambetts, we haven't set a timeframe for ironic testing requirements, but if we used nova's for example, it would need to report results within 4 hours after a patch is submitted 17:50:34 so we/I need to write something up there 17:50:36 jroll:Thanks 17:51:07 krtaylor: Makes sense :) thats something we'll need to define it the whole "what is a reliable CI?" convo then 17:51:18 +1 17:51:18 yeah 17:51:23 sambetts, exactly 17:51:33 and of course we can aim to improve over time, start looser and tighten as we go 17:51:40 or get some CIs up and see what's reasonable 17:51:55 what about logs? how much time to keep them? 17:52:00 1 month? 17:52:01 cinder started with a much larger number at first, it makes the sizing estimates easier 17:52:27 sinval, the infra requirement was 6 months, now down to a month 17:52:35 yeah also makes sense, and give us some time to adjust 17:52:47 krtaylor: larger number of expected patch sets? or time to post comment? 17:52:52 krtaylor, cool 17:53:06 sambetts, time to post CI result comment 17:53:29 krtaylor: Ok that makes sense :) 17:53:35 so start with 8 hours for N, then 4 hours for O, etc 17:54:05 as teams get running CI systems, they will be better able to see what they need to hit a 4 hour window 17:54:13 ++ 17:54:21 ++ 17:54:51 our (powerkvm) goal has always been to beat jenkins :) 17:55:04 and we do often 17:55:05 heh 17:55:17 gamify it :D 17:55:28 * jroll 5 minute warning 17:55:30 YES 17:55:45 ok, are we winding down here? 17:56:02 last call for topics 17:56:26 ok, thanks everyone, another good meeting! 17:56:32 Thanks krtaylor o/ 17:56:34 \o/ thanks for leading, krtaylor 17:56:45 #endmeeting