#openstack-meeting log

17:00:19 <sdague> #startmeeting qa
17:00:20 <openstack> Meeting started Thu Oct 10 17:00:19 2013 UTC and is due to finish in 60 minutes.  The chair is sdague. Information about MeetBot at http://wiki.debian.org/MeetBot.
17:00:21 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
17:00:23 <openstack> The meeting name has been set to 'qa'
17:00:27 <sdague> ok, how's around for the QA meeting?
17:00:38 <ravikumar_hp> hi
17:00:44 <mtreinish> o/
17:00:44 <Anju> hi
17:00:48 <sdague> #link https://wiki.openstack.org/wiki/Meetings/QATeamMeeting#Proposed_Agenda_for_October_10_2013
17:01:24 <sdague> ok, first off, I still need to do the old blueprint purge, a few other things got in the way of that with things like jury duty this week
17:01:38 <sdague> #topic Blueprints (sdague)
17:01:53 <sdague> so I'll work on doing that before next meeting
17:02:04 <sdague> anyone else have blueprint updates to share?
17:02:23 <mkoderer> hi
17:03:06 <dkranz> Here
17:03:55 <giulivo> hi
17:03:58 <sdague> ok, moving on from bluprints
17:04:11 <sdague> dkranz: why don't we jump to your topic
17:04:17 <sdague> #topic Lot's of new negative tests. Do we want that? (dkranz)
17:04:23 <dkranz> sdague: ok
17:04:30 <mkoderer> thats a good one
17:04:46 <dkranz> I am concerned that we have seen a lot of new negative tests.
17:05:01 <dkranz> Last time this happened it was because they are easy low-hanging fruit for new people
17:05:24 <dkranz> This is fine but we should have a clearer line between what it in tempest and what is in unit tests
17:05:39 <giulivo> I added a few "sub-topics" trying to discern which are good and how they should be tagged
17:06:04 <mkoderer> yes for me there are some that are typical unit tests
17:06:14 <dkranz> In an ideal world we would have abstractions where we describe the call to be made and it could be done either as unit or tempest
17:06:32 <dkranz> And there was the idea of "fuzz" testing as well which never went anywhere
17:06:48 <mtreinish> dkranz: I'm fine with negative tests in tempest as long as they're testing api responses.
17:06:51 <sdague> I remember marun actually had this idea at the last summit of tests in the neutron tree that tempest could also run in a real env
17:07:03 <marun> *sigh* yes
17:07:07 <sdague> but I don't think that got any progress
17:07:09 <mtreinish> it goes back to the reason we have api tests in general
17:07:09 <dkranz> sdague: Yes, I have discussed it with others as well
17:07:12 <marun> I did a proof of concept in may but haven't gotten farther
17:07:24 <marun> I should probably post the proof of concept so others can take over
17:07:51 <marun> i'll make sure to have something to show by the summit
17:08:01 <dkranz> I think we would want some kind of yaml or something that describes the actual test data
17:08:30 <mkoderer> marun: could you paste a link to that poc?
17:08:48 <dkranz> The review team could spend all day reviewing negative tests and it might be better to work on making this easier to create and review.
17:08:49 <marun> mkoderer: I never polished it for public consumption, I'm afraid.
17:08:58 <mkoderer> marun: ok I see
17:09:06 <sdague> marun you going to be in HK?
17:09:08 <marun> mkoderer: I'll have to dig it up and get it working again.
17:09:10 <marun> sdague: yes
17:09:22 <sdague> I think this would be a great summit session especially if you had that poc ready
17:09:37 <dkranz> sdague: I think I already added it to proposed on the etherpad
17:09:42 <sdague> dkranz: great
17:09:52 <sdague> honestly, right now, I'm torn on negative testing
17:10:04 <sdague> on the one hand, it's adding coverage
17:10:10 <dkranz> sdague: Well, I think we should have it, but it needs to be easier to write and review
17:10:17 <sdague> but it definitely could be done close to the projects
17:10:39 <dkranz> sdague: The projects could consider tempest to be "closer" than they currently do
17:10:49 <dkranz> sdague: Some are actually making movements to be closer
17:10:50 <sdague> dkranz: so in the short term, I'd propose that we continue to let folks propose it
17:11:03 <sdague> but that we ensure all the negative tests are in seperate files
17:11:12 <sdague> so it would be easier to purge out later
17:11:18 <dkranz> sdague: That seems reasonable
17:11:29 <mkoderer> and they should use the "negative" attr type
17:11:30 <sdague> then we can make final decisions at summit
17:11:33 <sdague> mkoderer: yep
17:11:36 <mtreinish> sdague: sounds like another hacking file update
17:11:40 <dkranz> sdague: But we should also suggest that no one concentrate on that
17:11:40 <sdague> people have been pretty good about it
17:11:51 <sdague> dkranz: sure
17:11:58 <sdague> ok, volunteers for the hacking update?
17:12:02 <dkranz> Once they are up to speed on tempest they should focus on higher-value tests
17:12:18 <afazekas> several negative test are not just simple malformed request, or a  reference to a non existing resource
17:12:31 <giulivo> yeah I still have concerns about that, for instance
17:12:39 <mtreinish> sdague: I guess I'll do it since I brought it up
17:12:41 <giulivo> - some are "validating strings", like passing long strings or checking for invalid vs. non existent uuids
17:12:47 <giulivo> shall we accept both tests?
17:12:51 <sdague> #action mtreinish to update hacking
17:12:58 <giulivo> - some are "traversing" the components, like ensuring we can't attach resources to non existent servers
17:13:03 <dkranz> giulivo: Those cases are malformed requests
17:13:04 <giulivo> shall we accept such a kind of negative?
17:13:05 <sdague> #action mtreinish to update hacking to describe negative tests in dedicated files
17:13:25 <mkoderer> giulivo: yes that a point.. I saw all of them in one review
17:13:31 <giulivo> sorry dkranz why malformed?
17:13:43 <dkranz> giulivo: string too long
17:13:47 <sdague> giulivo: negative is typically more fuzz testing of the API, throwing it garbage
17:14:19 <sdague> afazekas: do you have a good more complicated example?
17:14:22 <giulivo> so if I get it right, non of those is acceptable
17:14:23 <dkranz> I'm fine with those if we don't have to write them long-hand
17:14:30 <dkranz> and review them
17:14:46 <afazekas> Do we want to remove the 'negative' flag from test cases which not just simple fuzz ? Like the auth related ones ?
17:14:48 <sdague> dkranz: right, so this is about some kind of automation on fuzzing
17:14:57 <sdague> afazekas: code example of that?
17:15:00 <mkoderer> https://review.openstack.org/#/c/50217/5
17:15:04 <mkoderer> maybe this one?
17:15:23 <dkranz> sdague: Yes but it is two parts:  declarative test spec, and parameterized randomization
17:15:39 <dkranz> sdague: Not all cases have to be randomly generated
17:15:40 <afazekas> https://review.openstack.org/#/c/50122/ rejected
17:16:03 <dkranz> afazekas: That was rejected for bad spelling, not content
17:16:11 <afazekas> owner_accept
17:16:29 <afazekas> test_image_share_v2_owner_accept
17:17:00 <afazekas> dkranz: the 'rejected' test is not 'negative' sorry
17:17:02 <sdague> dkranz: ok, so that sounds like actually another good topic, and slightly different from marun's one
17:17:28 <dkranz> sdague: Yes. A simple negative test could require setup that is non-trivial
17:17:34 <sdague> yep
17:18:03 <dkranz> So for now perhaps we can just say that we will accept negative tests but not make them a priority
17:18:14 <dkranz> ANd come up with a better plan at the summit
17:18:30 <mkoderer> good plan
17:18:38 <ravikumar_hp> dkranz: yes. and already we have good coverage
17:18:46 <mkoderer> we should create a blueprint after the summit
17:19:08 <mtreinish> mkoderer: did you get your travel approval for hk?
17:19:13 <dkranz> It would be good to have a ml discussion about this prior to the summit to make it more effective
17:19:16 <dkranz> This is a big topic
17:19:20 <afazekas> https://github.com/openstack/tempest/blob/master/tempest/api/compute/test_authorization.py looks like these does not have the negative flag
17:19:31 <sdague> dkranz: yes, agreed
17:19:38 <sdague> dkranz: you want to kick that thread off?
17:19:49 <dkranz> sdague: OK. I will discuss it with marun
17:19:53 <mkoderer> mtreinish: most probably yes
17:19:54 <sdague> great
17:20:25 <sdague> #action dkranz to kick off the thread on openstack-dev list about approaches on negative and shared testing with projects
17:21:06 <dkranz> sdague: Were you thinking tagged with [qa] or not?
17:21:36 <dkranz> sdague: I was going to tag it
17:21:37 <afazekas> https://github.com/openstack/tempest/blob/master/tempest/api/compute/admin/test_flavors_access.py
17:21:48 <sdague> yeh, it should be tagged
17:22:16 <dkranz> sdague: I think that's it for that topic
17:23:19 <sdague> ok, great
17:23:37 <sdague> mtreinish: how about you next
17:23:41 <sdague> #topic Elastic Recheck Top issues (mtreinish)
17:23:53 <mtreinish> oh, this is a weekly thing now
17:24:01 <mtreinish> #link http://status.openstack.org/elastic-recheck/
17:24:10 <mtreinish> so we're actually doing really good since er went live
17:24:20 <mtreinish> most of the big bugs have been fixed
17:24:27 <mtreinish> most of the remaining issues are on the neutron side
17:24:48 <mtreinish> there are some new ones popping up that we need to classify still
17:25:13 <sdague> yeh, ER is already a pretty awesome tool
17:25:15 <mtreinish> so if you find a nondeterministic failure and can identify it with a logstash query please submit a review to add it to the query list
17:26:00 <sdague> #link https://github.com/openstack-infra/elastic-recheck
17:26:07 <sdague> for people that haven't seen the code yet
17:26:37 <dkranz> Very cool
17:26:48 <mkoderer> great work
17:26:54 <sdague> ok, lets talk about a related issue, dkranz on whitelisting errors
17:27:06 <sdague> #topic Rollout plan for failing builds that pass tempest but with spurious log ERRORs (dkranz)
17:27:16 <dkranz> sdague: So the patch is up https://review.openstack.org/#/c/50795/
17:27:46 <dkranz> Many of the non-neutron runs are showing no non-whitelisted errors
17:27:59 <dkranz> I think the next steps are to
17:28:17 <dkranz> 1. Use elastic recheck to track non-deterministic bogus error messages
17:28:26 <dkranz> 2. Communicate all this to the dev list
17:28:35 <dkranz> 3. Start failing builds
17:28:56 <dkranz> 4. Build some kind of momentum to actually get people fixing the bugs
17:28:57 <sdague> that sounds pretty reasonable
17:29:19 <dkranz> I have looked at a lot of errors in the past day and many of them look like real bugs to me
17:29:24 <dkranz> Not just bogus errors
17:29:28 <mtreinish> dkranz: for step 1 you'll need to open bugs for all the non-deterministic bogus error messages you hit
17:29:34 <dkranz> But on one pays attention to the logs when a build succeeds
17:30:01 <dkranz> mtreinish: I was thinking of doing it in a slightly different way
17:30:07 <sdague> dkranz: so something like http://status.openstack.org/elastic-recheck/ for showing the occurance of these might be helpful
17:30:27 <dkranz> Based on what appears in the tempest log
17:30:28 <sdague> just those graphs seemed to make a difference to people in understanding how bad the problems were, and being able to see them go away
17:30:34 <dkranz> sdague: Agreed
17:31:10 <dkranz> So please comment on that patch when you get a chance
17:32:07 <dkranz> sdague: Ideally each group would have some one paying attention to their log errors
17:32:46 <dkranz> The ignoring of these errors is a huge weakness in the tempest methodology
17:33:02 <sdague> yeh, but sun light will help
17:33:05 <dkranz> A test can pass but the system is screwed up afterword
17:33:10 <dkranz> sdague: +1
17:33:24 <sdague> ok, I actually got to run away, dkranz can you take over the rest of the meeting?
17:33:33 <dkranz> sdague: Sure
17:33:37 <dkranz> sdague: Just a sec
17:34:07 <dkranz> We done with this?
17:34:20 <sdague> I think so
17:34:23 <dkranz> #topic Stable Branch Timing
17:34:56 <dkranz> What happened to the topic bot?
17:35:00 <mtreinish> dkranz: I think you need to be sdague to do that
17:35:07 <mtreinish> since he started the meeting
17:35:14 <dkranz> Oh well.
17:35:19 <dkranz> ANyway, next topic
17:35:33 <dkranz> I had proposed branching as late as possible
17:35:44 <dkranz> Any other theories?
17:35:56 <mtreinish> dkranz: I'm fine with that
17:36:16 <dkranz> mtreinish: So when would that be? Release day?
17:36:20 <mtreinish> I don't think we should do it the same time as the other projects
17:36:27 <mtreinish> so next week
17:36:36 <dkranz> mtreinish: RIght, haven't some already done it?
17:36:53 <mtreinish> they've rc'd but I don't think anything is done yet
17:37:19 <dkranz> mtreinish: OK, so the release day is the 17th
17:37:24 <mkoderer> I think branching it as late as possible is the easiest way
17:37:29 <dkranz> We could just do it then
17:37:52 <mtreinish> do you want to do a milestone proposed branch or just go straight to a havana branch?
17:38:12 <giulivo> mtreinish, I was going to propose that, a milestone-proposed branch
17:38:30 <giulivo> but I'd also branch the same day as other projects then, why wait one week more?
17:38:31 <dkranz> mtreinish: Well, if we are waiting until release day I'm not sure why we need milestone-proposed
17:38:48 <mtreinish> yeah that's how I feel too
17:38:59 <giulivo> dkehn, my idea is that people wanting to propose stuff for the stable branch will have to use milestone-proposed
17:39:05 <dkranz> giulivo: Because we are still writing havana tests they will have to be backported once we branch
17:39:05 <giulivo> dkranz, ^^
17:39:06 <mtreinish> ttx: do you have any thoughts on this topic?
17:40:15 <giulivo> dkranz, yeah to solve that very same issue, I'd use a milestone-proposed branch but branch the 17th
17:40:35 <giulivo> (branch our stable, on the 17th)
17:40:55 <mtreinish> giulivo: there's no difference at that point from just doing a stable branch
17:41:01 <dkranz> giulivo: I don't see how that changes anything
17:41:14 <mtreinish> we did a milestone proposed last cycle because we broke something for the rcs when we added a test (I think)
17:41:15 <dkranz> giulivo: The back port rule is our policy
17:41:15 <giulivo> so I think it's important branching the same day as other projects
17:41:28 <giulivo> and the backport rule remains in place for the future additions
17:41:30 <dkranz> giulivo: We can't because they don't all do it on the same day
17:42:25 <giulivo> but tests submitted closer to the branch date, if relevant for the release, can be posted to the milestone and avoid being skipped
17:42:40 <dkranz> I think there is little risk. The 17th is one week from now
17:42:47 <mtreinish> dkranz: lets just say we'll cut the stable/havana right before next week's meeting
17:42:54 <dkranz> mtreinish: Works for me
17:43:43 <dkranz> mtreinish: I can't remember who actually does that. Infra?
17:43:43 <mtreinish> #info stable/havana branch to be cut right before next meeting
17:43:51 <mtreinish> dkranz: sdague will know
17:43:59 <dkranz> OK, any more comments on that?
17:44:54 <dkranz> #topic Design Summit Initial Planning
17:45:17 <dkranz> https://etherpad.openstack.org/icehouse-qa-session-planning
17:45:46 <dkranz> So we have 7 proposals and 13 slots
17:46:04 <dkranz> We don't have to use all of them, but please add to the etherpad if you have ideas.
17:46:34 <ravikumar_hp> dranz: is this the place to add under proposed ?
17:46:55 <dkranz> ravikumar_hp: Yes, the etherpad I pasted above
17:47:06 <ravikumar_hp> http://summit.openstack.org/   ?
17:47:22 <mkoderer> no use the etherpad and put it under "Proposed"
17:47:29 <ravikumar_hp> ok
17:47:39 <dkranz> ravikumar_hp: I think the plan was to do it in the etherpad because it is better for communication
17:48:16 <dkranz> ravikumar_hp: Of course any one who is not participating in these meetings can propose a session on summit.openstack.org
17:48:46 <dkranz> ravikumar_hp: That site is a clumsy tool for collaboration
17:49:09 <dkranz> Any other comments on this topic?
17:49:20 <mkoderer> btw it's my first summit, so if someone want to give me some word how a talk should look like
17:49:24 <ravikumar_hp> dkranz: thanks. i got it
17:49:27 <mkoderer> would be great
17:49:30 <dkranz> mkoderer: No talks!
17:49:48 <mkoderer> I mean I need to prepare something right?
17:49:51 <dkranz> mkoderer: The summit sessions are discussions. YOu can have a few intro slides, but that's it
17:50:08 <mkoderer> dkranz: ok just a few slides good
17:50:13 <dkranz> mkoderer: And try to have enough discussion beforehand so it can be effective
17:50:25 <mkoderer> dkranz: ok cool
17:50:26 <mtreinish> mkoderer: just having an etherpad with an outline of discussion points is normally all that is done up front
17:50:57 <mkoderer> ok fine
17:51:32 <dkranz> OK, any critical reviews before open discussion?
17:51:44 <dkranz> Other than my error log check :)
17:51:56 <mlavalle> dkranz: aren't we going to talk about Neutron? I see it in the agenda
17:52:18 <dkranz> mlavalle: Yes! That had sean's name but please tell
17:52:49 <mlavalle> dkranz: I've been working on this bug https://bugs.launchpad.net/swift/+bug/1224001
17:52:52 <uvirtbot> Launchpad bug 1224001 in neutron "test_network_basic_ops fails waiting for network to become available" [Critical,Fix released]
17:53:02 <dkranz> mlavalle: I saw a number of things in the neutron log files after succesful runs that looked like real bugs
17:53:21 <mlavalle> dkranz: I've been doing this with marun and salv-orlando
17:53:25 <mtreinish> dkranz: for reviews both: https://review.openstack.org/#/c/49559/ and https://review.openstack.org/#/c/49562/
17:53:39 <mtreinish> to split out the unit tests from the jenkins runs
17:54:20 <dkranz> mtreinish: doesn't everything run in jenkins?
17:54:38 <mlavalle> dkranz: salv-orlando proposed a fix to neutron that should help with test_network_basic_ops fails
17:54:49 <mtreinish> dkranz: I should have worded it more clearly. make the unit tests a separate jenkins job only for tempest
17:54:57 <dkranz> mtreinish: Oh, ok
17:55:00 <mtreinish> instead of running it with everything else
17:55:23 <dkranz> mlavalle: That's great
17:55:37 <marun> we definitely need to address errors in the log on otherwise successful runs
17:55:37 <mtreinish> mlavalle: looking at the elastic-recheck graph 1224001 is still happening
17:55:40 <mlavalle> dkranz: I'll test tonight or tomorrow
17:55:52 <marun> it's confusing things when we have failures - hard to see the forest for the trees.
17:56:01 <mtreinish> or do you think it's just an overzealous query
17:56:29 <mlavalle> mtreinish: I don't know. I need to look at it carefully
17:57:04 <mtreinish> mlavalle: http://status.openstack.org/elastic-recheck/
17:57:08 <dkranz> mlavalle: ok
17:57:15 <marun> i've been seeing failures that gerrit attributes to 124001 that are not due to a neutron test failure.
17:57:20 <mlavalle> mtreinish: thanks i'll look at it
17:57:21 <dkranz> We only have a few minutes left
17:57:33 <mlavalle> dkranz: thanks
17:57:42 <mtreinish> marun: it looks for 'tempest.scenario.test_network_basic_ops AssertionError: Timed out waiting for' in the test output
17:57:50 <dkranz> mlavalle, marun : anything else on neutron?
17:57:59 <mlavalle> dkranz: i'm done
17:58:03 <marun> i'm done
17:58:14 <marun> mtreinish: i'll follow up offline about the bug detection
17:58:20 <mtreinish> marun: ok
17:58:22 <dkranz> ok, anything else from any one? We had a full agenda today.
17:58:46 <Anju> dkranz:  can we renamed the Duplicate exception to Conflict ?
17:58:59 <Anju> dkranz:  can we rename the Duplicate exception to Conflict ?
17:59:28 <mkoderer> let's discuss it after the meeting?
17:59:29 <Anju> as 409 depicts Conflict
17:59:40 <dkranz> Yes, we are out of time.
17:59:45 <Anju> sure
17:59:53 <dkranz> #endmeeting
18:00:13 <mtreinish> fungi, clarkb, jeblair: ^^^
18:00:13 <dkranz> mtreinish: Can sdague end the meeting now or is he really gone?
18:00:22 <mtreinish> sdague is away how do we end the meeting
18:00:28 <clarkb> there is a timeout in meetbot
18:00:38 <clarkb> it is 60 minutes, so anyone can end it after that timeout
18:00:50 <clarkb> try it again
18:00:57 <dkranz> #endmeeting