Thursday, 2015-04-23

*** mtanino has quit IRC00:00
*** tsekiyam_ has quit IRC00:01
*** suneelb has quit IRC00:02
openstackrecheckopenstack/nova change: https://review.openstack.org/175739 failed gate-tempest-dsvm-full, gate-tempest-dsvm-postgres-full, gate-tempest-dsvm-neutron-full, gate-devstack-dsvm-cells, gate-tempest-dsvm-nova-v21-full, gate-tempest-dsvm-full-ceph in the gate queue with an unrecognized error00:10
*** davideagnello has quit IRC00:14
*** mlavalle has quit IRC00:15
*** yamamoto has quit IRC00:16
openstackgerritMatthew Treinish proposed openstack/tempest: Add unit test to ensure sane test class subclassing  https://review.openstack.org/17653900:18
openstackrecheckopenstack/glance change: https://review.openstack.org/174405 failed gate-tempest-dsvm-full, gate-tempest-dsvm-postgres-full, gate-tempest-dsvm-neutron-full, gate-devstack-dsvm-cells, gate-tempest-dsvm-full-ceph in the gate queue with an unrecognized error00:23
*** marun has quit IRC00:26
*** zz_dimtruck is now known as dimtruck00:27
openstackrecheckopenstack/glance change: https://review.openstack.org/171022 failed gate-tempest-dsvm-full, gate-tempest-dsvm-postgres-full, gate-tempest-dsvm-neutron-full, gate-devstack-dsvm-cells, gate-tempest-dsvm-full-ceph in the gate queue with an unrecognized error00:40
*** bknudson has quit IRC00:45
*** annashen has joined #openstack-qa00:45
openstackrecheckopenstack/horizon change: https://review.openstack.org/175122 failed gate-horizon-python27, gate-horizon-python27-django14, gate-horizon-selenium, gate-tempest-dsvm-full in the gate queue with an unrecognized error00:50
*** pc_m has quit IRC00:53
*** amotoki has joined #openstack-qa00:55
openstackrecheckopenstack/glance change: https://review.openstack.org/169813 failed gate-tempest-dsvm-full, gate-tempest-dsvm-postgres-full, gate-tempest-dsvm-neutron-full, gate-devstack-dsvm-cells, gate-tempest-dsvm-full-ceph in the gate queue with an unrecognized error00:57
*** mtanino has joined #openstack-qa00:58
*** annashen has quit IRC01:01
*** amotoki has quit IRC01:04
*** annashen has joined #openstack-qa01:06
openstackgerritIan Wienand proposed openstack-dev/devstack: Select Libreswan driver for openswan in Fedora  https://review.openstack.org/17619201:06
*** annashen has quit IRC01:06
*** amotoki has joined #openstack-qa01:09
openstackrecheckConsole logs not available after 13:21s for check-devstack-dsvm-cells 176518,1,1e4f25501:14
*** dboik has joined #openstack-qa01:15
*** dboik has quit IRC01:19
*** bknudson has joined #openstack-qa01:20
*** shashankhegde has quit IRC01:21
*** wuhg has joined #openstack-qa01:21
*** achanda has quit IRC01:22
*** marun has joined #openstack-qa01:26
*** mlavalle has joined #openstack-qa01:28
*** haigang has joined #openstack-qa01:41
*** Longgeek has joined #openstack-qa01:46
openstackrecheckConsole logs not available after 13:22s for gate-horizon-python27 139026,5,af61ad701:49
*** shashankhegde has joined #openstack-qa01:51
*** Longgeek has quit IRC01:56
*** Longgeek has joined #openstack-qa01:56
*** david-lyle has quit IRC01:59
openstackgerritSam Wan proposed openstack-dev/devstack: Fixes multiple global_filter created in /etc/lvm/lvm.conf by set_lvm_filter.  https://review.openstack.org/17622202:00
openstackrecheckConsole logs not available after 13:23s for gate-nova-python26 149371,3,4b27a8102:02
*** davideagnello has joined #openstack-qa02:02
*** davideagnello has quit IRC02:07
*** annashen has joined #openstack-qa02:07
*** suneelb has joined #openstack-qa02:10
*** annashen has quit IRC02:12
*** haigang has quit IRC02:12
*** harlowja is now known as harlowja_away02:12
*** ajmiller has joined #openstack-qa02:13
*** ajo has joined #openstack-qa02:13
openstackrecheckConsole logs not available after 13:22s for gate-horizon-python27-django14 173552,1,a52711902:15
*** patrickeast has quit IRC02:18
*** yamamoto has joined #openstack-qa02:22
*** marun has quit IRC02:23
*** rfolco has quit IRC02:28
openstackrecheckConsole logs not available after 13:22s for check-tempest-dsvm-full 176526,2,f4de36502:29
*** suneelb has quit IRC02:33
*** haigang has joined #openstack-qa02:39
openstackrecheckConsole logs not available after 13:22s for check-tempest-dsvm-full 175567,3,d7de63d02:42
*** achanda has joined #openstack-qa02:53
openstackrecheckConsole logs not available after 13:22s for check-tempest-dsvm-full 176457,1,606290a02:56
*** achanda has quit IRC02:59
*** yamamoto has quit IRC02:59
*** yamamoto has joined #openstack-qa03:05
*** dimtruck is now known as zz_dimtruck03:08
openstackrecheckConsole logs not available after 13:24s for gate-horizon-python27-django14 176532,1,4c6a87203:09
*** mmedvede has quit IRC03:10
openstackgerritMerged openstack-infra/devstack-gate: Make aiopcpu subnode directory more deterministic  https://review.openstack.org/16944903:15
*** mmedvede has joined #openstack-qa03:18
*** annashen has joined #openstack-qa03:30
*** yamamoto has quit IRC03:31
*** shashankhegde has quit IRC03:42
*** Madan has joined #openstack-qa03:43
openstackgerritMerged openstack-dev/grenade: support apache based upgrades for keystone  https://review.openstack.org/17583103:46
openstackgerritMerged openstack-dev/grenade: add process debugging when things fail  https://review.openstack.org/17593503:47
openstackgerritMerged openstack-infra/devstack-gate: add support for stable/kilo  https://review.openstack.org/17539103:49
openstackgerritMerged openstack/tempest: remove swift cli tests  https://review.openstack.org/17544703:49
*** oomichi has joined #openstack-qa03:52
*** yamamoto has joined #openstack-qa03:56
*** annashen has quit IRC04:04
openstackgerritMerged openstack-dev/devstack: made several changes to guides to comply to doc conventions  https://review.openstack.org/17602904:05
*** mlavalle_ has joined #openstack-qa04:05
*** mlavalle has quit IRC04:05
*** davideagnello has joined #openstack-qa04:07
*** krtaylor has quit IRC04:08
*** mlavalle_ has quit IRC04:11
*** krtaylor has joined #openstack-qa04:11
*** Sukhdev has joined #openstack-qa04:17
*** coolsvap|afk is now known as coolsvap04:31
*** coolsvap is now known as coolsvap|afk04:32
*** annashen has joined #openstack-qa04:35
*** annashen has quit IRC04:48
*** ajmiller has quit IRC04:54
*** david-lyle has joined #openstack-qa04:59
*** mtanino has quit IRC05:01
*** sks has joined #openstack-qa05:05
*** Poornima has joined #openstack-qa05:08
*** mmedvede has quit IRC05:08
openstackgerritIan Wienand proposed openstack-dev/devstack: Fix negated services with common prefix  https://review.openstack.org/17359305:10
*** krtaylor has quit IRC05:19
*** Triveni has joined #openstack-qa05:19
*** annashen has joined #openstack-qa05:20
*** marrusl has quit IRC05:21
*** mmedvede has joined #openstack-qa05:21
*** rwsu has quit IRC05:21
openstackgerritSam Wan proposed openstack-dev/devstack: Fixes multiple global_filter created in /etc/lvm/lvm.conf by set_lvm_filter.  https://review.openstack.org/17622205:26
*** shashankhegde has joined #openstack-qa05:26
*** davideagnello has quit IRC05:30
*** armax has quit IRC05:32
*** achanda has joined #openstack-qa05:33
*** krtaylor has joined #openstack-qa05:33
*** marrusl has joined #openstack-qa05:34
*** k4n0 has joined #openstack-qa05:35
*** Longgeek has quit IRC05:44
*** mmedvede has quit IRC05:46
*** luqas has joined #openstack-qa05:51
*** luqas has quit IRC05:55
*** Longgeek has joined #openstack-qa05:57
*** mmedvede has joined #openstack-qa06:04
*** BharatK has joined #openstack-qa06:04
*** pkoniszewski has joined #openstack-qa06:04
*** ibiris_away is now known as ibiris06:06
*** afazekas_ has joined #openstack-qa06:11
*** yog__ has joined #openstack-qa06:14
*** Longgeek has quit IRC06:16
*** Sukhdev has quit IRC06:17
*** anshul has quit IRC06:20
*** Longgeek has joined #openstack-qa06:26
*** anshul has joined #openstack-qa06:33
*** woodster_ has quit IRC06:40
*** Longgeek_ has joined #openstack-qa06:43
*** Longgeek has quit IRC06:46
*** annashen has quit IRC06:46
*** sileht has quit IRC06:48
*** yfried|afk has quit IRC06:56
*** mpaolino has joined #openstack-qa06:57
*** neeti has joined #openstack-qa07:00
*** sputnik13 has joined #openstack-qa07:02
openstackgerritRamakrishnan G proposed openstack-dev/devstack: Add support for ilo drivers in lib/ironic  https://review.openstack.org/17473307:03
openstackgerritRamakrishnan G proposed openstack-dev/devstack: Add support for ilo drivers in lib/ironic  https://review.openstack.org/17473307:05
openstackgerritSam Wan proposed openstack-dev/devstack: Fixes multiple global_filter created in /etc/lvm/lvm.conf by set_lvm_filter.  https://review.openstack.org/17622207:07
*** achanda has quit IRC07:07
*** pkoniszewski has quit IRC07:07
*** pkoniszewski has joined #openstack-qa07:07
*** Longgeek_ has quit IRC07:09
*** shashankhegde has quit IRC07:09
*** shashankhegde has joined #openstack-qa07:10
*** sileht has joined #openstack-qa07:10
*** Longgeek has joined #openstack-qa07:11
*** luqas has joined #openstack-qa07:11
*** pkoniszewski has quit IRC07:12
*** rushiagr_away is now known as rushiagr07:12
*** jyuso has joined #openstack-qa07:13
*** heyongli has quit IRC07:15
*** alex_xu has quit IRC07:15
*** alex_xu_ has joined #openstack-qa07:15
*** jyuso1 has quit IRC07:15
*** weshay has joined #openstack-qa07:17
*** heyongli has joined #openstack-qa07:17
*** shashankhegde has quit IRC07:18
*** davideagnello has joined #openstack-qa07:18
*** tkammer|not_here has joined #openstack-qa07:23
*** davideagnello has quit IRC07:23
*** bkopilov has quit IRC07:26
*** yfried|afk has joined #openstack-qa07:26
*** SridharG has joined #openstack-qa07:29
*** jlanoux has joined #openstack-qa07:40
*** e0ne has joined #openstack-qa07:41
*** bkopilov has joined #openstack-qa07:44
*** coolsvap|afk is now known as coolsvap07:52
*** coolsvap is now known as coolsvap|afk07:53
*** e0ne is now known as e0ne_07:53
*** safchain has joined #openstack-qa07:56
*** yfried|afk has quit IRC08:01
*** gfidente has joined #openstack-qa08:03
*** gfidente has quit IRC08:03
*** gfidente has joined #openstack-qa08:03
*** apevec has joined #openstack-qa08:08
*** apevec has quit IRC08:08
*** apevec has joined #openstack-qa08:08
*** zz_johnthetubagu is now known as johnthetubaguy08:08
*** jyuso has quit IRC08:10
*** jordanP has joined #openstack-qa08:11
*** e0ne_ is now known as e0ne08:14
*** liusheng has left #openstack-qa08:15
*** bkopilov has quit IRC08:19
*** jordanP has quit IRC08:19
*** e0ne is now known as e0ne_08:25
*** yfried|afk has joined #openstack-qa08:25
*** e0ne_ is now known as e0ne08:26
*** andreykurilin__ has joined #openstack-qa08:27
*** ihrachyshka has joined #openstack-qa08:27
*** Longgeek has quit IRC08:31
*** Longgeek has joined #openstack-qa08:31
*** e0ne has quit IRC08:32
*** jordanP has joined #openstack-qa08:43
*** hichihara has quit IRC08:46
*** sputnik1_ has joined #openstack-qa08:55
*** sputnik13 has quit IRC08:55
openstackrecheckopenstack/horizon change: https://review.openstack.org/175869 failed gate-horizon-python27, gate-horizon-python27-django14, gate-horizon-selenium in the gate queue with an unrecognized error08:55
*** Matt2 has quit IRC08:56
*** sputnik13 has joined #openstack-qa08:58
*** sputnik1_ has quit IRC09:01
*** davideagnello has joined #openstack-qa09:08
*** davideagnello has quit IRC09:12
*** e0ne has joined #openstack-qa09:13
*** shardy has joined #openstack-qa09:18
*** e0ne is now known as e0ne_09:31
*** haigang has quit IRC09:31
*** anshul has quit IRC09:32
*** aix has joined #openstack-qa09:35
*** e0ne_ is now known as e0ne09:35
*** mmedvede has quit IRC09:43
*** tosky has joined #openstack-qa09:43
*** anshul has joined #openstack-qa09:44
*** andreykurilin__ has quit IRC09:44
*** yamamoto has quit IRC09:45
*** annashen has joined #openstack-qa09:47
*** annashen has quit IRC09:51
*** e0ne is now known as e0ne_09:53
*** Longgeek has quit IRC09:57
*** Longgeek has joined #openstack-qa09:59
*** pcrews has quit IRC09:59
*** cody-somerville has joined #openstack-qa09:59
*** cody-somerville has quit IRC09:59
*** cody-somerville has joined #openstack-qa09:59
*** mmedvede has joined #openstack-qa10:00
*** e0ne_ has quit IRC10:03
*** yamamoto has joined #openstack-qa10:07
*** e0ne has joined #openstack-qa10:09
openstackgerritnayna-patel proposed openstack/tempest: CRUD operations for Glance Namespaces  https://review.openstack.org/17668610:09
openstackrecheckopenstack/ceilometer change: https://review.openstack.org/174320 failed gate-tempest-dsvm-neutron-full in the gate queue with an unrecognized error10:12
*** afazekas_ has quit IRC10:14
*** amotoki_ has joined #openstack-qa10:15
*** samueldmq has joined #openstack-qa10:21
*** mpaolino has quit IRC10:26
*** k4n0 has quit IRC10:28
*** cdent has joined #openstack-qa10:32
*** afazekas has joined #openstack-qa10:32
*** Triveni has quit IRC10:32
*** k4n0 has joined #openstack-qa10:34
*** yamamoto has quit IRC10:39
*** rushiagr is now known as rushiagr_away10:40
openstackgerritAbhishek G M proposed openstack/tempest: Test Creation of server in a shared_network  https://review.openstack.org/16891010:45
cdentIs there a preferred way to turn on USE_VENV in a devstack-based gate functional test? gnocchi doesn't follow global requirements and is ahead of keystone on oslo.db but keystone-manage barfs, thinking perhaps venv will ork around that10:47
*** sputnik13 has quit IRC10:48
*** pkoniszewski has joined #openstack-qa10:48
openstackgerritAbhishek G M proposed openstack/tempest: Test Create of VM with multiple Nics  https://review.openstack.org/16804010:50
*** davideagnello has joined #openstack-qa10:56
*** SridharG1 has joined #openstack-qa11:00
*** davideagnello has quit IRC11:01
*** SridharG has quit IRC11:03
openstackrecheckopenstack/ceilometer change: https://review.openstack.org/176572 failed because of: gate-tempest-dsvm-postgres-full: https://bugs.launchpad.net/bugs/142915111:04
openstackLaunchpad bug 1429151 in Cinder "SimpleReadOnlyCinderClientTest.test_cinder_encryption_type_list fails randomly" [High,Confirmed]11:04
*** yamamoto has joined #openstack-qa11:07
nithyag_ping @afazekas11:08
*** aix has quit IRC11:22
*** yamamoto has quit IRC11:23
*** david-lyle has quit IRC11:29
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411111:33
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670011:33
*** alex_xu_ has quit IRC11:38
*** weshay has quit IRC11:41
*** alex_xu has joined #openstack-qa11:41
*** weshay has joined #openstack-qa11:46
*** yog__ has quit IRC11:51
*** bknudson has quit IRC11:57
*** BharatK has quit IRC11:58
*** aix has joined #openstack-qa12:00
*** aix has quit IRC12:00
*** aix has joined #openstack-qa12:00
*** jhenner has quit IRC12:03
*** Triveni has joined #openstack-qa12:03
*** e0ne is now known as e0ne_12:04
*** david-lyle has joined #openstack-qa12:05
*** cdent has quit IRC12:13
*** cdent has joined #openstack-qa12:14
*** jgrimm is now known as zz_jgrimm12:15
afazekasnithyag_, pong12:16
*** luqas has quit IRC12:24
*** e0ne_ is now known as e0ne12:24
*** k4n0 has quit IRC12:27
*** woodster_ has joined #openstack-qa12:29
*** david-lyle has quit IRC12:31
openstackgerritRamakrishnan G proposed openstack-dev/devstack: Add support for ilo drivers in lib/ironic  https://review.openstack.org/17473312:35
*** jhenner has joined #openstack-qa12:35
*** SridharG1 has quit IRC12:37
*** neeti_ has joined #openstack-qa12:37
*** neeti has quit IRC12:38
*** mpaolino has joined #openstack-qa12:40
mtreinishcdent: we just use tox to run tests for tempest in a dsvm env. It takes care of the venv separation for running the tests12:41
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672312:41
mtreinishcdent: or are you trying to install gnocchi in a venv and run tests from a different place?12:41
cdentmtreinish: I'm not entirely certain what the problem or scope really is12:42
cdentBasically when keystone-manage runs (during the devstack build) it conflicts with an oslo_db that gnocchi wants to install.12:42
sdaguecdent: correct, gnocchi needs to stay within global-requirements to be runnable in a devstack12:43
cdentIf I add USE_VENV to the localrc that is getting generated via the gate_hook things work12:43
cdentsdague: we're thinking about ways to work around that because it is has proved too limited12:43
sdagueUSE_VENV is all kind of experimental at this point12:43
cdentwhat we will probably do is making gnocchi _always_ run in a venv12:44
*** jhenner has quit IRC12:44
* cdent finds some links12:44
cdenthttps://review.openstack.org/#/c/176503/12:44
sdaguecdent: so, basically, that probably will mean the ops community rejects it entirely12:44
*** davideagnello has joined #openstack-qa12:45
cdentI think the hope is that it is a temporary patch while waiting for the rest of the world to catch up, but I don't actually know12:45
cdentI was just trying to see what was possible12:45
cdentthe original review that introduced the problem was: https://review.openstack.org/#/c/175872/12:46
*** amotoki_ has quit IRC12:46
cdents/introduced/uncovered/12:46
sdagueit's not actually a problem, it's the way the system is designed to work12:47
*** Longgeek has quit IRC12:47
cdentI'm increasingly feeling like the purpose of the gate in my mind is not the same as the purpose of the gate in the general problem.12:47
*** erlon has joined #openstack-qa12:47
cdentIn my mind it's point should be to discover and fix breakage as fast as possible.12:47
cdentWhereas it seems like it's point is to keep things stable.12:47
cdentWhich has the knock on effect of slowing down the fixing of breakage.12:48
sdagueright, it's pre-commit ci12:48
*** Longgeek has joined #openstack-qa12:48
sdagueso if you put things into pre-commit ci the point is the tree is never broken12:48
sdagueso that all 1000 developers get to start from known good points12:48
sdagueand aren't constantly questioning if their code is broken because of someone else's commit12:49
sdagueyou can also do post-commit ci in the system, but people don't very much12:49
sdaguewhich would be it's ok to break things, and we'll fix them up later12:50
*** davideagnello has quit IRC12:50
cdentOkay, so going back to the practical issue:12:50
cdentIf gnocchi wants to be decoupled from the the release cycle (which has requirements freezes and the like on the cycle) but still wants to be play well with others and make use of the ci, what are some options for accomplishing that?12:52
cdentThe two that I can think of at this point are: a) turn on venvs throughout devstack b) make gnocchi use its own venv12:52
cdentsdague: are there others?12:52
sdaguethose are probably it, however it doesn't actually "play well with others" in either of those cases12:53
*** Longgeek_ has joined #openstack-qa12:55
cdentHow are you definiing "play well with others"? Uses the same requirements?12:56
*** SridharG has joined #openstack-qa12:56
sdagueworks as a system level install12:56
sdaguewith the other components12:56
cdentK, just wanted to clarify12:56
sdagueit basically makes it unpackagable by distros12:56
*** Longgeek has quit IRC12:56
cdentI don't have a particular iron in this fire, I'm just trying to gather as much data as possible or having a conversation with jd and mehdi as we decide what to do12:57
sdaguesure12:57
sdagueso a current standing tenant is the system level services should all be coinstallable at a system level12:57
cdentthere's a next issue which is that keystone-manage and keystone-all are easy_install script and thus use pkg_resources to create a WorkingSet (which is the thing that blows up when the wrong oslo_db is present). Is that a common style or is the general pattern these days to use console_script entry points?12:59
*** bknudson has joined #openstack-qa13:00
sdaguecdent: that's a style that exists, it's be hit by openstack client after that point if you get past that one13:00
sdagueor on the swift / ceilometermiddleware start13:01
* cdent nods13:01
cdenthmmm13:01
*** dkranz has joined #openstack-qa13:02
*** zz_dimtruck is now known as dimtruck13:03
*** jhenner has joined #openstack-qa13:07
*** rahmu has quit IRC13:08
*** rahmu has joined #openstack-qa13:09
*** Triveni has quit IRC13:09
openstackrecheckopenstack/nova change: https://review.openstack.org/168778 failed gate-tempest-dsvm-nova-v21-full in the gate queue with an unrecognized error13:10
*** yamamoto has joined #openstack-qa13:11
*** jecarey has joined #openstack-qa13:13
*** yamamoto has quit IRC13:16
*** mriedem_away has quit IRC13:21
*** dustins has joined #openstack-qa13:21
*** luqas has joined #openstack-qa13:25
*** e0ne has quit IRC13:25
*** sks has quit IRC13:25
cdentthanks for the info sdague that gives me enough to chew on and think about how to move things along13:27
*** neeti_ has quit IRC13:27
*** e0ne has joined #openstack-qa13:27
*** dimtruck is now known as zz_dimtruck13:28
openstackgerritMatthew Treinish proposed openstack/tempest-lib: Switch from using custom urlparse import logic to six  https://review.openstack.org/17673113:28
*** rfolco has joined #openstack-qa13:28
sdaguecoolio13:29
*** mpaolino has quit IRC13:29
*** luqas has quit IRC13:30
openstackgerritBaodong (Robert) Li proposed openstack-dev/devstack: Add /usr/local/bin to exec_dirs in rootwrap.conf  https://review.openstack.org/17604113:35
*** dboik has joined #openstack-qa13:39
*** wuhg has quit IRC13:39
*** SridharGaddam has quit IRC13:42
*** mriedem has joined #openstack-qa13:47
*** shardy_ has joined #openstack-qa13:57
*** shardy has quit IRC13:59
*** tochi has quit IRC13:59
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672314:01
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674814:01
sdaguedansmith: so.... I think nova might have the same stop problem that keystone had14:01
sdaguethere is an oslo fix for that14:02
dansmithsdague: the same "stop problem" ?14:02
*** shardy_ has quit IRC14:02
sdaguethe old oslo service code ignores signals if there are open connections14:02
dansmithhmm, really?14:03
sdagueapparently14:03
sdaguebknudson figured out that was the root issue14:03
*** shardy has joined #openstack-qa14:03
sdagueit ignores the term signal at least14:03
dansmithI just did something to nova the other day that was only happening when we received the term signal14:03
dansmithrelated to deleting all events14:03
bknudsondoesn't ignore it... allows the term signal to shut down the process14:03
dansmithso that we don't hang forever14:03
*** yamamoto has joined #openstack-qa14:04
bknudsonto recreate, all I did was open a connection to the process using nc14:04
sdaguebknudson: ok, you explain the issue then, because I don't have it fully in my head14:04
dansmithsdague: https://review.openstack.org/#/c/169056/14:04
dansmithsdague: so roman at least seems to be able to make it do the term thing: https://bugs.launchpad.net/nova/+bug/143818314:05
openstackLaunchpad bug 1438183 in OpenStack Compute (nova) "Graceful shutdown of nova-compute service fails" [Medium,Fix released] - Assigned to Dan Smith (danms)14:05
sdaguedansmith: did that end up in stable/kilo?14:05
dansmithsdague: targeted for rc114:05
sdaguehttps://bugs.launchpad.net/oslo-incubator/+bug/144658314:05
openstackLaunchpad bug 1446583 in oslo-incubator "services no longer reliably stop in stable/kilo" [Critical,Fix committed] - Assigned to Julien Danjou (jdanjou)14:05
bknudsonthe server would fork off a bunch of child processes... these child processes were handling the SIGTERM, by setting a flag to exit...14:05
dansmithsdague: without that fix, and instances mid-boot, you might perceive it as ignoring the term signal14:05
bknudsonbut for some reason, maybe related to eventlet, the child wouldn't actually process the SIGTERM if it was reading from the socket.14:05
bknudsonby default, a SIGTERM will just kill the process anyways, so there was no reason for the child to actually try to catch it and handle it if all it was going to do was shut down anyway14:06
*** paul_glass has joined #openstack-qa14:06
bknudsonso the fix was to change the child process from trying to handle SIGTERM and just let SIGTERM kill the process.14:06
dansmiththat likely doesn't apply to nova14:07
sdaguedansmith: because?14:07
*** luqas has joined #openstack-qa14:08
dansmithsdague: well, I meant nova-compute, because it has no child processes14:08
dansmithand because we can't ignore it and achieve the same goal14:08
dansmithwe need to handle it14:08
bknudsonthis was definitely related to how the oslo-incubator code handled the multi-process model.14:08
sdaguesure, the reason this failed - http://logs.openstack.org/23/176723/1/check/check-grenade-dsvm-neutron/6307eb0/ however was because nova-api didn't shut down14:08
openstackrecheckopenstack-dev/devstack change: https://review.openstack.org/176012 failed gate-tempest-dsvm-full in the gate queue with an unrecognized error14:08
*** jhenner has quit IRC14:09
*** Poornima has quit IRC14:09
bknudsonI don't understand why eventlet seems to have an issue with handling a signal while it's reading.14:09
dansmithsdague: okay, nova-api would apply I guess because (a) child processes and (b) we don't need to do the same thing that nova-compute does on term14:09
bknudsontypically if you get a signal, even if it's handled, the read gets a ESIG or something.14:10
dansmithbknudson: http://logs.openstack.org/23/176723/1/check/check-grenade-dsvm-neutron/6307eb0/logs/old/screen-n-api.txt.gz14:10
dansmithbknudson: "child caught sigterm" -- is that the symptom/14:10
bknudsonEINTR14:10
sdaguedansmith: actually, yes, I think that's typically the issue14:10
dansmithokay14:10
sdaguethe symptom14:10
sdaguewe can look at the working run14:11
dansmithI *think* nova-api would be the only server affected in nova, based on your description14:11
bknudsonhttp://git.openstack.org/cgit/openstack/oslo-incubator/tree/openstack/common/service.py#n35014:11
bknudson-- that's the service code... so it's like "Child %(pid)d killed by signal %(sig)d" or "'Child %(pid)s exited with status %(code)d'"14:11
sdaguedansmith: so... I thought I saw it with nova-conductor in other runs14:12
dansmithbknudson: yeah, see the bottom of that log14:12
*** rbak has joined #openstack-qa14:12
dansmithsdague: ah, true, conductor does use the same worker model as api,14:12
bknudsony, that's exactly what it looked like.14:12
dansmithsdague: but it needs to handle it like compute does14:12
dansmithsdague: it can't just punt like api could14:12
sdaguedansmith: ok, so... that's a different patch?14:13
*** ChuckC has quit IRC14:13
bknudsonoh, so you don't want the children to just get killed? because then my patch breaks nova.14:13
dansmithI dunno, let me look14:13
dansmithbknudson: not for conductor, ideally14:13
sdaguebknudson: nova-compute doesn't have workers14:13
sdagueso it shouldn't affect that14:13
bknudsony, this is nova-api log I'm looking at.14:13
dansmithright14:14
sdagueyep14:14
dansmiththere are lots of patches in that bug, which one should I be looking at?14:14
sdaguethe oslo incubator one probably14:14
dansmith176151/14:14
dansmithright, yeah, we don't want to do that in nova-conductor if we can help it14:15
bknudsonyou want nova-conductor to keep running after signaled?14:15
dansmithit needs to finish things up, yeah14:15
bknudsonof course, if it gets a SIGKILL it's going away anyways.14:15
dansmithit could be instrumenting a migration between two computes14:16
dansmithbknudson: of course, but term is supposed to be "finish your work, get to a stopping point, and shut down"14:16
bknudsonfrom what I've seen of RHEL is that it gives your process a few seconds.14:16
dansmitheh?14:16
bknudsonit'll do a SIGTERM and then a few seconds later it tries another signal.14:17
dansmithon system shutdown you mean14:17
bknudsony14:17
dansmithright, but that's different14:17
openstackgerritMartin Pavlásek proposed openstack/tempest: Added test of default domain in Keystone  https://review.openstack.org/3178214:17
bknudsonthan service stop?14:17
dansmithsure14:17
dansmithand also, not everyone is on rhel, systemd, etc14:17
dansmithif you're rolling out new code,14:18
*** shashankhegde has joined #openstack-qa14:18
dansmithyou need to be able to sigterm the process, let it land gracefully, and then restart it with new code14:18
dansmithfor minimal disruption to the running deployment14:18
dansmiththat's the point of this stuff14:18
bknudsonhow long will it take?14:18
dansmithon compute it could be quite a while14:18
dansmithwe go to great pains to stop listening for new rpc requests, and let all running threads finish before shutting down14:19
dansmithelse instances and db state end up in all manner of broken when you restart14:19
bknudsonoh, good.14:19
bknudsonat least you thought about it and it's not just random behavior... unlike keystone not stopping when there's a connection open.14:19
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670014:20
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411114:20
dansmithbknudson: no, this is very calculated behavior14:20
sdaguedansmith: so .... why might API be hanging14:20
bknudsonok, so the question for the bug was only about nova-api.14:20
*** johnthetubaguy is now known as zz_johnthetubagu14:20
dansmithsdague: well, it sounds like bknudson suspects weird eventlet stuff?14:20
dansmithit certainly seems like it's doing the right thing in that log, with things tearing down, etc14:21
bknudsonthe nova-api log looks the same as the keystone log, so I expect it's the same problem... easy to recreate by just opening a connection to nova and trying to stop it.14:21
*** zz_johnthetubagu is now known as johnthetubaguy14:21
dansmithbknudson: but, both api and conductor have connections open to the mq all the time14:23
dansmithbknudson: maybe this is a bug in the wsgi stuff?14:23
sdagueso - http://logs.openstack.org/23/176723/1/check/check-grenade-dsvm-neutron/6307eb0/logs/old/screen-n-api.txt.gz#_2015-04-23_13_18_35_857 was an inbound connection 15 seconds prior to the kill14:23
bknudsonI didn't look at outbound connections at all.14:23
*** rwsu has joined #openstack-qa14:24
dansmithsdague: which seems like a lot of padding before kill time14:24
*** anshul has quit IRC14:24
sdagueit http keepalive on?14:24
bknudsonsome web clients use a persistent connections.14:24
dansmithright, but all the clients here are commandline bits or tempest, right?14:25
bknudsonthe keystoneclient session code uses connection pools14:25
dansmithand those should be gone immediately I think14:25
dansmithI'm spinning up a devstack to poke at14:25
sdagueall the clients will be the python-*client libs or tempest14:26
dansmithright14:26
sdaguepython-*clients use requests14:26
dansmithnone of which should still be living by the time we run the kill, yes?14:26
dansmithI mean, none of the processes making those calls through the clients14:26
openstackgerritJens Rosenboom proposed openstack-dev/devstack: DO NOT MERGE - Update default cirros version  https://review.openstack.org/16677814:27
sdaguewell, in the neutron case, neutron is still up14:27
sdaguenova comes down first14:28
dansmithah, right14:28
dansmithI'm fine if we can blame this on neutron, yet again :P14:28
*** mtanino has joined #openstack-qa14:28
dansmithsdague: you said you saw this on conductor once?14:28
bknudsonI started devstack, was able to CTRL-C n-api, then I restarted it, in another window did 'nc localhost 8775', then tried to CTRL-C n-api again... it didn't shut down.14:29
bknudsonthis is the same problem keystone-all had.14:29
dansmithbknudson: and just to be clear, when you kill the nc, it doesn't finish shutting down, right?14:30
bknudsondansmith: correct, 2015-04-23 09:28:45.606 INFO nova.openstack.common.service [-] Child 32127 exited with status 114:30
bknudsonthat was the last line.14:30
sdaguedansmith: yeh, there is an ER query in the bug14:30
bknudsonthen I CTRL-C again and it goes away14:30
dansmithoh, hang on14:30
bknudsonor if I disconnect the client it goes away14:31
dansmiththe first batch of caffeine is just starting to metabolize...14:31
sdaguehttp://logstash.openstack.org/#eyJzZWFyY2giOiJtZXNzYWdlOlwiVGhlIGZvbGxvd2luZyBzZXJ2aWNlcyBhcmUgc3RpbGwgcnVubmluZ1wiIEFORCBtZXNzYWdlOlwiZGllXCIiLCJmaWVsZHMiOltdLCJvZmZzZXQiOjAsInRpbWVmcmFtZSI6IjYwNDgwMCIsImdyYXBobW9kZSI6ImNvdW50IiwidGltZSI6eyJ1c2VyX2ludGVydmFsIjowfSwic3RhbXAiOjE0Mjk3OTk0NTY0OTB914:31
dansmiththe change to incubator doesn't look like it would break anything in nova, because it's not like it's avoiding calling the thing we use in compute to finish processing14:31
dansmithso we must make another call to signal() ourselves14:31
bknudsonsdague: are those all failures? I wonder if sometimes they do eventually stop.14:32
sdaguebknudson: those are all failures14:33
sdaguethat's after the 10 second wait14:33
sdagueI can increase the wait time if you think they'll eventually exit14:33
bknudson10 seconds should be enough14:33
bknudsonat least for keystone-all... I don't know about the other nova- processes.14:33
sdaguethat's the message you get when it goes fatal (note the die)14:33
sdaguebased on what's in the request logs, I feel like at least for nova-api there shouldn't be any actual outstanding work14:34
bknudsonif you're saying that nova-conductor needs all the time it can get to eventually shut down, then 10 seconds might not be enough.14:34
*** achanda has joined #openstack-qa14:34
sdaguesure, agree, dansmith can weigh in on that14:34
dansmithwell,14:35
dansmithconductor and compute need all the time they can get to shutdown14:35
dansmithhowever,14:35
dansmithif this is a -infra timeout, and 10 seconds should be enough based on the workload, then that's fine14:35
sdaguewe're in a pretty constrained env here, so they should be idle14:35
sdagueyeh, it's a grenade timeout14:35
dansmithhowever, it doesn't sound like nova-api is going to eventually shut down14:35
bknudsonmight be interesting to see what those processes are up to and waiting on, if that's possible to dump out.14:35
*** BadCub_Hiding is now known as BadCub14:36
sdaguehttps://github.com/openstack-dev/grenade/blob/master/projects/60_nova/shutdown.sh#L3014:36
bknudsonI assumed for keystone-all that admin doesn't care what users might be up to.14:36
dansmithokay, so, I don't think this makes it any worse for nova14:36
*** artom has joined #openstack-qa14:36
dansmithand it probably means that conductor is not doing what we think it is/should today14:36
dansmithcompute works differently and it's the important one14:37
dansmithso if this fixes the problem, we should just sync it to nova14:37
sdagueyeh, let me find a conductor fail14:37
sdagueso we can look in more detail14:37
dansmithwell, my point is,14:37
*** anshul has joined #openstack-qa14:37
dansmithconductor using the process worker model isn't calling the term handler(s) now I don't think14:37
dansmithand this won't change it14:37
sdagueok14:37
dansmithconductor hasn't had long-running processes for long, so we probably just made an assumption that we were good, when we're not14:38
bknudsonis there a test for -- conductor is busy and got a shutdown so it waits?14:38
dansmithbknudson: no14:38
bknudsonthat would not be an easy test to write14:38
dansmithyeah14:38
dansmiththe actual mechanics of what happens is really wrapped up in the messaging code, etc14:39
dansmithwhich is why it has regressed in compute a few times14:39
*** jhenner has joined #openstack-qa14:39
*** zz_dimtruck is now known as dimtruck14:40
nithyag_afazekas Could you please have a look at the new patch https://review.openstack.org/#/c/14922914:40
*** ajmiller has joined #openstack-qa14:40
nithyag_afazekas I hope this patch addresses your comments. Please let me know if you see any gap14:41
nithyag_afazekas leaving now, will work on your suggestions if any on the new patch tomorrow. thanks14:44
*** achanda has quit IRC14:44
*** jhenner has quit IRC14:46
*** SridharGaddam has joined #openstack-qa14:48
*** ChuckC has joined #openstack-qa14:48
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of StringIO to use it from six  https://review.openstack.org/17678314:49
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of urlparse to import from six  https://review.openstack.org/17678414:49
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of ConfigParser to import from six  https://review.openstack.org/17678514:49
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of httplib to import from six  https://review.openstack.org/17678614:49
openstackgerritMatthew Treinish proposed openstack/tempest: Fix raise syntax in test.py for python3 compat  https://review.openstack.org/17678714:49
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of urllib and HTMLParser to import from six  https://review.openstack.org/17678814:49
*** nithyag_ has quit IRC14:50
*** Madan has quit IRC14:50
*** nithyag_ has joined #openstack-qa14:50
*** SridharGaddam has quit IRC14:51
*** SridharGaddam has joined #openstack-qa14:53
*** gamado has joined #openstack-qa14:54
*** jhenner has joined #openstack-qa14:55
sdaguedansmith: oh... mf14:55
sdaguehttp://logs.openstack.org/23/176723/2/check/check-grenade-dsvm-neutron/1011118/logs/grenade.sh.txt.gz14:56
sdagueso the entire socket dump is there14:56
sdagueI think it's ceilometer-collector14:56
sdaguethat's holding open things14:56
dansmithto nova-api?14:57
sdagueno, never mind, I think I'm wrong14:57
mtreinishsdague: I did just see a random commit fly by somewhere which said a new requests thing is hanging things14:57
mtreinishdug it up: https://review.openstack.org/#/c/176771/14:58
mtreinishdoubt that's really anything though14:59
sdagueyeh, unrelated14:59
*** mlavalle has joined #openstack-qa14:59
*** sabeen1 has joined #openstack-qa15:00
*** pcrews has joined #openstack-qa15:02
*** tkammer|not_here has quit IRC15:03
*** ihrachyshka has quit IRC15:03
*** SridharGaddam has quit IRC15:04
*** sabeen2 has joined #openstack-qa15:04
*** marun has joined #openstack-qa15:04
*** SridharGaddam has joined #openstack-qa15:05
*** sabeen1 has quit IRC15:07
*** apevec has quit IRC15:08
*** rdekel has joined #openstack-qa15:11
*** jlanoux has quit IRC15:16
*** tkammer|not_here has joined #openstack-qa15:17
*** dboik has quit IRC15:18
openstackgerritSean Dague proposed openstack-infra/devstack-gate: add liberty to feature grid  https://review.openstack.org/17549315:21
openstackgerritSean Dague proposed openstack-infra/devstack-gate: only force off KEYSTONE_USE_WSGI up until kilo  https://review.openstack.org/17553115:21
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674815:22
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of urllib and HTMLParser to import from six  https://review.openstack.org/17678815:23
openstackgerritMatthew Treinish proposed openstack/tempest: Fix comprehension syntax error with python3  https://review.openstack.org/17681415:23
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of iteritems to use six instead  https://review.openstack.org/17681515:23
mtreinishman I really hate python3...15:23
mtreinishafter all those patches you can only do a testr list-tests and run the unit tests with 30 failures15:24
sdagueit's not python3, it's the 2 versions of python at once15:24
mtreinishyeah, that's a fair point15:25
openstackgerritOpenStack Proposal Bot proposed openstack-dev/hacking: Updated from global requirements  https://review.openstack.org/17682015:27
*** dboik has joined #openstack-qa15:29
*** david-lyle has joined #openstack-qa15:29
*** armax has joined #openstack-qa15:30
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672315:34
openstackgerritOpenStack Proposal Bot proposed openstack/tempest: Updated from global requirements  https://review.openstack.org/17685115:36
openstackgerritOpenStack Proposal Bot proposed openstack/tempest-lib: Updated from global requirements  https://review.openstack.org/17685215:36
*** davideagnello has joined #openstack-qa15:41
*** bkopilov has joined #openstack-qa15:46
openstackgerritMatthew Treinish proposed openstack/tempest: Switch use of hashing strings to unicode in accounts  https://review.openstack.org/17686215:46
*** tsekiyama has joined #openstack-qa15:52
*** shashankhegde has quit IRC15:53
dansmithsdague: bknudson: I can't repro locally15:53
dansmithif I establish a connection to nova-api, and then "pkill nova-api" I see all the shutdown messages as expected, but everything dies and the connection terminates15:53
dansmithno nova-api processes left afterwards15:53
bknudsonis that what grenade does? pkill?15:54
*** jlanoux has joined #openstack-qa15:55
sdaguedansmith: pkill -g process leader pid15:55
*** dboik has quit IRC15:56
*** david-lyle has quit IRC15:56
dansmithah, there we go15:57
dansmithleaves me with one child and the parent15:57
bknudson`pkill -g 4384` killed it, where bknudson  4384 28633 11 10:56 pts/10   00:00:02 /usr/bin/python /usr/local/bin/nova-api15:57
dansmithsdague: one more of those commands cleans it up, no need to disconnect15:57
*** mlavalle has quit IRC15:58
dansmithlet me try again to confirm15:58
sdaguedansmith: ok, cool15:58
bknudsonif you send another signal while it's waiting it shuts down.15:58
bknudsonthe oslo-incubator code disables its signal handler after the first signal15:58
dansmithhmm, now I can't get it to happen :)15:59
sdaguebknudson: so we could work around the tests with a double kill?15:59
sdaguedansmith: it's not all the time15:59
bknudsonsdague: yes, that's another workaround15:59
*** jlanoux is now known as jlanoux_away15:59
bknudsonkeep signalling it15:59
sdagueand I don't know why we get stretches where it happens a lot15:59
*** jlanoux_away is now known as jlanoux15:59
bknudsonI didn't think that was a fix though... I didn't want keystone to not shut down just because a client was connected15:59
*** amotoki_ has joined #openstack-qa15:59
bknudsonsomeone might consider that a DoS15:59
dansmithsdague: yeah, but like, worked first time, and hasn't ten in a row :)16:00
*** dboik has joined #openstack-qa16:00
sdaguebknudson: sure, I agree, having the fix is better16:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of urllib and HTMLParser to import from six  https://review.openstack.org/17678816:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of ConfigParser to import from six  https://review.openstack.org/17678516:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of urlparse to import from six  https://review.openstack.org/17678416:00
openstackgerritMatthew Treinish proposed openstack/tempest: Fix raise syntax in test.py for python3 compat  https://review.openstack.org/17678716:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of httplib to import from six  https://review.openstack.org/17678616:00
openstackgerritMatthew Treinish proposed openstack/tempest: Fix comprehension syntax error with python3  https://review.openstack.org/17681416:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of iteritems to use six instead  https://review.openstack.org/17681516:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch all uses of StringIO to use it from six  https://review.openstack.org/17678316:00
openstackgerritMatthew Treinish proposed openstack/tempest: Switch use of hashing strings to unicode in accounts  https://review.openstack.org/17686216:01
dansmithsdague: seriously, I wonder if it's something to do with the first time it's started from either devstack or screen or whatever16:01
dansmithsdague: let me restack and try again16:02
bknudsonit's very easily reproducible for me... I always just use the devstack screen & restart it there.16:02
dansmithbknudson: yeah, that's what I'm doing16:03
bknudsonso, just wanted to mention that there weren't times where it sometimes worked and sometimes didn't.16:03
bknudson(for me)16:03
dansmithyeah, that's what has me very curious about root cause16:03
*** ihrachyshka has joined #openstack-qa16:04
sdaguedansmith: honestly, the speed of the machine might matter16:05
dansmithsdague: mine is very fast16:05
dansmithbut16:05
dansmithI hit it the FIRST time I tried16:05
dansmithbut we'll see in a minute if that's related16:05
dansmithbut, a second kill seemed to fix it, which is excellent if so, I just want to see it again16:06
dansmith...like the money I sent to the IRS16:06
dansmithdamn, I could have earned respect from mriedem if the joke had been "like that money I sent to that TV preacher"16:07
*** hemnafk is now known as hemna16:07
mriedemi'm lost16:07
mriedemthe baker's put your money to good use16:08
mriedemyou don't need it back, you were paid in karma16:08
dansmithmriedem: https://www.phactual.com/16-christmas-vacation-quotes-we-all-know-and-love/16:09
mriedemdansmith: i know the summer vacation quotes better16:09
mriedemnothing to be proud of russ....50 yards...16:10
dansmithsdague: happened again on the first run from devstack16:10
mriedemhttps://www.youtube.com/watch?v=c1wcnvDpdVU16:10
dansmithsdague: second kill took it out16:10
sdagueok good to know16:10
dansmithsdague: second attempt at restarting in the screen doesn't hit it16:10
dansmithweird.16:10
*** jlanoux has quit IRC16:10
*** pkoniszewski has quit IRC16:11
openstackgerritDan Smith proposed openstack-dev/devstack: Remove the lvm.conf filter during cleanup  https://review.openstack.org/17688916:14
dansmithsdague: that's why my clean.sh wasn't working ^16:14
dansmithsdague: because of switching between ceph and lvm I think16:14
sdagueoh, yeh, good catch16:14
sdaguethe lvm filter isn't idempotent in it's add either, right?16:15
dansmithI don't think it is, because I had a few in there16:16
openstackgerritRoman Podoliaka proposed openstack/tempest: Add a test case for rebuild of instances with volumes  https://review.openstack.org/17594916:16
dansmithso we could run this clean at the top of that function if you want16:16
sdagueany chance you want to throw a 2nd patch up to fix that16:16
sdagueno, I think the clean fix is fine16:16
sdagueit would also be nice to make the add idempotent16:16
dansmithI mean, we run clean before we do the filter16:16
dansmithwhich just nukes the line if it's there, before we try to regenerate/add it16:16
*** aix has quit IRC16:17
sdagueoh, hmmm... it seems safer to not constantly delete/add, but that would be a thing we could do16:17
sdagueyeh, either way16:17
dansmithwell,16:17
dansmithwe have to make sure we could match anything anyway, so this seems easier16:18
sdagueyeh, ok, that would work for me16:18
openstackgerritDan Smith proposed openstack-dev/devstack: Remove the lvm.conf filter during cleanup  https://review.openstack.org/17688916:18
dansmiththat ^16:18
dansmiththe problem for me was that it was different based on the ceph or lvm config,16:19
dansmithso just nuking any previous devstack lines avoids us guessing what it might look like16:19
*** tkammer|not_here has quit IRC16:19
*** jhenner has quit IRC16:21
openstackgerritSridhar Gaddam proposed openstack-dev/devstack: Devstack to support an IPv6 infra installation  https://review.openstack.org/17689816:22
*** weshay has quit IRC16:23
dansmithsdague: so should I put this service kill thing in the release notes under known issues with the workaround?16:23
*** ajo has quit IRC16:24
sdaguedansmith: so I think we're landing this for an rc3 because of the currently blocking neutron thing16:24
dansmithsdague: hmm, okay, where is that being discussed?16:24
*** SridharG has left #openstack-qa16:25
sdague  #openstack-relmgr-office16:25
sdaguethat's where ttx is making calls on such things16:25
dansmithah, okay16:25
*** mlavalle has joined #openstack-qa16:26
sdaguehere is the relevant conversation paste - http://paste.openstack.org/show/205334/16:26
*** Triveni has joined #openstack-qa16:26
dansmithI'm in there, so I have scrollback16:27
dansmithI just didn't realize16:27
sdagueok, sorry, yeh, there has been a lot of channel hopping16:27
dansmithsdague: so just to be clear, we *could* put a simple hack of "kill; sleep 1; kill" into our code to unblock this, yes?16:27
sdaguethat sounds possible16:28
dansmithif it were my call, which it isn't, I'd not cause another rc3 for nova for this just because our scripts hit it16:28
sdaguenow that you confirmed that it's always working on second kill16:28
sdaguefeel free to counter propose to ttx about it16:29
*** patrickeast has joined #openstack-qa16:31
*** mwagner_lap has quit IRC16:31
dansmithsdague: where is the kill happening? I don't see pkill in all of grenade16:32
sdaguedevstack16:32
sdaguewe call into devstack functions for most of the work16:32
dansmithokay, so grenade calls unstack?16:32
sdagueno, it calls devstack functions16:32
sdagueprojects/60_nova/shutdown.sh16:32
sdaguestop_nova_rest16:33
dansmithgotcha16:33
sdaguethe shutdown functions source BASE_DEVSTACK...16:33
sdaguethat eventually gets you into functions-common:stop_process16:33
dansmithyeah16:33
*** Swami has joined #openstack-qa16:34
sdaguehttps://github.com/openstack-dev/devstack/blob/master/functions-common#L134616:34
dansmithsdague: so we could do this either in devstack or grenade I think16:34
sdaguewhat's your grenade proposed fix?16:35
dansmithsdague: devstack would make it possible to just hit nova-api, but just calling stop_nova_rest twice in grenade would be more likely to just work I think16:35
sdagueso stop_nova_rest twice isn't going to work16:35
sdaguebecause we rm the pid file16:35
sdagueso that we don't have stale pids16:36
*** e0ne has quit IRC16:36
dansmithah16:36
sdaguehttps://github.com/openstack-dev/devstack/blob/master/functions-common#L134716:36
sdaguethat's what I was starting to write... then discovered that16:36
*** zz_jgrimm is now known as jgrimm16:36
dansmithsdague: so the other grenade-only change could be just really hacky after stop_nova_rest: pkill nova-api16:37
sdaguewe could capture the pid files before the kill so we could rekill.16:37
sdaguedansmith: yeh, we'll still have the conductor racing every once in a while16:37
dansmithwell, we could do the same for conductor16:38
*** Triveni has quit IRC16:38
*** luqas has quit IRC16:38
*** Triveni has joined #openstack-qa16:38
dansmithI guess you want to avoid the obligatory double-kill of everything in devstack16:38
sdaguewe could do the double kill blindly16:39
sdaguehonestly, if we need to work around in test code, that would be my thought on the way to do16:39
sdaguescheduler fails some times as well - http://logs.openstack.org/41/176041/2/check/check-grenade-dsvm/763fe2c/logs/grenade.sh.txt.gz16:39
dansmithhmm16:39
dansmitheven more suspect :/16:39
sdagueI think this is a race we've had for a long time16:40
sdaguethe first oslo-incubator change also removed an eventlet sleep, which might make it more likely to expose16:40
sdaguedansmith: you want to propose the double kill into stable/kilo devstack?16:42
openstackgerritDan Smith proposed openstack-dev/devstack: Workaround potential failure to shutdown services  https://review.openstack.org/17690916:42
dansmithsdague: ^16:42
dansmiththat what you had in mind?16:42
sdagueyeh, but stable/kilo only16:42
dansmithokay16:42
dansmithprobably want a comment above it too16:43
sdagueand probably should have a comment referencing the bug16:43
sdagueyep16:43
*** ajmiller_ has joined #openstack-qa16:43
sdaguehopefully we can remove this after backports16:43
sdagueafter release16:43
sdagueonce you get that up, I'll adjust the grenade stack to be on top of it16:44
dansmith/review.openstack.org/17691016:44
dansmithhttps://review.openstack.org/17691016:44
*** cody-somerville has quit IRC16:45
dansmithI'm going to use this bug to justify a third cup of coffee, just FYI16:46
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670016:46
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411116:46
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674816:46
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672316:46
sdaguedansmith: will you put wisky in it?16:46
dansmithno, waaay too early for that16:46
dansmithand also, I don't like whisky16:47
*** ajmiller has quit IRC16:47
*** shardy is now known as shardy_afk16:48
sdague:)16:48
sdaguemtreinish: ... I'm seeing a lot more failures on test_volume_boot_pattern in master16:48
sdagueis that a thing?16:48
dansmithsdague: failures with lvm?16:49
sdaguedansmith: yep16:49
*** dboik has quit IRC16:51
sdaguealso, seriously, wth keystone - http://logs.openstack.org/12/176012/2/gate/gate-tempest-dsvm-full/4f6c11f/logs/screen-n-api.txt.gz?level=TRACE#_2015-04-23_13_41_17_50016:51
*** apevec has joined #openstack-qa16:51
*** apevec has joined #openstack-qa16:51
mtreinishsdague: is there a pattern? There're have been a few generic non-descript ssh failures on that test16:52
mtreinishbut I haven't seen a major spike16:52
*** mpaolino has joined #openstack-qa16:52
sdaguemtreinish: it's killed a few of my patches today16:52
*** jordanP has quit IRC16:52
sdagueI'm looking deeper16:52
sdaguethough I need to stop for lunch soon16:52
*** miqui has joined #openstack-qa16:53
sdaguehttps://review.openstack.org/#/c/176012/ which is a noop patch effectively got whacked by it16:53
mtreinishsdague: yeah that was an ssh timeout, mriedem has hit a couple of those too16:54
*** dustins_ has joined #openstack-qa16:56
*** dboik has joined #openstack-qa16:57
openstackgerritJens Rosenboom proposed openstack-dev/devstack: DO NOT MERGE - Update default cirros version  https://review.openstack.org/16677816:58
*** safchain has quit IRC16:59
*** dustins has quit IRC16:59
*** nfedotov has joined #openstack-qa16:59
*** johnthetubaguy is now known as zz_johnthetubagu17:01
mtreinishsdague: so I used my fancy new subunit2sql tool for failure analysis and got: 99.1123% and 0.1024% over the past week for boot volume v217:01
mtreinishwith this nice graph: http://imgur.com/Bx8DYMp17:02
mtreinishwhere green is daily success count, red is skips and blue are failures17:02
mtreinishmriedem: ^^^17:02
mtreinishobviously the graphing side still needs some work :)17:02
*** cbader has quit IRC17:03
*** cbader has joined #openstack-qa17:03
mriedemheh17:03
mtreinishalso only gate17:04
*** suneelb has joined #openstack-qa17:05
*** rushiagr_away is now known as rushiagr17:07
*** Longgeek_ has quit IRC17:07
*** yamamoto has quit IRC17:08
*** harlowja_away is now known as harlowja17:13
*** marcusvrn has quit IRC17:20
*** marcusvrn has joined #openstack-qa17:20
*** achanda has joined #openstack-qa17:21
jogomriedem: we have a lot of failures http://status.openstack.org//elastic-recheck/data/uncategorized.html17:21
*** Triveni has quit IRC17:21
jogomriedem: based on http://jogo.github.io/gate/ it looks like it was fixed17:22
jogois there a bug opened for this? so I can add a e-r bug17:22
mriedemjogo: for this? http://logs.openstack.org/20/174320/3/gate/gate-tempest-dsvm-neutron-full/6ca2c9a/console.html#_2015-04-23_10_11_17_00217:26
*** marcusvrn1 has joined #openstack-qa17:26
jogoyeah17:27
*** marcusvrn has quit IRC17:27
jogoor was it http://logs.openstack.org/49/85249/18/check/check-tempest-dsvm-full/77170c5/logs/devstacklog.txt.gz#_2015-04-23_01_38_39_158 ?17:28
*** ihrachyshka has quit IRC17:29
jogoahh https://bugs.launchpad.net/devstack/+bug/144747617:30
openstackLaunchpad bug 1447476 in OpenStack Dashboard (Horizon) "No module named pathlib" [Critical,Fix released]17:30
*** marun has quit IRC17:34
openstackgerritJoe Gordon proposed openstack-infra/elastic-recheck: Add fingerprint for bug 1447476  https://review.openstack.org/17691917:35
openstackbug 1447476 in OpenStack Dashboard (Horizon) "No module named pathlib" [Critical,Fix released] https://launchpad.net/bugs/144747617:35
jogomriedem: ^17:35
*** patrickeast_ has joined #openstack-qa17:36
*** patrickeast has quit IRC17:37
*** patrickeast_ is now known as patrickeast17:37
mriedemjogo: k sec17:37
*** dboik has quit IRC17:38
*** dboik has joined #openstack-qa17:39
*** gamado has quit IRC17:41
*** sileht has quit IRC17:44
*** shashankhegde has joined #openstack-qa17:45
openstackgerritMerged openstack-dev/devstack: Fix issue with ml2 plugin on using provider network  https://review.openstack.org/17473117:45
*** e0ne has joined #openstack-qa17:53
*** sileht has joined #openstack-qa17:55
*** e0ne is now known as e0ne_18:01
openstackgerritSean M. Collins proposed openstack-dev/devstack: [WIP] Test VXLAN tunnel type with Linux Bridge  https://review.openstack.org/17692718:02
*** e0ne_ has quit IRC18:06
*** david-lyle has joined #openstack-qa18:07
*** tosky has quit IRC18:08
*** tosky_ has joined #openstack-qa18:08
*** tosky_ has quit IRC18:09
*** melwitt has joined #openstack-qa18:11
*** e0ne has joined #openstack-qa18:13
*** dboik_ has joined #openstack-qa18:15
*** dboik has quit IRC18:18
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670018:19
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411118:19
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674818:19
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672318:19
dansmithsdague: what happened?18:20
sdagueso... the whole stack was going to be blocked on the master patch you pushed18:20
sdaguewhich we didn't want to land18:20
dansmithwhy?18:21
sdagueso I had to push your stable/kilo with a different idempotent id, so this would only block on that18:21
sdaguewhy what?18:21
dansmithoh, you were using depends-on somewhere?18:21
sdagueyes18:21
dansmithI gotcha18:21
sdagueso that we could test it18:21
dansmiththe why was why blocked18:21
sdagueyeh18:21
sdagueso even if I abandoned the master patch, the depends-on blocks on abandoned things18:21
dansmithI didn't -x the cherry pick, but didn't think it'd matter about the change id18:22
dansmithyeah18:22
dansmithgot it now18:22
sdagueI only just figured that out, now things should be good to go18:22
dansmithokay18:22
sdaguebut still have to wait for test results18:22
openstackgerritMatthew Treinish proposed openstack/tempest: Fix tenant isolation and unit tests with py3  https://review.openstack.org/17693418:23
dansmithsdague: so this needs abandoning as well? https://review.openstack.org/#/c/176910/18:23
*** mpaolino has quit IRC18:23
sdagueyeh18:23
sdagueI was going to make sure to explain to you what was going on before I did that18:24
*** morganfainberg is now known as grebniafnagrom18:24
openstackgerritMerged openstack-infra/elastic-recheck: Add fingerprint for bug 1447476  https://review.openstack.org/17691918:24
openstackbug 1447476 in OpenStack Dashboard (Horizon) "No module named pathlib" [Critical,Fix released] https://launchpad.net/bugs/144747618:24
dansmithheh, it's cool18:24
*** grebniafnagrom is now known as morganfainberg18:24
*** cdent has quit IRC18:27
*** jlk has joined #openstack-qa18:27
jlkhogepodge: ping; Did you author https://etherpad.openstack.org/p/havana-capabilities-test-procedure ?18:28
*** e0ne is now known as e0ne_18:28
hogepodgejlk yes18:28
jlkquick question then, line 60, should that be --load-list rather than --load-tests ?18:29
hogepodgejlk probably18:29
*** e0ne_ is now known as e0ne18:29
hogepodgejlk yes, it's in error18:30
jlkalright cool. Just making sure I'm doing it right :)18:30
hogepodgejlk feel free to fix it in the etherpad. more official docs (which are also in error) are in the defcore repository now18:31
jlkgotcha18:31
jlkwant a PR for those?18:31
hogepodgeSure, always happy for that.18:31
hogepodgeThis is the file in question. https://github.com/openstack/defcore/blob/master/2015.04/procedure.rst18:32
hogepodgejlk: I can do it too. Don't want to be the "submit a pr" guy18:32
jlkheh, s'ok18:32
jlkchange up in Gerrit18:34
*** ArxCruz_ has joined #openstack-qa18:35
*** openstackrecheck has quit IRC18:36
*** openstackrecheck has joined #openstack-qa18:36
*** ArxCruz_ has quit IRC18:36
*** e0ne is now known as e0ne_18:40
openstackgerritClark Boylan proposed openstack-infra/devstack-gate: Make multinode first class with ansible  https://review.openstack.org/17261418:43
*** e0ne_ is now known as e0ne18:44
*** gfidente has quit IRC18:46
*** Swami has quit IRC18:49
*** dkranz has quit IRC18:50
*** yamamoto has joined #openstack-qa18:57
*** amotoki_ has quit IRC18:57
*** yamamoto has quit IRC19:01
*** mwagner_lap has joined #openstack-qa19:03
*** dkranz has joined #openstack-qa19:06
openstackrecheckopenstack/horizon change: https://review.openstack.org/176865 failed gate-grenade-dsvm in the gate queue with an unrecognized error19:07
*** shashankhegde has quit IRC19:08
mtreinishsdague: if you get a sec: https://review.openstack.org/#/c/172507/ should be pretty quick19:09
*** patrickeast has quit IRC19:10
*** rushiagr is now known as rushiagr_away19:14
dansmithsdague:  are we set -e here?19:15
*** marun has joined #openstack-qa19:15
sdagueyes19:15
sdaguewe try to set -e everywhere19:16
dansmithokay, I didn't realize, sorry19:16
sdagueno problem, I just was looking at zuul and saw the failure in progress, so reved it19:16
dansmithcool19:16
*** jhenner has joined #openstack-qa19:23
*** dkranz has quit IRC19:24
*** dkranz has joined #openstack-qa19:37
SergeyLukjanovmtreinish, sdague, I just saw a highlight from meeting - sahara definitely interested in grenade support19:39
*** jecarey has quit IRC19:42
*** Swami has joined #openstack-qa19:43
*** nfedotov has quit IRC19:47
mtreinishSergeyLukjanov: ok, good to know19:48
mtreinishI think we were discussing how we'd use sahara or ironic to highlight how to use the new modular grenade with an out of tree plugin19:48
SergeyLukjanovmtreinish, I think we could try to use it for sahara19:51
SergeyLukjanovmtreinish, I'm unfortunately not really aware of how updated grenade works...19:51
mtreinishSergeyLukjanov: fortunately it's much simpler to understand now19:52
mtreinishSergeyLukjanov: to add a project you just need to tell grenade about one of these dirs: http://git.openstack.org/cgit/openstack-dev/grenade/tree/projects/60_nova19:52
mtreinishand the files there all have a defined interface to work within19:53
*** amotoki_ has joined #openstack-qa19:57
*** annashen has joined #openstack-qa19:58
*** mtanino has quit IRC19:58
*** patrickeast has joined #openstack-qa19:59
*** Sukhdev has joined #openstack-qa19:59
*** ajo has joined #openstack-qa20:00
mtreinishdkranz: if you get a sec my open review list is getting a bit unwieldy. If you get a sec could take a look at: https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:accounts_v3,n,z , https://review.openstack.org/174671 , and https://review.openstack.org/#/c/176731/20:03
mtreinishthey should all be pretty quick20:03
dkranzmtreinish: ok, working from home tomorrow so should definitely be able to then if not later today20:04
mtreinishdkranz: cool, thanks a lot20:04
*** annashen has quit IRC20:04
mtreinishI just took a look at my open review page at it was sitting at ~50 so figured I should push to get the simple ones landed at least :)20:06
*** annashen has joined #openstack-qa20:06
mtreinishgranted it's my own fault for trying to tackle py3 this morning20:06
*** annashen has quit IRC20:12
*** mtanino has joined #openstack-qa20:12
mtreinishsdague: thanks for the +A, a good chunk of my docs patches should be in the gate now20:13
sdagueyep20:14
*** tkammer|not_here has joined #openstack-qa20:15
jlkhrm, I seem to still be running into the problem with https://review.openstack.org/#/c/163549/ in my tempest setup, even though I"m using master20:15
jlknamely I'm getting oslo_config.cfg.RequiredOptError: value required for option: lock_path20:16
mtreinishjlk: is you're venv stale?20:16
jlkI did a pip install -U -r requirements.txt in the .venv20:16
mtreinishjlk: hmm, that patch did change tempest to use the oslo lib for the lockutils instead of a local copy from the incubator20:16
mtreinishit did change the config file semantics a bit, but it should have been backwards compat20:17
mtreinishjlk: which group is your lock_path option in?20:18
*** cdent has joined #openstack-qa20:18
jlklet me find it.20:19
jlk[DEFAULT]20:19
jlkand it's commented out actually20:19
jlkso I have no lock_path set20:19
jlkhah, that would explain it20:19
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670020:19
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411120:19
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674820:19
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672320:19
mtreinishjlk: heh, yep :)20:19
sdaguedamn, I missed one of the lib/neutron references20:19
mtreinishjlk: oh, you know what, I could see prior to that patch we weren't enforcing that as a requirement and in certain configs it wasn't used20:19
sdaguebut it seemed to pass the shutdown part20:20
mtreinishjlk: but since we switched to using the lib it actually enforces the requirement now20:20
*** shashankhegde has joined #openstack-qa20:20
sdaguemtreinish: you want to +A https://review.openstack.org/#/c/176932 ?20:20
mtreinishjlk: also fwiw, the new location is in the oslo_concurrency group. It'll probably emit a deprecation warning if you use it in default20:21
jlkmtreinish: what's the lock path used?20:21
jlkmy old file doesn't even have that section :/20:21
*** marun has quit IRC20:21
* jlk looks in the new sample config20:21
mtreinishsdague: done20:22
jlkoh there is no default listed, so... what's a good reasonbale choice for this? I don't really understand what this path will be used for20:22
mtreinishjlk: just something in /tmp is fine20:23
SergeyLukjanovmtreinish, I'll take a look on it tomorrow, it's near to midnight now20:23
mtreinishjlk: it's only used for writing files for external locks20:23
mtreinishjlk: tempest needs it because there are certain tests which can't run at the same time20:23
sdaguemtreinish: so, tempest could really overwrite that and give it a default20:23
mtreinishjlk: http://docs.openstack.org/developer/tempest/configuration.html#lock-path20:23
sdagueall the security concerns with services are pretty much irrelevant in the tempest case20:23
mtreinishsdague: I don't think there is an api for doing that in oslo_concurrency20:24
mtreinishand dhellmann will yell at me if I start set_default on config opts from a library20:24
sdagueyou can't just conf.set_override?20:24
mtreinishsdague: according to dhellmann opt names and groups aren't part of the stable api20:24
mtreinisherr public api20:24
mtreinishthey should hopefully be stable20:25
mtreinishsdague: we've hit this several times already in tempest after moving things to libs20:25
mtreinishsdague: the most recent being: https://review.openstack.org/#/c/173581/20:25
sdagueso, honestly, I think that's a good reason to look at removing oslo pieces in the process.20:26
*** jecarey has joined #openstack-qa20:26
*** e0ne is now known as e0ne_20:26
sdaguebecause if the answer is "you can't make reasonable application defaults", that's ... really problematic20:27
*** e0ne has joined #openstack-qa20:27
*** andreykurilin__ has joined #openstack-qa20:28
mtreinishsdague: oh, I just checked there is an api available to set the default lock_path20:29
sdagueI also don't actually understand why it's problematic to set options like that, there has to be a deprecation cycle in oslo, they can't just remove things20:30
sdagueand if you do that on a deprecated option, it should just warn20:30
mtreinishsdague: I've done that many times before, but dhellmann says I shouldn't20:31
sdagueright, I don't find that a reasonable position20:32
sdaguebecause you can still deprecate things20:33
sdagueyou just now have to warn when they are going away20:33
sdagueand applications adjust to them20:33
*** dboik_ has quit IRC20:36
*** artom has quit IRC20:36
*** apevec has quit IRC20:37
mtreinishsdague: actually how should I handle the multiprocess aspect of it? I can't just call tempfile like this: http://paste.openstack.org/show/205397/20:38
mtreinishbecause that would result in a different lock path for each worker20:38
*** marun has joined #openstack-qa20:39
sdagueso... if tempest was actually properly installed, it would be set to /var/lock/tempest/20:39
sdaguebarring that I'd force it to $TMP_DIR/tempest20:39
sdaguethere is a python call to get the system definition of where tmp is, right?20:40
mtreinishsdague: yeah tempfile.tempdir20:40
*** dustins_ has quit IRC20:40
*** achanda has quit IRC20:41
sdagueactually I think you want tempfile.gettempdir()20:41
*** annashen has joined #openstack-qa20:42
openstackgerritMatthew Treinish proposed openstack/tempest: Set a tempest lock_path default to a tmpdir  https://review.openstack.org/17697520:42
mtreinishsdague: ^^^20:42
mtreinishoh, maybe20:42
sdaguein reading the docs, I think you want the function20:42
mtreinishyep just tested it manually, you're right20:43
*** jgrimm is now known as zz_jgrimm20:43
openstackgerritMatthew Treinish proposed openstack/tempest: Set a tempest lock_path default to a tmpdir  https://review.openstack.org/17697520:43
mtreinishsdague: ^^^20:44
sdaguedo you have to make sure it exists?20:44
sdagueor will it make it for you?20:44
mtreinishsdague: that's a good question20:45
mtreinishI probably have to create it20:45
sdagueyou could write a unit test for it20:45
sdagueand find out20:45
openstackgerritMerged openstack/tempest: Decouple Sahara templates test from vanilla plugin  https://review.openstack.org/16137020:46
sdagueok, calling it a night20:47
*** mlavalle has quit IRC20:47
*** e0ne has quit IRC20:47
*** samueldmq has quit IRC20:51
*** dboik has joined #openstack-qa20:55
openstackgerritAndrea Frittoli  proposed openstack/tempest: Add OS_TOP_LEVEL to testr conf  https://review.openstack.org/17697920:56
andreafmtreinish: let me know if have any concern about this: https://review.openstack.org/17697920:57
andreafoomichi, gmann, masayukig: I addressed your comments on https://review.openstack.org/#/c/153681 (I hope) :)21:01
mtreinishandreaf: the lock_path patch doesn't work :)21:02
andreafmtreinish: oh, too bad :(21:02
mtreinishI need to play with it a bit more, I'll add a unit test on the next rev to show it working21:02
andreafmtreinish: ok cool21:03
mtreinishandreaf: for the os_top_level thing I think we need to think about how we'll do that in the future21:03
mtreinishI think that interface will work, but when we have a real entry point/cli we'll want to just globally install tempest21:04
*** shardy_afk is now known as shardy21:05
*** zz_johnthetubagu is now known as johnthetubaguy21:05
andreafmtreinish: the problem is that at the moment .testr.conf is lost when tempest is installed - so what I do is install tempest and then pull .testr.conf from source21:06
openstackgerritMerged openstack/tempest: Start a tempest config guide networking section  https://review.openstack.org/17250721:06
openstackgerritMerged openstack/tempest: Add configuration guide sections for more required options  https://review.openstack.org/17404721:06
openstackgerritMerged openstack/tempest: Add config guide section on service feature enabled sections  https://review.openstack.org/17408621:07
andreafmtreinish: I guess if we want to have a proper CLI end point, the configuration needed for it will become part of the install21:07
openstackgerritMerged openstack/tempest: Add section to config guide for setting up services  https://review.openstack.org/17414921:07
mtreinishandreaf: yeah, that's what I was thinking21:07
mtreinishbut I'm not sure how to do that :)21:07
andreafmtreinish: neither am I - but that's something we should do in liberty I think21:09
mtreinishandreaf: yep21:10
mtreinishbut I guess in the short term adding another OS env var is fine21:10
mtreinishit's not like it hurts anything21:10
jlkCan any of you help me figure out what tempest is doing during setUpClass (tempest.api.compute.servers.test_create_server.ServersTestJSON) ? It's getting a 503 back, but I'm having a hard time finding the matching error on the openstack side21:11
jlkI think it's in the credentials setup21:11
jlkor not actually , etm e post the trace21:12
mtreinishjlk: it should have more details in the tempest log file about the request and response21:12
jlklsoh where does the log go?21:12
jlkah, log_file setting21:13
mtreinishjlk: yeah by default it tries stdout, but I think it gets lost because of the test runner. I have a patch up to switch that default value to tempest.log in your cwd21:14
*** tkammer|not_here has quit IRC21:15
openstackgerritMatthew Treinish proposed openstack/tempest: Set a tempest lock_path default to a tmpdir  https://review.openstack.org/17697521:16
mtreinishandreaf, sdague: ^^^21:16
*** ajo has quit IRC21:17
jlkhah, oops21:21
jlkso log_path isn't enough to get good logging, stupid file is empty21:21
*** cdent has quit IRC21:22
*** dboik has quit IRC21:22
mtreinishhmm, that's odd. normally all I need to do is set log_file and it should start logging as soon as the config file is loaded21:22
mtreinishwell if you have the traceback I can take a look at that and see if anything stands out21:23
mtreinishandreaf: heh, I ran unit tests locally but forgot pep8 :)21:24
jlkhttps://gist.github.com/j2sol/1d8ce608ad4baa397d8821:24
jlkso it's failing in create_network21:24
mtreinishjlk: yeah, that's all I can tell is that the 503 is being returned by neutron21:25
jlkso if I crank up debug and verbose, would I be able to get the actual details of the call, like what network it was trying to create with which details?21:25
*** cbader has quit IRC21:26
andreafmtreinish: oh, and if you have a sec to re-review this one: https://review.openstack.org/#/c/153681/ it has been around for 40 patchsets, I really hope it will be enough :D21:26
jlkactually21:26
mtreinishandreaf: heh, sure. But 40 is nothing21:26
jlkwhy is tempest trying to create a network?21:26
jlkand can I avoid doing that?21:27
mtreinishjlk: if you have tenant isolation enabled and neutron enabled tempest will try to make a network for each of the new tenants it creates21:27
mtreinishso it has an isolated network for testing with that user21:27
jlkhrm.21:27
mtreinishjlk: I just landed some docs around those config options21:28
jlkso maybe I have a bad setting in here for what CIDR to use for this network21:28
mtreinishjlk: but neutron shouldn't 503 if you give it a bad cidr21:28
jlkoh okay21:28
mtreinishor at least I would hope not, that should be a 400 or something21:28
*** marcusvrn1 has quit IRC21:29
*** marcusvrn has joined #openstack-qa21:29
mtreinishjlk: unfortunately it doesn't look like the doc changes have been pushed to the docs site yet21:29
mtreinishjlk: is there anything in the neutron logs?21:29
jlkthat's what I'm trying to find21:30
jlkI was hoping for some unique identifier from the tempest side I could use to search21:30
mtreinishjlk: that gets logged in the tempest log file :)21:30
mtreinishall the request ids21:30
jlk2015-04-23 21:26:55.927 20785 INFO tempest_lib.common.rest_client [req-4ec0f920-43eb-49fd-addd-4e286b36ec2b ] Request (ServersTestJSON:setUpClass): 503 POST https://bbg-staging-01.openstack.blueboxgrid.com:9797/v2.0/networks 0.390s21:30
jlkyeah21:30
openstackgerritMatthew Treinish proposed openstack/tempest: Set a tempest lock_path default to a tmpdir  https://review.openstack.org/17697521:31
andreafmtreinish: do you plan on adding a py34 unittest job to tempest?21:31
mtreinishandreaf: yep: https://review.openstack.org/17686921:32
mtreinishexperimental to start21:32
jlkso neutron says:21:32
jlk2015-04-23 21:23:25.610 18016 TRACE neutron.api.v2.resource NoNetworkAvailable: Unable to create the network. No tenant network is available for allocation.21:32
mtreinishand that returns a 50321:32
mtreinishmestery: ^^^ you're stuff is weird again21:32
* mestery looks21:32
jlkwhat does that even mean? This is a pretty empty cluster, what resource isn't available?21:33
mtreinishI have no idea, I've not seen that before21:33
mtreinishjlk: heh, when the ptl is looking ask google: https://ask.openstack.org/en/question/29214/unable-to-create-the-network-no-tenant-network-is-available-for-allocation/ :)21:34
jlkmaybe I should just make a tenant and not use isolation21:34
mtreinishnot sure that works or not21:34
mtreinishjlk: sure that's allow_tenant_isolation in the auth section21:34
mtreinishbut make sure you run serially21:35
mtreinishotherwise the tests will fight with each other in parallel21:35
mtreinishjlk: while I'm waiting for the docs to get uploaded, from the review: http://docs-draft.openstack.org/26/176026/2/check/gate-tempest-docs/7e06adc//doc/build/html/configuration.html#networking21:36
jlkthanks21:37
mtreinishstill missing some details, but it was what I started with21:37
mtreinishjlk: the other thing is if network creations aren't working, you might have some issue running the neutron tests21:40
jlkyeah,21:40
jlkI'm going to try and create one via horizon and see what happens21:40
*** dkranz has quit IRC21:40
*** achanda has joined #openstack-qa21:41
jlkoh hrm21:41
jlkthis is a vlan setup21:41
jlkso maybe we can't make new vlan networks21:41
mtreinishoh, I could see that21:42
jlkso tenant_isolation always requires a tenant network?21:42
*** shashankhegde has quit IRC21:42
openstackrecheckopenstack/nova change: https://review.openstack.org/149371 failed gate-tempest-dsvm-neutron-full in the gate queue with an unrecognized error21:42
mtreinishjlk: that's how it's written now21:43
jlkalright I'll go without isolation21:43
jlkand figure out all the accounts I need to make21:43
mtreinishjlk: you could try using an accounts file if you want to run in parallel. That gives you more flexability with setup21:43
mtreinishthere's a script up for review right now to automate creating one: https://review.openstack.org/#/c/172605/21:45
*** mpaolino has joined #openstack-qa21:45
jlklooks like using accounts has interesting limitations on the network stuff21:45
mtreinishalthough I haven't tried using it yet or reviewed the lastest rev either21:45
mtreinishjlk: well if you use the static config options for providing accounts to tempest, they have to be on a shared network or have only 1 network available21:46
jlknod21:46
mtreinishusing an accounts.yaml file will let you tell tempest which network to use with which account21:46
mtreinishbut that code path is only like a week old21:48
mtreinishspecifying networks with accounts.yaml that is21:48
*** mpaolino has quit IRC21:49
openstackgerritSean Dague proposed openstack-dev/grenade: allow DEVSTACK_DIRS to be overridden  https://review.openstack.org/17670021:49
openstackgerritSean Dague proposed openstack-dev/grenade: add convenience tools  https://review.openstack.org/17411121:49
openstackgerritSean Dague proposed openstack-dev/grenade: make the ss debug include all services  https://review.openstack.org/17674821:49
openstackgerritSean Dague proposed openstack-dev/grenade: fix neutron grenade due to kilo rename of lib/neutron  https://review.openstack.org/17672321:49
mtreinishjlk: anyway I've got to run for now. I'll probably be back online later tonight at some point.21:50
jlkthanks21:51
mtreinishjlk: if you could file a bug for the tenant isolation w/ vlan provider networks that would be awesome21:52
jlkthanks21:53
jlker yeah21:53
mtreinishI think we can easily add a config option to disable the network autocreation in tenant isolation21:53
jlkoh that'd be sweet.21:53
mtreinishbut I don't want to forget about it :)21:53
*** Sukhdev has quit IRC21:59
jlkmtreinish: https://bugs.launchpad.net/tempest/+bug/144782922:01
openstackLaunchpad bug 1447829 in tempest "allow_tenant_isolation network creation fails with vlan networks" [Undecided,New]22:01
*** Sukhdev has joined #openstack-qa22:01
*** johnthetubaguy is now known as zz_johnthetubagu22:01
*** suneelb has quit IRC22:01
*** suneelb has joined #openstack-qa22:02
*** suneelb1 has joined #openstack-qa22:02
*** suneelb1 has quit IRC22:02
*** suneelb1 has joined #openstack-qa22:02
jlkgrr next up, something keeps looking for 'public' as a key in server['addresses']22:03
jlksomething is defaulting to 'public' as a network name22:03
*** Kennan has quit IRC22:03
*** Kennan has joined #openstack-qa22:03
*** mriedem is now known as mriedem_away22:04
*** andreykurilin__ has quit IRC22:04
*** suneelb2 has joined #openstack-qa22:05
*** suneelb3 has joined #openstack-qa22:06
*** suneelb2 has quit IRC22:06
*** suneelb1 has quit IRC22:06
*** dboik has joined #openstack-qa22:06
*** suneelb has quit IRC22:06
*** Kennan2 has joined #openstack-qa22:07
*** dboik_ has joined #openstack-qa22:08
*** Kennan has quit IRC22:08
*** openstackstatus has quit IRC22:09
*** dboik has quit IRC22:10
openstackgerritIan Wienand proposed openstack-dev/devstack: Remove the lvm.conf filter during cleanup  https://review.openstack.org/17688922:12
*** paul_glass has quit IRC22:15
*** greghaynes has quit IRC22:18
openstackgerritIan Wienand proposed openstack-infra/devstack-gate: Fix swapiness sysctl set  https://review.openstack.org/17700622:22
*** mlavalle has joined #openstack-qa22:22
openstackgerritIan Wienand proposed openstack-infra/devstack-gate: Add -e to gate wrap script  https://review.openstack.org/17701722:26
jlkokay, seems tempest doesn't like A) fixed_network definition, B) provided accounts, so not isolated, and C) needing floating IPs from a shared network to ssh in22:33
*** bknudson has quit IRC22:33
*** mlavalle has quit IRC22:39
*** mlavalle has joined #openstack-qa22:39
mtreinishjlk: thanks for filing the bug22:41
mtreinishjlk: all 3 of those are config options in tempest iirc22:41
mtreinishA should be fixed_network_name in the compute section22:41
jlkyeah22:41
jlkthey are22:41
mtreinishI thought hogepodge was actually running with that config in his testing22:42
jlkbut it seems that in some test setups, even if the use_floatingip_for_ssh is set to true, the tests still try to find the network_for_ssh network in the server info22:42
jlkwhich that defaults to 'public'22:42
jlkso it's trying to find a public network22:42
jlkeven if it doesn't use it, which cases a backtrace22:43
jlkI think I just managed to fool it by putting the fixed network name value in for network_for_ssh22:43
mtreinishjlk: there is a 'ssh_connect_method' option in the compute group22:44
mtreinishI think you can set that to fixed to stop it from trying to use floating ip22:44
jlkno, I need to use floating IP22:44
mtreinishoh22:44
mtreinishso it's just a case of a stupid default again...22:45
jlkwell22:45
jlkit's a case of trying to get that data when it's not needed22:45
jlkle tme see if I can find the code22:45
mtreinishfwiw there is a patch series up to try and make all the ssh options a bit cleaner22:45
*** shashankhegde has joined #openstack-qa22:45
jlkhttps://github.com/openstack/tempest/blob/master/tempest/common/utils/linux/remote_client.py#L3722:46
jlkthat code block in there doesn't care if use_floatingip_for_ssh is set22:47
jlkso it always tries to find the address for network_for_ssh22:47
mtreinishah, ok yeah that's a bug22:47
mtreinishalso maybe a duplicate option too22:47
jlkI could start a fix for this, but I wouldn't be able to quickly create unit tests around it22:48
openstackrecheckopenstack/glance change: https://review.openstack.org/176956 failed gate-tempest-dsvm-full in the gate queue with an unrecognized error22:48
*** mvandijk has quit IRC22:48
mtreinishjlk: well there is a refactor up for a bunch of this here: https://review.openstack.org/#/q/status:open+project:openstack/tempest+branch:master+topic:bp/ssh-auth-strategy,n,z22:48
mtreinishI need to prioritize reviews on that series22:49
jlkah let me see if any of that touches this code22:49
jlkoh yes, yes it does22:49
*** mvandijk has joined #openstack-qa22:50
*** erlon has quit IRC22:51
jlkhrm, maybe it's that RemoteClient is supposed to be called with the IP address string when use_floatingip_for_ssh is set22:55
jlkthat makes more sense22:55
jlkand maybe some tests aren't doing that right22:55
mtreinishjlk: do you have run_ssh option enabled? There are some tests which don't use it right22:56
jlkyes22:57
mtreinishwe don't gate on it so there are probably bugs with it22:57
jlkwell22:57
mtreinishwe probably should add that to the help for that option22:57
mtreinishthat flag isn't globally respected so some tests will still ssh in22:57
mtreinishjust not as many22:57
mtreinishthis whole section of tempest is a mess22:57
mtreinishit was all added ad hoc over time and no one really thought about it22:58
jlkokay I think I was overzealous in trying to get ssh working22:58
jlklet me remove that and see where we get22:58
jlkseems like every place that would do run_ssh needs to check for floating_ip stuff and create it if required22:59
jlkor move that code into the RemoteClient23:00
*** annashen has quit IRC23:00
*** annashen has joined #openstack-qa23:00
mtreinishjlk: yeah I'm pretty sure that's what the bp is addressing. It's going to break out all of those options into a separate config section and centrally check them23:01
*** suneelb3 has quit IRC23:01
mtreinishthen refactor all the create server calls to use them23:01
*** suneelb has joined #openstack-qa23:01
jlkalright23:01
*** suneelb has quit IRC23:01
*** suneelb has joined #openstack-qa23:02
jlkI've got run-ssh turned off this time, we'll see which ones fail this time around.23:02
*** melwitt has quit IRC23:02
*** rbak has quit IRC23:03
*** suneelb1 has joined #openstack-qa23:03
*** annashen has quit IRC23:05
jlkoooh ooh! I think it's working!23:06
*** suneelb has quit IRC23:06
*** hemna is now known as hemnafk23:07
mtreinishjlk: awesome23:07
mtreinishyeah, I think you were just hitting a broken test when run_ssh is enabled23:07
jlkyeah, really seems like it23:08
jlkI thought I needed ssh for one of these tests to work23:08
jlkRan 113 tests in 466.223s23:09
jlkFAILED (failures=1)23:09
jlkHUZZAH23:09
jlkthat one failure is a policy fix23:09
*** mlavalle has quit IRC23:09
mtreinishawesome23:11
hogepodgejlk ooh, send me your configuration23:11
jlkwell, a number of SKIPPED tests, which may mean not passing the defcore assertions. Gotta review those23:11
hogepodgeI'd like to see if I can reproduce ssh passing on my own cloud23:12
hogepodge(of course, redact any private stuff like passwords)23:12
jlkhogepodge: I just turned off run_ssh23:12
hogepodgeAh23:12
*** ajmiller_ is now known as ajmiller23:12
mtreinishhogepodge: well run_ssh is kinda in a weird state right now because it's not gated and not globally honored23:12
mtreinishso some tests do ssh, regardless of what that config flag is23:12
mtreinishand most of the ones which listen to it don't work when it's enabled23:13
jlkyeah, particularly when floatingips23:13
mtreinishthat bp is trying to fix it, I think the last time they ran a test patch on top of it with run_ssh enabled it worked23:13
hogepodgemtreinish: do you think this problem will be fixed in liberty? If not, we're going to have to pull tests that rely on it from defcore. It's not reasonable to expect tests we can't test.23:14
jlkhogepodge: so if I'm using the 2015.04.required.txt test list, do all of them have to be OK, non SKIP ?23:14
mtreinishhogepodge: yeah it'll definitely be fixed in liberty23:14
mtreinishthe reviews are up already23:14
hogepodgejlk everything has to be ok, except for those marked as flagged in the json file. We leave them in there because they're still capabilities we want. It's weird23:14
hogepodgeSo if tempest fixed them today, it's likely we would move our testing hash to a working version of tempest so we could remove flagged status.23:15
jlkokay looks like just 3 flagged23:16
jlkRan 113 tests in 299.370s23:16
jlkOK23:16
hogepodgejlk wow, impressive. there's one flagged swift test too23:16
hogepodge(maybe two)23:16
hogepodgefor swift it's best to use the accounts.yaml because you need to have roles assigned23:17
hogepodgeIt's documented in tempest.conf23:17
jlkokay23:17
mtreinishhogepodge: well you can use tenant isolation too :)23:17
jlkI'll be tackling swift next week23:17
openstackrecheckConsole logs not available after 13:26s for check-grenade-dsvm-neutron 176012,2,4deca5e23:17
mtreinishhogepodge: have you played with network support in accounts.yaml? I'm curious if that works for you guys23:18
hogepodgemtreinish: tenant isolation assumed admin credentials, though. So yes if you have them.23:18
mtreinishbecause my test for that code path was kinda synthetic23:18
mtreinishhogepodge: yeah it does. Eventually we want to be able to have domain admin support in a v3 only env23:18
mtreinishbut that's probably a long way off23:18
hogepodgemtreinish: not yet. I'm prepping a kilo demo right now, on top of handling incoming test requests.23:19
openstackgerritAdolfo Duarte proposed openstack/tempest: Tests to cover FIP in dvr multinode setup  https://review.openstack.org/17703323:20
mtreinishhogepodge: ok, no worries I was just curious23:23
*** wuhg has joined #openstack-qa23:28
*** sabeen2 has quit IRC23:28
*** marun has quit IRC23:34
*** dimtruck is now known as zz_dimtruck23:34
*** Swami has quit IRC23:35
*** hichihara has joined #openstack-qa23:36
openstackgerritAdolfo Duarte proposed openstack/tempest: Tests to cover FIP in dvr multinode setup  https://review.openstack.org/17703323:38
*** ajmiller has quit IRC23:41
*** tochi has joined #openstack-qa23:42
jlkhogepodge: this test, verify_created_server_vcpus relies on run_ssh, which is broken: https://github.com/openstack/tempest/blob/master/tempest/api/compute/servers/test_create_server.py#L98-L10723:48
jlkhogepodge: but this test isn't flagged.23:48
jlkhogepodge: same with test_attach_detach_volume https://github.com/openstack/tempest/blob/master/tempest/api/compute/volumes/test_attach_volume.py#L87-L9023:50
mtreinishhogepodge, jlk: fwiw I feel like that's a weird test to require. It's mostly testing libvirt (or whatever hypervisor) boots with the right number of cpus. That doesn't feel very interopy (for lack of a better word)23:51
jlkmtreinish: also it seems to rely upon flavor metadata for VCPU count23:51
jlkwhich isn't a standard thing?23:52
jlkoh wait23:52
jlkderp. sorry I was thinking about image metadata23:52
jlknot flavor, disregard me23:52
*** suneelb1 has quit IRC23:53
mtreinishjlk: there is at least a similar test which doesn't ssh to validate the attachment: https://github.com/openstack/tempest/blob/master/tempest/api/volume/test_volumes_actions.py#L6223:54
jlknod23:54
mtreinishjlk: also I don't think that test with ssh will be reliable at all, we have a similar test in scenario and it's been skipped for forever, because the device name is rarely what you expect it to be23:58
mtreinishhttp://git.openstack.org/cgit/openstack/tempest/tree/tempest/scenario/test_stamp_pattern.py23:58
jlkoh man23:58
jlkso when using a static user list23:58
jlkthere are some tests that seem to fail upon repeated runs23:59
jlkthings such as ssh key creation23:59
jlkand metadata setting23:59
mtreinishjlk: using accounts.yaml or the config options?23:59
jlkaccounts.yaml23:59
mtreinishbut either way its clearly a bug23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!