Friday, 2017-10-27

*** kumarmn has quit IRC00:08
*** kumarmn has joined #openstack-trove00:09
*** itlinux has joined #openstack-trove00:10
*** kumarmn has quit IRC00:13
*** kumarmn has joined #openstack-trove00:15
*** kumarmn has quit IRC00:18
*** kumarmn has joined #openstack-trove00:18
*** rcernin has joined #openstack-trove00:21
*** kumarmn has quit IRC00:23
*** kumarmn has joined #openstack-trove00:24
*** chhavi has joined #openstack-trove00:34
*** chhavi has quit IRC00:38
*** kumarmn has quit IRC00:39
*** kumarmn has joined #openstack-trove00:39
*** kumarmn has quit IRC00:43
*** gouthamr has joined #openstack-trove00:49
*** smatzek has joined #openstack-trove01:02
*** Kevin_Zheng has joined #openstack-trove01:21
*** fanzhang has quit IRC01:24
*** fanzhang_ has joined #openstack-trove01:24
smatzekhi fanzhang_. I have noted some of your reviews but haven't had a chance to give them a good review yet.  I'm still very focused on getting the master, stable and troveclient gates working so we can deliver code in general.01:25
smatzekwould you mind watching https://review.openstack.org/#/c/514710 during your day and doing rechecks if/when it fails? I just did a recheck so it will be another 3 hours or so until it succeeds or fails again.01:26
*** kumarmn has joined #openstack-trove01:26
smatzekI've found one case where some nodes that the jobs get scheduled on have older versions of qemu and libvirt and when that happens the nested DB VM fails to boot, nova sits forever in the build state and the tests fail.  I have a fix in mind which I'll try to write and test out tomorrow.01:27
fanzhang_morning smatzek01:29
fanzhang_I'll do some rechecks if necessary, just go to sleep and relax :)01:31
*** kumarmn has quit IRC01:36
*** kumarmn has joined #openstack-trove01:37
*** kumarmn has quit IRC01:41
*** rcernin has quit IRC01:43
*** rcernin has joined #openstack-trove01:44
*** fanzhang_ is now known as fanzhang01:44
*** smatzek has quit IRC01:50
openstackgerritjian.song proposed openstack/trove master: Fix requirepass problem with redis  https://review.openstack.org/51495502:13
*** zhaochao has quit IRC02:15
*** zhaochao has joined #openstack-trove02:16
*** gcb has joined #openstack-trove02:16
*** wong has joined #openstack-trove02:17
wongmorning02:17
fanzhangmorning:)02:34
openstackgerritOpenStack Proposal Bot proposed openstack/python-troveclient master: Updated from global requirements  https://review.openstack.org/50003303:18
*** chhavi has joined #openstack-trove04:11
*** kumarmn has joined #openstack-trove04:40
*** kumarmn has quit IRC04:51
*** tianhui has quit IRC06:10
openstackgerritDavid Rabel proposed openstack/trove stable/pike: Remove Mitaka reference in install/dashboard.rst  https://review.openstack.org/51560906:13
*** gouthamr has quit IRC06:21
*** spectr has joined #openstack-trove06:22
*** magicboiz has joined #openstack-trove06:27
*** magicboiz has quit IRC06:32
*** magicboiz has joined #openstack-trove06:32
*** kumarmn has joined #openstack-trove06:51
*** magicboiz has quit IRC06:53
*** magicboiz has joined #openstack-trove06:55
*** kumarmn has quit IRC06:55
*** magicboiz has quit IRC07:00
*** magicboiz has joined #openstack-trove07:07
openstackgerritOpenStack Proposal Bot proposed openstack/trove-dashboard stable/pike: Imported Translations from Zanata  https://review.openstack.org/49376107:07
*** tesseract has joined #openstack-trove07:22
*** itlinux has quit IRC07:43
*** rcernin has quit IRC07:58
*** gcb has quit IRC08:00
*** magicboiz has quit IRC08:00
*** gcb has joined #openstack-trove08:03
*** magicboiz has joined #openstack-trove08:03
*** magicboiz has quit IRC08:08
*** magicboiz has joined #openstack-trove08:15
*** dasTor has joined #openstack-trove08:37
*** tianhui has joined #openstack-trove09:10
*** tosky has joined #openstack-trove09:17
*** gcb has quit IRC09:37
*** tianhui has quit IRC09:38
*** tianhui_ has joined #openstack-trove09:38
*** links has quit IRC09:42
*** kumarmn has joined #openstack-trove09:51
*** maciejjozefczyk has joined #openstack-trove09:52
*** maciejjo1 has quit IRC09:55
*** kumarmn has quit IRC09:56
*** lijinhui has left #openstack-trove10:07
*** daidv has quit IRC10:12
*** robcresswell has quit IRC11:03
*** smatzek has joined #openstack-trove11:15
*** kumarmn has joined #openstack-trove11:52
*** kumarmn has quit IRC11:57
*** magicboiz has quit IRC12:16
openstackgerritSamuel Matzek proposed openstack/python-troveclient master: WIP: Fix gate / add tempest job  https://review.openstack.org/51539312:42
*** kei_yama has quit IRC13:09
openstackgerritOpenStack Proposal Bot proposed openstack/python-troveclient stable/pike: Updated from global requirements  https://review.openstack.org/50588613:46
openstackgerritOpenStack Proposal Bot proposed openstack/trove stable/pike: Updated from global requirements  https://review.openstack.org/49320313:49
*** spectr has quit IRC13:49
*** zhaochao has quit IRC13:53
*** kumarmn has joined #openstack-trove13:57
*** kumarmn has quit IRC14:00
*** kumarmn has joined #openstack-trove14:00
*** tianhui_ has quit IRC14:00
*** tianhui has joined #openstack-trove14:01
*** gouthamr has joined #openstack-trove14:01
*** Dinesh_Bhor has quit IRC14:04
*** tosky has quit IRC14:06
*** tosky has joined #openstack-trove14:08
*** spectr has joined #openstack-trove14:09
*** McClymontS has joined #openstack-trove14:16
*** McClymontS has quit IRC14:26
*** kumarmn has quit IRC14:38
*** kumarmn has joined #openstack-trove14:38
*** spectr has quit IRC14:39
*** kumarmn has quit IRC14:45
*** tosky has quit IRC15:12
*** links has joined #openstack-trove15:19
openstackgerritSamuel Matzek proposed openstack/trove master: WIP: Scope the use of kvm virt type in gate  https://review.openstack.org/51574115:26
*** McClymontS has joined #openstack-trove15:45
*** McClymontS has quit IRC15:47
*** dasTor has quit IRC15:49
*** itlinux has joined #openstack-trove15:57
*** robcresswell has joined #openstack-trove16:19
*** tosky has joined #openstack-trove16:19
*** links has quit IRC16:38
openstackgerritMerged openstack/trove master: Fix requirepass problem with redis  https://review.openstack.org/51495517:09
openstackgerritSamuel Matzek proposed openstack/python-troveclient master: Fix gate / add tempest job  https://review.openstack.org/51539317:18
*** itlinux has quit IRC17:28
smatzekgate status update:  master is working fairly well, we're able to merge do and do a few rechecks.17:40
smatzekonce https://review.openstack.org/#/c/514769/ and https://review.openstack.org/#/c/515393/ the python-troveclient gate should be unblocked17:41
smatzekstable/pike remains blocked waiting for https://review.openstack.org/#/c/514710/ to pass rechecks17:42
smatzekI've been investigating a timeout failure where the instance fails with a KVM error like this http://logs.openstack.org/10/514710/3/gate/legacy-trove-scenario-dsvm-mysql-multi/b05115e/logs/libvirt/qemu/instance-00000001.txt.gz17:43
smatzekThis only happens when Trove's devstack plugin reconfigures Nova to use kvm virt_type.  However, I don't want to disable that reconfigure as it greatly increases our gate job success ratio.17:44
smatzekIn my initial investigation, all the successes I saw were using later versions of libvirt and qemu.  This morning I've found successes with those older versions of libvirt and qemu.17:45
smatzekThis failure is similar to this bug https://bugs.launchpad.net/ubuntu/+source/linux-lts-xenial/+bug/168207717:46
openstackLaunchpad bug 1682077 in linux-lts-xenial (Ubuntu) "nested KVM fails - KVM: entry failed, hardware error 0x0 " [High,Confirmed]17:46
smatzekthat bug points at the kernel of the libvirt host or the host of that VM if it is a VM.17:46
smatzekit may also have something to do with the processor model.  I have seen failures only with (Haswell, no TSX) processors but we also have one success.  This is leading me to believe the issue lies in the host of the VM the jobs are running in, probably in its kernel.17:48
smatzekThe next round of investigation is to gather the hosts from all the successes and fails I can find where we've set virt_type kvm to see if there is a pattern.  From the data set I have so far, the hostnames of the devstack VMs that have failed are *-ovh-gra1 and I have no successes with that hostname.17:49
smatzekI do have a success on  ubuntu-xenial-ovh-bhs117:50
smatzekone thing I missed above, is that the kernel level between success and failures of the devstack VM are the same.17:52
*** itlinux has joined #openstack-trove17:53
*** harlowja has quit IRC18:05
*** harlowja has joined #openstack-trove18:05
*** smatzek has quit IRC18:52
*** chhavi has quit IRC19:02
*** smatzek has joined #openstack-trove19:27
*** tesseract has quit IRC19:33
*** smatzek has quit IRC19:47
*** smatzek has joined #openstack-trove19:55
*** rcernin has joined #openstack-trove19:57
openstackgerritSamuel Matzek proposed openstack/trove master: Do not configure kvm virt_type in devstack  https://review.openstack.org/51581419:59
*** itlinux has quit IRC20:01
*** miqui has joined #openstack-trove20:04
*** smatzek has quit IRC20:05
*** McClymontS has joined #openstack-trove20:39
*** itlinux has joined #openstack-trove20:41
*** itlinux has quit IRC21:07
*** smatzek has joined #openstack-trove21:07
*** itlinux has joined #openstack-trove21:15
smatzekamrith, given the above ^, I've looked into seeing if it is host specific and if there bread crumbs in the logs to help.  What I've found so far is that all the failures I've found happen in the ovh cloud, but its not tied to a specific region.  These are two runs one, succeeded and one failed and both were in the bhs1 region of ovh cloud.21:16
smatzekI'm not sure where to go from here. I could check host name and avoid setting virt_type kvm ovh cloud instances, but putting nodepool specific info in the trovestack devstack plugin seems wrong.21:17
smatzekI could try to fire up a VM of some sort in the devstack plugin and see if it fails with kvm virt, but that seems like a lot of work and overkill.  The patch to fix the stable/pike gate has hit this particular problem several times in its multiple rechecks and given that we may actually have higher overall success rate if we run with qemu all the time.21:19
smatzekI have a patch up against master to test this.  I have seen the tests succeed often enough with qemu.21:19
smatzekI don't really want to throw out the kvm virt_type runs since they do go faster, but I'm on the fence here.  Thoughts?21:20
smatzekthe two runs on the same region in ovh:21:25
smatzekhttp://logs.openstack.org/10/514710/3/gate/legacy-trove-scenario-dsvm-mysql-multi/7fbfa7021:25
smatzekhttp://logs.openstack.org/10/514710/3/gate/legacy-trove-scenario-dsvm-mysql-single/9ca522e21:26
*** McClymontS has quit IRC21:27
*** rcernin has quit IRC21:28
*** itlinux has quit IRC21:44
openstackgerritOpenStack Proposal Bot proposed openstack/python-troveclient stable/pike: Updated from global requirements  https://review.openstack.org/50588622:03
openstackgerritOpenStack Proposal Bot proposed openstack/python-troveclient master: Updated from global requirements  https://review.openstack.org/50003322:05
openstackgerritOpenStack Proposal Bot proposed openstack/trove stable/pike: Updated from global requirements  https://review.openstack.org/49320322:06
openstackgerritOpenStack Proposal Bot proposed openstack/python-troveclient master: Updated from global requirements  https://review.openstack.org/50003322:07
*** smatzek has quit IRC22:09
*** smatzek has joined #openstack-trove22:10
*** smatzek has quit IRC22:11
*** smatzek has joined #openstack-trove22:11
*** smatzek has quit IRC22:16
*** itlinux has joined #openstack-trove22:33
*** itlinux has quit IRC22:46
*** kumarmn has joined #openstack-trove22:54
*** tosky has quit IRC23:00
*** kumarmn has quit IRC23:03
*** kumarmn has joined #openstack-trove23:04
*** kumarmn has quit IRC23:08

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!