Monday, 2020-09-07

*** tkajinam has joined #openstack-nova00:00
*** zhanglong has joined #openstack-nova00:46
*** Liang__ has joined #openstack-nova01:12
*** spatel has joined #openstack-nova01:26
*** spatel has quit IRC01:30
*** zzzeek has quit IRC01:42
openstackgerritWenping Song proposed openstack/nova-specs master: Support vGPU in nova and cyborg interaction  https://review.opendev.org/75011601:42
*** zzzeek has joined #openstack-nova01:45
*** zhanglong has quit IRC01:52
*** iurygregory has quit IRC02:03
*** suryasingh has joined #openstack-nova02:15
*** jamesdenton has quit IRC02:30
*** zzzeek has quit IRC02:30
*** zzzeek has joined #openstack-nova02:30
*** jamesdenton has joined #openstack-nova02:30
*** mkrai has joined #openstack-nova02:54
*** sapd1_x has joined #openstack-nova03:20
*** euclidsun has joined #openstack-nova03:24
*** mkrai has quit IRC03:26
*** mkrai has joined #openstack-nova03:27
*** psachin has joined #openstack-nova03:39
*** euclidsun has quit IRC03:46
*** spatel has joined #openstack-nova04:18
*** links has joined #openstack-nova04:21
*** euclidsun has joined #openstack-nova04:26
*** euclidsun has left #openstack-nova04:26
*** evrardjp has quit IRC04:33
*** evrardjp has joined #openstack-nova04:33
*** ratailor has joined #openstack-nova04:34
*** vishalmanchanda has joined #openstack-nova04:38
*** euclidsun has joined #openstack-nova05:24
*** euclidsun has quit IRC05:31
*** sapd1_x has quit IRC05:33
*** brinzhang has joined #openstack-nova05:36
*** jsuchome has joined #openstack-nova05:43
*** spatel has quit IRC05:56
*** songwenping_ has quit IRC06:05
*** swp20 has joined #openstack-nova06:05
*** zzzeek has quit IRC06:08
*** zzzeek has joined #openstack-nova06:09
*** mkrai has quit IRC06:13
*** mkrai_ has joined #openstack-nova06:13
*** ralonsoh has joined #openstack-nova06:26
*** zzzeek has quit IRC06:32
*** zzzeek has joined #openstack-nova06:35
*** ralonsoh_ has joined #openstack-nova06:55
*** ralonsoh has quit IRC06:57
*** Yumeng has joined #openstack-nova07:01
*** slaweq has joined #openstack-nova07:08
*** mkrai_ has quit IRC07:09
*** mkrai has joined #openstack-nova07:10
*** nightmare_unreal has joined #openstack-nova07:11
*** tesseract has joined #openstack-nova07:12
*** mkrai has quit IRC07:15
*** sapd1_x has joined #openstack-nova07:31
bauzasgood morning Nova07:35
gibibauzas: good morning07:37
*** rcernin has quit IRC07:37
*** owalsh has quit IRC07:48
*** sapd1_x has quit IRC07:49
*** sapd1_x has joined #openstack-nova07:49
*** owalsh has joined #openstack-nova07:52
*** iurygregory has joined #openstack-nova08:07
*** damien_r has quit IRC08:27
*** damien_r has joined #openstack-nova08:28
*** mkrai has joined #openstack-nova08:28
brinzhangbauzas, gibi: good morning08:29
brinzhangbauzas, gibi: two backport patches hope you can review, trivial changes https://review.opendev.org/#/c/749681/ , https://review.opendev.org/#/c/749701/08:30
*** derekh has joined #openstack-nova08:31
*** jaosorior has joined #openstack-nova08:44
*** k_mouza has joined #openstack-nova08:52
openstackgerritBalazs Gibizer proposed openstack/nova master: Use UUID as vif and network_id in vif tests  https://review.opendev.org/74872208:53
openstackgerritBalazs Gibizer proposed openstack/nova master: Support SRIOV interface attach and detach  https://review.opendev.org/74099508:53
*** zzzeek has quit IRC08:58
*** zzzeek has joined #openstack-nova08:59
openstackgerritLiang Fang proposed openstack/nova master: Add volume local cache support  https://review.opendev.org/66354209:02
elodgibi: if you will have time: https://review.opendev.org/#/c/750068/ :) (Stein release patch)09:04
*** Liang__ has quit IRC09:06
openstackgerritBalazs Gibizer proposed openstack/nova master: Make PCI claim NUMA aware during live migration  https://review.opendev.org/74845309:09
gibielod: done09:10
gibithanks09:10
*** stephenfin has joined #openstack-nova09:11
gibistephenfin: hi! the nova-multi-cell failure in the vtpm resize patch seems to be relevant09:12
stephenfinack, looking09:13
gibi https://zuul.opendev.org/t/openstack/build/7326bc59092346b18fe9858774606321/log/compute1/logs/screen-n-cpu.txt?severity=4#774109:13
*** jangutter_ has joined #openstack-nova09:15
*** jangutter has quit IRC09:18
elodgibi: thx! \o/09:18
luyaostephenfin: Hi, not sure whether you saw my message last week, just a kind remind for vpmem-enhencement https://review.opendev.org/#/q/topic:bp/vpmem-enhancement+(status:open+OR+status:merged) . I move the patch 'improve orphans tracking' to the last of the patch sequence (we don't have any big concern on the other 3 patches I think), I redefined those orphans  in updated patch since previous version involved the09:24
luyaobug #1879878, and I didn't notice you have fixed it.09:24
openstackbug 1879878 in OpenStack Compute (nova) "VM become Error after confirming resize with Error info CPUUnpinningInvalid on source node " [Medium,In progress] https://launchpad.net/bugs/1879878 - Assigned to Stephen Finucane (stephenfinucane)09:24
stephenfinluyao: ack, will take a look this afternoon09:25
luyaostephenfin: Thank you in advance. :)09:25
bauzasgosh, morning hell here09:34
*** ralonsoh_ is now known as ralonsoh09:45
openstackgerritWenping Song proposed openstack/nova-specs master: Support vGPU in nova and cyborg interaction  https://review.opendev.org/75011609:47
*** xek has joined #openstack-nova09:50
*** xek has quit IRC09:56
nightmare_unreallive-migration force node is not working with v2.67 , In v.268 force live migration was completely removed but it should work with v2.6709:59
* bauzas is about to cry...10:10
bauzaswhy don't we have good api reference for https://docs.openstack.org/api-ref/network/v2/index.html#subnets ?10:10
*** k_mouza has quit IRC10:11
noonedeadpunkhi everyone. any chance you know why nova-compute may fail on centos7 for master and ussuri that way? https://zuul.opendev.org/t/openstack/build/6add842202c34390959d5fd0bd6fc83b/log/logs/host/nova-compute.service.journal-15-37-20.log.txt#2729-276810:11
noonedeadpunkcentos8, debian,ubuntu feel ok with the same setup path and configs10:13
bauzasgibi: do you know how I can know the API reference for getting the subnets information ?10:13
bauzasgibi: looking at https://docs.openstack.org/neutron/latest/admin/config-routed-networks.html#example10:13
bauzasgibi: it looks to me you can get the related segment of a subnet10:14
bauzasopenstack subnet show my_subnet --c segment_id10:14
bauzasbut i guess it's a neutron extension10:14
bauzasahah, that's the reference which is confusing10:17
bauzashttps://docs.openstack.org/api-ref/network/v2/index.html?expanded=show-subnet-details-detail#show-subnet-details10:17
*** tobias-urdin has joined #openstack-nova10:25
*** psachin has quit IRC10:25
*** k_mouza has joined #openstack-nova10:26
*** zzzeek has quit IRC10:27
*** zzzeek has joined #openstack-nova10:30
*** tosky has joined #openstack-nova10:33
gibibauzas: sorry, I was afk10:33
*** kaisers1 has quit IRC10:37
*** dtantsur|afk is now known as dtantsur10:38
*** jawad_axd has joined #openstack-nova10:50
*** kaisers has joined #openstack-nova10:53
*** ratailor has quit IRC10:57
*** k_mouza has quit IRC11:05
*** rcernin has joined #openstack-nova11:06
*** k_mouza has joined #openstack-nova11:09
*** vishalmanchanda has quit IRC11:17
*** lee1 has joined #openstack-nova11:26
lee1kashyap: morning, random question, have you had to debug device detach issues before through libvirt and the guestOS and if so do you have any tips?11:27
*** lee1 is now known as lyarwood11:27
lyarwoodkashyap: context is https://bugs.launchpad.net/nova/+bug/188252111:27
openstackLaunchpad bug 1882521 in OpenStack Compute (nova) "Failing device detachments on Focal" [Critical,Confirmed] - Assigned to Lee Yarwood (lyarwood)11:27
kashyapAck; back here in a min :)11:27
lyarwoodkashyap: I can reproduce while running the full suite of tests and I'm pretty sure it's just an issue of the guestOS (cirros) not being able to process the request but I just want to prove it somehow11:28
lyarwoodack np11:28
kashyaplyarwood: lee1 was your nick too, I suppose?11:29
* kashyap is reading; and morning/afternoon11:29
kashyaplyarwood: I vaguely recall some triaging some device detach issues; but I forget the details.  Gimme a few11:32
kashyaplyarwood: Are you also implying that this is not reproducible with non-CirrOS guests?11:32
kashyapOkay, you say as much in #511:34
kashyap"Each time this has been hit however it appears that the Guest OS (cirros) isn't able to react to the ACPI request to detach the disk device. "11:34
lyarwoodkashyap: yeah that's my feeling at the moment, I'm looking to prove it now11:34
lyarwoodkashyap: just trying to work out how to capture the moment libvirt / QEMU signal the guestOS to detach the device11:35
kashyaplyarwood: Right, configure the debug log filters, it should definitely give us some clues11:35
lyarwoodkashyap: and then work out how to capture that in the guest, AFAICT dmesg doesn't list it11:35
kashyap`journalctl`?11:35
lyarwoodkashyap: cirros doesn't have systemd11:37
kashyapDarn, I keep forgetting11:37
*** k_mouza has quit IRC11:37
lyarwoodah it's using acpid11:37
*** k_mouza has joined #openstack-nova11:39
sean-k-mooney1i need to try and find time to test alpine as a cirros alternitive11:40
kashyaplyarwood: Fedora doesn't do it?11:40
kashyap(As in, 'acpid' daemon)11:40
sean-k-mooney1fedora uses systemd for udev im not cure if that will handel acpid too11:41
sean-k-mooney1*sure11:41
*** sean-k-mooney1 is now known as sean-k-mooney11:41
*** k_mouza has quit IRC11:44
kashyapsean-k-mooney: 'systemd' can handle some ACPI events; not all - https://wiki.archlinux.org/index.php/Power_management#ACPI_events11:45
kashyapOn my Fedora laptop I see:11:45
kashyap$> systemctl | grep -i acpi sys-devices-platform-thinkpad_acpi-leds-tpacpi::kbd_backlight.device                                  loaded active plugged   /sys/devices/platform/thinkpad_acpi/leds/tpacpi::kbd_backlight11:46
kashyapsystemd-backlight@leds:tpacpi::kbd_backlight.service                                                  loaded active exited    Load/Save Screen Backlight Brightness of leds:tpacpi::kbd_backlight11:46
sean-k-mooneylyarwood: so i have been suggesting we should look into useing alpine instead of cirros going forward in the gate. its still does not use systemd but its one of the lightest weight distros i know of and unlike cirros its still maintained regurally11:46
kashyap(So some Thinkpad-related ACPI events are handled)11:46
sean-k-mooneykashyap: that sound like a lenovo extention11:46
*** k_mouza has joined #openstack-nova11:46
sean-k-mooneyrather then generic support11:46
kashyaplyarwood: Back to your original question - yeah, we need to find the "event" (IIRC, DEVICE_DELETED - need to double-check) that libvirtsends to the guest OS11:47
lyarwoodkashyap: do you know what that actually maps to in terms of what the guestOS sees?11:47
lyarwoodkashyap: an ACPI event right but any idea what type etc?11:47
kashyaplyarwood: Not top off my head, perhaps Michal from libvirt might know; he worked on the 'udev' integration11:47
lyarwoodkashyap: could you ask and I'll work out a while of capturing that within the guestOS itself11:48
kashyaplyarwood: Yeah, just asked; he's AFK.  I'm checking w/ a couple of others11:48
lyarwoodsean-k-mooney: tbh it's a little silly that we are using it in CI and running nodes with such little resource as well tbh11:49
sean-k-mooneylyarwood: well we dont have enogh disk/ram to use something much hevier11:49
sean-k-mooneynot without reducing concurancy at least11:49
sean-k-mooneycirros made sense when it was activly maintained and updated11:50
*** k_mouza has quit IRC11:51
*** rcernin has quit IRC11:54
kashyaplyarwood: Do you have access to the guest?  If so - is this present in it: /sys/module/pci_hotplug?11:59
*** sapd1_x has quit IRC12:00
sean-k-mooneykashyap: cirrus uses a striped down ubuntu 18.04 kernel so it may not be12:01
lyarwoodkashyap: yeah that's there, I assume I can enable that12:01
lyarwoodkashyap: debug that is12:02
*** xek has joined #openstack-nova12:02
lyarwoodand yeah was just reading https://blog.chrishowie.com/2019/09/19/hot-swapping-virtio-disks-on-qemu/ so it's a PCI hot remove with virtio-blk that makes sense12:03
sean-k-mooneyyep it is12:03
sean-k-mooneythat why i was asserting that virtio-scsi or q35 might help12:03
kashyaplyarwood: So I learn that's the part (the /sys/module/pci_hotplug) which is responsible for hotplug/hotunplug events12:04
sean-k-mooneyvirtio-scsi woudl be the simplest thing to enable12:04
lyarwoodsean-k-mooney: well if it the guestOS can't process the request to detach I don't think changing the underlying bus is going to help tbh12:04
kashyaplyarwood: So I just chatted w/ a couple of QEMU devs; and it seems notoriously difficult to detect this.  Way too low-level ...12:05
sean-k-mooneylyarwood: well it wont be a pci hotplug  anymore12:05
sean-k-mooneylyarwood: it will be a scsi detach12:05
lyarwoodsean-k-mooney: true but the guest would still need to handle the SCSI command (?) to detach12:05
gibistephenfin: fyi, I have a question in https://review.opendev.org/#/c/746945/6/nova/tests/functional/libvirt/test_pci_sriov_servers.py@a37012:05
sean-k-mooneyyes proably but  i think that would be more relyable12:05
kashyaplyarwood: A snippet:12:06
kashyap<kashyap> Hiya, a ranodm question: on monitor command 'device_del' (for device detach), would you happen to know how exactly does it manifest in the guest?12:06
kashyapAnswer (from Igor): guest gets SCI interrupt, next thing it reads status from GPE block and calls appropriate AML handler (it's all done within guest  kernel)12:06
kashyapAnswer 2 (from DanPB): "you'll get <insert hand waving> an ACPI unplug event something in the guest needs to respond to this event for it to complete"12:06
*** xek has quit IRC12:08
*** rcernin has joined #openstack-nova12:10
jangutter_kashyap: on physical hw I've hotplugged and unplugged SATA/SCSI/USB devices for ages, but I've NEVER done so with a PCIe device.12:11
*** jangutter_ is now known as jangutter12:11
*** rcernin has quit IRC12:11
sean-k-mooneygibi: stephenfin can i get your eyes on this https://review.opendev.org/#/c/738432/12:11
*** rcernin has joined #openstack-nova12:11
sean-k-mooneyi want to get that bug fix merged before m3 if we can so we can backport it to train12:12
kashyapjangutter: Yeap, noted12:12
kashyaplyarwood: So Jiri from libvirt also suggests to get the communication w/ QEMU monitor12:12
sean-k-mooneygibi: stephenfin im also hoping to get https://review.opendev.org/#/q/topic:bug/1888395+(status:open+OR+status:merged) merged soon bug im going to adress artoms nits now12:13
lyarwoodkashyap: yeah tracking that, I see the DEVICE_DELETED events12:16
lyarwoodkashyap: I've used https://www.kernel.org/doc/html/latest/firmware-guide/acpi/debug.html to enable ACPI debug for the ACPI_PCI_COMPONENT12:17
lyarwoodkashyap: within the guestos12:17
lyarwoodkashyap: lets see if that helps12:17
kashyaplyarwood: So I just posted #912:17
kashyapTo copy/paste my point-1 from there:12:17
kashyap"- DEVICE_DELETED is the event that QEMU sends to libvirt, *once* the device was removed by the guest, so that libvirt can clean-up. So if we see DEVICE_DELETED that means the device was successfully detached from QEMU's point of view (therefore, from the guest's PoV, too)"12:17
lyarwoodkashyap: right sorry I'm just working out how to instrument things in CI at the moment12:18
kashyaplyarwood: Are you using a new kernel rebuilt with it?12:18
lyarwoodkashyap: detach works correctly in the env at the moment12:18
lyarwoodkashyap: I'm just trying to figure out what I need to capture during a run to show things are delayed in the guestos12:19
lyarwoodkashyap: and yeah 5.3.0-26-generic is the kernel12:19
*** k_mouza has joined #openstack-nova12:20
kashyaplyarwood: So, Igor (KVM/QEMU dev) says: "You'd could watch for udev events as indirect result of unplug events for specific device subsystem"12:20
*** rcernin has quit IRC12:21
lyarwoodkashyap: I don't think cirros is using udev tbh12:21
kashyaplyarwood: Nod; I've actually snipped out his first part where he admits he isn't familiar w/ 'acpid'12:22
lyarwoodhttps://git.busybox.net/busybox/tree/util-linux/acpid.c it's not even the old version I was used to tbh12:23
*** k_mouza has quit IRC12:24
kashyaplyarwood: I'm curious if your test with slightly "better resources" for the guest fixes it12:25
*** mkrai has quit IRC12:25
kashyaplyarwood: Also can you tell what's the buggy guest configuration?  If you don't mind posting the guest XML...12:25
lyarwoodkashyap: I still saw a few failures12:25
kashyapSo it's not the resources allocated to the guest12:26
lyarwoodkashyap: that was in reference to the host guest running openstack FWIW12:26
lyarwoodkashyap: correct12:26
lyarwoodkashyap: CI nodes run with 1 vCPU and 8GB of RAM at the moment12:26
lyarwoodkashyap: the instances have 1 vCPU and 128MB of RAM12:27
kashyaplyarwood: BTW, haven't we "proved" that it is the guest OS that is buggy when you can't reproduce it w/ other guest OSes? :)12:27
kashyap(Thx for the guest config)12:27
lyarwoodkashyap: I'd just like to capture the actual events to prove it12:27
kashyapNod.  Seems notoriously difficult so far from my interactions12:28
kashyaplyarwood: I guess your approach w/ this rebuilt kernel w/ ACPI debug is to reproduce the prob and watch for output in 'dmesg'?12:28
*** mkrai has joined #openstack-nova12:29
lyarwoodkashyap: yeah, I shouldn't need to rebuild the kernel12:30
lyarwoodkashyap: I just need to work out a way of providing command line args to the instances12:30
lyarwoodkashyap: and then capture their console logs on failure12:31
* kashyap bbiab; break12:31
*** jangutter_ has joined #openstack-nova12:36
*** jsuchome has quit IRC12:36
*** jangutter has quit IRC12:38
*** jangutter has joined #openstack-nova12:38
openstackgerritGhanshyam Mann proposed openstack/nova master: [Trivial] Replace ref of policy.json to policy.yaml  https://review.opendev.org/74982112:39
gmanndansmith: sean-k-mooney gibi policy file default change is ready now- https://review.opendev.org/#/c/748059/912:39
*** jangutter_ has quit IRC12:41
gibigmann: ack12:43
openstackgerritsean mooney proposed openstack/nova master: add functional regression test for bug #1888395  https://review.opendev.org/74745412:45
openstackbug 1888395 in OpenStack Compute (nova) "shared live migration of a vm with a vif is broken in train" [High,In progress] https://launchpad.net/bugs/1888395 - Assigned to sean mooney (sean-k-mooney)12:45
openstackgerritsean mooney proposed openstack/nova master: Set migrate_data.vifs only when using multiple port bindings  https://review.opendev.org/74218012:45
*** Luzi has joined #openstack-nova12:45
*** mkrai has quit IRC12:50
*** k_mouza has joined #openstack-nova12:57
sean-k-mooneystephenfin: another one for your review queue https://review.opendev.org/#/q/topic:bug/1860555+(status:open+OR+status:merged) althouhg that is still WIP so lower priority but that might be the cause of our downstream issue13:10
sean-k-mooneygmann: im just poping out to grab lunch but ill try and take a look at the polcy change when i get back. its not really my area but ill take a look in anycase13:21
gmannsure, thanks13:21
*** jangutter_ has joined #openstack-nova13:21
kashyaplyarwood: BTW, can you please link to the latest error logs from upstream?  I can't find them here - https://zuul.opendev.org/t/openstack/build/9290c83e18a741a5bdab4e28de5eedb7/log/13:22
kashyaplyarwood: I'm looking for the offending guest QEMU command-line and its guest kernel version13:22
gibigmann: only have a request in the reno https://review.opendev.org/#/c/748059 but overall looks good to me13:23
*** sapd1_x has joined #openstack-nova13:23
gmanngibi: thanks. updating.13:23
kashyaplyarwood: The reason for the above details is because one of the QEMU devs say "lack of CPU time doesn't make sense [as a potential cause], as hot[un]plug events should be porcessed sooner or later"13:23
*** jangutte_ has joined #openstack-nova13:24
*** jangutter has quit IRC13:25
kashyaplyarwood: I think I should find the logs here (for the latest failing -focal logs): https://review.opendev.org/#/c/734029/13:25
lyarwoodkashyap: https://zuul.opendev.org/t/openstack/build/eee0dc94780c4555b376f17c4f50c301 is a recent example13:26
openstackgerritGhanshyam Mann proposed openstack/nova master: Migrate default policy file from JSON to YAML  https://review.opendev.org/74805913:26
lyarwoodkashyap: https://zuul.opendev.org/t/openstack/build/eee0dc94780c4555b376f17c4f50c301/log/controller/logs/libvirt/qemu/instance-0000007a_log.txt is the QEMU log for an instance that hit this13:27
openstackgerritGhanshyam Mann proposed openstack/nova master: Migrate default policy file from JSON to YAML  https://review.opendev.org/74805913:27
lyarwoodkashyap: 1dec20ff-922e-4bed-a97f-1699f114e74b13:27
*** jangutter_ has quit IRC13:27
kashyaplyarwood: Thank you; do you have the guest kernel version?  (Or the CirrOS version - then I can figure out the kernel version)13:27
lyarwoodkashyap: pretty sure it's the same as the version I listed earlier13:27
lyarwood5.3.0-26-generic13:28
kashyapAh, okay; was about to guess as much.  Thank you13:28
gmanngibi: updated, added bug in cmt msg also13:28
*** xek has joined #openstack-nova13:28
lyarwoodkashyap: just modified the cirros image in my test env to use debug ACPI btw13:28
lyarwoodkashyap: just hacking tempest to dump the console log / dmesg on failure13:28
kashyapAh, cool13:29
*** jawad_axd has quit IRC13:29
*** jangutter has joined #openstack-nova13:30
*** jangutte_ has quit IRC13:30
*** k_mouza has quit IRC13:30
*** jangutter_ has joined #openstack-nova13:31
*** jangutter has quit IRC13:34
gibigmann: thanks, +213:35
openstackgerritGhanshyam Mann proposed openstack/nova master: [Trivial] Replace ref of policy.json to policy.yaml  https://review.opendev.org/74982113:35
gmanngibi: thanks. ^^ this is trivial one to replace the ref of policy.json in doc and test13:36
gibilooking13:37
*** xek has quit IRC13:38
gibisean-k-mooney: I have a question at https://review.opendev.org/#/c/742180/1113:45
*** priteau has joined #openstack-nova13:47
* bauzas just discovered today a new world with neutron13:48
*** priteau has quit IRC13:48
*** priteau has joined #openstack-nova13:49
bauzassean-k-mooney: sooooo, we build the VIFs once we are in the compute service, right?13:49
bauzaswell, answering myself13:51
bauzasright, only when we add the fixed IP to an instance13:51
bauzaswhich is called either after creating the instance in the compute, or when adding the fixed IP directly to an instance by the API...13:52
kashyaplyarwood: So, I just combed through the libvirtd log surrounding the QMP 'device_del' (which does the detach), and here's the little fragment: https://kashyapc.fedorapeople.org/CirrOS_device_detach_issues/libvirtd-log-surrounding-device_del.txt14:00
bauzasgibi: sean-k-mooney: question, should we look at the segments if someone asks the API to put a port to an existance ?14:00
bauzasif so...14:00
kashyaplyarwood: It all looks "clean" until here to me:14:01
kashyap2020-09-03 20:01:53.019+0000: 65328: debug : qemuMonitorJSONIOProcessEvent:205 : handle DEVICE_DELETED handler=0x7f0230572840 data=0x55d556edf3c014:01
kashyap2020-09-03 20:01:53.019+0000: 65328: debug : qemuMonitorJSONHandleDeviceDeleted:1287 : missing device in device deleted event14:01
gibiport will be bound and I guess neutron will fail the binding if there is no segment on the given host14:01
gibias far as I remember interface_attach is a call so the error will propagate back the user14:01
gibibauzas: ^^14:01
bauzasgibi: ok, so Neutron will check it ?14:02
bauzasif so, fine14:02
gibiI assume, yes14:02
bauzascool14:02
gibias neutron would need to assign an ip14:02
gibiduring the binding14:02
bauzasanyway, we could provide a caveat documentation if no14:02
bauzasanyway, today is the last day I'm trying to work on this14:03
bauzasgibi: sean-k-mooneyif you have changes you want to me to review, lemme know14:03
bauzasand then I'll review them tomorrow14:03
openstackgerritLee Yarwood proposed openstack/nova master: WIP/DNM libvirt: Start emitting DeviceRemovedEvent and DeviceRemovalFailedEvent events  https://review.opendev.org/74992914:03
*** sapd1_x has quit IRC14:03
gibibauzas: sriov attach is ready, sean-k-mooney is alread +1 on it and the bottom has +2s from stephenfin. series starts here https://review.opendev.org/#/c/74143614:04
bauzasgibi: ack, will look14:04
gibithanks!14:04
bauzasgibi: now that I work on some network features, I know better the related files ;)14:05
gibi:)14:05
lyarwoodkashyap: yeah that's long after tempest has stopped waiting for the volume to be detached14:07
lyarwoodkashyap: let me grab some logs in pastebin14:07
kashyaplyarwood: I've got some contextual stuff here: https://kashyapc.fedorapeople.org/CirrOS_device_detach_issues/14:07
*** sapd1_x has joined #openstack-nova14:07
lyarwoodkashyap: http://paste.openstack.org/show/797545/ - AFAICT n-cpu stops trying to detach the volume much earlier than the libvirtd logs you've posted14:16
* kashyap clicks14:17
kashyaplyarwood: Okay, I perhaps need to look further up; let me see if I can see this "Unable to detach" thing in the log14:18
kashyaplyarwood: I'm stumped - I don't see why that "Unable to detach ..." isn't captured here: https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_c3a/734029/2/check/devstack-platform-focal/c3ab542/controller/logs/libvirt/libvirtd_log.txt14:19
kashyap(Beaware the above log size: gzip-compressed - 8.2MB; uncompressed - 118MB)14:19
lyarwood721862 2020-09-03 19:58:35.443+0000: 65331: debug : qemuDomainDeleteDevice:128 : Detaching of device virtio-disk1 failed and no event arrived14:20
lyarwood^ kashyap  I think that's what we are after14:20
kashyaplyarwood: "Huzzah", that's right14:20
*** k_mouza has joined #openstack-nova14:29
openstackgerritBalazs Gibizer proposed openstack/nova master: Follow up for I67504a37b0fe2ae5da3cba2f3122d9d0e18b9481  https://review.opendev.org/75018414:33
*** Luzi has quit IRC14:39
openstackgerritStephen Finucane proposed openstack/nova master: Add support for resize and cold migration of emulated TPM files  https://review.opendev.org/63993414:50
openstackgerritStephen Finucane proposed openstack/nova master: Expand generic reproducer for bug #1879878  https://review.opendev.org/75018614:50
openstackbug 1879878 in OpenStack Compute (nova) "VM become Error after confirming resize with Error info CPUUnpinningInvalid on source node " [Medium,In progress] https://launchpad.net/bugs/1879878 - Assigned to Stephen Finucane (stephenfinucane)14:50
openstackgerritStephen Finucane proposed openstack/nova master: Set 'old_flavor', 'new_flavor' on source before resize  https://review.opendev.org/75018714:50
gmannstephenfin: are you planning the xenapi removal for Victoria release? if so i can review your tempest patch on priority (as that will block the nova side change) otherwise after Focal migration work.14:54
stephenfingmann: Yes, I was hoping to14:54
stephenfinI think it's in merge conflict though14:54
* stephenfin looks14:54
gmannyeah.14:55
stephenfinokay, resolved that. docstring conflict14:56
stephenfingibi: replied at https://review.opendev.org/#/c/746945/6/nova/tests/functional/libvirt/test_pci_sriov_servers.py@a37014:58
gibithanks, looking14:59
*** mkrai has joined #openstack-nova15:01
*** jangutter has joined #openstack-nova15:02
*** jangutter has quit IRC15:02
*** jangutter has joined #openstack-nova15:03
*** jangutter_ has quit IRC15:05
*** k_mouza has quit IRC15:07
*** jangutter has quit IRC15:07
*** jangutter has joined #openstack-nova15:08
*** k_mouza has joined #openstack-nova15:11
*** k_mouza has quit IRC15:21
*** k_mouza has joined #openstack-nova15:24
*** martinkennelly has joined #openstack-nova15:26
*** links has quit IRC15:34
*** k_mouza has quit IRC15:39
*** ircuser-1 has joined #openstack-nova15:42
*** k_mouza has joined #openstack-nova15:47
sean-k-mooneybauzas: technically yes but the port binding would fail15:52
sean-k-mooneyah gibi aready said that15:52
bauzasall cool then15:52
sean-k-mooneygibi so regarding https://review.opendev.org/#/c/742180/11/nova/tests/functional/regressions/test_bug_1888395.py i was thinking of using stephens seriese eventuly to enable the migration testing15:55
sean-k-mooneygibi: once the sriov migration fuctional test series merges tehn that regression test can be updated15:55
gibisean-k-mooney: yeah that would be nice15:55
gibiI read through stephenfin's series today and I'm +2 almost all the way15:56
sean-k-mooneyim not sure if i need all the patches by the way. i was hopeing to get this merged before his series to avoid conflicts on backport but im hoping both merged in victoria15:57
sean-k-mooneyall the patches in stephens series that is15:57
gibisean-k-mooney: my -1 on https://review.opendev.org/#/c/742180/ is about the question if we break SRIOV live migration if there is no multi portbinding15:58
sean-k-mooneyyes so sriov live migration requires multiple port bindings15:58
sean-k-mooneyit was only ment to work if the backend supported that15:58
gibithen the question will it fail cleanly?15:58
sean-k-mooneyyes it will15:59
*** mkrai has quit IRC15:59
sean-k-mooneywe check if multiple port bindigns is supproted i nthe conductor15:59
sean-k-mooneyand fail the migration if not15:59
sean-k-mooneygibi: https://github.com/openstack/nova/blob/master/nova/conductor/tasks/live_migrate.py#L250-L25516:00
gibicool16:00
gibiI'm droping my -1 then16:00
sean-k-mooney:) any other concerns?16:00
gibinope16:01
sean-k-mooneyshould we be worried about all those gate timeouts16:01
gibiI haven't checked the gate this afternoon16:01
sean-k-mooneyim seeing sqlalcamy errors in the unit tests which are unrelated16:02
gibiI'm leaving for today...16:02
gibio/16:03
sean-k-mooneyo/16:03
*** openstackgerrit has quit IRC16:11
*** k_mouza has quit IRC16:34
*** xek has joined #openstack-nova16:41
*** ralonsoh has quit IRC16:44
*** k_mouza has joined #openstack-nova16:45
*** martinkennelly has quit IRC16:46
*** sapd1_x has quit IRC16:57
*** derekh has quit IRC17:03
*** tesseract has quit IRC17:04
*** dtantsur is now known as dtantsur|afk17:18
*** k_mouza has quit IRC17:22
*** nightmare_unreal has quit IRC17:40
*** zzzeek has quit IRC18:20
*** zzzeek has joined #openstack-nova18:24
*** openstackgerrit has joined #openstack-nova18:30
openstackgerritLee Yarwood proposed openstack/nova master: fakelibvirt: Use versionutils to set min versions found in the driver  https://review.opendev.org/74970718:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Bump MIN_{LIBVIRT,QEMU}_VERSION and NEXT_MIN_{LIBVIRT,QEMU}_VERSION  https://review.opendev.org/74698118:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove MIN_LIBVIRT_FILE_BACKED_DISCARD_VERSION  https://review.opendev.org/74698218:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove MIN_{LIBVIRT,QEMU}_NATIVE_TLS_VERSION  https://review.opendev.org/74698318:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove MIN_LIBVIRT_BETTER_SIGKILL_HANDLING  https://review.opendev.org/74698418:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove MIN_LIBVIRT_VIDEO_MODEL_VERSIONS  https://review.opendev.org/74698518:30
openstackgerritLee Yarwood proposed openstack/nova master: libvirt: Remove MIN_{LIBVIRT,QEMU}_PMEM_SUPPORT  https://review.opendev.org/74698618:30
*** zzzeek has quit IRC18:50
*** zzzeek has joined #openstack-nova18:52
openstackgerritMerged openstack/nova stable/ussuri: Add note and daxio version to the vPMEM document  https://review.opendev.org/74968119:00
*** priteau has quit IRC19:04
*** gregwork has joined #openstack-nova19:27
*** ociuhandu has joined #openstack-nova19:37
*** ociuhandu_ has joined #openstack-nova19:42
*** ociuhandu has quit IRC19:45
*** ociuhandu_ has quit IRC20:19
openstackgerritsean mooney proposed openstack/nova master: add functional regression test for bug #1888395  https://review.opendev.org/74745420:22
openstackbug 1888395 in OpenStack Compute (nova) "shared live migration of a vm with a vif is broken in train" [High,In progress] https://launchpad.net/bugs/1888395 - Assigned to sean mooney (sean-k-mooney)20:22
openstackgerritsean mooney proposed openstack/nova master: Set migrate_data.vifs only when using multiple port bindings  https://review.opendev.org/74218020:22
*** ociuhandu has joined #openstack-nova20:35
openstackgerritMerged openstack/nova stable/train: Removed the host FQDN from the exception message  https://review.opendev.org/74960920:35
openstackgerritMerged openstack/nova stable/ussuri: resolve ResourceProviderSyncFailed issue  https://review.opendev.org/74966820:35
openstackgerritMerged openstack/nova stable/ussuri: Set different VirtualDevice.key  https://review.opendev.org/74941820:35
openstackgerritMerged openstack/nova stable/train: tests: Add reproducer for bug #1889633  https://review.opendev.org/74825420:35
openstackbug 1889633 in OpenStack Compute (nova) train "Pinned instance with thread policy can consume VCPU" [High,In progress] https://launchpad.net/bugs/1889633 - Assigned to Stephen Finucane (stephenfinucane)20:35
openstackgerritMerged openstack/nova stable/train: Add checks for volume status when rebuilding  https://review.opendev.org/74855820:35
*** ociuhandu has quit IRC20:39
*** zzzeek has quit IRC20:42
*** zzzeek has joined #openstack-nova20:44
*** k_mouza has joined #openstack-nova21:11
*** xek has quit IRC21:14
openstackgerritsean mooney proposed openstack/nova master: use os-brick connector fixture form ServersTestBase  https://review.opendev.org/75021521:16
*** slaweq has quit IRC21:19
*** slaweq has joined #openstack-nova21:23
*** slaweq has quit IRC21:27
openstackgerritsean mooney proposed openstack/nova master: add using_multiple_port_bindings property to livemigrate_data  https://review.opendev.org/75021721:42
openstackgerritsean mooney proposed openstack/nova master: add using_multiple_port_bindings property to LiveMigrateData  https://review.opendev.org/75021721:44
*** k_mouza has quit IRC21:48
*** hoonetorg has quit IRC22:15
*** hoonetorg has joined #openstack-nova22:28
openstackgerritGhanshyam Mann proposed openstack/nova master: Migrate default policy file from JSON to YAML  https://review.opendev.org/74805922:32
*** rcernin has joined #openstack-nova22:41
*** tosky has quit IRC23:16

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!