opendevreview | melanie witt proposed openstack/nova master: DNM test ephemeral encryption + resize: qcow2, raw, rbd https://review.opendev.org/c/openstack/nova/+/862416 | 00:09 |
---|---|---|
*** efried1 is now known as efried | 01:04 | |
opendevreview | melanie witt proposed openstack/nova master: DNM test ephemeral encryption + resize: qcow2, raw, rbd https://review.opendev.org/c/openstack/nova/+/862416 | 02:03 |
opendevreview | likui proposed openstack/nova master: Add py3.11 in tox file https://review.opendev.org/c/openstack/nova/+/901587 | 02:10 |
opendevreview | likui proposed openstack/nova master: Update python classifier in setup.cfg https://review.opendev.org/c/openstack/nova/+/901587 | 02:10 |
tonyb | Any chance I can get some eyes on: https://review.opendev.org/c/openstack/nova/+/905314 ? | 02:11 |
opendevreview | Amit Uniyal proposed openstack/nova master: Fixes: bfv vm reboot ends up in an error state. https://review.opendev.org/c/openstack/nova/+/904817 | 06:50 |
bauzas | tonyb: sent to the gate | 08:28 |
opendevreview | Merged openstack/nova master: Fix test failures with oslo.limit 2.3.0 https://review.opendev.org/c/openstack/nova/+/905314 | 09:21 |
tonyb | bauzas: thank you | 09:23 |
tonyb | melwitt: and thank you too for the quick review | 09:24 |
*** priteau_ is now known as priteau | 12:14 | |
opendevreview | Merged openstack/nova master: Updates glance fixture for create image https://review.opendev.org/c/openstack/nova/+/905684 | 14:37 |
opendevreview | Sylvain Bauza proposed openstack/nova master: Add a functest for verifying multiple VGPU allocations https://review.opendev.org/c/openstack/nova/+/845747 | 15:16 |
opendevreview | Sylvain Bauza proposed openstack/nova master: Support multiple allocations for vGPUs https://review.opendev.org/c/openstack/nova/+/845757 | 15:16 |
opendevreview | Merged openstack/nova master: Fixes: bfv vm reboot ends up in an error state. https://review.opendev.org/c/openstack/nova/+/904817 | 16:40 |
*** tosky_ is now known as tosky | 16:41 | |
opendevreview | Doug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration https://review.opendev.org/c/openstack/nova/+/906053 | 16:53 |
_colby | bauzas: thanks for your help yesterday. So from what you said do we need to manually create the mdevs now? I was confused on that part. I see all the VF and available_instances for them all. But mdevctl shows no devices and nova does not seem to create them any more. So is manual creation required now? | 17:16 |
bauzas | (on a meeting now but I'll reply later) | 17:17 |
opendevreview | Doug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration https://review.opendev.org/c/openstack/nova/+/906053 | 17:17 |
bauzas | _colby: I'll need to disappear shortly, but please explain me more : are you able to see mediated devices in /sys/bus/mdev ? | 17:30 |
opendevreview | Doug Szumski proposed openstack/nova master: Use dedicated live migration network during pre-migration https://review.opendev.org/c/openstack/nova/+/906053 | 17:32 |
_colby | bauzas: yes I see 1 here for all the available VFs: cat /sys/class/mdev_bus/*/mdev_supported_types/nvidia-563/available_instances | 17:38 |
bauzas | so basically all GPU VFs are available | 17:39 |
bauzas | do you have instances on the host ? | 17:39 |
_colby | no this is a fresh host Im testing with | 17:39 |
bauzas | and do you have placement allocations ? | 17:39 |
bauzas | _colby: re: the bug I mentioned yesterday, once you create enough mdevs equal to the max number of vGPUs the type can create, then all other VFs will start having 0 available instances but placement will continue to say 'sure, you can use them' | 17:40 |
_colby | yes the resource provider shows all the VF addresses as seperate items and looking at inventory of one I see the expected output: VGPU, allocation_ratio=1.0,min_unit=1,max_unit=1,reserved=0,step_size=1,total=1,used=0 | 17:41 |
bauzas | that's why I'm recommending to precreate the mediated devices on the VFs you want and then start nova-compute | 17:41 |
bauzas | oh, you already started nova-computz | 17:41 |
bauzas | then, | 17:41 |
bauzas | 1/ create the mdevs by hand (or by mdevctl) | 17:41 |
bauzas | 2/ stop nova-compute | 17:41 |
bauzas | 3/ delete the resource providers matching the unused VFs | 17:42 |
bauzas | 4/ restart nova-compute | 17:42 |
bauzas | you should then have placement reflecting the correct resources | 17:42 |
bauzas | I need to leave now | 17:42 |
_colby | ok I can do that. Seems like a bit of heavy admin work every time we want to reconfig the vGPU slices. Nova does not want to create them automatically any more? | 17:43 |
bauzas | that's the bug report I mentioned yesterday | 17:43 |
_colby | but if that will get it working then we can do that | 17:44 |
bauzas | nova can continue to create the mediated devices | 17:44 |
bauzas | but as soon as you create the Nth mdev from a GPU (where N equals the max number of vGPUs you can create from a type), then you'll have inconsistencies between placement and sysfs | 17:45 |
_colby | I see | 17:45 |
_colby | but in our case Nova does not seem to be creating any mdevs | 17:45 |
bauzas | a fix is there, we gonna review it this cycle | 17:45 |
bauzas | I need to go but shouldn't be | 17:45 |
_colby | no problem thanks | 17:46 |
bauzas | I can't remember when we added more logs to _allocate_mdevs() but yoga seems recent enough to have those | 17:46 |
bauzas | https://github.com/openstack/nova/commit/6feb3350b048606297068841e3feba110bb0b0ab | 17:48 |
bauzas | nah, antelope | 17:48 |
_colby | I can manually patch our system to add these logs for testing | 17:49 |
_colby | bauzas: whenever you get back I made more progress: https://pastebin.com/Xv2bvaf7 | 18:19 |
_colby | its compaining about no available instances on that device but there is: cat /sys/class/mdev_bus/0000\:17\:02.0/mdev_supported_types/nvidia-563/available_instances | 18:20 |
_colby | also that device is not listed in the config for that type: device_addresses=0000:17:00.4,0000:17:00.5,0000:17:00.6,0000:17:00.7 | 18:22 |
opendevreview | Merged openstack/nova master: Allow config to support virtiofs (driver) https://review.opendev.org/c/openstack/nova/+/886522 | 21:51 |
opendevreview | Merged openstack/nova master: Attach Manila shares via virtiofs (db) https://review.opendev.org/c/openstack/nova/+/831193 | 21:51 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!