*** k_mouza has joined #openstack-nova | 00:27 | |
*** k_mouza has quit IRC | 00:32 | |
*** gyee has quit IRC | 00:32 | |
*** brinzhang0 is now known as brinzhang | 00:33 | |
brinzhang | gibi, gmann: the nova-grenade-multinode task always failed, is there a bug tracing? | 00:34 |
---|---|---|
brinzhang | https://zuul.opendev.org/t/openstack/builds?job_name=nova-grenade-multinode | 00:34 |
gmann | brinzhang: i see error 'Failed to start rtslib-fb-targetctl.service: Unit rtslib-fb-targetctl.service is not loaded properly: Exec format error.' | 00:52 |
gmann | but that is passing sometime | 00:52 |
gmann | not 100% failure | 00:52 |
gmann | clarkb: ^^ are you aware of this error - https://zuul.opendev.org/t/openstack/build/599cfa422a0648168c8b00a27fbd3114/log/logs/grenade.sh.txt#46891-46912 | 00:53 |
gmann | brinzhang: if you see, there are some passing job also on master gate https://zuul.opendev.org/t/openstack/builds?job_name=nova-grenade-multinode | 00:55 |
gmann | but yes, it is happening more frequently | 00:56 |
*** brinzhang_ has joined #openstack-nova | 01:05 | |
*** brinzhang has quit IRC | 01:08 | |
*** __ministry has joined #openstack-nova | 01:17 | |
brinzhang_ | gmann: yes, there are always failed in https://review.opendev.org/c/openstack/nova/+/764292 | 01:18 |
*** sapd1_x has joined #openstack-nova | 01:20 | |
gmann | brinzhang_: this one https://askubuntu.com/questions/1334619/failed-to-start-rtslib-fb-targetctl-service | 01:21 |
*** sapd1_x has quit IRC | 01:25 | |
brinzhang_ | gmann: ok, it seems there is no answer with this question | 01:26 |
*** hamalq has quit IRC | 01:34 | |
*** macz_ has joined #openstack-nova | 01:35 | |
*** macz_ has quit IRC | 01:40 | |
*** macz_ has joined #openstack-nova | 01:56 | |
*** swp20 has joined #openstack-nova | 01:59 | |
*** swp20 is now known as wenpingsong | 02:00 | |
*** macz_ has quit IRC | 02:00 | |
*** brinzhang_ is now known as brinzhang | 02:07 | |
*** sapd1_x has joined #openstack-nova | 02:12 | |
*** dasp_ has quit IRC | 02:31 | |
*** dasp has joined #openstack-nova | 02:35 | |
*** rcernin has quit IRC | 02:35 | |
*** rcernin has joined #openstack-nova | 02:42 | |
*** hemanth_n has joined #openstack-nova | 02:45 | |
*** _erlon_ has quit IRC | 03:21 | |
*** boxiang_ has quit IRC | 03:31 | |
*** sapd1_x has quit IRC | 03:34 | |
*** mkrai has joined #openstack-nova | 03:36 | |
*** sapd1_x has joined #openstack-nova | 03:41 | |
*** mkrai has quit IRC | 03:55 | |
*** macz_ has joined #openstack-nova | 03:55 | |
openstackgerrit | Xinran WANG proposed openstack/nova-specs master: Repropose smartnic support spec https://review.opendev.org/c/openstack/nova-specs/+/783632 | 03:56 |
*** rcernin has quit IRC | 03:59 | |
*** rcernin has joined #openstack-nova | 04:00 | |
*** macz_ has quit IRC | 04:00 | |
*** vishalmanchanda has joined #openstack-nova | 04:24 | |
*** terdei has quit IRC | 04:30 | |
*** terdei has joined #openstack-nova | 04:33 | |
*** aarents has quit IRC | 04:42 | |
*** ratailor has joined #openstack-nova | 04:52 | |
*** openstackgerrit has quit IRC | 05:01 | |
*** erbarr has joined #openstack-nova | 05:16 | |
erbarr | hi, is there an issue with configdrive and ussuri? | 05:22 |
*** sapd1_x has quit IRC | 05:44 | |
*** slaweq has joined #openstack-nova | 05:55 | |
*** macz_ has joined #openstack-nova | 05:56 | |
*** ratailor_ has joined #openstack-nova | 05:58 | |
*** ratailor has quit IRC | 05:59 | |
*** macz_ has quit IRC | 06:00 | |
*** ralonsoh has joined #openstack-nova | 06:25 | |
*** luksky has joined #openstack-nova | 06:39 | |
*** macz_ has joined #openstack-nova | 06:46 | |
*** aarents has joined #openstack-nova | 06:47 | |
*** macz_ has quit IRC | 06:52 | |
*** wenpingsong has quit IRC | 06:59 | |
*** rcernin has quit IRC | 07:04 | |
*** ociuhandu has joined #openstack-nova | 07:06 | |
*** ociuhandu has quit IRC | 07:07 | |
*** ociuhandu has joined #openstack-nova | 07:07 | |
gibi | gmann, brinzhang: https://review.opendev.org/c/openstack/devstack/+/788429 should have fixed the grenade issue | 07:10 |
gibi | gmann, brinzhang: and when this https://review.opendev.org/c/openstack/nova/+/778885 finally lands | 07:11 |
gibi | we don't need to pin ISCSI_HELPER any more | 07:11 |
*** andrewbonney has joined #openstack-nova | 07:19 | |
*** ociuhandu has quit IRC | 07:21 | |
*** ociuhandu has joined #openstack-nova | 07:22 | |
*** macz_ has joined #openstack-nova | 07:25 | |
*** ociuhandu has quit IRC | 07:27 | |
*** rpittau|afk is now known as rpittau | 07:27 | |
*** macz_ has quit IRC | 07:30 | |
*** gokhani has joined #openstack-nova | 07:32 | |
*** ociuhandu has joined #openstack-nova | 07:34 | |
*** boxiang has joined #openstack-nova | 07:35 | |
*** sapd1_x has joined #openstack-nova | 07:39 | |
*** dtantsur|afk is now known as dtantsur | 07:40 | |
*** sapd1_x has quit IRC | 07:43 | |
*** tosky has joined #openstack-nova | 07:44 | |
*** sapd1_x has joined #openstack-nova | 07:59 | |
*** martinkennelly has joined #openstack-nova | 08:02 | |
*** sapd1_x has quit IRC | 08:04 | |
*** lucasagomes has joined #openstack-nova | 08:05 | |
brinzhang | gibi: ack | 08:09 |
*** ociuhandu has quit IRC | 08:11 | |
*** ociuhandu has joined #openstack-nova | 08:12 | |
*** swp20 has joined #openstack-nova | 08:14 | |
*** dklyle has quit IRC | 08:18 | |
*** ociuhandu has quit IRC | 08:18 | |
*** sapd1_x has joined #openstack-nova | 08:18 | |
*** macz_ has joined #openstack-nova | 08:37 | |
*** macz_ has quit IRC | 08:41 | |
*** links has joined #openstack-nova | 08:43 | |
*** ociuhandu has joined #openstack-nova | 08:50 | |
*** ociuhandu has quit IRC | 08:56 | |
*** k_mouza has joined #openstack-nova | 09:01 | |
*** tesseract has joined #openstack-nova | 09:17 | |
*** ociuhandu has joined #openstack-nova | 09:19 | |
*** ociuhandu has quit IRC | 09:24 | |
*** bnemec has quit IRC | 09:27 | |
*** ociuhandu has joined #openstack-nova | 09:27 | |
*** bnemec has joined #openstack-nova | 09:28 | |
*** k_mouza has quit IRC | 09:32 | |
*** derekh has joined #openstack-nova | 09:32 | |
*** ociuhandu has quit IRC | 09:35 | |
*** ociuhandu_ has joined #openstack-nova | 09:35 | |
*** k_mouza has joined #openstack-nova | 09:39 | |
*** xinranwang has quit IRC | 09:40 | |
*** zzzeek has quit IRC | 09:42 | |
*** zzzeek has joined #openstack-nova | 09:43 | |
*** ociuhandu_ has quit IRC | 09:44 | |
*** k_mouza has quit IRC | 09:44 | |
*** k_mouza has joined #openstack-nova | 09:47 | |
*** tobberydberg has quit IRC | 09:47 | |
*** tobberydberg has joined #openstack-nova | 09:52 | |
*** openstackgerrit has joined #openstack-nova | 10:08 | |
openstackgerrit | Lee Yarwood proposed openstack/nova master: zuul: Replace grenade and nova-grenade-multinode with grenade-multinode https://review.opendev.org/c/openstack/nova/+/778885 | 10:08 |
openstackgerrit | Lee Yarwood proposed openstack/nova master: zuul: Remove nova-dsvm-multinode-base https://review.opendev.org/c/openstack/nova/+/778908 | 10:08 |
lyarwood | gibi: ^ that series was still using the older detach flow and seeing errors, I've rebased to pickup your new code and hopefully that should allow it to pass | 10:08 |
gibi | hm, I thought zuul rebases the patches | 10:09 |
gibi | before running the tests | 10:09 |
*** sapd1_x has quit IRC | 10:09 | |
*** macz_ has joined #openstack-nova | 10:10 | |
gibi | anyhow I +Ad them | 10:10 |
sean-k-mooney | gibi: zuul merges the patch with the head of master and if you use depends on in the same repor it merged with that too | 10:13 |
sean-k-mooney | gibi: so not a rebase but similar | 10:13 |
gibi | then it should have picked up my detach fixes | 10:13 |
lyarwood | AFAICT it didn't | 10:14 |
*** ociuhandu has joined #openstack-nova | 10:14 | |
lyarwood | this is just in the check queue btw | 10:14 |
lyarwood | it didn't make it to the gate this run | 10:14 |
sean-k-mooney | yep check should do the same | 10:14 |
sean-k-mooney | it will speculetlivly merge and then run on the result | 10:14 |
sean-k-mooney | its why sometimes test pass locally and fail in the gate | 10:15 |
*** macz_ has quit IRC | 10:15 | |
sean-k-mooney | and you have to do a local rebase to see the isssues locally | 10:15 |
sean-k-mooney | could it have just been a timing thing? | 10:16 |
sean-k-mooney | check jobs dont restart if something merges | 10:16 |
lyarwood | https://zuul.opendev.org/t/openstack/build/02d0c22ca9334a1094782df2f9570c35/log/compute1/logs/screen-n-cpu.txt and req-d017e833-5df2-4c5e-b348-f2625a2a608d is the old flow as it's using loopingcall | 10:16 |
sean-k-mooney | unlike gate | 10:16 |
* gibi clicks | 10:17 | |
gibi | yepp the logs are from the old code | 10:18 |
lyarwood | anyway it's rebased now | 10:18 |
lyarwood | so hopefully | 10:18 |
gibi | fingers crossed | 10:18 |
*** __ministry has quit IRC | 10:18 | |
gibi | lyarwood: it is a grenade run, so I guess before the upgrade it have to run the old code from Wallaby | 10:19 |
lyarwood | aaaaaaaah | 10:19 |
lyarwood | right | 10:19 |
lyarwood | I didn't even think | 10:19 |
lyarwood | so is this still v to w? | 10:19 |
gibi | I think it is w to master | 10:20 |
gibi | but we only merged the fix to master | 10:20 |
sean-k-mooney | oh ya | 10:20 |
lyarwood | we did talk about backporting your change to wallaby | 10:20 |
sean-k-mooney | well grenade shoudl be wallaby to xena | 10:21 |
* lyarwood needs to jump on a call brb | 10:21 | |
lyarwood | sean-k-mooney: yeah I think this is the live migration test between the two | 10:21 |
lyarwood | sean-k-mooney: with the compute still on w | 10:21 |
sean-k-mooney | back porting the change for libvirt events to wallaby would be nice form a downstrem perspecitve too | 10:22 |
sean-k-mooney | we talked about trying that downstream but while i like the idea of doing that i was not personally sure we wanted to diverge form upstream in this case | 10:23 |
gibi | lyarwood: the delta between w and current master should be small, so I can fire up the cherry-picks | 10:24 |
gibi | ahh there are already merge conflicts :/ | 10:29 |
*** zzzeek has quit IRC | 10:33 | |
*** zzzeek has joined #openstack-nova | 10:34 | |
lyarwood | yeah that might be my fault sorry | 10:36 |
lyarwood | actually yeah it is | 10:36 |
lyarwood | sean-k-mooney: yeah I did bring it up upstream as well as it's also a bugfix IMHO | 10:36 |
gibi | it is nobody's fault, we needed those fixes | 10:37 |
*** tbachman has quit IRC | 10:43 | |
*** tbachman has joined #openstack-nova | 10:43 | |
*** tbarron has joined #openstack-nova | 10:48 | |
*** whoami-rajat has joined #openstack-nova | 10:52 | |
*** ratailor__ has joined #openstack-nova | 10:53 | |
*** hemanth_n has quit IRC | 10:53 | |
*** iurygregory has quit IRC | 10:54 | |
*** ratailor_ has quit IRC | 10:56 | |
*** sapd1_x has joined #openstack-nova | 10:56 | |
*** iurygregory has joined #openstack-nova | 10:58 | |
*** damien_r has quit IRC | 11:02 | |
*** macz_ has joined #openstack-nova | 11:02 | |
*** damien_r has joined #openstack-nova | 11:03 | |
*** macz_ has quit IRC | 11:06 | |
*** tbachman has quit IRC | 11:07 | |
openstackgerrit | Vlad Gusev proposed openstack/nova master: libvirt: Abort live-migration job when monitoring fails https://review.opendev.org/c/openstack/nova/+/764435 | 11:08 |
openstackgerrit | Daniel Bengtsson proposed openstack/nova master: Use the new type HostDomainOpt. https://review.opendev.org/c/openstack/nova/+/788240 | 11:10 |
*** damien_r has quit IRC | 11:15 | |
*** k_mouza has quit IRC | 11:29 | |
*** k_mouza has joined #openstack-nova | 11:29 | |
*** gokhani has quit IRC | 11:38 | |
*** dave-mccowan has quit IRC | 11:40 | |
*** ociuhandu has quit IRC | 11:41 | |
*** ociuhandu has joined #openstack-nova | 11:42 | |
*** dave-mccowan has joined #openstack-nova | 11:42 | |
*** macz_ has joined #openstack-nova | 11:42 | |
*** ociuhandu has quit IRC | 11:47 | |
*** macz_ has quit IRC | 11:47 | |
*** ociuhandu has joined #openstack-nova | 11:48 | |
*** ociuhandu has quit IRC | 11:53 | |
*** macz_ has joined #openstack-nova | 12:03 | |
*** ratailor__ has quit IRC | 12:07 | |
*** macz_ has quit IRC | 12:08 | |
*** ratailor has joined #openstack-nova | 12:10 | |
*** ratailor has quit IRC | 12:10 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Replace blind retry with libvirt event waiting in detach https://review.opendev.org/c/openstack/nova/+/788720 | 12:13 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Move the guest.get_disk test to test_guest https://review.opendev.org/c/openstack/nova/+/788721 | 12:13 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Enable mypy on libvirt/guest.py https://review.opendev.org/c/openstack/nova/+/788722 | 12:13 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Follow up type hints for a634103 https://review.opendev.org/c/openstack/nova/+/788723 | 12:13 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: libvirt: Remove dead error handling code https://review.opendev.org/c/openstack/nova/+/788724 | 12:13 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Move instance power state check to _detach_with_retry https://review.opendev.org/c/openstack/nova/+/788725 | 12:13 |
*** ociuhandu has joined #openstack-nova | 12:14 | |
gibi | lyarwood, sean-k-mooney: ^^ here is the backport series. I'm not 100% confident about it as there was hairy conflicts in the tests | 12:14 |
openstackgerrit | Balazs Gibizer proposed openstack/nova stable/wallaby: Consolidate device detach error handling https://review.opendev.org/c/openstack/nova/+/788726 | 12:14 |
*** dave-mccowan has quit IRC | 12:15 | |
*** dave-mccowan has joined #openstack-nova | 12:17 | |
*** ociuhandu has quit IRC | 12:20 | |
gibi | lyarwood: fyi, there is a rescue + volume + RBD bug in our untriaged list https://bugs.launchpad.net/nova/+bug/1926601 | 12:26 |
openstack | Launchpad bug 1926601 in OpenStack Compute (nova) "Rescuing RBD volume-backed instance does not work" [Undecided,New] | 12:26 |
sean-k-mooney | this is hte rbd images backend | 12:29 |
sean-k-mooney | and a bfv guest | 12:29 |
lyarwood | gibi: ack yeah just on a call but I'll triage it today | 12:29 |
*** gokhani has joined #openstack-nova | 12:30 | |
gibi | thanks | 12:30 |
sean-k-mooney | it sound like we are just not creating the rescue disk in ceph but trying to use it | 12:31 |
sean-k-mooney | i wonder is that becaue of the fact the vm is bfv. | 12:32 |
*** ociuhandu has joined #openstack-nova | 12:33 | |
*** k_mouza has quit IRC | 12:34 | |
sean-k-mooney | they are on victoria so they should have https://specs.openstack.org/openstack/nova-specs/specs/ussuri/implemented/virt-bfv-instance-rescue.html | 12:34 |
sean-k-mooney | my guess is that we are creating the rescue disk as a cinder volumn but then trying to use it form teh vms pool since the rbd images backend is enabeld | 12:35 |
*** tbachman has joined #openstack-nova | 12:40 | |
*** damien_r has joined #openstack-nova | 12:41 | |
*** k_mouza has joined #openstack-nova | 12:43 | |
sean-k-mooney | so this is going to use the image_backed specified in the nova.conf so the disk it returns would be basedon the rbd image backend which will use the vms pool | 12:43 |
sean-k-mooney | https://github.com/openstack/nova/blob/eba9d596daa91d8f702b719afb88cb89f2d5bb32/nova/virt/libvirt/driver.py#L5170 | 12:44 |
*** ociuhandu has quit IRC | 12:44 | |
*** ociuhandu has joined #openstack-nova | 12:45 | |
*** damien_r has quit IRC | 12:45 | |
sean-k-mooney | _create_image is going too check if its a bfv guest here https://github.com/openstack/nova/blob/eba9d596daa91d8f702b719afb88cb89f2d5bb32/nova/virt/libvirt/driver.py#L4500 | 12:47 |
sean-k-mooney | which will be passed to _create_and_inject_local_root https://github.com/openstack/nova/blob/eba9d596daa91d8f702b719afb88cb89f2d5bb32/nova/virt/libvirt/driver.py#L4578 | 12:48 |
sean-k-mooney | which will not create teh image becuase bfv willl be true https://github.com/openstack/nova/blob/eba9d596daa91d8f702b719afb88cb89f2d5bb32/nova/virt/libvirt/driver.py#L4664 | 12:49 |
sean-k-mooney | so we will just take the else branch | 12:49 |
sean-k-mooney | lyarwood: gibi ^ pretty sure that is the root cause of https://bugs.launchpad.net/nova/+bug/1926601 | 12:50 |
openstack | Launchpad bug 1926601 in OpenStack Compute (nova) "Rescuing RBD volume-backed instance does not work" [Undecided,New] | 12:50 |
gibi | sean-k-mooney: nice analysis | 12:50 |
lyarwood | hmm I was sure we had tempest tests for this | 12:53 |
*** k_mouza has quit IRC | 12:53 | |
lyarwood | and I also verified it during the bfv rescue work a few cycles ago | 12:53 |
* lyarwood looks | 12:53 | |
*** mkrai has joined #openstack-nova | 12:55 | |
*** ociuhandu_ has joined #openstack-nova | 12:58 | |
*** ociuhandu has quit IRC | 13:01 | |
*** k_mouza has joined #openstack-nova | 13:01 | |
*** jraju__ has joined #openstack-nova | 13:13 | |
*** links has quit IRC | 13:13 | |
*** macz_ has joined #openstack-nova | 13:15 | |
*** macz_ has quit IRC | 13:20 | |
lyarwood | sean-k-mooney: thanks for the pointers, the real issue is that the request was even allowed as we don't support bfv rescue outside of also requesting a stable device rescue. | 13:26 |
lyarwood | appears I never encoded that in the compute API so it's accepted and passed down to the virt driver that was never changed to support this with legacy rescue attempts (rescue device first etc) | 13:27 |
lyarwood | stable device rescues of bfv instances works just fine and is tested in tempest | 13:27 |
lyarwood | I'll write up a regression test and add some logic in the compute API to avoid this | 13:28 |
lyarwood | https://docs.openstack.org/nova/latest/user/rescue.html#instance-rescue I did at least call it out in the docs | 13:29 |
*** vishalmanchanda has quit IRC | 13:30 | |
*** sapd1_x has quit IRC | 13:33 | |
*** sapd1_x has joined #openstack-nova | 13:34 | |
*** zzzeek has quit IRC | 13:34 | |
sean-k-mooney | lyarwood: ok so for the bug i guess we coudl do two things. one explain how to use stabel rescue | 13:35 |
*** tesseract has quit IRC | 13:35 | |
sean-k-mooney | and second update it to track blocking it in the api with a 400 | 13:35 |
*** tesseract has joined #openstack-nova | 13:35 | |
lyarwood | yup indeed, I'll sort both out shortly | 13:36 |
*** zzzeek has joined #openstack-nova | 13:37 | |
*** zzzeek has quit IRC | 13:40 | |
*** zzzeek has joined #openstack-nova | 13:41 | |
*** k_mouza has quit IRC | 13:42 | |
*** zzzeek has quit IRC | 13:42 | |
*** haleyb has quit IRC | 13:43 | |
*** zzzeek has joined #openstack-nova | 13:43 | |
*** haleyb has joined #openstack-nova | 13:47 | |
*** ociuhandu_ has quit IRC | 13:49 | |
*** ociuhandu has joined #openstack-nova | 13:50 | |
*** macz_ has joined #openstack-nova | 13:52 | |
*** dave-mccowan has quit IRC | 13:52 | |
*** dave-mccowan has joined #openstack-nova | 13:56 | |
*** macz_ has quit IRC | 13:57 | |
*** ociuhandu has quit IRC | 13:59 | |
*** ociuhandu has joined #openstack-nova | 14:01 | |
*** ociuhandu has quit IRC | 14:06 | |
*** mkrai has quit IRC | 14:07 | |
lyarwood | sean-k-mooney: https://bugs.launchpad.net/nova/+bug/1926375 seen this? | 14:07 |
openstack | Launchpad bug 1926375 in OpenStack Compute (nova) "nova-compute service failed to start up" [Undecided,New] | 14:07 |
*** ociuhandu has joined #openstack-nova | 14:09 | |
*** ociuhandu has quit IRC | 14:12 | |
*** ociuhandu has joined #openstack-nova | 14:13 | |
*** ociuhandu has quit IRC | 14:13 | |
*** ociuhandu has joined #openstack-nova | 14:14 | |
*** ociuhandu has quit IRC | 14:19 | |
sean-k-mooney | lyarwood: i can see why that would happen | 14:21 |
sean-k-mooney | if the binding profile in the neutorn port was currupted and the pci_slot key was removed that would happen | 14:22 |
gibi | sean-k-mooney: yeah, but we cannot really prove anyithing as the logs does not have an instance uuid or port uuid to map it back to the neutron state | 14:24 |
sean-k-mooney | well we can see plugin='ovs',port_profile=VIFPortProfileOpenVSwitch | 14:25 |
sean-k-mooney | and .plug_hw_veb | 14:25 |
sean-k-mooney | so this is hardware offloaded ovs which does not support trusted VF | 14:25 |
gibi | ohh, so we only support trusted VF with the sriov agent? | 14:28 |
gibi | I did not know that | 14:28 |
sean-k-mooney | if it works with anything else its by acident wew never extended support to anything else | 14:28 |
gibi | it seems they are using it :) | 14:30 |
*** ociuhandu has joined #openstack-nova | 14:33 | |
sean-k-mooney | well im pretty sure that it wont work the way they intend | 14:34 |
sean-k-mooney | it might allow them to change the mac adress but ovs wont make the port promisous and the security group rules will still be enforced | 14:35 |
*** belmoreira has joined #openstack-nova | 14:37 | |
*** dave-mccowan has quit IRC | 14:38 | |
*** ociuhandu has quit IRC | 14:38 | |
sean-k-mooney | gibi: ok i have marked it as incomplete for now https://bugs.launchpad.net/nova/+bug/1926375 | 14:40 |
openstack | Launchpad bug 1926375 in OpenStack Compute (nova) "nova-compute service failed to start up" [Undecided,Incomplete] | 14:40 |
gibi | sean-k-mooney: ack, thanks | 14:41 |
*** links has joined #openstack-nova | 14:44 | |
*** jraju__ has quit IRC | 14:44 | |
*** dave-mccowan has joined #openstack-nova | 14:44 | |
*** k_mouza has joined #openstack-nova | 14:44 | |
*** dklyle has joined #openstack-nova | 14:47 | |
*** macz_ has joined #openstack-nova | 14:54 | |
*** zzzeek has quit IRC | 15:09 | |
*** xek_ is now known as xek | 15:09 | |
*** zzzeek has joined #openstack-nova | 15:10 | |
*** ociuhandu has joined #openstack-nova | 15:21 | |
*** tesseract has quit IRC | 15:26 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Fix bond_mode enum 802.1ad -> 802.3ad https://review.opendev.org/c/openstack/nova/+/788790 | 15:49 |
gibi | sean-k-mooney: could you double check that I understood this right ^^ ? | 15:50 |
*** gyee has joined #openstack-nova | 15:52 | |
gibi | nova meeting starts in 8 minutes in #openstack-meeting-3 | 15:52 |
*** gokhani has quit IRC | 15:55 | |
gibi | dansmith: regarding yesterday service version check discussion. You said that you think the version alias is wrong. I've double checked and I don't see the problem. 54 was the first Wallaby service version. 52 is the first Victoria service version | 15:57 |
sean-k-mooney | gibi: am sure | 15:58 |
*** ociuhandu_ has joined #openstack-nova | 15:58 | |
openstackgerrit | Merged openstack/nova master: zuul: Replace grenade and nova-grenade-multinode with grenade-multinode https://review.opendev.org/c/openstack/nova/+/778885 | 15:58 |
openstackgerrit | Merged openstack/nova master: zuul: Remove nova-dsvm-multinode-base https://review.opendev.org/c/openstack/nova/+/778908 | 15:59 |
lyarwood | \o/ | 16:00 |
sean-k-mooney | gibi: ya 802.3ad is lacp bonding 802.1 is QinQ | 16:00 |
gibi | lyarwood: ++ | 16:00 |
gibi | sean-k-mooney: thanks | 16:00 |
*** ociuhandu has quit IRC | 16:02 | |
*** mlavalle has joined #openstack-nova | 16:02 | |
*** lucasagomes has quit IRC | 16:03 | |
*** ociuhandu_ has quit IRC | 16:03 | |
sean-k-mooney | it shows how frequetly people use the network jsone templating if we are only seeing this now | 16:04 |
sean-k-mooney | this is used to validate https://docs.openstack.org/nova/latest/configuration/config.html#DEFAULT.injected_network_template right | 16:05 |
gibi | sean-k-mooney: ooh, I did not know about that | 16:06 |
gibi | but I guess yes | 16:06 |
openstackgerrit | Ghanshyam proposed openstack/nova master: DNM: testing bionic drop https://review.opendev.org/c/openstack/nova/+/788791 | 16:07 |
sean-k-mooney | well im not sure what we might be using the network.json for other then the network metadta consumed by cloud init which i think is generated by that template | 16:07 |
gibi | it make sense | 16:10 |
gibi | btw sean-k-mooney you promised on the PTG to add periodic jobs for placement and then we can look at the results in regularly on the nova meeting | 16:11 |
gibi | let me know if you need help adding them | 16:12 |
sean-k-mooney | oh i should be in the meeting didn realise it started | 16:12 |
gibi | :) | 16:12 |
sean-k-mooney | https://review.opendev.org/c/openstack/placement/+/787508 | 16:13 |
sean-k-mooney | you looked at the patch already | 16:13 |
gibi | sean-k-mooney: lol, I totally forgot | 16:14 |
gibi | thanks again | 16:15 |
*** hamalq has joined #openstack-nova | 16:22 | |
*** ociuhandu has joined #openstack-nova | 16:23 | |
*** hamalq has quit IRC | 16:23 | |
*** hamalq has joined #openstack-nova | 16:24 | |
elod | lyarwood: I've checked the placement patch meanwhile and +2+W'd as it looks appropriate | 16:26 |
lyarwood | ack thanks | 16:26 |
gmann | elod: lyarwood its base patch too -https://review.opendev.org/c/openstack/placement/+/787525/2 | 16:26 |
*** ociuhandu has quit IRC | 16:27 | |
gibi | and if you are reviewing placement already could you please hit this too https://review.opendev.org/c/openstack/placement/+/787508 ? | 16:29 |
gibi | simple job addition | 16:29 |
elod | gmann: +2+W'd, too | 16:29 |
gmann | elod: thanks | 16:31 |
elod | np | 16:31 |
gmann | gibi: lgtm, +A | 16:33 |
gmann | gibi: sean-k-mooney that is good idea. I will try this for few of QA repos too where we do not have frequent changes and end up finding failing gate if there is any change. | 16:34 |
sean-k-mooney | gmann: the weekly pipeline give a nice cadence to them too | 16:37 |
*** zzzeek has quit IRC | 16:37 | |
kashyap | lyarwood: Hey | 16:38 |
sean-k-mooney | being realistinc peopel are not going to check them daily and if its a 2 second line item in the weekly meeting just to make sure the last build ran and it was green then i think its useful | 16:38 |
kashyap | lyarwood: We don't use "virtio-blk,scsi=on" as a SCSI passthrough, right? From my `grep`ing around, we don't... | 16:38 |
*** zzzeek has joined #openstack-nova | 16:39 | |
kashyap | lyarwood: I ask because, QEMU folks pinged me to tell that support for it was removed in Linux v5.6; and QEMU deprecated it in 5.0 | 16:39 |
gmann | sean-k-mooney: true, we check periodic jobs in QA office hours weekly and it will hell to add these type of jobs in less-active repo | 16:39 |
sean-k-mooney | kashyap: i dont belive we do | 16:39 |
kashyap | Cool; figured as much | 16:39 |
*** macz_ has quit IRC | 16:40 | |
sean-k-mooney | gmann: i have tought about doing it for os-vif in the past but i tened to test that trasitivly enough in my local devstack but its useful in any repo that does not ahve a patch proplsed at least once a week | 16:41 |
gmann | +1, what i encounter is trying to add some change but see gate is already failing for long and we did not knw about it and end up spending time on that in release time or so :) | 16:43 |
*** BLZbubba has joined #openstack-nova | 16:44 | |
BLZbubba | hi guys, when i create a uefi image with openstack image create and specify "os_secure_boot=disabled", this property is ignored and nova forces the uefi secboot bios instead. interestingly it is only ignored when I also specify "hw_firmware_type=uefi" so it must be happening at a higher level than just glance.... any idea how to stop nova from trying the secboot version of ovmf? | 16:47 |
BLZbubba | apart from the gross hack I've been using, which is rm /usr/share/OVMF/OVMF_CODE.secboot.fd | 16:48 |
BLZbubba | the glance people said to ask here | 16:49 |
*** dtantsur is now known as dtantsur|afk | 16:49 | |
sean-k-mooney | BLZbubba: so secure boot shoudl be disabled by default | 16:58 |
*** iurygregory has quit IRC | 16:58 | |
sean-k-mooney | if you jsut set hw_firmware_type=uefi it should use the non secure boot firmware | 16:58 |
*** rpittau is now known as rpittau|afk | 16:59 | |
sean-k-mooney | unless the recnelty added secure boot support feature regressed that behavior | 16:59 |
sean-k-mooney | stephenfin: kashyap ^ | 16:59 |
stephenfin | BLZbubba: what version of nova? | 17:00 |
*** derekh has quit IRC | 17:01 | |
stephenfin | sean-k-mooney: Prior to the secure boot feature, we defaulted the secure boot firmware https://github.com/openstack/nova/commit/363710b655434a15b6b85d9ca65343210b104e56 | 17:02 |
sean-k-mooney | stephenfin: but we did not enable secureboot in the xml so it was disabled in the geust | 17:02 |
stephenfin | though we didn't set the necessary flags, so it should be using the firmware but not enabling the feature | 17:02 |
stephenfin | yes | 17:02 |
sean-k-mooney | so the firmwar was capable fo secure boot but qemu woudl not try to use it | 17:03 |
sean-k-mooney | so we should still have the same behavior | 17:03 |
stephenfin | yes | 17:03 |
sean-k-mooney | BLZbubba: is this breaking you some how? | 17:03 |
sean-k-mooney | BLZbubba: os_secure_boot prior to the wallaby relesae was only suppoted by hyperv | 17:04 |
BLZbubba | I'm using ubunbu lts: 2:21.1.2-0ubuntu1 | 17:05 |
sean-k-mooney | so ussuri | 17:06 |
BLZbubba | yes | 17:06 |
sean-k-mooney | in that release the libvirt dirver does not support securebot or os_secure_boot | 17:06 |
sean-k-mooney | only the hyperv driver supported os_secure_boot at that time | 17:07 |
sean-k-mooney | BLZbubba: your vms are not actlly using secure boot in this case | 17:07 |
BLZbubba | ok thanks for the info. I just know that if OVMF_CODE.secboot.fd is in the libvirt xml (which it is by default) they fail to boot. but it sounds like it should work... i'll play around with the qemu options and see if I can figure this out. | 17:10 |
BLZbubba | thanks! | 17:10 |
sean-k-mooney | sound like there is a bug in the ovmf package that canonical is shiping in 20.04 | 17:11 |
sean-k-mooney | i think they recently rebased the version fo qemu they ship in the cloud archive | 17:12 |
sean-k-mooney | maybe that broke something | 17:12 |
*** k_mouza has quit IRC | 17:16 | |
*** k_mouza has joined #openstack-nova | 17:17 | |
*** andrewbonney has quit IRC | 17:19 | |
*** k_mouza has quit IRC | 17:22 | |
*** ralonsoh has quit IRC | 17:26 | |
erbarr | hello, what could be causing this, I stacked last night with FORCE_CONFIG_DRIVE on ussuri and ran into it. Didn't see it on other branches up to train https://usercontent.irccloud-cdn.com/file/p1kmwH0W/image.png | 17:28 |
sean-k-mooney | its a python 3 issue | 17:29 |
sean-k-mooney | in this case i thik its cause by libguestfs | 17:30 |
sean-k-mooney | lyarwood: gibi ^ is that the libguestfs issue ye were seeing in the gate | 17:30 |
lyarwood | that's ironic so I wouldn't think so | 17:31 |
lyarwood | and configdrive related | 17:31 |
*** iurygregory has joined #openstack-nova | 17:31 | |
*** iurygregory has quit IRC | 17:31 | |
sean-k-mooney | well maybe its form makeiso | 17:32 |
openstackgerrit | Merged openstack/placement stable/wallaby: Add a reproduction test for bug story/2008831 https://review.opendev.org/c/openstack/placement/+/787525 | 17:32 |
*** zzzeek has quit IRC | 17:32 | |
sean-k-mooney | its the same bytes vs str issue | 17:32 |
lyarwood | hard to tell from that trace tbh | 17:32 |
lyarwood | yeah | 17:32 |
lyarwood | it's that underlying issue | 17:32 |
sean-k-mooney | so that being raised form here https://github.com/openstack/nova/blob/stable/ussuri/nova/virt/ironic/driver.py#L544-L548 | 17:34 |
sean-k-mooney | but i feel like the excption is actully happening in ironic | 17:34 |
sean-k-mooney | and they are just propagating it up to us | 17:35 |
sean-k-mooney | erbarr: do you ahve any error in the ironic conductor for the deploy | 17:35 |
*** zzzeek has joined #openstack-nova | 17:36 | |
*** iurygregory has joined #openstack-nova | 17:38 | |
openstackgerrit | Merged openstack/placement stable/wallaby: Make sure the policy upgrade check get a valid config https://review.opendev.org/c/openstack/placement/+/787526 | 17:39 |
erbarr | sean-k-mooney, looks similar and that's the only errors there: https://usercontent.irccloud-cdn.com/file/aPWzwmpk/image.png | 17:43 |
sean-k-mooney | yep the node is the same | 17:45 |
sean-k-mooney | so its an ironic bug | 17:45 |
openstackgerrit | Merged openstack/placement master: Add weekly jobs https://review.opendev.org/c/openstack/placement/+/787508 | 17:45 |
erbarr | okay, thanks | 17:46 |
sean-k-mooney | https://opendev.org/openstack/ironic/src/branch/stable/ussuri/ironic/conductor/deployments.py#L351-L360 | 17:46 |
sean-k-mooney | the file is being opened in text mode | 17:46 |
sean-k-mooney | mode=wt | 17:47 |
sean-k-mooney | so im not sure why its saying it need a byte like obejct | 17:47 |
sean-k-mooney | oh there we go https://opendev.org/openstack/ironic/commit/0da73cdd30f6726548992090bb3626292e3518b0 | 17:47 |
sean-k-mooney | erbarr: your missing ^ | 17:47 |
sean-k-mooney | erbarr: do you have [deploy]configdrive_use_object_store=True set | 17:49 |
sean-k-mooney | that was only merge 2 months ago so you likely are just missing that in your deployment but that shoudl fix your issue | 17:50 |
erbarr | i stacked yesterday, so does that go in ironic.conf? | 17:51 |
erbarr | that is set to true :( | 17:52 |
sean-k-mooney | when you say stacked you mean devstack. unless you have RECLONE=True set it wont update your repos | 17:52 |
sean-k-mooney | so you might need to manually do a pull on the ironic one | 17:53 |
sean-k-mooney | becareful with RECLONE=ture if you do developemnt in the repos in /opt/stack | 17:53 |
sean-k-mooney | since it will delete them | 17:53 |
erbarr | it was a fresh vm | 17:54 |
sean-k-mooney | well not quite but it will erase any work you dont have commited so if you use it you shoudl keep the reposyou work on somehere else. | 17:54 |
sean-k-mooney | erbarr: ok well maybe the have regressed it | 17:54 |
erbarr | i'll check the code and play around with it, thanks! | 17:55 |
sean-k-mooney | erbarr: i would doble check the file to be sure and if you see mode=wt then ping the ironic channel and ask | 17:55 |
sean-k-mooney | no worries hopfully its something tirvial | 17:55 |
sean-k-mooney | erbarr: dtantsur|afk: wrote the previous fix so they will like know where to look to fix it if its still happening | 17:56 |
erbarr | cool, thanks! | 17:57 |
*** dave-mccowan has quit IRC | 18:06 | |
*** dave-mccowan has joined #openstack-nova | 18:07 | |
*** macz_ has joined #openstack-nova | 18:09 | |
*** macz_ has quit IRC | 18:14 | |
*** belmoreira has quit IRC | 18:16 | |
*** links has quit IRC | 18:24 | |
*** tbachman has quit IRC | 18:28 | |
*** zzzeek has quit IRC | 18:41 | |
*** zzzeek has joined #openstack-nova | 18:42 | |
*** k_mouza has joined #openstack-nova | 19:17 | |
*** k_mouza has quit IRC | 19:23 | |
*** macz_ has joined #openstack-nova | 19:25 | |
*** macz_ has quit IRC | 19:25 | |
*** macz_ has joined #openstack-nova | 19:26 | |
*** ociuhandu has joined #openstack-nova | 19:47 | |
*** ociuhandu has quit IRC | 19:54 | |
*** tbachman has joined #openstack-nova | 19:57 | |
*** jobewan has quit IRC | 20:03 | |
*** amodi has quit IRC | 20:04 | |
*** amodi has joined #openstack-nova | 20:15 | |
*** amodi has quit IRC | 20:17 | |
*** amodi has joined #openstack-nova | 20:24 | |
*** amodi has quit IRC | 20:26 | |
*** amodi has joined #openstack-nova | 20:32 | |
openstackgerrit | melanie witt proposed openstack/nova stable/queens: Use subqueryload() instead of joinedload() for (system_)metadata https://review.opendev.org/c/openstack/nova/+/761814 | 20:36 |
*** macz_ has quit IRC | 20:37 | |
*** slaweq has quit IRC | 21:21 | |
*** rcernin has joined #openstack-nova | 21:44 | |
*** martinkennelly has quit IRC | 21:51 | |
*** rcernin has quit IRC | 21:56 | |
*** rcernin has joined #openstack-nova | 22:02 | |
*** haleyb has quit IRC | 22:20 | |
*** haleyb has joined #openstack-nova | 22:22 | |
*** luksky has quit IRC | 22:24 | |
*** rcernin has quit IRC | 22:37 | |
*** martinkennelly has joined #openstack-nova | 22:49 | |
*** whoami-rajat has quit IRC | 23:13 | |
*** k_mouza has joined #openstack-nova | 23:18 | |
*** rcernin has joined #openstack-nova | 23:19 | |
*** k_mouza has quit IRC | 23:22 | |
*** tosky has quit IRC | 23:40 | |
*** martinkennelly has quit IRC | 23:48 | |
*** k_mouza has joined #openstack-nova | 23:57 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!