opendevreview | Jun Chen proposed openstack/nova master: Catch an exception in power off procedure https://review.opendev.org/c/openstack/nova/+/817176 | 02:41 |
---|---|---|
*** ricolin_ is now known as ricolin | 04:18 | |
*** bauwser is now known as bauzas | 08:31 | |
bauzas | good morning Nova | 08:31 |
elodilles | good morning bauzas o/ i've replied to your comment here: https://review.opendev.org/c/openstack/releases/+/817606 | 08:57 |
elodilles | so if you'll have time... o:) | 08:58 |
giblet | morning | 09:00 |
*** giblet is now known as gibi | 09:02 | |
bauzas | elodilles: ack, saw your ping on Friday | 09:14 |
bauzas | elodilles: mmm, look at the last comment I just provided in https://review.opendev.org/c/openstack/releases/+/817606 | 09:16 |
opendevreview | Brin Zhang proposed openstack/nova master: WIP: Cyborg suspend/resume support https://review.opendev.org/c/openstack/nova/+/729945 | 10:25 |
kashyap | sean-k-mooney[m]: giblet: Hi, when you're about: can we move this forward? - https://review.opendev.org/c/openstack/nova/+/798680 (libvirt: Switch the default video model from 'cirrus' to 'virtio') | 10:29 |
songwenping_ | bauzas: i install nvidia-smi driver(470.82.00) for my vgpu ubuntu vm(20.04) and run nvidia-smi, but get the error NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. from the dmesg |grep -i nvidia, i found the error: probe of 0000:00:06.0 failed with error -1. | 10:30 |
songwenping_ | hava you met before and is there some solutions? | 10:30 |
kashyap | sean-k-mooney[m]: I have answered all the questions there. Please re-adjust your -1. I don't see any technical obstacles here | 10:31 |
gibi | kashyap: sure, I will re-read the patch | 10:37 |
gibi | kashyap: but you should get the bp approved https://blueprints.launchpad.net/nova/+spec/virtio-as-default-display-device | 10:37 |
kashyap | Thank you | 10:37 |
kashyap | Ohh, right | 10:37 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 10:38 |
kashyap | gibi: It's a tiny patch; also see my summary of testing from 04 Aug. | 10:38 |
gibi | bauzas: ^^ | 10:38 |
kashyap | (I've got some real live migration tests done by Red Hat QE folks to have Windows + Linux guests - and it all works as expected.) | 10:39 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 10:40 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 10:42 |
kashyap | gibi: Unrelated: Oddly, this simple class addition is failing only in the CI, not locally here. You see what I'm missing here? - https://paste.opendev.org/show/810997/ | 10:45 |
bauzas | songwenping_: sorry I was afk | 10:46 |
bauzas | songwenping_: honestly, I don't know, it looks like a nvidia driver issue | 10:46 |
bauzas | gibi: what do you want me to look ? | 10:47 |
songwenping_ | bauzas:ok, thanks. | 10:47 |
gibi | bauzas: kashyap: would like to get a specless bp approved | 10:58 |
kashyap | bauzas: This one - https://blueprints.launchpad.net/nova/+spec/virtio-as-default-display-device | 10:59 |
bauzas | gibi: ok, let's discuss this during tomorrow's meeting | 10:59 |
bauzas | kashyap: add it to the meeting agenda | 10:59 |
bauzas | and please be around tomorrow during the meeting :) | 10:59 |
kashyap | Yes, I'll be around | 10:59 |
kashyap | Yea; thx | 10:59 |
gibi | ack, I will be on the meeting | 11:02 |
elodilles | bauzas: (sorry for the late reply) thanks, i've created the python-novaclient release: https://review.opendev.org/c/openstack/releases/+/817930 | 11:17 |
elodilles | and rebased the ussuri-em patch on top of this | 11:18 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 11:43 |
sean-k-mooney | gibi: so technially the guest abi is mostly maintained when goign form cirrus to virtio as virtio has a vga compatiable interface that is compatiable with cirrus. it would be a guest visabel change | 12:22 |
sean-k-mooney | it apparently is not a harmful guest visable change | 12:22 |
sean-k-mooney | if we are ok with the change in default affectign existing instnace the kasahyap patch is workable if not then i would like him to adress the comments i left in the patch | 12:25 |
kashyap | Yep, this is a benign case. | 12:27 |
kashyap | sean-k-mooney[m]: What else comments? I addressed what you (incorrectly) thought was a problem | 12:27 |
kashyap | I don't see any outstanding questions there. | 12:27 |
kashyap | Also, recall: | 12:28 |
sean-k-mooney | you ignored all the other comment i made and you have not | 12:28 |
kashyap | In the patch comments, I've also outlined the integration tests (Windows + Linux) done by Red Hat QE as well | 12:28 |
sean-k-mooney | what i sated as a probelm is that the xml would change for existign instace after a hard reboot | 12:28 |
sean-k-mooney | it still will for vms create before the patch | 12:29 |
kashyap | I need to evaluate the merits of the other comments and see if they make sense. | 12:29 |
kashyap | Some of them maybe non-problems | 12:29 |
sean-k-mooney | https://review.opendev.org/c/openstack/nova/+/798680/3/nova/virt/libvirt/driver.py | 12:29 |
sean-k-mooney | kashyap: the disconenct is that when we discussed this with dan and other before we said you should follow the patteren we used for machine type and recorerd the current video model for all instnace and just change the behavior for new ones | 12:31 |
sean-k-mooney | now you have shown that changing the behaivor for new isntnace should be safe | 12:31 |
sean-k-mooney | that is great it does not change the fact that we had previously agreed not too | 12:31 |
kashyap | Correct | 12:31 |
kashyap | Well | 12:31 |
kashyap | We should revisit previous "agreements" if they don't make sense in light of new info | 12:32 |
sean-k-mooney | if people are ok with makeing the change we can but we should at least disucss that | 12:32 |
kashyap | Yes, but I hope it won't become a big drag. | 12:32 |
* gibi realized that his irc notification was broken, reading back | 12:33 | |
gibi | kashyap, sean-k-mooney: OK, so the cirros -> virtio change is considered harmless during a hard reboot, then I'm OK with the direction of the patch | 12:35 |
kashyap | Cool; yes. And that is proved by the tests. Thanks. | 12:35 |
kashyap | sean-k-mooney: Wait: what is still not correct in the broader direction? On hard-reboot, we recreate the XML, and yes, it *will* change from CirrOS to virtio - the QEMU folks confirmed that harmless ABI change is okay | 12:44 |
kashyap | sean-k-mooney: So what is still "not correct in this patch"? | 12:44 |
kashyap | Please keep the explanation short. | 12:44 |
* kashyap goes to comment on the patch | 12:44 | |
opendevreview | Dmitrii Shcherbakov proposed openstack/nova-specs master: Integration With Off-path Network Backends https://review.opendev.org/c/openstack/nova-specs/+/787458 | 12:50 |
sean-k-mooney | kashyap: gibi has just now indicated he is ok with the change in direction. other have not bar perhaps stephen. the patch as written assumed that change in direction would happen without disucssing it | 12:54 |
kashyap | sean-k-mooney: Okay, for that part - I'll explicitly state it in the commit message; and as a comment somewhere. | 12:54 |
sean-k-mooney | the other comment i left tell you how to modify the patch to confrom to what we had previosuly agreed. e.g. record the current used video model | 12:54 |
kashyap | sean-k-mooney: What is the use-case for still recording the used video model? | 12:55 |
kashyap | When we know that changing it even for existing instances on hard-reboot is benign and harmless? | 12:56 |
sean-k-mooney | consitency with the change fo default of machine type | 12:56 |
kashyap | Sorry, that's not a valid use-case. (While I agree in general to be consistent where it makes sense) | 12:57 |
sean-k-mooney | in this case you are instead following what we did for the rng which is ok | 12:57 |
kashyap | We should not add needless extra code for "consistency" if it is not solving any problem or improving maintainability | 12:58 |
kashyap | Yep; like for RNG indeed | 12:58 |
dmitriis | sean-k-mooney, gibi: updated the spec https://review.opendev.org/c/openstack/nova-specs/+/787458. This comment may need further discussion https://review.opendev.org/c/openstack/nova-specs/+/787458/comment/4337b974_a214c6f9/ during the implementation (I added code references and the actual log to give an impression of what I am talking about). I | 13:03 |
dmitriis | agree about the need to extend has_bind_time_event (which I added to the spec) but, from what I can see, this won't help with the actual problem of the virt driver code trying to wait for the `network-vif-plugged` event that gets discarded by Nova in the absence of a waiter. | 13:03 |
sean-k-mooney | dmitriis: the wait in the virt driver shoudl be filtered by the bindtyime/plugtime event fucntions | 13:05 |
sean-k-mooney | dmitriis: if we know that a backend only sends bind time event reliably its oke to ignore events sent at other times | 13:09 |
dmitriis | sean-k-mooney: so you are proposing that _get_neutron_events https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L7191-L7198 calls get_bind_time_events https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/network/model.py#L567-L572 for filtering, right? | 13:10 |
sean-k-mooney | dmitriis: if you modify has_bind_time_event to account for the new vnic type | 13:10 |
sean-k-mooney | https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L7221-L7222 shoudl return an empty [] | 13:11 |
opendevreview | Kashyap Chamarthy proposed openstack/nova master: libvirt: Introduce config class for QEMU's "tb-cache" https://review.opendev.org/c/openstack/nova/+/816823 | 13:11 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 13:12 |
sean-k-mooney | dmitriis: yes more or less | 13:12 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 13:12 |
sean-k-mooney | i belive that shoudl eb using the plugtime event funciton | 13:12 |
sean-k-mooney | dmitriis: since we have already passed binding the port | 13:12 |
sean-k-mooney | dmitriis: currently it is https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L7191-L7198 | 13:13 |
sean-k-mooney | dmitriis: effectvly it s implementaiton shoudl be like this https://github.com/openstack/nova/blob/0e0196d979cf1b8e63b9656358116a36f1f09ede/nova/network/model.py#L581-L586 | 13:15 |
sean-k-mooney | i suspect that is a latent bug actully | 13:15 |
opendevreview | Stephen Finucane proposed openstack/nova master: db: Remove use of 'bind' arguments https://review.opendev.org/c/openstack/nova/+/817747 | 13:16 |
opendevreview | Stephen Finucane proposed openstack/nova master: db: Remove unnecessary warning filters https://review.opendev.org/c/openstack/nova/+/817748 | 13:16 |
sean-k-mooney | actully dmitriis :) this might be the cause of our revert resize issue | 13:17 |
dmitriis | sean-k-mooney: the VIF object itself has has_bind_time_event actually https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/network/model.py#L488-L497 . So I can extend it to accept `None` for the migration object and add handling for VNIC_TYPE since that's available in the VIF itself. | 13:17 |
dmitriis | sean-k-mooney: oh well, good thing we looked at this code then :^) | 13:17 |
sean-k-mooney | yes i would extend the vif class | 13:18 |
sean-k-mooney | artom has a repoducer patch up for the resize bug i think | 13:20 |
sean-k-mooney | so we might fix that issue for you in a seperate patch so we can backport it | 13:20 |
sean-k-mooney | this is where we repoduced our downstream bug https://review.opendev.org/c/openstack/nova/+/817303 | 13:22 |
sean-k-mooney | the revert resize test fail in a very specific situation whic i think is basially the same thing you are seeing | 13:23 |
sean-k-mooney | https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f60/817303/2/check/nova-ovs-hybrid-plug/f60d54c/testr_results.html | 13:23 |
dmitriis | sean-k-mooney: ok, I'll fix it in my WIP spec implementation proposal https://review.opendev.org/c/openstack/nova/+/812111 and try it out on the actual test environment. We can have a separate patch to fix the issue and that will leave my patch with an extension to has_bind_time_event to handle VNIC_TYPE_SMARTNIC. | 13:24 |
sean-k-mooney | yep that makes sense to me | 13:25 |
sean-k-mooney | hum actully for the revert case we pass in the filtered events list | 13:29 |
sean-k-mooney | https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L11204-L11210 | 13:29 |
opendevreview | Kashyap Chamarthy proposed openstack/nova master: Deprecate support for floppy drives https://review.opendev.org/c/openstack/nova/+/802022 | 13:30 |
sean-k-mooney | dmitriis: but this is still broken for your usecase | 13:30 |
sean-k-mooney | so we should still fix it | 13:30 |
dmitriis | sean-k-mooney: yes, it looks like the "active" state check might be there for that case but it doesn't help in my case | 13:31 |
sean-k-mooney | the active state chagne on the neutron port? | 13:31 |
dmitriis | sean-k-mooney: yes, on this line https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L7198 | 13:32 |
dmitriis | this state gets updated via the info cache from what I understand | 13:32 |
sean-k-mooney | not quite | 13:33 |
sean-k-mooney | it will but that is not related | 13:33 |
sean-k-mooney | this is the admin state | 13:33 |
dmitriis | I see | 13:33 |
sean-k-mooney | we skip port that have been marked as down | 13:34 |
dmitriis | sean-k-mooney: is it OK if I extend has_bind_time_event to accept `None` for the `migration` argument? https://github.com/openstack/nova/blob/0e0196d979cf1b8e63b9656358116a36f1f09ede/nova/network/model.py#L488-L497 | 13:34 |
dmitriis | I don't have it in _get_neutron_events https://github.com/openstack/nova/blob/e28afc564700a1a35e3bf0269687d5734251b88a/nova/virt/libvirt/driver.py#L7191-L7198 and I'd rather not try to instantiate one there | 13:35 |
sean-k-mooney | i think so i have been trying to figure out what that should look like and or if you should jsut replace migration with same_host=True/False | 13:35 |
kashyap | sean-k-mooney: Hmm, this page should also list "bochs" as an option, right? - https://docs.openstack.org/glance/xena/admin/useful-image-properties.html | 13:36 |
kashyap | I think I forgot to updte the docs for that :-( I'm going to do that now | 13:36 |
sean-k-mooney | yes eventually and you should update the glance metadefs if you have not already done so | 13:36 |
kashyap | For hw_video_model, I mean | 13:36 |
* kashyap goes to check | 13:36 | |
sean-k-mooney | https://github.com/openstack/glance/blob/master/etc/metadefs/compute-libvirt-image.json#L71-L84 | 13:37 |
sean-k-mooney | that used to generate the horizon and heat dropdown menues | 13:37 |
sean-k-mooney | you should be able to do both in the same patch | 13:38 |
kashyap | Right; I updated this file: etc/metadefs/compute-libvirt-image.json and also this file: doc/source/admin/useful-image-properties.rst | 13:39 |
dmitriis | sean-k-mooney: yes, making `has_bind_time_event` accept a boolean seems cleaner than passing None. There aren't any other migration attributes used so far in it. | 13:39 |
kashyap | sean-k-mooney: I wonder if I should add a belated release note to the Glance docs? | 13:40 |
dmitriis | sean-k-mooney: I've put generic wording into the spec for now https://review.opendev.org/c/openstack/nova-specs/+/787458/12/specs/yoga/approved/integration-with-off-path-network-backends.rst#343 I think it encompasses whatever we decide implementation-wise. | 13:41 |
sean-k-mooney | dmitriis: right it also an encapsulation thing e.g. pass the minium it need rather then large objects | 13:41 |
dmitriis | sean-k-mooney: yes, agreed | 13:41 |
opendevreview | Takashi Kajinami proposed openstack/nova master: Deprecate [wsgi] secure_proxy_ssl_header https://review.opendev.org/c/openstack/nova/+/817927 | 13:43 |
sean-k-mooney | dmitriis: something like this i think woudl make sense https://paste.opendev.org/show/811000/ | 13:45 |
sean-k-mooney | actuly result &= self['vnic_type'] != 'smartnic' shoudl be result |= self['vnic_type'] == 'smartnic' | 13:46 |
kashyap | Posted the docs: https://review.opendev.org/c/openstack/glance/+/817950 (Add "bochs" as a possible video model) | 13:46 |
* kashyap backports it to the Glance's Xena branch | 13:46 | |
sean-k-mooney | i.e. if the vnic_type is smartnic then we know the vif has bind time events | 13:47 |
dmitriis | sean-k-mooney: yeah, that's certainly better than my original workaround https://git.launchpad.net/~dmitriis/+git/nova/commit/?h=2021-09-10-off-path-net-backends&id=76bd42c59a33d3c9d410444ac32d3babde472ba4 | 13:49 |
kashyap | sean-k-mooney: Thank you for the quick review | 13:52 |
Zer0Byte | hey guys | 15:37 |
Zer0Byte | im having a issue with nova for one reason i can't live migrate vms from one node to a specific node im getting this error while im trying to move to these node Failed to retrieve allocations for consumer d9f8c2ed-74b0-4eb9-8b1d-30f397ec2102: Expected to find allocations for source node resource provider f7751b40-7f22-4b2f-8d40-b1479a369969. Retry the operation without forcing a destination host. | 15:39 |
Zer0Byte | checking openstack resource provider | 15:39 |
Zer0Byte | don't show the specifc node | 15:39 |
Zer0Byte | who should create on openstack resource provider these record? | 15:39 |
sean-k-mooney | Zer0Byte: nova | 15:43 |
sean-k-mooney | are you upgrading form a deployment without placement to one with placment | 15:44 |
sean-k-mooney | if so there are some specific steps you need to do to enusre the allcoation are created | 15:44 |
Zer0Byte | no was runinng perfect i notice some performance degradation to keystone because switft that make me belive that nova can't communicate with placement on these moment | 15:45 |
sean-k-mooney | hum you suspect that the keystone authention some how did not work when it was trying to retive the allcoations | 15:46 |
sean-k-mooney | have you check the instnace in quetion to see if it has allcoation manually | 15:46 |
Zer0Byte | when the node was registered i suspect keystone was unavailable | 15:46 |
Zer0Byte | i see the node registered on nova service | 15:47 |
Zer0Byte | but not on resource provider | 15:47 |
sean-k-mooney | is the node in "openstack hypervisor list" | 15:47 |
Zer0Byte | yep | 15:48 |
sean-k-mooney | but if you do a openstack resouce provider show <compute node uuid> | 15:48 |
sean-k-mooney | it does not find the RP | 15:48 |
EugenMayer | what is the right way to segment an network into 2 subnets and let those instances reach eachother? (OVN) do i really need an router, a gateway in both subnets and the add a port of each subnet to that router? | 15:49 |
Zer0Byte | dont listed on resource provider list | 15:49 |
sean-k-mooney | EugenMayer: you need a neutorn router to interconnect the subnets | 15:49 |
Zer0Byte | don't appears on resource provider list | 15:50 |
sean-k-mooney | EugenMayer: unless you are using provider networking which you wont be if your using OVN | 15:50 |
EugenMayer | sean-k-mooney ok so that is the only option. | 15:50 |
sean-k-mooney | EugenMayer: more or less yes. if they are on the same neutron network with ovn technically they should have l2 conenctivty and be able to comunicate using the mac adress only | 15:51 |
sean-k-mooney | but arp for a non local subnet will always be sent to the current subnets routeer and the network stack withotuh a static route will assume a upstream reouter can interconnect the subnets | 15:52 |
EugenMayer | sean-k-mooney: i have an issue with the fact that i can only add a router with those 2 interfaces, if a gateway is presnet. I have a network A (sub1/sub2) and network B (sub1/sub2). I want an instance to be in A sub1 with the default gateway of Asub1, while this instnace is also in Bsub1. The problem is, if Bsub1 also has a gateway (which i need to | 15:52 |
EugenMayer | create the router so Bsub1 can reach Bsub2) the gateway of Bsub1 is pushed | 15:52 |
EugenMayer | (default gateway) | 15:52 |
sean-k-mooney | so in generall you will create a tenant network and a rounter to connect the tenant network to the external network | 15:53 |
sean-k-mooney | and if you want to have 2 subnets on that tenant network you just add a second interface to the existing router for the second subnet | 15:53 |
sean-k-mooney | Zer0Byte: ok if it does not appear in the RP list then if you check the compaute agent log on that host it should have an error | 15:54 |
sean-k-mooney | Zer0Byte: in the logs o fthe update_avaiable_resouces periodic task if not earlier | 15:55 |
EugenMayer | sean-k-mooney my problem is with the default gateway pushed, if an instance has 2 interfaces which both have a gateway | 15:56 |
sean-k-mooney | right that is a common problem | 15:56 |
sean-k-mooney | you can disable the gateway on the subnet | 15:56 |
sean-k-mooney | which should prevent that | 15:57 |
sean-k-mooney | EugenMayer: i think you do "openstack subnet unset --gateway <subnet>" | 15:58 |
EugenMayer | sean-k-mooney yes, that is what i did, then i cannot route Bsub1 to Bsub2 - you cannot add the interface | 15:59 |
EugenMayer | if a subnet has not gateway, it cannot be added to the router to route to the subnets | 15:59 |
sean-k-mooney | you can add the interface but you have to specify the ip manually to be what would be the normal gateway ip | 15:59 |
EugenMayer | so you mean override the gateway via cloud-init? | 15:59 |
sean-k-mooney | so when adding the router interface you will need to give it the .1 address or whatever adress is correct for the cidr | 16:00 |
sean-k-mooney | no | 16:00 |
sean-k-mooney | do openstack subnet unset --gateway <subnet b> | 16:00 |
Zer0Byte | RROR nova.scheduler.client.report [req-4d4167be-e58d-42eb-bf49-ffeb0a21d42c - - - - -] [req-6400ff72-11e0-4953-9980-463cfacf7df1] Failed to retrieve resource provider tree from placement API for UUID 1c684ec9-6ab2-4ef7-8451-c989a0b90f45. Got 503: {"message": "The server is currently unavailable. Please try again at a later time.<br /><br />\nThe Keystone service is temporarily unavailable.\n\n", "code": "503 Service Unavailable", | 16:00 |
Zer0Byte | "title": "Service Unavailable"}. | 16:00 |
Zer0Byte | is exactly what i suspect | 16:00 |
Zer0Byte | now my question is how i force the node to populate on placement | 16:00 |
sean-k-mooney | then do "openstack port create --network <my net> --fixed-ip subnet=<subnet>,ip-address=192.168.2.1 router_subnet_b_port" | 16:03 |
EugenMayer | sean-k-mooney sorry i'am confused. Let met try to explain my dilemma more clearly, maybe you can point me to my mistake. Network A(sub1) and Network B(sub1,sub2) exist. Now i have to options. I disable the gw on B-sub1 so if my instance is in A-sub1 and B-sub1, the default gateway of A-sub1 is used. If i disable that gw, i'am no longer able to | 16:03 |
EugenMayer | create a neutron router to bridge between B-sub1 and B-sub2, since a gateway in B-sub1 seems to be required for that | 16:03 |
sean-k-mooney | then do openstack router add port <router> <port uuid for router_subnet_b_port> | 16:04 |
sean-k-mooney | EugenMayer: if the gateway is disable i think you have to manually create the port rather then addign the subnet to the router a is descibed above ^ | 16:05 |
EugenMayer | interesting | 16:05 |
sean-k-mooney | Zer0Byte: i think you jsut need to restart the compute agent | 16:05 |
sean-k-mooney | it should try again the next time the perodic runs if your keystone issues are resovled | 16:05 |
EugenMayer | so i should stick to B sub1/sub2 with gw disabled on sub1 and try to workarround the issue to bridge Bsub1 to Bsub2 | 16:06 |
sean-k-mooney | so your real issue is that dhcp is going to fight with the default router right | 16:08 |
sean-k-mooney | normally i fix this with cloud-init manually and set the metic | 16:08 |
sean-k-mooney | what sound like would be useful would be some way to configure the metric per subnet | 16:08 |
sean-k-mooney | if you never needed to have two port on the same vm (from both subnets) this woudl not normlay be an issue | 16:09 |
EugenMayer | well i need that in my case. It is an kubernetes control-plane while A is its 'intranet' for others to access apps internally and also get routed to a floating IP. B is the cluster internal network | 16:11 |
EugenMayer | sean-k-mooney i'am not sure what your advice is. Try to go with disabling the gw and try adding the port manually or try to use cloud-init to somehow change the metric for DHCP | 16:12 |
Zer0Byte | @sean-k-mooney restarning the compute agent don;t work also no error on logs :( | 16:14 |
Zer0Byte | for understand the flow nova-agent (the service isntaled on the compute node) will contact placement directly right | 16:15 |
hyang[m] | Hi Nova core reviewers, can someone help to check the patch https://review.opendev.org/c/openstack/nova/+/811521? It can resolve a bug that affects both Nova and Neutron. | 16:39 |
sean-k-mooney | hyang[m]: that is partly a feature and partly a bug | 16:55 |
sean-k-mooney | i say its partly a feature as support for neutron rbac in nova is a new feature | 16:55 |
sean-k-mooney | its partly a bug in that that should have been part of the development of security group shareing in neutorn | 16:56 |
sean-k-mooney | there shoudl have been a sibligh spec to cover the nova changes | 16:56 |
sean-k-mooney | actully is this a new neutron featfure this cycle | 16:56 |
sean-k-mooney | if so then you shoudl convert that patch to a blueprint | 16:57 |
Zer0Byte | is very weird i ssue | 16:59 |
Zer0Byte | i put the agent on debug | 16:59 |
Zer0Byte | Final resource view: name=tc-m2mem-compute2-1210 is registering on placement | 16:59 |
Zer0Byte | but don;t show as a resource | 16:59 |
sean-k-mooney | hyang[m]: actully this should have a spec or specless bluerpint its a new feature not a bug | 16:59 |
sean-k-mooney | hyang[m]:this si not something that should be backported | 17:00 |
bauzas | folks, gentle reminder that tomorrow is spec review day | 17:05 |
bauzas | sharpen your pens | 17:05 |
gibi | bauzas: good reminder | 17:06 |
gibi | thanks | 17:06 |
bauzas | gibi: you can thank sean-k-mooney for ringing me a bell in my empty closet that's called "a brain" | 17:07 |
gibi | sean-k-mooney: thanks | 17:07 |
gibi | :D | 17:07 |
sean-k-mooney | as i said downstream the only reason i rememebred is i tought i missed it last week so i checked :) | 17:08 |
* bauzas notes he'll mark his agenda for next review day in 1 month | 17:08 | |
gibi | bauzas: I think you can abandon these https://review.opendev.org/q/project:openstack/nova-specs+status:open+label:Code-Review-2 as all of them are heavily outdated | 17:09 |
gibi | just to have a clean spec list for tomorrow | 17:10 |
Zer0Byte | any another idea @sean-k-mooney ? | 17:11 |
sean-k-mooney | Zer0Byte: have you tried openstack resource provider list --name tc-m2mem-compute2-1210 | 17:16 |
Zer0Byte | got a empty response | 17:16 |
sean-k-mooney | Zer0Byte: assuming this is libvirt what does virsh hostname return | 17:16 |
sean-k-mooney | does it have a domain appended to it? | 17:17 |
Zer0Byte | no | 17:17 |
Zer0Byte | im using juju to install the compute nodes | 17:17 |
Zer0Byte | so the configuration is always persistent | 17:17 |
sean-k-mooney | ok libvirt via the python bindign provides the hostname to nova which we use for the RP name | 17:18 |
Zer0Byte | in this case we should have a empty record with id right? | 17:18 |
Zer0Byte | uuid | 17:18 |
sean-k-mooney | well we should have a resouce provider with the hostname as the name and the compute node uuid( not the compute service uuid) as the uuid | 17:19 |
sean-k-mooney | and then if you do an inventory list you shoudl see the reouces that are tracked | 17:19 |
sean-k-mooney | so cpu ram disk ectra | 17:19 |
sean-k-mooney | Zer0Byte: if you are not seeing the RP in placment then you should be sing an error in the compute agent log trying to create the RP at some point | 17:20 |
Zer0Byte | yeah from the other nodes yes | 17:20 |
Zer0Byte | i can see the usage form inventory | 17:21 |
Zer0Byte | my doubt here is how nova is "reporting allocations" to placement without a RP uuid | 17:23 |
Zer0Byte | or to a non visible RP | 17:23 |
Zer0Byte | Instance 21a17b74-400f-4dce-99ed-555e1ae80512 actively managed on this compute host and has allocations in placement: {'resources': {'VCPU': 8, 'MEMORY_MB': 65536}}. _remove_deleted_instances_allocations /usr/lib/python3/dist-packages/nova/compute/resource_tracker.py:1538 | 17:23 |
EugenMayer | sean-k-mooney thank you for that metric trick with cloud-init. Surely not perfect since it makes the entire instance very topology aware, but i guess that info leaked into it never the less | 17:25 |
sean-k-mooney | well normally the netowrks routers and isntance are created by the tenant | 17:27 |
sean-k-mooney | so they are creating that toplogy themselves | 17:28 |
sean-k-mooney | in your case its sligly different since you are migrating/addpoting workload form your previous enviornment | 17:28 |
sean-k-mooney | i wonder if dhcp has the concept of a metiric it can pass | 17:28 |
sean-k-mooney | if so it might be worth wile asking neutron to support setting that via the subnet | 17:29 |
Zer0Byte | i will try something @sean-k-mooney the uuid from RP is auto generated or is from the ocmpute host? | 17:29 |
sean-k-mooney | it would be nice if you could jsut declare that via the subnet and not need to do it per instance | 17:29 |
sean-k-mooney | Zer0Byte: its autogenerate when the compute agent first connect to the db | 17:29 |
sean-k-mooney | Zer0Byte: you can fined it if you have access to the db by looking at the compute_nodes table in the cell db | 17:30 |
Zer0Byte | ok i wil try that | 17:30 |
sean-k-mooney | or you can get it via the rest api vai the hyperviors api | 17:30 |
Zer0Byte | recreate manually the rp | 17:30 |
Zer0Byte | to see if after i generate it i can get the allocations that the compute node is reporting | 17:31 |
EugenMayer | i'll try sean-k-mooney | 17:31 |
sean-k-mooney | to create the allocation i think you whould have to run the allcoation audit or heal allocation nova/placment manage commands | 17:33 |
*** jgwentworth is now known as melwitt | 17:34 | |
EugenMayer | interesting, there is extra_dhcp_option on the port | 17:38 |
EugenMayer | https://registry.terraform.io/providers/terraform-provider-openstack/openstack/latest/docs/resources/networking_port_v2#extra_dhcp_option | 17:38 |
EugenMayer | also here | 17:41 |
EugenMayer | https://docs.openstack.org/python-openstackclient/latest/cli/command-objects/port.html | 17:41 |
dasp | question: if I want to make a constant hardcoded in nova a configurable value instead, would tha require a spec or not? | 17:41 |
dmitriis | sean-k-mooney, gibi: I'll be around tomorrow for the spec review day in case you don't have time for a re-review until then (I resubmitted the spec so the approvals are gone) | 17:47 |
Zer0Byte | well recreted the record but still n data on it | 17:48 |
hyang[m] | sean-k-mooney: got it, thanks for your review. I'll add a blueprint for https://review.opendev.org/c/openstack/nova/+/811521. Regarding your comment https://review.opendev.org/c/openstack/nova/+/811521/comment/6954e488_0d70cd6e/ do you think the additional request in nova is acceptable given the current Neutron API behavior or the Neutron API behavior must be changed first (I'm not sure if it is possible)? | 17:48 |
Zer0Byte | @sean-k-mooney if the node appears as service in nova service probably will don't try to register on placement? | 17:57 |
opendevreview | Daniel Speichert proposed openstack/nova master: conf: add [libvirt]/no_compression_image_types = qcow2 https://review.opendev.org/c/openstack/nova/+/817995 | 17:58 |
Zer0Byte | ohh | 17:59 |
Zer0Byte | i see what happening | 17:59 |
Zer0Byte | checking on placement database | 17:59 |
Zer0Byte | as resouce_provider_id i have id 1 | 17:59 |
Zer0Byte | and the instance reported by the affected node is saving as ID 1 | 18:00 |
Zer0Byte | :s | 18:00 |
sean-k-mooney | hyang[m]: the addtional request is proably workabole but really since this is a new exttion being added this cycle i woudl jsut change it now before addign the nova support | 18:34 |
sean-k-mooney | hyang[m]:without another exteion we wont be abel to detech support for usign a signel query | 18:34 |
sean-k-mooney | hyang[m]: im going to call it a day. tl;dr if this api had already shipped in an upstream release of neutron (not neutron-lib) then i would ahve suggested addign ascond extention for the more effeicnt query but since it has not shipped yet i think we shoudl just fix it and hold the neutorn-lib release until that is done | 18:48 |
hyang[m] | sean-k-mooney: the neutron api extension and related server code is already released in both neutron-lib and neutron https://review.opendev.org/c/openstack/neutron-lib/+/812617 https://review.opendev.org/c/openstack/neutron/+/811242. Since the behavior using shared=true for SGs is same as other existing resources in Neutron like networks, I'm not sure if Neutron team will want to change it (and change for all the resources apis). | 19:11 |
opendevreview | Dmitrii Shcherbakov proposed openstack/nova-specs master: Integration With Off-path Network Backends https://review.opendev.org/c/openstack/nova-specs/+/787458 | 19:12 |
opendevreview | Dmitrii Shcherbakov proposed openstack/nova-specs master: Integration With Off-path Network Backends https://review.opendev.org/c/openstack/nova-specs/+/787458 | 19:41 |
Zer0Byte | alright | 19:50 |
Zer0Byte | i found the issue | 19:50 |
Zer0Byte | @sean-k-mooney i have two regions and by mistake someone when provisioning put placement on the second region as regionone | 19:51 |
Zer0Byte | now the issue is solved | 19:51 |
Zer0Byte | also changed the endpoints url | 19:51 |
Zer0Byte | but nova-agent is still trying to connect to second region placement | 19:52 |
Zer0Byte | found the bug | 20:09 |
Zer0Byte | =) | 20:09 |
opendevreview | Artom Lifshitz proposed openstack/nova master: api-ref: server rescue adminPass injection is conf-dependant https://review.opendev.org/c/openstack/nova/+/818022 | 21:04 |
opendevreview | Daniel Speichert proposed openstack/nova master: conf: add [libvirt]/no_compression_image_types = qcow2 https://review.opendev.org/c/openstack/nova/+/817995 | 21:19 |
opendevreview | Daniel Speichert proposed openstack/nova master: conf: add [libvirt]/no_compression_image_types = qcow2 https://review.opendev.org/c/openstack/nova/+/817995 | 21:25 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!