opendevreview | Merged openstack/nova master: Detect host renames and abort startup https://review.opendev.org/c/openstack/nova/+/863920 | 07:12 |
---|---|---|
dvo-plv | Hello, All. Could you please review our new changes in the next blueprint https://review.opendev.org/c/openstack/nova-specs/+/859290 | 08:33 |
opendevreview | Maxim Monin proposed openstack/nova master: Server Rescue leads to Server ERROR state if basew image is deleted https://review.opendev.org/c/openstack/nova/+/872385 | 08:33 |
sean-k-mooney | hum ^ is expected if you dont pass an image to use instead | 08:41 |
sean-k-mooney | dvo-plv: we are well after spec freeze specs wont reopen until ~march | 08:41 |
dvo-plv | Okay, I see, we told about our approach to add NT NIC support. So we made some poc and I have pushed new code to simplify understating of our approach | 08:50 |
sean-k-mooney | ok so Feature Feeze is in 2 week RC 1 will be about 2 week after that and the master branch will reopen for the 2023.2 release | 08:51 |
sean-k-mooney | in the interim you can rebase you spec to the 2023.2 folder on when that is created and it can be reviewd | 08:52 |
sean-k-mooney | has the neutron work been accpeted yet | 08:53 |
sean-k-mooney | and if so is it implemtned/reviewd or pending | 08:53 |
opendevreview | ribaudr proposed openstack/os-traits master: Add 'COMPUTE_SHARE_LOCAL_FS' https://review.opendev.org/c/openstack/os-traits/+/872185 | 09:11 |
sean-k-mooney | bauzas: gibi im happy with both sahid's seriese and dans at this point and think we can proceed with merging both if ye can rereview them again this morning that would be great | 09:31 |
bauzas | sean-k-mooney: yup I wanted to look at it yesterday, will do it this morning | 09:32 |
sahid | o/ ++ guys, I will be around if you want me to change or add something | 09:33 |
sean-k-mooney | there were some trivial nits but i was fine with a followup patch for those | 09:33 |
sahid | sure I will do that | 09:34 |
sean-k-mooney | at this point i would prefer to land the changes so that the sdk/osc changes can merge ectra and so you can avoid any conflicts on api ro compute service verison | 09:34 |
sahid | ++ | 09:35 |
opendevreview | Rodolfo Alonso proposed openstack/os-vif master: Implement "BaseCommand" result property https://review.opendev.org/c/openstack/os-vif/+/872391 | 09:53 |
sean-k-mooney | ralonsoh: ok so that is a premtive mesure to allow the ovsdbapp to be updated when its relased. | 09:55 |
ralonsoh | sean-k-mooney, yes | 09:56 |
sean-k-mooney | your cutting it kind of close | 09:56 |
ralonsoh | sean-k-mooney, I know that, that's why I'm speeding it | 09:56 |
sean-k-mooney | well more that the non clien lib free is thurday week | 09:57 |
opendevreview | Kashyap Chamarthy proposed openstack/nova stable/xena: libvirt: Add a workaround to skip compareCPU() on destination https://review.opendev.org/c/openstack/nova/+/871975 | 09:57 |
opendevreview | Kashyap Chamarthy proposed openstack/nova stable/xena: Add a workaround to skip hypervisor version check on LM https://review.opendev.org/c/openstack/nova/+/851205 | 09:57 |
opendevreview | Kashyap Chamarthy proposed openstack/nova stable/xena: libvirt: At start-up rework compareCPU() usage with a workaround https://review.opendev.org/c/openstack/nova/+/872011 | 09:57 |
ralonsoh | sean-k-mooney, is in two weeks | 09:57 |
sean-k-mooney | so both os-vif and ovsdbapp need to have this merged by then | 09:57 |
sean-k-mooney | ralonsoh: no that is feature freeze | 09:57 |
sean-k-mooney | not the non-client lib freeze | 09:57 |
sean-k-mooney | the non-clint lib freeze is the 9th | 09:58 |
sean-k-mooney | https://releases.openstack.org/antelope/schedule.html | 09:58 |
bauzas | sahid: sean-k-mooney: series sent to the gate with an ask for a FUP for 2 nits | 09:58 |
sean-k-mooney | thanks if you still have energy to review i tested dans serise this morning | 09:59 |
sean-k-mooney | it worked as expect although one error could be better | 10:00 |
sean-k-mooney | again i think thats fixable in a follow up too | 10:00 |
sean-k-mooney | so i dont think we need to wait for that | 10:00 |
sean-k-mooney | ralonsoh: ill try an loop back to the os-vif change once ci has run | 10:01 |
ralonsoh | sean-k-mooney, thanks a lot | 10:01 |
sean-k-mooney | i know its not actully used really right now so it cant break just want to make sure everythign else is fine with it | 10:01 |
opendevreview | Jorge San Emeterio proposed openstack/nova master: Dividing global privsep profile https://review.opendev.org/c/openstack/nova/+/871729 | 10:02 |
bauzas | sean-k-mooney: working hard on cutting the fake sysfs dir btw. for my own series | 10:05 |
bauzas | definitely too large for our gate | 10:05 |
sean-k-mooney | you only need a small subset of it currently | 10:06 |
sean-k-mooney | if you need me to generate new data i can | 10:07 |
bauzas | sean-k-mooney: yeah, I need to cut some numbers, unless you have another smaller sysfs, my proposal is just to drop a few cpus and related info | 10:32 |
sean-k-mooney | you can drop entires trees in the fake file system | 10:36 |
sean-k-mooney | like the numa nodes and memory | 10:36 |
opendevreview | Merged openstack/nova stable/zed: Improving logging at '_allocate_mdevs'. https://review.opendev.org/c/openstack/nova/+/871413 | 10:37 |
bauzas | once I'm done with downstream stuff, I'll cut | 10:38 |
opendevreview | Jorge San Emeterio proposed openstack/nova master: WIP: Moving privsep profiles to nova/__init__.py https://review.opendev.org/c/openstack/nova/+/872010 | 10:48 |
gibi | dansmith sean-k-mooney: I've approved the rest of the stable compute uuid series. | 10:55 |
sean-k-mooney | cool | 10:56 |
bauzas | ditto | 10:56 |
sean-k-mooney | did you have any issues or concerns | 10:56 |
bauzas | even the top patch which was WIP yesterday ? | 10:56 |
gibi | I dont see any wip patches | 10:56 |
sean-k-mooney | dan pushed stuff yesterday evening | 10:56 |
sean-k-mooney | after you signed off | 10:56 |
gibi | https://review.opendev.org/q/topic:bp%252Fstable-compute-uuid | 10:56 |
sean-k-mooney | i woke up at 5 am today so i reviewd and tested all the new patches this morning | 10:57 |
gibi | I think we are in good shape here | 10:57 |
sean-k-mooney | bauzas: dansmith took the suggestion of adding a STUB_COMPUTE_ID class property | 10:58 |
sean-k-mooney | https://review.opendev.org/c/openstack/nova/+/872204/5/nova/test.py#178 | 10:58 |
sean-k-mooney | and that allwoed them to get the final tests working | 10:58 |
sean-k-mooney | by sutubing _ensure_existing_node_identity by default expect in tests that are testin git | 10:59 |
sean-k-mooney | bauzas: if your interested in the extra manual tests i did my notes are here https://etherpad.opendev.org/p/Stable-compute-uuid-manual-testing#L382 | 11:00 |
gibi | sean-k-mooney: yeah I saw that, make sense | 11:01 |
gibi | our compute start / restart logic in func test is a but messy | 11:01 |
sean-k-mooney | ya but its a useful mess most of the time :) | 11:02 |
sean-k-mooney | gibi: did you intend to +w https://review.opendev.org/c/openstack/nova/+/872220 bauzas do you want ot have a look or will i send it into the gate | 11:03 |
sean-k-mooney | gibi: you set review priorty +2 which you may or may not have intneded :) | 11:04 |
gibi | sean-k-mooney: my bad, fixed it | 11:10 |
gibi | sean-k-mooney, sahid: Am I correct here https://review.opendev.org/c/openstack/nova/+/858384/41/doc/api_samples/os-evacuate/v2.95/server-evacuate-find-host-req.json ? | 11:10 |
gibi | I think targetState only part of the RPC API but not the REST API | 11:11 |
bauzas | sean-k-mooney: gibi: sorry my internal brain concurrency mechanism is currently locked with a downstream semaphore | 11:13 |
sean-k-mooney | gibi: correct only RPC not RestAPI | 11:16 |
sean-k-mooney | gibi it used to be in the rest api but we remvoed it | 11:16 |
sean-k-mooney | that tells me our api sample tests are not validating extra fields | 11:17 |
gibi | sean-k-mooney: OK, then lets fix that sample in a FUP. other than that I have no issue with the evacuate series, but I only skimmed it as it was already approved | 11:17 |
bauzas | gibi: ++ and thanks for the spot | 11:27 |
sahid | thank you guys I'm building a patch to fix all the points that you noticed | 11:30 |
opendevreview | Kashyap Chamarthy proposed openstack/nova stable/wallaby: Add a workaround to skip hypervisor version check on LM https://review.opendev.org/c/openstack/nova/+/851206 | 11:47 |
opendevreview | Kashyap Chamarthy proposed openstack/nova stable/wallaby: libvirt: At start-up rework compareCPU() usage with a workaround https://review.opendev.org/c/openstack/nova/+/872402 | 11:47 |
opendevreview | Merged openstack/nova master: compute: enhance compute evacuate instance to support target state https://review.opendev.org/c/openstack/nova/+/858383 | 11:49 |
opendevreview | Merged openstack/nova master: api: extend evacuate instance to support target state https://review.opendev.org/c/openstack/nova/+/858384 | 11:49 |
sahid | i'm not sure about what should be changed for openstacksdk and python-openstackclient? | 12:19 |
sahid | a release note would be enough? | 12:19 |
sean-k-mooney | you need to bump the max microversion | 12:20 |
sean-k-mooney | thats about it | 12:20 |
sean-k-mooney | you could add help text for evacuate | 12:21 |
sean-k-mooney | to explian the new bahaivor in osc | 12:21 |
sean-k-mooney | that would also be a good addtion | 12:21 |
sahid | yes i was thinking about that too | 12:21 |
opendevreview | Maxim Monin proposed openstack/nova master: Server Rescue leads to Server ERROR state if base image is deleted https://review.opendev.org/c/openstack/nova/+/872385 | 12:38 |
artom | sahid, I think I'll need to add the 2.94 bump before yours though (for the FQDN hostname) | 13:05 |
artom | Don't think I need to do anything else, since we don't appear to validate the hostname anywhere in the client, so it can already be an FQDN | 13:05 |
artom | sahid, actually, I suspect you can just bump directly to 2.95 and be done with it | 13:05 |
artom | Yeah, we don't do anything clientside | 13:07 |
*** dasm|off is now known as dasm | 13:44 | |
sahid | artom: thank you ! | 13:51 |
opendevreview | Sahid Orentino Ferdjaoui proposed openstack/nova master: fup: support evacuate target state https://review.opendev.org/c/openstack/nova/+/872413 | 13:56 |
sahid | artom: i think i don't get where we should bump this version ? | 13:58 |
opendevreview | Jean-Sébastien Bevilacqua proposed openstack/nova master: Add Lustre support to nova https://review.opendev.org/c/openstack/nova/+/853786 | 14:12 |
artom | sahid, I don't know off the top of my head either, maybe I'll do both when I find it | 14:24 |
artom | sahid, so 2.95 doesn't actually change anything in the API itself, there's just a new default instance state? | 14:49 |
artom | after evacuation? | 14:49 |
opendevreview | Maxim Monin proposed openstack/nova master: Server Rescue leads to Server ERROR state if base image is deleted https://review.opendev.org/c/openstack/nova/+/872385 | 15:03 |
opendevreview | Artom Lifshitz proposed openstack/python-novaclient master: Bump microversion to 2.95 https://review.opendev.org/c/openstack/python-novaclient/+/872418 | 15:09 |
artom | sahid ^^ | 15:09 |
artom | Hrmm, so how do we make this work for openstackclient? AFAICT there is no max microversion declaration anywhere | 15:11 |
artom | Does it just magically work if users pass --os-compute-api-version=2.95? | 15:12 |
artom | Oh, and we have no multinode functional tests for osc | 15:15 |
artom | So we can't even test 2.95 | 15:15 |
* artom does a func test for 2.94 only | 15:20 | |
opendevreview | Artom Lifshitz proposed openstack/nova-specs master: Amend FQDN in hostname spec to reflect implementation https://review.opendev.org/c/openstack/nova-specs/+/872422 | 15:26 |
*** ksambor is now known as NICK-afk | 16:08 | |
*** rpittau is now known as elfosardo | 16:34 | |
*** elfosardo is now known as rpittau | 16:43 | |
opendevreview | Dan Smith proposed openstack/nova master: Protect against a deleted node id file https://review.opendev.org/c/openstack/nova/+/872204 | 16:47 |
opendevreview | Dan Smith proposed openstack/nova master: Move comment about _destroy_evacuated_instances() https://review.opendev.org/c/openstack/nova/+/872348 | 16:47 |
opendevreview | Artom Lifshitz proposed openstack/python-novaclient master: Bump microversion to 2.95 https://review.opendev.org/c/openstack/python-novaclient/+/872418 | 16:55 |
opendevreview | Stephen Finucane proposed openstack/nova master: db: Remove legacy migrations https://review.opendev.org/c/openstack/nova/+/872428 | 17:08 |
opendevreview | Stephen Finucane proposed openstack/nova master: db: Remove the legacy 'migration_version' table https://review.opendev.org/c/openstack/nova/+/872429 | 17:08 |
stephenfin | sean-k-mooney: gibi: that should fix SQLA 2.0 compat ^ | 17:08 |
stephenfin | I think it's okay to drop them completely. We've supported automatic migration to alembic since Wallaby. Antelope will be 5 releases later which spans even the biggest fast-forward upgrade interval. Also, even with FFU we expect folks to run DB upgrades on each version so | 17:19 |
opendevreview | Dan Smith proposed openstack/nova master: Check our nodes for hypervisor_hostname changes https://review.opendev.org/c/openstack/nova/+/872220 | 17:23 |
opendevreview | Dan Smith proposed openstack/nova master: Protect against a deleted node id file https://review.opendev.org/c/openstack/nova/+/872204 | 17:23 |
opendevreview | Dan Smith proposed openstack/nova master: Move comment about _destroy_evacuated_instances() https://review.opendev.org/c/openstack/nova/+/872348 | 17:23 |
opendevreview | Dan Smith proposed openstack/nova master: Abort startup if nodename conflict is detected https://review.opendev.org/c/openstack/nova/+/872432 | 17:23 |
dansmith | gdi | 17:23 |
sean-k-mooney | stephenfin: ill take a look shortly | 18:11 |
opendevreview | Sylvain Bauza proposed openstack/nova master: cpu: interfaces for managing state and governor https://review.opendev.org/c/openstack/nova/+/868236 | 18:37 |
opendevreview | Sylvain Bauza proposed openstack/nova master: libvirt: let CPUs be power managed https://review.opendev.org/c/openstack/nova/+/821228 | 18:37 |
opendevreview | Sylvain Bauza proposed openstack/nova master: WIP: enable cpus when an instance is spawning https://review.opendev.org/c/openstack/nova/+/868237 | 18:37 |
bauzas | sean-k-mooney: looks like there is a discrepancy between the guest pcpu and the numa topology blob : | 18:37 |
bauzas | https://paste.opendev.org/show/bcAxuCSeroU2VHjfqpUx/ | 18:37 |
sean-k-mooney | thats the bug i reported last week | 18:40 |
sean-k-mooney | oh the power seriese | 18:40 |
sean-k-mooney | bauzas: let me check if you are usign the right data set | 18:41 |
bauzas | nevermind, I found the rootcase | 18:41 |
sean-k-mooney | ok | 18:41 |
sean-k-mooney | by the way just so you are aware | 18:41 |
sean-k-mooney | core 0 cant generally be turnned off | 18:42 |
sean-k-mooney | its special in the kernel | 18:42 |
sean-k-mooney | more genrelaly the first core in a socket can be special in the same way | 18:42 |
bauzas | sean-k-mooney: found the discrepancy reason : https://paste.opendev.org/show/bAQTjXRRqaxXPO11HPq6/ | 18:42 |
bauzas | tl;dr: cpu_pinning property is wrong | 18:43 |
bauzas | I'll change my functest to use the pcpuset | 18:43 |
bauzas | s/use/verify | 18:43 |
sean-k-mooney | or your reading it wtong | 18:43 |
sean-k-mooney | it looks correct to me | 18:44 |
sean-k-mooney | cpu_pinning_raw={0=0,1=1,2=2,3=3,4=6} that is a dict to logical guest core to host core | 18:44 |
sean-k-mooney | so topology.cpu_pinning returned {0, 1, 2, 3, 6} | 18:45 |
bauzas | I'm confused | 18:45 |
sean-k-mooney | which are the host cores its the vm cores are pinned too | 18:45 |
bauzas | are those numbers the vcpu ones ? | 18:45 |
sean-k-mooney | the key 0-4 | 18:45 |
sean-k-mooney | are logical guest cpu cores 0-4 | 18:45 |
sean-k-mooney | the values are the ids of the host cores those vcpus are pinned too | 18:46 |
opendevreview | Sylvain Bauza proposed openstack/nova master: WIP: enable cpus when an instance is spawning https://review.opendev.org/c/openstack/nova/+/868237 | 18:46 |
bauzas | then, the guest.vcpu.cpu set is wrong | 18:46 |
bauzas | from libvirt | 18:46 |
sean-k-mooney | can you show me the libvirt xml | 18:46 |
bauzas | gimme me a sec | 18:47 |
* bauzas actually wonders what's the best for getting them from the functest | 18:47 | |
bauzas | calling the domain I guess | 18:47 |
sean-k-mooney | you sould use the instance numa toplogy blob | 18:47 |
sean-k-mooney | that is the singel source or truth | 18:48 |
bauzas | Laptop freezed, had to reboot | 18:51 |
opendevreview | Dan Smith proposed openstack/nova master: Stable compute uuid functional tests https://review.opendev.org/c/openstack/nova/+/872441 | 18:52 |
*** gibi is now known as gibi_pto | 19:04 | |
opendevreview | Dan Smith proposed openstack/nova master: Stable compute uuid functional tests https://review.opendev.org/c/openstack/nova/+/872441 | 19:12 |
opendevreview | Sylvain Bauza proposed openstack/nova master: WIP: enable cpus when an instance is spawning https://review.opendev.org/c/openstack/nova/+/868237 | 19:35 |
bauzas | sean-k-mooney: updated ^ | 19:35 |
sean-k-mooney | bauzas: ack ill take a look tomorrwo | 19:38 |
sean-k-mooney | dansmith: ah i was going to ask but i see that there was an issue with ironic on your series | 19:38 |
dansmith | sean-k-mooney: only because I aded that missing node file check before the ironic exclusion | 19:39 |
dansmith | before I added that, it worked fine on ironic | 19:39 |
dansmith | but it's running another job with the two reversed now, should be fine, but we should wait to be sure | 19:39 |
sean-k-mooney | ack i can take a look again tomrrow | 19:40 |
sean-k-mooney | i also see you stared adding func test in a follow up to codify some of the manual tests | 19:40 |
sean-k-mooney | oh and you adressed the compute node create traceback | 19:41 |
sean-k-mooney | cool let us know when its ready to re review | 19:41 |
sean-k-mooney | i think you have adressed everything i found in my manual testing at this point i can quickly run true the list again tomorrow | 19:43 |
dansmith | sean-k-mooney: no, I can't really address the traceback (on startup) without a change to oslo.service AFAIK | 19:48 |
dansmith | but it will no longer be a trace if it happens during periodic | 19:48 |
sean-k-mooney | i was refering to https://review.opendev.org/c/openstack/nova/+/872432/1/nova/compute/manager.py | 19:49 |
sean-k-mooney | sorry not that | 19:49 |
sean-k-mooney | https://review.opendev.org/c/openstack/nova/+/872432/1/nova/compute/resource_tracker.py | 19:49 |
dansmith | sean-k-mooney: if you could re-+W this before you go, that can still merge https://review.opendev.org/c/openstack/nova/+/872220/3 | 19:49 |
dansmith | and then we'll have less for tomorrow | 19:49 |
sean-k-mooney | sure | 19:50 |
dansmith | sean-k-mooney: yeah, but in order to get service startup to abort, we still have to raise and you'll get a trace in the logs | 19:50 |
dansmith | sean-k-mooney: that one was just hit by a rebase accidentally | 19:50 |
sean-k-mooney | i was just skiming the later two patches by the way | 19:50 |
sean-k-mooney | dansmith: ok and the agent will abort on start in that case? | 19:50 |
dansmith | yes, | 19:51 |
sean-k-mooney | ill test it to tomrrow either way just wondering what to expect | 19:51 |
sean-k-mooney | cool | 19:51 |
dansmith | the reason it wasn't before is we swallow and ignore Exception, except for specific ones, so now this makes InvalidConfiguration abort, if startup=True | 19:51 |
sean-k-mooney | presumably in a decorator | 19:51 |
dansmith | so that patch is mostly just to make sure we catch the duplicate error specifically, turn it into InvalidConfiguration, and then allow InvalidConfiguration on startup to abort us | 19:52 |
simondodsley | Question from a customer using Queens (I know EOL and unsupported), is there a way to migrate a boot volume to a new backend wit ha shutdown instance? | 19:52 |
dansmith | no | 19:52 |
dansmith | just in our own wrapper around update_available_resource() | 19:52 |
sean-k-mooney | ah ok | 19:52 |
dansmith | you'll see when you look closer.. it was fairly obvious, there are just lots of layers | 19:52 |
sean-k-mooney | ack | 19:53 |
sean-k-mooney | simondodsley: so ha shutdown isntance you mean isntancce ha is in use but the instance is stopped | 19:53 |
simondodsley | i beleive that is what they are asking | 19:53 |
sean-k-mooney | simondodsley: if its a boot form volume guest i would still expec a cinder volume retry or volume migration could be used | 19:53 |
sean-k-mooney | nova does not have any apis for this so you would have to ask the cinder folks i think | 19:54 |
simondodsley | they tried the cinder retype and it dodn't work. Nova got confused and attached the new volume with the wrong vd device | 19:54 |
simondodsley | or is that definatley a cinder/os-brick thing? | 19:54 |
sean-k-mooney | vd device? | 19:55 |
sean-k-mooney | as in /dev/vda /dev/vdb in the guest | 19:55 |
sean-k-mooney | if so the device path in the guest is not actully used when you are using libvirt | 19:56 |
sean-k-mooney | we cant actully contol that | 19:56 |
sean-k-mooney | but if you mean on the host that sound like a bug but im not sure if its a nova one of os-brick/cinder one | 19:56 |
sean-k-mooney | most of the host block device managemnt is done os-brick | 19:57 |
sean-k-mooney | dansmith: oh ya that was right in front of me https://review.opendev.org/c/openstack/nova/+/872432/1/nova/compute/manager.py#10491 | 20:01 |
dansmith | yep, that's it, and you can see earlier we raise on startup for reshape things | 20:02 |
sean-k-mooney | yep | 20:02 |
dansmith | I wasn't expecting us to have a "log and swallow" exception handler there, so it took me a while to realize why it *was* running at startup, but not stopping us | 20:02 |
sean-k-mooney | ya | 20:03 |
sean-k-mooney | ok actully going now o/ the first patch is on its way | 20:03 |
dansmith | thanks | 20:04 |
*** dasm is now known as dasm|off | 22:40 | |
opendevreview | melanie witt proposed openstack/nova master: Reproducer for bug 2003991 unshelving offloaded instance https://review.opendev.org/c/openstack/nova/+/872470 | 22:51 |
opendevreview | melanie witt proposed openstack/nova master: Enforce quota usage from placement when unshelving https://review.opendev.org/c/openstack/nova/+/872471 | 22:51 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!