*** tosky has quit IRC | 00:23 | |
*** LinPeiWen has joined #openstack-nova | 00:29 | |
*** spatel has joined #openstack-nova | 00:46 | |
*** spatel has quit IRC | 00:52 | |
*** lifeless has quit IRC | 00:53 | |
*** lifeless has joined #openstack-nova | 00:55 | |
*** tbachman has quit IRC | 01:13 | |
*** tbachman has joined #openstack-nova | 01:14 | |
openstackgerrit | melanie witt proposed openstack/nova master: Enable test_volume_backed_live_migration in tempest https://review.opendev.org/c/openstack/nova/+/528104 | 01:17 |
---|---|---|
*** gyee has quit IRC | 01:25 | |
*** sapd1 has joined #openstack-nova | 01:30 | |
brinzhang0 | bauzas, sean-k-mooney: I have replied the cyborg shelve/unshleve patch, pls review again, and I think there is no need to update in, one comments need to move the comments, and I will addressed in the followed up patch | 01:30 |
*** martinkennelly has quit IRC | 01:43 | |
*** zenkuro has joined #openstack-nova | 02:04 | |
*** macz_ has joined #openstack-nova | 02:12 | |
*** spatel has joined #openstack-nova | 02:26 | |
*** spatel has quit IRC | 02:28 | |
*** psachin has joined #openstack-nova | 03:30 | |
*** hemanth_n has joined #openstack-nova | 03:38 | |
*** mkrai has joined #openstack-nova | 03:40 | |
*** mgoddard has joined #openstack-nova | 04:08 | |
*** sapd1_x has joined #openstack-nova | 04:39 | |
*** ratailor has joined #openstack-nova | 04:40 | |
*** sapd1 has quit IRC | 04:43 | |
*** ociuhandu has joined #openstack-nova | 05:08 | |
*** ociuhandu has quit IRC | 05:13 | |
*** dave-mccowan has quit IRC | 05:19 | |
*** evrardjp has quit IRC | 05:33 | |
*** evrardjp has joined #openstack-nova | 05:33 | |
*** sapd1_x has quit IRC | 05:38 | |
*** whoami-rajat__ has joined #openstack-nova | 05:41 | |
*** zzzeek has quit IRC | 05:42 | |
*** zzzeek has joined #openstack-nova | 05:43 | |
*** macz_ has quit IRC | 05:45 | |
*** zzzeek has quit IRC | 05:50 | |
*** zzzeek has joined #openstack-nova | 05:52 | |
*** sapd1_x has joined #openstack-nova | 05:57 | |
*** zzzeek has quit IRC | 05:59 | |
*** zzzeek has joined #openstack-nova | 06:00 | |
*** zzzeek has quit IRC | 06:05 | |
*** zzzeek has joined #openstack-nova | 06:11 | |
*** zzzeek has quit IRC | 06:18 | |
*** zzzeek has joined #openstack-nova | 06:19 | |
*** zzzeek has quit IRC | 06:46 | |
*** zzzeek has joined #openstack-nova | 06:49 | |
*** sapd1_x has quit IRC | 06:51 | |
openstackgerrit | Mamduh proposed openstack/os-vif stable/queens: Refactor code of linux_net to more cleaner and increase performace https://review.opendev.org/c/openstack/os-vif/+/765941 | 06:53 |
openstackgerrit | Mamduh proposed openstack/os-vif stable/queens: Fix - os-vif fails to get the correct UpLink Representor https://review.opendev.org/c/openstack/os-vif/+/765983 | 06:53 |
*** mkrai has quit IRC | 06:56 | |
*** sapd1_x has joined #openstack-nova | 07:06 | |
*** zzzeek has quit IRC | 07:09 | |
*** zzzeek has joined #openstack-nova | 07:12 | |
*** zzzeek has quit IRC | 07:21 | |
*** zzzeek has joined #openstack-nova | 07:22 | |
*** dklyle has quit IRC | 07:46 | |
*** macz_ has joined #openstack-nova | 07:46 | |
*** macz_ has quit IRC | 07:50 | |
*** zzzeek has quit IRC | 07:50 | |
*** zzzeek has joined #openstack-nova | 07:54 | |
*** mkrai has joined #openstack-nova | 07:59 | |
*** sapd1_x has quit IRC | 07:59 | |
*** rpittau|afk is now known as rpittau | 08:01 | |
*** tesseract has joined #openstack-nova | 08:03 | |
*** zzzeek has quit IRC | 08:16 | |
*** zzzeek has joined #openstack-nova | 08:19 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Remove dead code from SchedulerReportClient https://review.opendev.org/c/openstack/nova/+/769467 | 08:21 |
*** andrewbonney has joined #openstack-nova | 08:23 | |
*** elod_pto is now known as elod | 08:48 | |
*** CeeMac has joined #openstack-nova | 09:01 | |
*** zzzeek has quit IRC | 09:03 | |
*** zzzeek has joined #openstack-nova | 09:04 | |
*** zenkuro has quit IRC | 09:26 | |
*** mkrai has quit IRC | 09:27 | |
*** zenkuro has joined #openstack-nova | 09:27 | |
*** ociuhandu has joined #openstack-nova | 09:34 | |
*** alexe9191 has joined #openstack-nova | 09:41 | |
alexe9191 | Hello There, | 09:42 |
*** zzzeek has quit IRC | 09:42 | |
alexe9191 | I was wondering if I can run a newer version of nova-api/scheduler/conductor on an older version of nova-compute. Nova-compute would be kili for instance and the newer version of the API services would be on Pike or Rocky for instance? | 09:42 |
*** zzzeek has joined #openstack-nova | 09:43 | |
gibi | alexe9191: nova keeps compatibility between version N+1 controller and version N compute service | 09:44 |
gibi | so you can have Rocky controller services and Pike compute services | 09:45 |
gibi | sorry, so you can have Rocky controller services and Queens compute serivces | 09:45 |
*** mkrai has joined #openstack-nova | 09:46 | |
gibi | as Q + 1 = R | 09:46 |
alexe9191 | interesting, what out of curiosity. Does this also work the other way around? | 09:46 |
alexe9191 | so nova-compute R and nova-api Q ? | 09:46 |
*** macz_ has joined #openstack-nova | 09:47 | |
*** derekh has joined #openstack-nova | 09:48 | |
gibi | no, the controller services need to be upgraded first | 09:49 |
alexe9191 | I am also guessing that I can't immediately upgrade the nova-compute from kilo to rocky in one go? This also has to be done in a rolling fashion? So basically, K -> L -> etc > R on the nova-computes ? | 09:50 |
*** zzzeek has quit IRC | 09:51 | |
gibi | alexe9191: what we test is upstream is rolling upgrade as you guessed. There are some support for fast forward upgrade where you move more than one release forward in single step | 09:51 |
gibi | https://wiki.openstack.org/wiki/Fast_forward_upgrades | 09:51 |
*** martinkennelly has joined #openstack-nova | 09:52 | |
*** macz_ has quit IRC | 09:52 | |
*** zzzeek has joined #openstack-nova | 09:53 | |
alexe9191 | I honestly fail to understand that a little bit as the documentation is not really clear on fast forward upgrades | 09:54 |
*** slaweq has joined #openstack-nova | 09:55 | |
sean-k-mooney | alexe9191: fast forward upgrades are not really a feature of the comonet project more an installer feature that the compoent project like nova try to not intentionally break | 09:55 |
sean-k-mooney | stictly speaking nova only support n to n+1 | 09:55 |
alexe9191 | So basically this fast forward *feature* is more of a wrapper around what nova support ? | 09:56 |
sean-k-mooney | not quite it a way of runnign all the db updates and migration for the intermidat release so that you can skip them | 09:57 |
sean-k-mooney | on the nova side we try to keep that logic around for multiple release so that if you skip 3 version of the compute agent it will still have the compatibley code | 09:57 |
*** ociuhandu has quit IRC | 09:57 | |
sean-k-mooney | so nova "supports" fast forward upgrades | 09:58 |
alexe9191 | So for instance running the DB migration scripts from newton while one is running Kilo ? | 09:58 |
sean-k-mooney | by keeping the compat code for several release beyond when it was striclty reuired | 09:58 |
sean-k-mooney | when you do an FFU there is a period where the contolplane is in accessable. | 09:59 |
sean-k-mooney | basiclly you move the contoler services to newton while the computes are on kilo | 09:59 |
sean-k-mooney | then you move the compute to newton | 09:59 |
sean-k-mooney | when you do that in principal the newton contoler cannot manage the kilo compute but the vms keep running | 10:00 |
sean-k-mooney | when you bring the compute back up to newton manageablity is restored | 10:00 |
*** zzzeek has quit IRC | 10:00 | |
*** zzzeek has joined #openstack-nova | 10:00 | |
sean-k-mooney | in pratice if there has been no RPC major version bump between the two version you have deploy the contoler could tehoretically mange compute more then 1 releasee old | 10:01 |
sean-k-mooney | but we dont test that | 10:01 |
alexe9191 | ok, and rolling upgrade being moving everything at once from one release to the other, both the APi & the nova-compute agents, K, L, N, ... | 10:01 |
sean-k-mooney | there is typically 5-6 release betwwen major rpc version bumps | 10:01 |
alexe9191 | That's very helpful to know! | 10:02 |
*** ociuhandu has joined #openstack-nova | 10:02 | |
sean-k-mooney | alexe9191: rolling upgrades are where you move the contolplane in one go to n+1, then move the computes in a "rolling" or incremental fashion 1 by 1 | 10:02 |
sean-k-mooney | rolling upgrades allow you to maintain managmeablity of the computes with mixed verions | 10:03 |
sean-k-mooney | but we only test a delta of 1 verions | 10:03 |
sean-k-mooney | we write the code such that it could be a larger delta but the test matix is too large for use to offialy support that | 10:04 |
alexe9191 | Right, so if upgrading K to L. I could basically roll the upgrade to the compute nodes one rack at a time from K to L | 10:04 |
sean-k-mooney | yes | 10:04 |
alexe9191 | So in theory one could do N+2 but that's not tested. | 10:04 |
sean-k-mooney | yes in princiapl n->n+2 should work provided there is not an rpc major version bump in between | 10:05 |
sean-k-mooney | but really you would be the first to test that | 10:05 |
alexe9191 | I'd rather stick to what you guys are doing upstream in that case. | 10:06 |
sean-k-mooney | we have been on RPC version 5.X for about 5 releases now | 10:06 |
*** mkrai has quit IRC | 10:06 | |
sean-k-mooney | alexe9191: ya if this is a prod deployment that is likely your safest option | 10:06 |
alexe9191 | I am guessing there are major changes between K and R in RPC versions, is that documented in the release note? | 10:06 |
*** mkrai_ has joined #openstack-nova | 10:06 | |
alexe9191 | Yes, which sounds that doing the rolling-upgrades is the safest supported solution. | 10:07 |
sean-k-mooney | i think it would be in the release notes but i can quickly check the code | 10:08 |
alexe9191 | Fantastic, if I could also know which file you are checking in the source tree that would be great as well. | 10:08 |
sean-k-mooney | we have each verion bump documented with a comment https://github.com/openstack/nova/blame/master/nova/compute/rpcapi.py#L76 | 10:09 |
sean-k-mooney | https://github.com/openstack/nova/blame/master/nova/compute/rpcapi.py#L385 | 10:09 |
alexe9191 | Awesome! | 10:09 |
sean-k-mooney | the aliase are the quickest way to check | 10:09 |
alexe9191 | Interesting, so K to P is on the same Major RPC release. | 10:10 |
sean-k-mooney | so kilo was 4.0 | 10:10 |
sean-k-mooney | rocky was 5.0 | 10:10 |
sean-k-mooney | technically 5.0 is the ame as 4.22 | 10:10 |
sean-k-mooney | so if rocky still has the 4.22 compat code then you could pin the rpc version to 4.0 in principal | 10:11 |
sean-k-mooney | but it proably will have bugs | 10:11 |
alexe9191 | This is really helpful! | 10:12 |
sean-k-mooney | honestly i doubt we actully manged to keep perfect rpc compate and i suspect rocky has had the 4.x compat code removed | 10:13 |
sean-k-mooney | but looking quickly it looks like the compat code was maybe removed in stine | 10:14 |
*** zzzeek has quit IRC | 10:14 | |
sean-k-mooney | without testign it first however i would not bet on it it fully working | 10:14 |
alexe9191 | If I am not mistaken, this could mean that in theory I could run Newton API on kilo nova compute. Since the RPC API is on the same major version. | 10:15 |
sean-k-mooney | in theory yes | 10:15 |
sean-k-mooney | in practice you would likely be the first to try that and you would have to let us know if it works :) | 10:16 |
alexe9191 | I will most definitely do that :) | 10:16 |
sean-k-mooney | you would pin the RPC version of the newton contolplane to 4.0 | 10:16 |
alexe9191 | This should be done in nova.conf I could imagine ? | 10:16 |
sean-k-mooney | you can do that via the upgrade levels in the nova.conf | 10:16 |
*** zzzeek has joined #openstack-nova | 10:17 | |
alexe9191 | I am gonna let you guys know once I test that! Thanks a lot for all of the helpful information. | 10:19 |
bauzas | sean-k-mooney: alexe9191: well, Kilo is very prehistoric | 10:19 |
bauzas | and from the RPC PoV, we only started to introduce massively nova objects by this time | 10:20 |
sean-k-mooney | it is but so is newton :P | 10:20 |
bauzas | at newton, we mostly stopped to use dicts on RPC calls | 10:20 |
bauzas | while during kilo, most of the values were those | 10:20 |
sean-k-mooney | mostly but we still have a few even on master | 10:20 |
bauzas | speaking of RPC compat, I'd say a nova boot *could* work | 10:21 |
sean-k-mooney | bauzas: having random dicts of stings migth actully help in this case | 10:21 |
sean-k-mooney | provide the required data is there | 10:21 |
bauzas | maybe | 10:21 |
bauzas | anyway, all of this is very experimental | 10:22 |
bauzas | but I sincerely hope alexe9191 is not expecting live-migration to work with this env :) | 10:23 |
sean-k-mooney | oh yes it is. | 10:23 |
alexe9191 | bauzas no and this is not a production environment where I will be testing. Our of curiosity i am going to run an experiment but for production I am probably gonna do the rolling upgrade. | 10:26 |
bauzas | alexe9191: anyway, I'm very curious about your experiment results :) | 10:28 |
bauzas | I'm personnally not opposed to have operators breaking the N+1 rule by running very old computes if they wish | 10:29 |
bauzas | provided they understand the maintenance price | 10:29 |
bauzas | but... Kilo/Rocky is like a gymnast front oversplit to me :) | 10:31 |
bauzas | front side* even | 10:31 |
sean-k-mooney | one of the cost is really you can only safely use the max api version of the oldest compute | 10:31 |
sean-k-mooney | as in microversion | 10:31 |
sean-k-mooney | you might be able to use later microverions but if they needed an rpc change it wont work | 10:32 |
sean-k-mooney | we will backlevel the request but you wont get teh behavior the new microverion implies | 10:32 |
gibi | stephenfin: I finished reviewing the migration compacting series, nice work! | 10:33 |
stephenfin | gibi: Thanks :) | 10:33 |
gibi | I have couple of questions in the middle but overall I think we are good | 10:33 |
stephenfin | If you want me to review anything, just let me know. I'm almost done on the os-hypervisors fixup (many rabbit holes) | 10:33 |
stephenfin | Sweet. Will take a look shortly | 10:33 |
sean-k-mooney | bauzas: by the way i have not got around to doing the full code review for the cyborg shevle patches but did my respoce make sense | 10:35 |
*** dtantsur|afk is now known as dtantsur | 10:35 | |
sean-k-mooney | regarding the conductor chagnes | 10:35 |
*** zzzeek has quit IRC | 10:35 | |
bauzas | sean-k-mooney: I'll look at them later today, I was on a meeting just before and I want to rebase my own routed networks series | 10:37 |
sean-k-mooney | no worries | 10:37 |
bauzas | in other words, I lag :D | 10:37 |
*** slaweq has quit IRC | 10:38 | |
*** zzzeek has joined #openstack-nova | 10:38 | |
gibi | stephenfin: there is a trivial review https://review.opendev.org/c/openstack/nova/+/769467 you can lok at | 10:38 |
sean-k-mooney | hehe well i was indening to do a fully review yesterady and havent started other then to respond ot your comments so your still ahead of me | 10:39 |
stephenfin | gibi: have it open already :) | 10:39 |
gibi | besides that I have nothing :) | 10:40 |
gibi | I need to do a bunch of rebases | 10:40 |
sean-k-mooney | speaking of rebases i wil be rebasing my vdpa spec in a day or so but if people have time it would be nice if ye could review that spec https://review.opendev.org/c/openstack/nova-specs/+/764999 and also the numa port polices spec https://review.opendev.org/c/openstack/nova-specs/+/765901 | 10:49 |
sean-k-mooney | i intend to start pushing some standalone patches for those this week namely the xml generation for vdpa and some constants for the newtuon extensions | 10:50 |
sean-k-mooney | the larger change will still be a few weeks out but i would like to start submitting an merging some of the smaller patches which would be dead code but easilly reviewable while i work on the more complex part but the spec need to be approved first before that can happen | 10:51 |
*** mkrai_ has quit IRC | 11:03 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Use the non polling notification waiter in func test https://review.opendev.org/c/openstack/nova/+/758445 | 11:07 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Create a fixture around fake_notifier https://review.opendev.org/c/openstack/nova/+/758446 | 11:07 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Use NotificationFixture for legacy notifications too https://review.opendev.org/c/openstack/nova/+/758448 | 11:07 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Test the NotificationFixture https://review.opendev.org/c/openstack/nova/+/758450 | 11:09 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Move fake_notifier impl under NotificationFixture https://review.opendev.org/c/openstack/nova/+/758451 | 11:09 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: api: Drop statistics-style fields from os-hypervisors https://review.opendev.org/c/openstack/nova/+/764040 | 11:16 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: tests: Remove 'test_extended_hypervisors' https://review.opendev.org/c/openstack/nova/+/769519 | 11:16 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: api: Normalize exception handling for os-hypervisors https://review.opendev.org/c/openstack/nova/+/769520 | 11:16 |
stephenfin | gibi: Done ^ Sorry for the delay | 11:17 |
* stephenfin looks at DB patches | 11:17 | |
gibi | on it | 11:17 |
*** masterpe has quit IRC | 11:20 | |
*** LinPeiWen has quit IRC | 11:25 | |
*** masterpe has joined #openstack-nova | 11:29 | |
openstackgerrit | Sylvain Bauza proposed openstack/nova master: Add requested_networks field to RequestSpec object https://review.opendev.org/c/openstack/nova/+/749977 | 11:29 |
openstackgerrit | Sylvain Bauza proposed openstack/nova master: WIP: Add a routed networks scheduler pre-filter https://review.opendev.org/c/openstack/nova/+/749068 | 11:29 |
sean-k-mooney | does anywone know if shareing resouce providers actully work by the way | 11:43 |
sean-k-mooney | its in relateion to http://lists.openstack.org/pipermail/openstack-discuss/2021-January/019647.html | 11:43 |
sean-k-mooney | specifically for the RBD image backend | 11:43 |
sean-k-mooney | if we used provider.yaml to set the local disk_gb inventory to 0 and created a shareing resocue provider with a disk_gb inventory for the ceph pool | 11:44 |
lyarwood | I don't think anyone was working on that | 11:44 |
sean-k-mooney | then added the MISC_SHARES_VIA_AGGREGATE trait to the RP and adedd the RP and the comptue to an aggreate would it work | 11:45 |
sean-k-mooney | lyarwood: they have not been for like 2 years | 11:45 |
sean-k-mooney | but i tought the placment side was done | 11:45 |
sean-k-mooney | i just dont know if anyone has ever tested it | 11:45 |
gibi | sean-k-mooney: we don't have functional or other integration tests for it so I assume that it does not work | 11:45 |
lyarwood | no idea sorry | 11:46 |
sean-k-mooney | hehe ya that my base assumtion with openstack too | 11:46 |
sean-k-mooney | if its not tested its broken | 11:46 |
sean-k-mooney | if it did work technially operators/installer tooll could configure it and actully get correct reporting for shared storage this way | 11:47 |
sean-k-mooney | the real question is will the placement allocation candiate include the inventory form the sharing provider | 11:47 |
sean-k-mooney | using the request we currently make | 11:48 |
sean-k-mooney | it "should" but i have no idea if it will actully work | 11:48 |
sean-k-mooney | placment has functrion api test for shared resouces https://github.com/openstack/placement/blob/master/placement/tests/functional/gabbits/shared-resources.yaml | 11:49 |
sean-k-mooney | https://github.com/openstack/placement/blob/master/placement/tests/functional/gabbits/shared-resources.yaml#L135-L143 | 11:50 |
sean-k-mooney | it looks like it should work | 11:50 |
sean-k-mooney | if you read those top to bottom | 11:51 |
gibi | sean-k-mooney: I'm not sure that our resize and migrate logic works properly with sharing providers | 11:52 |
sean-k-mooney | if i had time i would really like to test that and write that up in docs because it would be greate do document how to correctly deploy using sharing agggreats to model storage | 11:52 |
gibi | currently we copy the allocation including the part that is allocated from sharing providers | 11:53 |
gibi | even if the same sharing provider shares with the new destination | 11:53 |
sean-k-mooney | we would just do double allcoations then | 11:53 |
sean-k-mooney | which i think is ok | 11:53 |
sean-k-mooney | it would reconsile the vaules wehn we do resize confirm | 11:53 |
gibi | hm, yes | 11:54 |
sean-k-mooney | i mean its not perfect but its not dangourous | 11:54 |
sean-k-mooney | it would jsut cause some failures wehn near capasity | 11:54 |
*** ociuhandu has quit IRC | 11:59 | |
*** evrardjp has left #openstack-nova | 12:00 | |
*** ociuhandu has joined #openstack-nova | 12:00 | |
gibi | stephenfin: left feedback in https://review.opendev.org/c/openstack/nova/+/764040 | 12:00 |
sean-k-mooney | gibi: do we have any ci tests using provider.yaml | 12:01 |
gibi | sean-k-mooney: hm, good question | 12:01 |
gibi | I don't think we have tempest tesd | 12:01 |
gibi | test | 12:02 |
sean-k-mooney | im wondering if i could maybe test this theory with a change to the ceph job | 12:02 |
gibi | we have some functional test | 12:02 |
gibi | https://github.com/openstack/nova/blob/ccb2e11129d4a0730b0ba19a7f49ed8fc5a05f6d/nova/tests/functional/compute/test_resource_tracker.py#L665 | 12:03 |
*** ociuhandu has quit IRC | 12:05 | |
sean-k-mooney | specificly i was thinking of configure both compute nodes to report 0 disk_gb and having a pretest hook create the sharing resouce provider | 12:05 |
gibi | I think it can be done | 12:06 |
sean-k-mooney | ya i think it would be worth testing at least | 12:07 |
sean-k-mooney | ill have to try and make time to do it. | 12:08 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: api: Drop statistics-style fields from os-hypervisors https://review.opendev.org/c/openstack/nova/+/764040 | 12:08 |
stephenfin | gibi: done | 12:08 |
lyarwood | sean-k-mooney: I'd be interested in helping with that | 12:09 |
lyarwood | once I've finished this machine type stuff | 12:10 |
stephenfin | gibi++ thanks :) | 12:11 |
stephenfin | gmann: When you're around, could do with some input on https://review.opendev.org/c/openstack/nova/+/765798/ I'm kind of lost as to what the next steps are /o\ | 12:12 |
sean-k-mooney | hehe i was hoping you would say that. long term it would be nice to be able to do it automatically but if this worked we could do it via an install and docuemnt it for this release and figure out how to make it work out of the box in the future | 12:12 |
gibi | stephenfin: further comment in https://review.opendev.org/c/openstack/nova/+/764040 | 12:12 |
*** ratailor has quit IRC | 12:14 | |
lyarwood | sean-k-mooney: yup agreed | 12:14 |
*** tosky has joined #openstack-nova | 12:15 | |
*** lpetrut has joined #openstack-nova | 12:17 | |
*** slaweq has joined #openstack-nova | 12:17 | |
sean-k-mooney | the main downside to this approch is without a reshape this would only work for new installs | 12:19 |
sean-k-mooney | but it would work for new installs so that would at least be progress | 12:19 |
sean-k-mooney | for existing deployments we would need to reshape the allcoation form the compute node rp to the shareing one | 12:20 |
sean-k-mooney | or sharing ones, you coudl have 1 ceph cluster per az or cell or something consiveably | 12:21 |
sean-k-mooney | aggrates would take care of modeling that however | 12:21 |
lyarwood | yeah the upgrade case isn't going to be fun to handle tbh | 12:23 |
*** sapd1 has joined #openstack-nova | 12:24 | |
sean-k-mooney | no matter what we do i dont see that improving much. sure if we can write a reshap but it will he complicated | 12:25 |
sean-k-mooney | i suspect a nova manage command would acatully be a better approch | 12:25 |
*** ociuhandu has joined #openstack-nova | 12:26 | |
*** ociuhandu has quit IRC | 12:26 | |
sean-k-mooney | where you spcify the sharing rp and the compute host to reshape | 12:26 |
sean-k-mooney | hum actully i guess placment-mange not nova mange as this is all on the placement side | 12:27 |
*** ociuhandu has joined #openstack-nova | 12:27 | |
*** ociuhandu has quit IRC | 12:28 | |
*** ociuhandu has joined #openstack-nova | 12:29 | |
*** ociuhandu has quit IRC | 12:35 | |
*** zzzeek has quit IRC | 12:46 | |
*** zzzeek has joined #openstack-nova | 12:49 | |
*** ociuhandu has joined #openstack-nova | 12:51 | |
*** martinkennelly has quit IRC | 12:54 | |
*** ociuhandu has quit IRC | 13:02 | |
*** hemanth_n has quit IRC | 13:06 | |
*** ociuhandu has joined #openstack-nova | 13:07 | |
*** brinzhang0 has quit IRC | 13:12 | |
*** ociuhandu has quit IRC | 13:12 | |
*** slaweq has quit IRC | 13:15 | |
*** sapd1 has quit IRC | 13:19 | |
*** ociuhandu has joined #openstack-nova | 13:33 | |
sean-k-mooney | lyarwood: by the way have you reviewd https://review.opendev.org/c/openstack/cinder-specs/+/766732 | 13:35 |
sean-k-mooney | its propsoing extending os-brick to spawn a deamon process ot monitor and heal NVMEoF volules created over MD raids | 13:36 |
sean-k-mooney | e.g. havign it actily monitoing the underlying raid config and healing it | 13:37 |
*** ociuhandu has quit IRC | 13:37 | |
sean-k-mooney | unfortunetly that is now approved on the cinder cide but i dont think this should be in the scope of os-brick to do personlally | 13:37 |
sean-k-mooney | gibi: lyarwood did this come up in the nova cinder cross project dicusstion at the ptg? | 13:38 |
sean-k-mooney | gibi: lyarwood i dont see any nova review on the spec at all | 13:38 |
sean-k-mooney | this is the os-brick patch https://review.opendev.org/c/openstack/os-brick/+/768576 | 13:39 |
sean-k-mooney | gibi: we had some discussion with qqmber about this on monday | 13:41 |
lyarwood | sean-k-mooney: iirc it came up in the cinder track ahead of the nova track starting up | 13:50 |
lyarwood | sean-k-mooney: it's definitently not in n-cpu's wheel house to look after stuff like this so I'm not sure where you would have it if not os-brick? | 13:51 |
sean-k-mooney | in a standalone agnet | 13:52 |
sean-k-mooney | the fact is it would be executing in the n-cpu process | 13:52 |
sean-k-mooney | granted as a child process spawned by os-brick | 13:53 |
sean-k-mooney | but in any case it would be in the nova-compute contianer | 13:53 |
*** alexe9191 has quit IRC | 13:53 | |
sean-k-mooney | for me it makes far more sense to make this its own standalone deamon | 13:53 |
*** ociuhandu has joined #openstack-nova | 13:55 | |
lyarwood | Yeah I get that but to rearch that layer between os-brick and another local agent would be a huge amount of work | 13:55 |
lyarwood | ah wait | 13:55 |
lyarwood | isn't that the proposal anyway | 13:55 |
* lyarwood opens the spec again | 13:55 | |
sean-k-mooney | the propsoal is to add an agent into os-brick and have os-bick spawn the agent into a new process from the nova compute agent | 13:56 |
sean-k-mooney | so a seperate deamon coudl use os-brick to do all the work | 13:56 |
sean-k-mooney | what i dont like is the magic spawning of a seperate process . granted we kind of do that for privsep but that is differnt | 13:57 |
sean-k-mooney | the lifetime fo rhte privesep process is related to the lifetiem of the nova-compute service | 13:57 |
lyarwood | right | 13:57 |
sean-k-mooney | the lifetime or this agent would not be | 13:57 |
lyarwood | sorry I had this the wrong way around | 13:58 |
lyarwood | I thought they already had a deamon somewhere | 13:58 |
* lyarwood adds the spec to his list to review in full later | 13:58 | |
sean-k-mooney | its already approved on the cinder side | 13:58 |
sean-k-mooney | i would have -1 it if it wasnt merged. | 13:59 |
*** nweinber has joined #openstack-nova | 14:10 | |
*** dave-mccowan has joined #openstack-nova | 14:11 | |
openstackgerrit | Stephen Finucane proposed openstack/nova master: api: Drop statistics-style fields from os-hypervisors https://review.opendev.org/c/openstack/nova/+/764040 | 14:13 |
stephenfin | gibi: third time lucky :D ^ | 14:14 |
openstackgerrit | Lee Yarwood proposed openstack/nova-specs master: libvirt: Update instance machine type stash spec https://review.opendev.org/c/openstack/nova-specs/+/769547 | 14:42 |
openstackgerrit | Lee Yarwood proposed openstack/nova master: WIP libvirt: Record the machine_type of instances in system_metadata https://review.opendev.org/c/openstack/nova/+/767533 | 14:42 |
openstackgerrit | Lee Yarwood proposed openstack/nova master: WIP nova-manage: Add commands for managaing instance machine type https://review.opendev.org/c/openstack/nova/+/769548 | 14:42 |
sean-k-mooney | stephenfin: how is your review queue at the moment | 14:59 |
stephenfin | sean-k-mooney: Manageable. What's up? | 15:00 |
sean-k-mooney | can you take a look at this small bug fix https://review.opendev.org/c/openstack/nova/+/767368 and when you have time my spec for port numa polices https://review.opendev.org/c/openstack/nova-specs/+/765901 | 15:01 |
sean-k-mooney | i still have to update the vdpa spec but that what im going to do now | 15:01 |
stephenfin | sure thing | 15:02 |
gibi | sean-k-mooney: if nova needs to know (communicate with) a new agent (spawned by os-brick) then we need a nova spec about it. If the os-brick change is transparent to nova then meh | 15:02 |
gibi | sean-k-mooney: do we define what is in a nova compute container upstream? | 15:03 |
gibi | stephenfin: will look in a sec | 15:03 |
sean-k-mooney | personally im a little uncomfortably with our dependient libs effectivly injecting arbitry code that executes as a persitent deamon | 15:03 |
sean-k-mooney | gibi: no nova does not kolla/ooo/loci do | 15:04 |
sean-k-mooney | we do try to keep it at 1 process per container for the most part bar privsep | 15:04 |
gibi | then those projects needs to be involved to the dicussion as they are impacted by a new agent | 15:04 |
gibi | stephenfin: I'm +2 on https://review.opendev.org/c/openstack/nova/+/764040 | 15:05 |
sean-k-mooney | the way its proposed it would jsut run in the continer without there interventions or knowladge | 15:05 |
sean-k-mooney | although in the contier case the new agent would still implictly share the same lifetime as the nova agent since tehy are in the same container | 15:05 |
stephenfin | gibi: Great. Thanks! | 15:06 |
gibi | stephenfin: new you are the closest to grab the microversion 2.88 | 15:06 |
gibi | sean-k-mooney: that is actually a good argument for a separate service as they want independent lifetime | 15:07 |
stephenfin | gibi: Yup. I'm going to ask gmann to take a look at that later, assuming he's back from his PTO | 15:07 |
gibi | stephenfin: cool | 15:07 |
*** sapd1 has joined #openstack-nova | 15:20 | |
*** dklyle has joined #openstack-nova | 15:26 | |
*** derekh has quit IRC | 15:27 | |
*** lpetrut has quit IRC | 15:27 | |
*** ociuhandu has quit IRC | 15:34 | |
*** mgoddard has quit IRC | 15:41 | |
*** mgoddard has joined #openstack-nova | 15:41 | |
*** spotz has quit IRC | 15:44 | |
*** ociuhandu has joined #openstack-nova | 15:46 | |
*** alexe9191 has joined #openstack-nova | 15:46 | |
*** mgoddard has quit IRC | 15:47 | |
*** ociuhandu has quit IRC | 15:56 | |
openstackgerrit | Kashyap Chamarthy proposed openstack/nova master: libvirt: Parse the 'os' element from domainCapabilities https://review.opendev.org/c/openstack/nova/+/673790 | 15:59 |
*** mlavalle has joined #openstack-nova | 16:01 | |
*** macz_ has joined #openstack-nova | 16:10 | |
*** macz_ has quit IRC | 16:10 | |
*** macz_ has joined #openstack-nova | 16:11 | |
dansmith | lyarwood: I was just about to +2 that machine type sysmeta spec rev, but saw that stephenfin already did.. I have annoying unrelated questions, one of which might be an accidental state omission | 16:11 |
dansmith | lyarwood: if it 's cool, I'll just +2 and let you quickly look if you want to throw the other state in there to avoid a rev-to-the-rev, | 16:12 |
dansmith | and I can just +W if I'm wrong or you want it separate | 16:12 |
lyarwood | dansmith: just on a call now, happy to answer your questions before this lands. I'll take a look once this is over. | 16:16 |
dansmith | lyarwood: ack | 16:16 |
*** jamesden_ is now known as jamesdenton | 16:46 | |
*** gyee has joined #openstack-nova | 16:50 | |
*** ociuhandu has joined #openstack-nova | 16:51 | |
*** ociuhandu has quit IRC | 16:52 | |
*** ociuhandu has joined #openstack-nova | 16:52 | |
*** ociuhandu has quit IRC | 16:52 | |
*** tesseract has quit IRC | 17:02 | |
*** sapd1 has quit IRC | 17:07 | |
*** mgoddard has joined #openstack-nova | 17:10 | |
*** ircuser-1 has joined #openstack-nova | 17:20 | |
*** sapd1 has joined #openstack-nova | 17:21 | |
lyarwood | dansmith: replied, need to drop for dinner but I'll respin this evening. | 17:22 |
dansmith | ack | 17:23 |
*** psachin has quit IRC | 17:23 | |
*** ociuhandu has joined #openstack-nova | 17:27 | |
gmann | stephenfin: ack, I will review today. | 17:30 |
*** ociuhandu has quit IRC | 17:33 | |
*** rpittau is now known as rpittau|afk | 17:43 | |
*** sapd1 has quit IRC | 17:47 | |
stephenfin | gmann: thanks | 17:47 |
*** dtantsur is now known as dtantsur|afk | 18:27 | |
*** zenkuro has quit IRC | 18:37 | |
*** andrewbonney has quit IRC | 18:56 | |
openstackgerrit | sean mooney proposed openstack/nova master: [WIP] test numa and vcpu topologies https://review.opendev.org/c/openstack/nova/+/769601 | 18:58 |
sean-k-mooney | melwitt:^ | 18:58 |
melwitt | thanks | 18:59 |
sean-k-mooney | i wonder is there an easy way to print the xml that would be created | 19:00 |
*** ociuhandu has joined #openstack-nova | 19:13 | |
*** tbachman has quit IRC | 19:17 | |
*** bnemec has quit IRC | 19:18 | |
sean-k-mooney | ah i can use OS_DEBUG=1 and add assert False | 19:20 |
sean-k-mooney | and it then dumps all the logs | 19:20 |
sean-k-mooney | http://paste.openstack.org/show/801461/ | 19:22 |
sean-k-mooney | the numa toplogy is correct but the cpu toplogy is not | 19:24 |
sean-k-mooney | <topology sockets="8" cores="1" threads="1"/> | 19:24 |
melwitt | a-ha, cool (about the logs) | 19:24 |
*** bnemec has joined #openstack-nova | 19:24 | |
sean-k-mooney | that was the xml for https://review.opendev.org/c/openstack/nova/+/769601/1/nova/tests/functional/libvirt/test_numa_servers.py#158 | 19:24 |
melwitt | ack | 19:25 |
sean-k-mooney | oh my extra specs are wrong | 19:25 |
sean-k-mooney | wiat did i copy those form the customer | 19:25 |
sean-k-mooney | no they have the correct ones im missing cpu_ | 19:26 |
*** ociuhandu has quit IRC | 19:26 | |
*** tbachman has joined #openstack-nova | 19:26 | |
sean-k-mooney | ok i repoduced it now | 19:28 |
sean-k-mooney | http://paste.openstack.org/show/801463/ | 19:28 |
melwitt | omg ! | 19:29 |
sean-k-mooney | that is the error | 19:29 |
melwitt | yes, looks like you've got it. sweet | 19:29 |
*** bnemec has quit IRC | 19:30 | |
melwitt | yeah there's the IndexError which is a separate thing but once that is fixed it will still be an erroneous failure to schedule. need to open I guess two bugs, one for the IndexError and one for the scheduling fail | 19:30 |
openstackgerrit | sean mooney proposed openstack/nova master: [WIP] test numa and vcpu topologies https://review.opendev.org/c/openstack/nova/+/769601 | 19:31 |
sean-k-mooney | the index error i think is jsut use trying to get the first entry in the list of toplogies | 19:32 |
*** bnemec has joined #openstack-nova | 19:32 | |
sean-k-mooney | but its empty so that is simple to fix | 19:32 |
melwitt | correct | 19:32 |
sean-k-mooney | actully just lookking a littel more up in the logs http://paste.openstack.org/show/801464/ | 19:33 |
sean-k-mooney | we have the limits printed | 19:33 |
sean-k-mooney | so it has the flaovr limits there Flavor limits 2:2:8 | 19:34 |
melwitt | yeah. that is showing on the customer case too | 19:34 |
sean-k-mooney | ya so the issue is the preference of 0,0,0 | 19:34 |
melwitt | yeah but we use 0 (on master) and -1 (on queens) to mean "no preference" | 19:35 |
sean-k-mooney | yep so we need to adjust that logic slightly | 19:35 |
sean-k-mooney | so that the closet match code works | 19:35 |
*** bnemec has quit IRC | 19:37 | |
*** bnemec has joined #openstack-nova | 19:39 | |
melwitt | sean-k-mooney: did you want to work on the fix? or did you want me to stack a change on top of your test? | 19:41 |
sean-k-mooney | im testing a quick hack but sofar its not working so if you want to stack a change on top go for it | 19:43 |
sean-k-mooney | that is what im trying currrntly but the two things i have tired so far have no effect :) | 19:43 |
melwitt | oh heh. well it's up to you | 19:44 |
sean-k-mooney | oh its becasue its using the numa toplogy object i think | 19:46 |
openstackgerrit | Merged openstack/nova master: tests: Remove 'test_extended_hypervisors' https://review.opendev.org/c/openstack/nova/+/769519 | 19:46 |
sean-k-mooney | ill give it one more try then im going for dinner and its all yours | 19:46 |
melwitt | hehe, k. well it was your idea on how to fix it :) so if you want to stick with it that wfm. if not, I can give it a try | 19:48 |
openstackgerrit | sean mooney proposed openstack/nova master: [WIP] fix max cpu topologies with numa affinity https://review.opendev.org/c/openstack/nova/+/769614 | 19:51 |
sean-k-mooney | this is probaly not right but it passes the new test i added | 19:51 |
melwitt | ack | 19:52 |
sean-k-mooney | the vcpu toplogy cannot cahnge per numa node | 19:52 |
sean-k-mooney | so i dont actully think we shoul dbe looking at the numa node there | 19:52 |
sean-k-mooney | hum i just ran all the libvirt func tests and they passed | 19:53 |
sean-k-mooney | im running the full set now | 19:53 |
sean-k-mooney | :( i hit the subunit parser bug.... | 20:00 |
sean-k-mooney | File "/opt/repos/nova/.tox/functional/lib/python3.8/site-packages/subunit/v2.py", line 227, in _write_packet | 20:01 |
sean-k-mooney | raise ValueError("Length too long: %r" % base_length) | 20:01 |
sean-k-mooney | ValueError: Length too long: 5072548 | 20:01 |
sean-k-mooney | there are dissadvatages to OS_DEBUG=1 | 20:01 |
sean-k-mooney | ok im going to have dinner and we can see what the ci says but i thinnk that working acourdign to to the func tests that have run but i think i have disabled too much | 20:03 |
sean-k-mooney | melwitt: wem might actully be able to just remove https://github.com/openstack/nova/blob/ccb2e11129d4a0730b0ba19a7f49ed8fc5a05f6d/nova/virt/hardware.py#L616-L637 | 20:07 |
sean-k-mooney | it was added by https://github.com/openstack/nova/commit/770ab8eeb72b184ac6164aeabb89c4bf45f938a9 | 20:07 |
sean-k-mooney | but in not conviced the resoning is correct | 20:07 |
melwitt | hm ok | 20:08 |
sean-k-mooney | the guest vcpu toplogy is indepenet of its numa toplogy | 20:09 |
sean-k-mooney | this patch was trying to support different guest vcpu toplogies per numa node which we have never supported | 20:10 |
sean-k-mooney | at least looking at https://github.com/openstack/nova/commit/770ab8eeb72b184ac6164aeabb89c4bf45f938a9#diff-027adff210e1ed4c63524a486cb988702337880b5c0773482200a136e391ecfbR773 | 20:11 |
sean-k-mooney | it looks like we still have some of those unit tests https://github.com/openstack/nova/blob/master/nova/tests/unit/virt/test_hardware.py#L792-L881 | 20:16 |
sean-k-mooney | the last one is definetly not requesatable via the api | 20:17 |
sean-k-mooney | https://github.com/openstack/nova/blob/master/nova/tests/unit/virt/test_hardware.py#L858-L883 | 20:17 |
*** ociuhandu has joined #openstack-nova | 20:17 | |
sean-k-mooney | oh my func test run passed locally too. | 20:17 |
sean-k-mooney | ok really getting dinner now o/ | 20:18 |
*** ociuhandu has quit IRC | 20:18 | |
*** ociuhandu has joined #openstack-nova | 20:19 | |
*** ociuhandu has quit IRC | 20:19 | |
*** ociuhandu has joined #openstack-nova | 20:20 | |
*** whoami-rajat__ has quit IRC | 20:21 | |
*** ociuhandu has quit IRC | 20:29 | |
*** ociuhandu has joined #openstack-nova | 20:34 | |
*** ociuhandu has quit IRC | 20:43 | |
openstackgerrit | Lance Bragstad proposed openstack/placement master: Bump oslo.log version to 4.3.0 https://review.opendev.org/c/openstack/placement/+/760229 | 20:53 |
*** ociuhandu has joined #openstack-nova | 20:56 | |
*** ociuhandu has quit IRC | 21:03 | |
*** jmlowe has quit IRC | 21:33 | |
*** ociuhandu has joined #openstack-nova | 21:36 | |
*** ociuhandu has quit IRC | 21:40 | |
*** ociuhandu has joined #openstack-nova | 21:44 | |
*** ociuhandu has quit IRC | 21:51 | |
*** ociuhandu has joined #openstack-nova | 21:54 | |
*** ociuhandu has quit IRC | 21:58 | |
*** ociuhandu has joined #openstack-nova | 22:10 | |
gmann | stephenfin: done, left comment for https://review.opendev.org/c/openstack/nova/+/769520/1/nova/api/openstack/compute/hypervisors.py#363 | 22:15 |
gmann | stephenfin: and other two patches of this BP also | 22:15 |
*** nweinber has quit IRC | 22:17 | |
*** ociuhandu has quit IRC | 22:19 | |
*** rcernin has joined #openstack-nova | 22:27 | |
*** alexe9191 has quit IRC | 22:42 | |
*** bnemec has quit IRC | 22:45 | |
*** zzzeek has quit IRC | 22:53 | |
*** zzzeek has joined #openstack-nova | 22:55 | |
*** bnemec has joined #openstack-nova | 23:01 | |
*** zzzeek has quit IRC | 23:08 | |
*** eharney has quit IRC | 23:09 | |
*** zzzeek has joined #openstack-nova | 23:09 | |
*** ociuhandu has joined #openstack-nova | 23:25 | |
*** ociuhandu has quit IRC | 23:30 | |
*** tosky has quit IRC | 23:51 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!