Monday, 2020-05-11

*** brinzhang has joined #openstack-nova00:13
*** tetsuro has joined #openstack-nova00:16
*** swp20 has joined #openstack-nova00:39
*** songwenping_ has joined #openstack-nova00:41
*** swp20 has quit IRC00:45
*** Liang__ has joined #openstack-nova01:11
*** yaawang has quit IRC01:35
*** yaawang has joined #openstack-nova01:36
*** sapd1 has joined #openstack-nova01:48
*** ircuser-1 has quit IRC02:19
*** sapd1 has quit IRC03:14
*** sapd1_x has joined #openstack-nova03:14
*** slaweq has joined #openstack-nova03:17
*** slaweq has quit IRC03:22
*** tetsuro has quit IRC03:28
*** ratailor has joined #openstack-nova03:56
*** tetsuro has joined #openstack-nova04:16
*** evrardjp has quit IRC04:36
*** evrardjp has joined #openstack-nova04:36
*** belmoreira has joined #openstack-nova04:42
openstackgerritElod Illes proposed openstack/nova stable/rocky: libvirt: check job status for VIR_DOMAIN_EVENT_SUSPENDED_MIGRATED event  https://review.opendev.org/71123304:44
*** songwenping_ has quit IRC04:46
*** belmoreira has quit IRC04:46
*** huaqiang has quit IRC04:54
*** udesale has joined #openstack-nova05:15
*** vishalmanchanda has joined #openstack-nova05:29
*** songwenping_ has joined #openstack-nova05:32
*** links has joined #openstack-nova05:52
*** dpawlik has joined #openstack-nova06:10
*** dpawlik has quit IRC06:13
*** slaweq has joined #openstack-nova06:17
*** dpawlik has joined #openstack-nova06:18
*** alex_xu has joined #openstack-nova06:18
openstackgerritXinran WANG proposed openstack/os-resource-classes master: Add new resource class for QAT card.  https://review.opendev.org/72631406:40
*** lpetrut has joined #openstack-nova06:41
*** maciejjozefczyk has joined #openstack-nova06:58
*** ttsiouts has joined #openstack-nova06:59
*** tinwood is now known as tinwood-afk07:06
*** ccamacho has joined #openstack-nova07:11
*** ccamacho has quit IRC07:12
*** ccamacho has joined #openstack-nova07:29
*** tosky has joined #openstack-nova07:34
*** rpittau|afk is now known as rpittau07:36
*** ralonsoh has joined #openstack-nova07:40
*** belmoreira has joined #openstack-nova07:46
*** nightmare_unreal has quit IRC07:47
*** nightmare_unreal has joined #openstack-nova08:01
*** dtantsur|afk is now known as dtantsur08:04
gibigood morning Nova08:05
*** tinwood-afk is now known as tinwood08:05
lyarwoodMorning \o08:07
*** eandersson7 has joined #openstack-nova08:07
*** arxcruz has quit IRC08:10
*** arxcruz has joined #openstack-nova08:10
*** kukacz_ has joined #openstack-nova08:11
openstackgerritKevin Zhao proposed openstack/nova master: [WIP] CI: add tempest-integrated-compute-aarch64 job  https://review.opendev.org/71443908:12
*** kukacz has quit IRC08:15
*** eandersson has quit IRC08:15
*** eandersson7 is now known as eandersson08:15
openstackgerritKevin Zhao proposed openstack/nova master: [WIP] CI: add tempest-integrated-compute-aarch64 job  https://review.opendev.org/71443908:24
*** ttsiouts has quit IRC08:24
*** ttsiouts has joined #openstack-nova08:25
*** martinkennelly has joined #openstack-nova08:26
*** ttsiouts has quit IRC08:28
*** ttsiouts has joined #openstack-nova08:28
*** ttsiouts has quit IRC08:28
*** ttsiouts has joined #openstack-nova08:29
*** ttsiouts has quit IRC08:30
*** ttsiouts has joined #openstack-nova08:31
*** salmankhan has joined #openstack-nova08:35
*** derekh has joined #openstack-nova08:37
openstackgerritStephen Finucane proposed openstack/nova master: objects: Remove 'NovaObjectDictCompat' from 'Migration'  https://review.opendev.org/72357208:37
openstackgerritStephen Finucane proposed openstack/nova master: objects: Remove 'NovaObjectDictCompat' from 'InstancePCIRequest'  https://review.opendev.org/72357308:37
*** salmankhan has quit IRC08:42
*** salmankhan has joined #openstack-nova08:42
*** links has quit IRC08:43
*** links has joined #openstack-nova08:54
*** xek has joined #openstack-nova09:03
*** kevinz has quit IRC09:07
brinzhanggibi, bauzas: When I researching the define nova cyborg interaction notification, I found some code logical not suitable, so I09:07
brinzhangsubmmit a patch to optimize it, would you like to review https://review.opendev.org/#/c/726564/09:07
*** kevinz has joined #openstack-nova09:07
openstackgerritLiang Fang proposed openstack/nova master: [WIP] rbd patch for volume local cache  https://review.opendev.org/72676209:21
*** Liang__ has quit IRC09:30
*** ttsiouts has quit IRC09:31
*** ttsiouts has joined #openstack-nova09:32
*** ttsiouts has quit IRC09:36
bauzasgood morning Nova09:38
*** ttsiouts has joined #openstack-nova09:41
*** priteau has joined #openstack-nova09:44
*** happyhemant has joined #openstack-nova10:15
*** rpittau is now known as rpittau|bbl10:15
*** tetsuro has quit IRC10:26
stephenfinlyarwood, bauzas: would you do me the honours? https://review.opendev.org/#/c/710238/10:33
* bauzas just has read too fast10:34
stephenfinhahaha10:34
bauzas"would you do me the homous" ?10:34
* bauzas is hungry10:34
gibiI got an interesting support case downstream. Does nova calculate the disk usage of its own image cache on the compute?10:36
stephenfingibi: I don't think nova includes anything except instances in those calculation10:37
stephenfin*s10:37
gibiall the non-nova disk usage should be configured in reserved_host_disk_mb10:37
stephenfinI'd expect image cache to be included in the reserved host config10:37
stephenfinyeah10:37
gibifor that I would need to know the maximum size of the image cache10:38
bauzasstephenfin: I'm not a multi-attach specialist10:38
bauzasbut I wonder why we were avoiding QEMU>2.1010:38
gibido we have a way to maximize the size of the nova image cache?10:38
gibiI mean limit10:38
stephenfinbauzas: I think kashyap explained that to me at some point. Let me look10:38
stephenfinit's a weird conditional, for sure10:38
stephenfingibi: I'm not aware of any, but I suspect there must be something. /me looks10:39
gibiI tried to find it but I failed10:39
bauzasstephenfin: yeah I suspect something was borked10:39
bauzasbut this whole comment is confusing10:40
bauzasit's an "or" clause10:40
bauzasso in theory, we should only support multiattach if QEMU<2.1010:40
bauzasbut I suspect the wording being incorrect, hence the confusion10:40
stephenfingibi: Yeah, I can't see anything either. Sounds like a gap :-\10:42
stephenfinbauzas: okay, the context in in https://bugzilla.redhat.com/show_bug.cgi?id=137824210:45
openstackbugzilla.redhat.com bug 1378242 in libvirt "QEMU image file locking (libvirt)" [Unspecified,Closed: errata] - Assigned to pkrempa10:45
*** links has quit IRC10:45
gibistephenfin: thanks for confirming. I will do a problem reproduction and file a bug but I feel this will be considered as a feautre request10:45
stephenfintl;dr: QEMU added a feature that broke multi-attach, which necessitated a new libvirt feature to fix it again10:45
gibifrom upstream perspective10:45
bauzasstephenfin: cool, and the bug is readable without being internal10:46
*** links has joined #openstack-nova10:46
bauzasstephenfin: okay, so the libvirt version is superseding the QEMU one10:47
stephenfingibi: Perhaps. A quick look suggests we have documented multiple error codes for the API though, so if it was classified as a bug, it should be a backportable one10:47
gibistephenfin: which API you are referring to?10:48
bauzasstephenfin: worth respinning a better commit msg explaining this ?10:48
stephenfinbauzas: Yup. You either need to use an older version of QEMU, or you need a newer version of libvirt to workaround the changes in newer QEMU10:48
stephenfinbauzas: Good call. Let me do that10:48
bauzasand the fact we have a recent QEMU isn't a problem since libvirt fixes this10:48
bauzasstephenfin: thanks10:49
stephenfingibi: The host aggregate image caching API. Commit 339129870692467b703220dbc3905fd8bffe6a8310:49
gibistephenfin: ohh. This goes way beyond that. As nova cached images before that API was added.10:50
gibibased on images download for new instance boots10:51
stephenfinAh, the old just-in-time caching behavior?10:51
gibijepp10:51
gibias far as I see that is also not limited in size10:51
lyarwoodgibi: is this with the libvirt virt driver?10:52
gibilyarwood: yes libvirt10:52
gibilyarwood: the instances dir is on local file system10:52
lyarwoodgibi: right then you're correct that isn't limited AFAIK10:52
gibiisn't it even a security concern? Can I will the disk via the cache to prevent nova-compute for booting VMs?10:53
gibis/will/fill/10:53
lyarwoodgibi: that should be taken into consideration when attempting to schedule instances to the node10:54
lyarwoodgibi: the cache is just a way of sharing the base image between instances10:54
gibilyarwood: but it doesn't as far as I see10:54
lyarwoodgibi: are you providing a unique image with every request?10:55
gibilyarwood: if I have a raw base image and two qcow2 guest image then I potentially use original image.size + instance 1 flavor.disk + instance 2 flavor.disk, but nova only calculate the instance 1 flavor.disk + instance 2 flavor.disk as used10:56
openstackgerritStephen Finucane proposed openstack/nova master: libvirt: Remove MIN_LIBVIRT_MULTIATTACH  https://review.opendev.org/71023810:57
stephenfinbauzas: ^10:57
gibi(I ignore disk_available_least as placement does not use that just he old removed DiskFilter used that)10:57
lyarwoodgibi: don't we report the adjusted available space to placement taking that into account?10:58
lyarwoodgibi: the size of the RAW base file that is10:58
lyarwoodgibi: and the potential size of the two qcow2 instance disks?10:58
lyarwood_get_disk_over_committed_size_total ?10:59
lyarwoodah right11:00
lyarwoodwe don't actually report that back up sorry11:00
gibihttp://paste.openstack.org/show/793375/ I don't see adjustements in placement based on this11:00
lyarwoodewww11:01
gibilyarwood: for me either the size of the cache should be configurable (and then we can account for that in reserved_host_disk_mb) or nova needs to report cached disk usage as used in placement11:02
lyarwoodgibi: yeah I'd say the latter, I honestly thought we did already.11:02
*** martinkennelly has quit IRC11:03
gibilyarwood: this is a bug in pike for one of our customers so I also have to think about a backportable solution11:04
gibilyarwood: do you think that determining the size of the image cache is easy?11:04
gibiis it just some file system calls in the _base dir, isn't it?11:05
lyarwoodgibi: yes for file based backends11:05
gibiI have close to zero knowledge on non file based backends behavior11:06
*** jsuchome has joined #openstack-nova11:06
lyarwoodgibi: we don't cache in rbd iirc11:07
* lyarwood spins up an env to play with this11:07
gibiis there a way to turn of the cache?11:07
gibifor file based backend? (that would be a workaround in my downstream issue)11:07
lyarwoodgibi: I don't think so, you can disable the manager but I think we still cache things at creation time11:09
lyarwoodgibi: the manager just doesn't run to clean things up11:10
* lyarwood -> lunch back in 2011:10
gibilyarwood: thanks11:10
* gibi follows lyarwood's example and goes for food11:10
*** songwenping_ has quit IRC11:14
*** tbachman has joined #openstack-nova11:19
*** songwenping_ has joined #openstack-nova11:32
*** iurygregory has quit IRC11:37
*** toabctl has quit IRC11:43
openstackgerritSasha Andonov proposed openstack/nova master: rbd_utils: increase _destroy_volume timeout  https://review.opendev.org/70576411:57
*** rpittau|bbl is now known as rpittau11:58
*** iurygregory has joined #openstack-nova11:58
nightmare_unrealhow can I determine default api version used by osc CLI ? suppose if we don't specify --os-compute-api-version , which version will it take and how it's determined ?12:07
*** raildo has joined #openstack-nova12:09
lyarwoodnightmare_unreal: https://docs.openstack.org/api-guide/compute/microversions.html#version-discovery - I think there's also an osc command for that12:10
nightmare_unrealThanks :)12:10
*** priteau has quit IRC12:16
*** tkajinam has quit IRC12:31
*** eharney has joined #openstack-nova12:31
*** links has quit IRC12:38
*** links has joined #openstack-nova12:39
*** songwenping_ has quit IRC12:51
openstackgerritStephen Finucane proposed openstack/nova master: docs: Note the ``hw_numa_nodes`` image property  https://review.opendev.org/68384912:52
*** sapd1_x has quit IRC12:59
*** iurygregory has quit IRC13:01
*** nweinber has joined #openstack-nova13:02
*** iurygregory has joined #openstack-nova13:02
*** lbragstad has joined #openstack-nova13:06
*** udesale_ has joined #openstack-nova13:06
*** udesale has quit IRC13:09
*** lbragstad has quit IRC13:21
*** nweinber has quit IRC13:22
*** lbragstad has joined #openstack-nova13:22
*** nweinber has joined #openstack-nova13:22
*** artom has joined #openstack-nova13:24
*** zzzeek has quit IRC13:25
*** zzzeek has joined #openstack-nova13:26
*** zzzeek has quit IRC13:26
*** zzzeek has joined #openstack-nova13:27
*** owalsh has quit IRC13:30
*** owalsh has joined #openstack-nova13:31
openstackgerritStephen Finucane proposed openstack/nova master: SR-IOV passthrough: Check PF only if VF is enabled  https://review.opendev.org/47664213:34
jsuchomedansmith: Hi, I've updated related patchets about that direct rbd download (including the spec) could you give it another look?13:36
*** dpawlik has quit IRC13:38
*** dpawlik has joined #openstack-nova13:42
openstackgerritStephen Finucane proposed openstack/nova master: Poison netifaces.interfaces() in tests  https://review.opendev.org/67177313:46
dansmithjsuchome: yep will queue for today13:46
*** beekneemech is now known as bnemec13:47
*** ttsiouts has quit IRC13:47
*** ttsiouts has joined #openstack-nova13:48
*** ratailor has quit IRC13:49
openstackgerritStephen Finucane proposed openstack/nova master: Raise if flavor and image disagree on hide_hypervisor_id  https://review.opendev.org/66336513:51
*** ttsiouts has quit IRC13:52
*** brinzhang_ has joined #openstack-nova13:58
*** ttsiouts has joined #openstack-nova14:01
*** brinzhang has quit IRC14:02
*** brinzhang has joined #openstack-nova14:02
*** brinzhang_ has quit IRC14:03
*** awalende has joined #openstack-nova14:06
*** brinzhang_ has joined #openstack-nova14:11
*** brinzhang has quit IRC14:14
*** ttsiouts has quit IRC14:17
openstackgerritLee Yarwood proposed openstack/nova stable/train: Revert "nova shared storage: rbd is always shared storage"  https://review.opendev.org/72686114:17
openstackgerritLee Yarwood proposed openstack/nova stable/stein: Revert "nova shared storage: rbd is always shared storage"  https://review.opendev.org/72686214:17
*** ttsiouts has joined #openstack-nova14:17
openstackgerritLee Yarwood proposed openstack/nova stable/rocky: Revert "nova shared storage: rbd is always shared storage"  https://review.opendev.org/72686314:17
*** dtantsur is now known as dtantsur|brb14:17
openstackgerritLee Yarwood proposed openstack/nova stable/queens: Revert "nova shared storage: rbd is always shared storage"  https://review.opendev.org/72686414:18
*** ttsiouts has quit IRC14:21
sean-k-mooneyby the way are we planning to backport https://review.opendev.org/#/c/663365/ upstream14:23
sean-k-mooneyits the fix for bug@ #183172314:24
sean-k-mooneybug: #183172314:24
openstackbug 1831723 in OpenStack Compute (nova) "The flavor hide_hypervisor_id value can be overridden by the image img_hide_hypervisor_id" [Undecided,In progress] https://launchpad.net/bugs/1831723 - Assigned to Stephen Finucane (stephenfinucane)14:24
sean-k-mooneydownstream i think we would want to backport that as im sure customer will hit it at some point14:24
sean-k-mooneybut it might be nice to backport upstream too but im not sure its allowed14:25
sean-k-mooneylyarwood: ^ any toughts on the topic14:25
sean-k-mooneywe are just adding a namespaced version of an existing unnamsepaced extra_spec14:26
*** lpetrut has quit IRC14:26
sean-k-mooneydownstream that is not conisderd an api change since extra_specs are not part of the api14:26
lyarwoodsean-k-mooney: reading14:26
openstackgerritJames Page proposed openstack/nova stable/queens: hardware: fix memory check usage for small/large pages  https://review.opendev.org/72686714:27
sean-k-mooneylyarwood: actully that is not the patch i ment to link14:28
lyarwoodsean-k-mooney: right you had me slightly confused tbh14:29
sean-k-mooneythat one we might also want to backport but one sec14:29
lyarwoodyeah that's looks valid to backport14:29
lyarwoodthat*14:30
sean-k-mooneyhttps://review.opendev.org/#/c/722187/14:30
sean-k-mooneythat is the one i ment14:30
sean-k-mooneyfor https://bugs.launchpad.net/nova/+bug/184193214:30
openstackLaunchpad bug 1841932 in OpenStack Compute (nova) "hide_hypervisor_id extra_specs in nova flavor cannot pass AggregateInstanceExtraSpecsFilter" [Low,In progress] - Assigned to Stephen Finucane (stephenfinucane)14:30
lyarwoodsean-k-mooney: so we deprecate hide_hypervisor_id in that change but still provide backward compatability so at first glance I think we can backport this upstream?14:33
sean-k-mooneylyarwood: correct deprecated but still supported14:33
sean-k-mooneywe would have to drop the validation changes14:33
sean-k-mooneyform the patch14:33
sean-k-mooneybut the rest of it i think would be fine14:34
lyarwoodah right that only just landed14:34
openstackgerritJames Page proposed openstack/nova stable/queens: hardware: fix memory check usage for small/large pages  https://review.opendev.org/72686714:34
openstackgerritJames Page proposed openstack/nova stable/queens: Fix overcommit for NUMA-based instances  https://review.opendev.org/72686814:34
*** dklyle has joined #openstack-nova14:35
sean-k-mooneylyarwood: yep but that should be fine and easy to call out on the intiall backport patch14:35
sean-k-mooneyits pretty self contained14:35
*** jhesketh has quit IRC14:35
*** sapd1_x has joined #openstack-nova14:40
*** brinzhang_ has quit IRC14:40
*** MrWatson has quit IRC14:58
*** dtantsur|brb is now known as dtantsur14:59
*** mlavalle has joined #openstack-nova14:59
*** NostawRm has joined #openstack-nova15:00
jsuchomedansmith: thanks, I'll address the changes in spec15:01
*** jharris has joined #openstack-nova15:02
-openstackstatus- NOTICE: Our CI mirrors in OVH BHS1 and GRA1 regions were offline between 12:55 and 14:35 UTC, any failures there due to unreachable mirrors can safely be rechecked15:08
*** klindgren_ has joined #openstack-nova15:09
*** klindgren has quit IRC15:09
gibilyarwood: reported a bug for the image cache issue https://bugs.launchpad.net/nova/+bug/187802415:12
openstackLaunchpad bug 1878024 in OpenStack Compute (nova) "disk usage of the nova image cache is not counted as used disk space" [Undecided,New]15:12
gibilyarwood: could you please check if it make sense what I wrote there?15:12
gibidansmith: you worked with the image cache recently so you might be interested ^^15:13
dansmithyeah, reading now15:13
gibithanks15:13
dansmithif the op has a separate mount for the cache, this wouldn't be a problem15:13
dansmithso it's not going to affect everyone the sae15:14
dansmith*same15:14
gibiyeah, what the downstream customer has is a simple disk partition for the nova instances_path15:14
gibiand the cache is under the instances_path15:14
dansmithright, which I'm sure is common15:14
lyarwoodgibi: ack thanks just on calls for a while but will look once I'm off15:14
gibilyarwood: thanks15:15
dansmiththe problem with something like A is that when you're scheduling,15:15
dansmithyou don't know whether or not the image is already on the remote system, so you don't know whether to look for hosts with 2*$size disk space or not15:15
gibidansmith: true15:16
dansmithB doesn't really work either because you can't assume you can purge your way out of the cache limit15:16
dansmithif you boot a hundred instances from different images, you can't prevent the image cache from going over the desired size,15:17
dansmithunless you refuse to boot instances there, which people will complain about because there is plenty of disk space and not understand15:17
gibicould nova-compute periodically update a separate DISK_GB allocation in placement based on the actual size of the image cache15:18
gibi?15:18
dansmithnot periodically, but synchronously with the decision to cache an image (either during boot or otherwise)15:18
dansmithotherwise you have a race15:18
dansmithI'd have to think about that, but we'd need to only do that if the images and cache are on the same filesystem, otherwise we'd count against the wrong total15:20
gibiit will be racey anyhow due to what you said about the problem of requesting allocation for the cache during scheduling15:20
dansmithI mean racing for disk space, which could go badly if you lose, not just racing for image boot15:21
dansmithbut yes, the scheduler is never going to know whether or not to pick a host based on cache availability, so you always have that15:21
*** belmoreira has quit IRC15:21
dansmithwe could go totally crazy and create an allocation for each image, by image uuid and after selecting a host, the scheduler could check to see if there was an allocation for that image against the host's provider to decide if it thinks it will fit :)15:22
dansmiththat has some nice benefits, but it's a little crazy and there's still plenty of room for racing of course,15:22
dansmithand plenty of room for exhausting all the candidates in a small query set, leading to non-ideal looping of retries15:23
dansmithplus we'd have to have a separate provider for the cache disk if they're separate15:23
gibicomplicated indeed15:27
gibiI will pass  the workaround of having the cache on a different partition to downstream15:27
gibiat least that is something that the downstream project can do15:27
aarentsgibi: dansmith we have this issue, painfull one15:28
dansmithgibi: ack15:28
aarentsgibi yep  in some case we put cahe in another file system to get ride of this15:29
gibidansmith: the old DiskFilter had the disk_available_least info to prevent overallocation but we removed the DiskFilter15:29
dansmithyeah, fair point15:30
gibiwe might want to re-introduce something like diks_available_least as a filter? or a pre-filter with placement support?15:30
dansmithI think diskfilter had plenty of other problems, like  the other way where the filter behavior conflicted with the hypervisors listing, which definitely causes support cases15:31
*** belmoreira has joined #openstack-nova15:31
dansmithnot that they agree now, but..15:31
dansmithgibi: I don't think we'd want that to be a pre-filter because you'd have to provide either an inclusion or exclusion list of all hosts to placement each time15:32
dansmithnot like a trait or aggregate, but "any one of these hosts: [... array of 5000 ...]15:32
gibiyeah I don't want to bring back the whole DiskFilter, just bring back the extra information to the scheduler / placement that how much actual disk space is free under the instance_path15:34
gibias an idea15:34
*** belmoreira has quit IRC15:34
dansmithgibi: all I'm saying is doing it as a pre-filter is the wrong place15:34
gibiack15:35
dansmithwe might still be reporting that value such that a filter can check it15:35
*** sapd1_x has quit IRC15:37
dansmithone other option is that the cache_images() thing that I added is setup to work as a call, returning information about presence,15:38
dansmithso if we were to pass it a "don't download just check" flag, or a "download in background" flag, then we would get back an indication if it's present or not,15:38
dansmithwhich would let the scheduler know whether or not to filter out hosts with 1x or 2x the disk space15:39
dansmiththat's pretty heavy, and would likely need to be done from conductor,15:40
dansmithbut it's a little less racy than checking some minutes-old disk free amount and assuming it's going to work15:40
dansmithanother cloudy way to look at this is to say we should just have people weigh hosts on free space,15:40
dansmithin line with our "we don't schedule at capacity" project goal15:41
gibidansmith: so if the rpc call reports that the image is not cached, then we would add 2x disk space the allocation candidate query, but only allocate 1x disk space on the selected host for the instance, then on the compute side the image cache code would allocate the other 1x disk space in placement for the cache15:42
dansmithno15:42
dansmithwe've already done the a-c query at that point15:42
dansmithwe'd just use that to advise us which of the a-c are valid15:43
gibiahh yeah, you have to now which host you send the rpc call15:43
dansmithright15:43
gibithe weigher thing is good for big deployments but will fall short for edge. As far as I understand my downstream report is from a really small edge site close to capacity. :/15:46
*** links has quit IRC15:47
dansmithindeed, although I think I'd argue that for highly constrained situations the separate filesystem is the right approach there anyway, given the (a) usually constrain-able image sets for edge and (b) the need to avoid the race that we'll have in some form anyway15:47
dansmithbut yep, it's not a great answer for someone that just wants it to work ideally15:47
gibiI will definitely suggest the separate partition for now as I feel whathever solution we come up with (if any) it will not be backportable15:48
dansmithfor sure15:48
dansmithfwiw, making a pre-call to cache_images(background=True) would theoretically give us some lower time-to-boot performance in other cases15:49
dansmithI'm really not sure whether that's a terrible idea or not, but it's an interesting thought15:50
dansmithsimilar to the cyborg case of starting the programming at bind time from the conductor in parallel to the rest of the instance bringup15:50
gibiit would have a side effect to cache image to a compute that otherwise will not be selected15:50
gibihm or not15:51
dansmithin the tight case you mean right? We'd start caching an image on a host that the scheduler was going to exclude anyway, which is true15:51
*** sapd1_x has joined #openstack-nova15:52
gibiyeah for the thight case when the image would fit into the cache but the instance root disk would not any more15:52
dansmithyep, for sure15:52
gibibut that really tight15:52
gibiwhat if for thight cases we allow disabling the cache entirely? it is thight so no space for cache15:53
dansmithdoing that would require a substantial resdesign of the whole image backend for libvirt I think15:54
*** swp20 has joined #openstack-nova15:54
gibiis it because we assume that there is a backing file for the root fs image which happen to be the cached image?15:55
*** sapd1_x has quit IRC15:56
dansmithI'm actually not sure what happens if you configure qcow2 and "flatten_images" actually, I'd have to look15:56
dansmiththat might have the same effect, I'm not sure15:56
*** markguz_ has joined #openstack-nova15:58
gibido you mean force_raw_images conf option?15:59
gibior use_cow_images = False? or a linear combination of the two :)16:00
markguz_Hi nova folks. i have an instance that failed that i migrated (not live) to another compute host, but upon restarting it errors with16:00
markguz_Unsupported VIF type binding_failed convert '_nova_to_osvif_vif_binding_failed'16:00
dansmithgibi: yeah there's some combination that results in full flattening, but I'm not sure what they are16:00
dansmithgibi: not sure if tat actually results in the cache image going away, or getting copied or what16:01
dansmithactually, as to be the latter I think since we have to expand the size of it16:01
*** swp20 has quit IRC16:01
markguz_i've tried the various solutions found via google but non work16:01
dansmithso you probably end up with 2x the space initially at least, and then you'd need to immediately purge the original or something16:01
gibidansmith: ack16:02
*** gyee has joined #openstack-nova16:04
gibidansmith: hm even when the flat backend copies the image to raw it does update the cache to keep the base image https://github.com/openstack/nova/blob/d6450879c7f7dd19366b6f002301fbbf87918026/nova/virt/libvirt/imagebackend.py#L58516:05
gibidansmith: anyhow thanks for your thoughts I have to drop today soon so I will add some summary of this discussion to the bug.16:08
*** rpittau is now known as rpittau|afk16:09
dansmithgibi: yeah, that's what I was thinking above when I said "but then you'd need to immediately purge"16:10
dansmithgibi: the cache is trying to be a cache16:10
gibiyeah, I see now16:12
*** sapd1_x has joined #openstack-nova16:13
*** dtantsur is now known as dtantsur|afk16:19
openstackgerritTakashi Natsume proposed openstack/nova master: Remove six.reraise  https://review.opendev.org/72689816:23
*** swp20 has joined #openstack-nova16:24
*** swp20 has quit IRC16:29
*** swp20 has joined #openstack-nova16:30
gibilyarwood, dansmith: updated the bug 1878024 with what we talked about above.16:30
openstackbug 1878024 in OpenStack Compute (nova) "disk usage of the nova image cache is not counted as used disk space" [Undecided,New] https://launchpad.net/bugs/187802416:30
dansmithcool16:30
gibiand now I go and bake some bread for dinner16:30
gibisee you tomorrow16:31
*** hemna_ has joined #openstack-nova16:32
*** swp20 has quit IRC16:34
*** udesale_ has quit IRC16:34
*** swp20 has joined #openstack-nova16:34
*** hemna has quit IRC16:34
*** evrardjp has quit IRC16:36
*** evrardjp has joined #openstack-nova16:36
*** brinzhang has joined #openstack-nova16:37
*** amodi has quit IRC16:39
*** jharris has quit IRC16:40
*** mlavalle has quit IRC16:44
*** mlavalle has joined #openstack-nova16:46
*** ttsiouts has joined #openstack-nova16:57
*** derekh has quit IRC17:01
*** maciejjozefczyk_ has joined #openstack-nova17:06
*** swp20 has quit IRC17:07
*** swp20 has joined #openstack-nova17:08
*** maciejjozefczyk has quit IRC17:09
*** maciejjozefczyk has joined #openstack-nova17:10
*** swp20 has quit IRC17:12
*** swp20 has joined #openstack-nova17:12
*** maciejjozefczyk_ has quit IRC17:14
*** sapd1_x has quit IRC17:22
*** ttsiouts has quit IRC17:31
*** nightmare_unreal has quit IRC17:39
*** jsuchome has quit IRC17:39
*** swp20 has quit IRC17:46
*** swp20 has joined #openstack-nova17:56
*** salmankhan has quit IRC17:57
*** ttsiouts has joined #openstack-nova17:58
*** swp20 has quit IRC18:03
*** happyhemant has quit IRC18:05
*** ttsiouts has quit IRC18:10
*** ttsiouts has joined #openstack-nova18:11
*** ralonsoh has quit IRC18:11
openstackgerritMerged openstack/nova master: Support for --force flag for nova-manage placement heal_allocations command  https://review.opendev.org/71539518:15
openstackgerritMerged openstack/nova stable/queens: Include only required fields in ironic node cache  https://review.opendev.org/72486218:15
openstackgerritMerged openstack/nova stable/queens: Lowercase ironic driver hash ring and ignore case in cache  https://review.opendev.org/72305418:16
openstackgerritMerged openstack/nova stable/rocky: Add config option for neutron client retries  https://review.opendev.org/72281918:16
*** ttsiouts has quit IRC18:16
openstackgerritMerged openstack/nova master: Suppress remaining policy warnings in unit tests  https://review.opendev.org/72627218:16
markguz_anyone know how to get out of Unsupported VIF type binding_failed convert '_nova_to_osvif_vif_binding_failed' hell?18:22
*** ircuser-1 has joined #openstack-nova18:25
*** iurygregory has quit IRC18:25
*** ttsiouts has joined #openstack-nova18:38
*** iurygregory has joined #openstack-nova18:38
*** maciejjozefczyk has quit IRC18:54
*** amodi has joined #openstack-nova18:56
*** toabctl has joined #openstack-nova18:57
*** dkehn has joined #openstack-nova19:20
*** ccamacho has quit IRC19:25
*** brinzhang_ has joined #openstack-nova19:32
openstackgerritHarshavardhan Metla proposed openstack/nova master: [Nova] Add reference to Placement installation guide  https://review.opendev.org/72693619:35
*** brinzhang has quit IRC19:35
*** factor has joined #openstack-nova19:36
*** jmlowe has quit IRC19:42
markguz_for anyone that's interested setting neutron.ml2_port_bindings.vif_type to "ovs" in the database fixed this for me19:42
sean-k-mooneymarkguz_: if you get that error its because neutron failed to bind the port19:43
markguz_i know19:43
sean-k-mooneywhich normally means there was an error on cthe compute node19:43
markguz_but once it happens it seems next to impossible to fix it via the normal methods.19:44
sean-k-mooneymarkguz_: you fix it by setting the host filed to "" or "none" then back to the original hostname19:44
markguz_the instance will not boot due the "binding_failed" being written into the vif_type field19:45
markguz_sean-k-mooney: or by updating that field i just mentioned in the neutron db19:45
sean-k-mooneyyes but you can do "openstack --os-cloud=admin port set --host none baf2b165-797b-4305-bc6b-5b63250b890d" follow by " openstack --os-cloud=admin port set --host workstation baf2b165-797b-4305-bc6b-5b63250b890d"19:46
sean-k-mooneyto do it from the api without db hacking19:46
sean-k-mooneyit will actully cause port binding to happen properly recalualting the correct values19:46
*** jmlowe has joined #openstack-nova19:46
markguz_sean-k-mooney: ok. thanks for that19:46
sean-k-mooneyso i did that yesterday becauses i was swaping form the iptables firewall dirver to the ovs one19:47
markguz_this happened to me when i did a non-live migrate of a shutdown instance to a new host19:48
sean-k-mooneyhum it should not happen in that case19:48
sean-k-mooneysomething obviosly went wrong there should be an error in the neutron server log19:49
markguz_yeah. haven't had time to deep dive. Was focussed on getting the instance back online19:49
sean-k-mooneyist the instance still in resize_verify or did this happen after that point19:50
sean-k-mooneyif you had not confirmed the migrate/resize then you could have reverted19:50
sean-k-mooneyif you had then ya db edit or unest and reset the host to rebind the port then hard reboot19:51
*** ttsiouts has quit IRC20:26
*** nweinber has quit IRC20:42
*** dpawlik has quit IRC20:46
*** xek has quit IRC20:59
*** awalende has quit IRC21:05
*** awalende has joined #openstack-nova21:06
*** jangutter_ has quit IRC21:09
*** awalende has quit IRC21:10
*** raildo has quit IRC21:17
*** raildo has joined #openstack-nova21:17
*** raildo has quit IRC21:39
*** brinzhang has joined #openstack-nova21:51
*** brinzhang_ has quit IRC21:54
*** mgariepy has joined #openstack-nova22:00
*** slaweq has quit IRC22:08
*** slaweq has joined #openstack-nova22:09
*** slaweq has quit IRC22:13
*** KeithMnemonic has joined #openstack-nova22:14
*** tkajinam has joined #openstack-nova22:55
*** tosky has quit IRC22:57
*** markguz_ has quit IRC23:47
*** kevinz has quit IRC23:52

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!