*** markvoelker has joined #openstack-nova | 00:11 | |
*** markvoelker has quit IRC | 00:15 | |
*** bnemec has joined #openstack-nova | 00:16 | |
*** markvoelker has joined #openstack-nova | 00:26 | |
*** bnemec has quit IRC | 00:35 | |
*** gyee has quit IRC | 00:41 | |
*** bbowen__ has quit IRC | 00:48 | |
*** bbowen__ has joined #openstack-nova | 00:48 | |
*** spsurya has joined #openstack-nova | 00:59 | |
*** hongbin has joined #openstack-nova | 01:04 | |
*** ricolin has joined #openstack-nova | 01:05 | |
openstackgerrit | sean mooney proposed openstack/nova stable/pike: lxc: make use of filter python3 compatible https://review.opendev.org/676502 | 01:15 |
---|---|---|
*** dtruong has quit IRC | 01:24 | |
*** dtruong has joined #openstack-nova | 01:25 | |
*** igordc has quit IRC | 01:35 | |
*** boxiang has joined #openstack-nova | 01:41 | |
openstackgerrit | Takashi NATSUME proposed openstack/nova master: Fix missing rule description in HACKING.rst https://review.opendev.org/677589 | 01:43 |
*** liuyulong has quit IRC | 01:51 | |
*** igordc has joined #openstack-nova | 01:58 | |
*** sapd1_x has joined #openstack-nova | 02:07 | |
openstackgerrit | Merged openstack/nova master: [Trivial]Remove unused helper filter_and_format_resource_metadata https://review.opendev.org/676038 | 02:21 |
*** sapd1_x has quit IRC | 02:26 | |
*** sapd1_x has joined #openstack-nova | 02:28 | |
*** igordc has quit IRC | 02:49 | |
*** mkrai_ has joined #openstack-nova | 02:52 | |
openstackgerrit | Takashi NATSUME proposed openstack/nova master: Add a hacking rule for non-existent assertions https://review.opendev.org/677596 | 03:00 |
*** BjoernT has joined #openstack-nova | 03:12 | |
*** sapd1_x has quit IRC | 03:15 | |
*** BjoernT has quit IRC | 03:17 | |
*** BjoernT has joined #openstack-nova | 03:17 | |
openstackgerrit | Boxiang Zhu proposed openstack/nova master: Fix live migration break group policy simultaneously https://review.opendev.org/651969 | 03:26 |
*** igordc has joined #openstack-nova | 03:33 | |
*** psachin has joined #openstack-nova | 03:33 | |
*** janki has joined #openstack-nova | 03:41 | |
*** mkrai_ has quit IRC | 04:00 | |
*** hongbin has quit IRC | 04:06 | |
*** mkrai has joined #openstack-nova | 04:30 | |
*** mkrai has quit IRC | 04:30 | |
*** jaosorior has quit IRC | 04:34 | |
*** mkrai has joined #openstack-nova | 04:34 | |
*** tetsuro has joined #openstack-nova | 04:42 | |
openstackgerrit | melanie witt proposed openstack/nova master: Verify archive_deleted_rows --all-cells in post test hook https://review.opendev.org/672840 | 04:44 |
openstackgerrit | melanie witt proposed openstack/nova master: Verify archive_deleted_rows --all-cells in post test hook https://review.opendev.org/672840 | 04:44 |
*** ratailor has joined #openstack-nova | 05:02 | |
*** BjoernT has quit IRC | 05:06 | |
*** udesale has joined #openstack-nova | 05:10 | |
*** tetsuro has quit IRC | 05:12 | |
*** BjoernT has joined #openstack-nova | 05:12 | |
*** udesale has quit IRC | 05:14 | |
*** tetsuro has joined #openstack-nova | 05:18 | |
openstackgerrit | Takashi NATSUME proposed openstack/nova master: Add a hacking rule for non-existent assertions https://review.opendev.org/677596 | 05:23 |
*** BjoernT has quit IRC | 05:29 | |
*** sapd1_x has joined #openstack-nova | 05:33 | |
*** tetsuro has quit IRC | 05:33 | |
*** ash2307 has left #openstack-nova | 05:42 | |
*** sridharg has joined #openstack-nova | 05:42 | |
openstackgerrit | Merged openstack/nova master: doc: Trivial fixes to API version history https://review.opendev.org/649561 | 05:46 |
openstackgerrit | Takashi NATSUME proposed openstack/nova master: Add a hacking rule for useless assertions https://review.opendev.org/677610 | 05:46 |
*** jaosorior has joined #openstack-nova | 05:49 | |
openstackgerrit | Merged openstack/nova master: Add blocker migration for completing services.uuid migration https://review.opendev.org/673620 | 05:50 |
*** sapd1_x has quit IRC | 06:04 | |
*** dpawlik has joined #openstack-nova | 06:20 | |
*** jaosorior has quit IRC | 06:21 | |
*** igordc has quit IRC | 06:30 | |
*** tetsuro has joined #openstack-nova | 06:47 | |
*** tetsuro has quit IRC | 06:48 | |
*** tetsuro has joined #openstack-nova | 06:48 | |
*** tetsuro has quit IRC | 06:50 | |
*** tetsuro has joined #openstack-nova | 06:50 | |
*** sridharg has quit IRC | 06:52 | |
*** aojea has joined #openstack-nova | 07:01 | |
*** brinzhang_ has joined #openstack-nova | 07:01 | |
alex_xu | efried: sean-k-mooney dansmith, here is the refined version about using db persistent https://etherpad.openstack.org/p/vpmems-non-virt-driver-specific-new. I also sent email to summary the pros/cons of two ways http://lists.openstack.org/pipermail/openstack-discuss/2019-August/008625.html | 07:02 |
*** jaosorior has joined #openstack-nova | 07:04 | |
*** tetsuro has quit IRC | 07:04 | |
*** brinzhang has quit IRC | 07:05 | |
*** sridharg has joined #openstack-nova | 07:05 | |
*** brinzhang has joined #openstack-nova | 07:05 | |
*** brinzhang_ has quit IRC | 07:06 | |
*** trident has quit IRC | 07:10 | |
*** maciejjozefczyk has joined #openstack-nova | 07:12 | |
*** sridharg has quit IRC | 07:12 | |
*** rcernin has quit IRC | 07:14 | |
*** boxiang has quit IRC | 07:14 | |
*** boxiang has joined #openstack-nova | 07:14 | |
*** trident has joined #openstack-nova | 07:17 | |
*** udesale has joined #openstack-nova | 07:24 | |
*** sapd1_x has joined #openstack-nova | 07:25 | |
*** xek has joined #openstack-nova | 07:28 | |
*** ivve has joined #openstack-nova | 07:39 | |
*** rpittau|afk is now known as rpittau | 07:40 | |
*** bhagyashris has joined #openstack-nova | 07:50 | |
*** ralonsoh has joined #openstack-nova | 07:52 | |
*** zbr is now known as zbr|ooo | 07:56 | |
*** takashin has left #openstack-nova | 08:00 | |
*** jaosorior has quit IRC | 08:00 | |
openstackgerrit | Brin Zhang proposed openstack/nova master: Specify availability_zone to unshelve https://review.opendev.org/663851 | 08:07 |
*** boxiang has quit IRC | 08:17 | |
*** boxiang has joined #openstack-nova | 08:17 | |
*** mkrai has quit IRC | 08:25 | |
*** mkrai has joined #openstack-nova | 08:25 | |
*** boxiang has quit IRC | 08:28 | |
*** boxiang has joined #openstack-nova | 08:28 | |
*** tkajinam has quit IRC | 08:29 | |
*** shilpasd has joined #openstack-nova | 08:33 | |
*** ociuhandu has joined #openstack-nova | 08:35 | |
*** tetsuro has joined #openstack-nova | 08:46 | |
*** elod_off is now known as elod | 08:47 | |
*** tetsuro has quit IRC | 08:51 | |
*** mkrai has quit IRC | 08:53 | |
openstackgerrit | Ghanshyam Mann proposed openstack/python-novaclient master: Microversion 2.77: Support Specifying AZ to unshelve https://review.opendev.org/665136 | 09:00 |
*** Alon_KS has joined #openstack-nova | 09:18 | |
*** jaosorior has joined #openstack-nova | 09:19 | |
*** jawad_axd has joined #openstack-nova | 09:20 | |
*** Conqueror has joined #openstack-nova | 09:22 | |
*** derekh has joined #openstack-nova | 09:26 | |
*** mkrai has joined #openstack-nova | 09:28 | |
*** shilpasd has quit IRC | 09:32 | |
*** sapd1_x has quit IRC | 09:53 | |
openstackgerrit | Merged openstack/nova master: [Trivial]Remove unused helper _get_instances_by_filters https://review.opendev.org/676036 | 09:56 |
*** klindgren_ has quit IRC | 09:59 | |
*** klindgren has joined #openstack-nova | 09:59 | |
*** rpittau is now known as rpittau|bbl | 10:14 | |
*** shilpasd has joined #openstack-nova | 10:28 | |
*** bhagyashris has quit IRC | 10:31 | |
aspiers | alex_xu: I will have another PS for you very soon | 10:34 |
aspiers | alex_xu: I'm adding the missing tests | 10:34 |
*** jchhatbar has joined #openstack-nova | 10:47 | |
*** janki has quit IRC | 10:49 | |
*** shilpasd has quit IRC | 10:49 | |
*** ociuhandu has quit IRC | 10:49 | |
*** hemna has quit IRC | 10:54 | |
*** owalsh is now known as owalsh|away | 11:08 | |
openstackgerrit | Merged openstack/nova master: Libvirt: add support for vPMU configuration. https://review.opendev.org/671338 | 11:15 |
*** ociuhandu has joined #openstack-nova | 11:20 | |
*** ociuhandu has quit IRC | 11:25 | |
*** udesale has quit IRC | 11:26 | |
*** nnsingh has joined #openstack-nova | 11:27 | |
*** udesale has joined #openstack-nova | 11:27 | |
*** nnsingh has left #openstack-nova | 11:27 | |
*** yikun has joined #openstack-nova | 11:28 | |
*** tbachman has quit IRC | 11:42 | |
*** shilpasd has joined #openstack-nova | 11:42 | |
alex_xu | aspiers: thanks a lot | 11:51 |
*** mdbooth has joined #openstack-nova | 11:52 | |
aspiers | alex_xu: just finished now, submitting ... | 11:52 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Split fake host capabilities into reusable variables https://review.opendev.org/677710 | 11:52 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Track libvirt host/domain capabilities for multiple machine types https://review.opendev.org/673151 | 11:53 |
mdbooth | Heads up: the libvirt tpool proxy doesn't work reliably: https://bugs.launchpad.net/nova/+bug/1840912 Potentially the cause of a bunch of interesting behaviours | 11:53 |
openstack | Launchpad bug 1840912 in OpenStack Compute (nova) "libvirt calls aren't reliably using tpool.Proxy" [Undecided,New] | 11:53 |
openstackgerrit | Brin Zhang proposed openstack/nova master: Specify availability_zone to unshelve https://review.opendev.org/663851 | 11:53 |
alex_xu | aspiers: I will try another around tomorrow | 11:55 |
aspiers | alex_xu: thanks. I just updated the test comment and added some simple tests to test_host.py | 11:56 |
aspiers | alex_xu: so it should be easy to review | 11:56 |
alex_xu | aspiers: cool | 11:58 |
*** udesale has quit IRC | 12:02 | |
*** rpittau|bbl is now known as rpittau | 12:10 | |
openstackgerrit | Stephen Finucane proposed openstack/nova master: docs: Rewrite host aggregate, availability zone docs https://review.opendev.org/667133 | 12:27 |
stephenfin | alex_xu: Think you could send this on its way? https://review.opendev.org/#/c/675570/ | 12:28 |
*** udesale has joined #openstack-nova | 12:32 | |
*** tbachman has joined #openstack-nova | 12:33 | |
openstackgerrit | Adam Spiers proposed openstack/nova master: Split fake host capabilities into reusable variables https://review.opendev.org/677710 | 12:36 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Track libvirt host/domain capabilities for multiple machine types https://review.opendev.org/673151 | 12:36 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract new base class for provider usage functional tests https://review.opendev.org/676964 | 12:37 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Provide HW_CPU_X86_AMD_SEV trait when SEV is supported https://review.opendev.org/638680 | 12:37 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add extra spec parameter and image property for memory encryption https://review.opendev.org/664420 | 12:37 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract SEV-specific bits on host detection https://review.opendev.org/636334 | 12:37 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add <launchSecurity> and <driver iommu='on' /> to config.py https://review.opendev.org/636318 | 12:37 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Apply SEV-specific guest config when SEV is required https://review.opendev.org/644565 | 12:38 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Enable booting of libvirt guests with AMD SEV memory encryption https://review.opendev.org/666616 | 12:38 |
*** jchhatba_ has joined #openstack-nova | 12:39 | |
shilpasd | stephenfin: Hi, small doubt, at https://github.com/openstack/nova/blob/master/nova/api/openstack/compute/views/versions.py#L43 | 12:40 |
shilpasd | for multiple choices reply, links getting printed | 12:41 |
shilpasd | and if we given wrong request path, that getting added to response | 12:41 |
stephenfin | shilpasd: Could you give me an example? | 12:41 |
stephenfin | Expected vs. actual | 12:41 |
shilpasd | actual: "links": [{"href": "http://10.232.48.206/compute/v2.1/compute/v4.1/servers/detail", "rel": "self"}] | 12:42 |
*** jchhatbar has quit IRC | 12:42 | |
shilpasd | expected: "links": [{"href": "http://10.232.48.206/compute/v2/servers/detail", "rel": "self"}] | 12:43 |
*** jchhatba_ has quit IRC | 12:43 | |
sean-k-mooney | its reading v4.1 | 12:43 |
sean-k-mooney | as the server uuid | 12:43 |
sean-k-mooney | actully no its not | 12:43 |
shilpasd | i have purposfully given v4.1 in req path | 12:44 |
sean-k-mooney | ya i guessed that | 12:44 |
sean-k-mooney | compute/v4.1/servers/detail was what you used | 12:45 |
shilpasd | curl -g -i -X GET http://<host_ip>/compute/v4.1/servers/detail -H "Accept: application/json" -H "OpenStack-API-Version: compute 2.74 <AUTH_TOKEN>" -H "X-OpenStack-Nova-API-Version: 2.74" | 12:45 |
shilpasd | yes | 12:45 |
shilpasd | sean-k-mooney: yes | 12:45 |
aspiers | Can anyone comment on whether six.iteritems() should be avoided in nova? Currently our hacking extensions allow it despite http://lists.openstack.org/pipermail/openstack-dev/2015-June/066391.html | 12:46 |
sean-k-mooney | well we proably should be validaing the links if nothing else | 12:46 |
shilpasd | i referred https://docs.openstack.org/api-guide/compute/versions.html | 12:46 |
stephenfin | I'm trying to figure out where we parse the URL, but it sounds like a legit bug | 12:46 |
stephenfin | I assume you meant to request 'http://<host_ip>/compute/v2.1/servers/detail' ? | 12:46 |
*** ociuhandu has joined #openstack-nova | 12:46 | |
*** ociuhandu has quit IRC | 12:47 | |
stephenfin | I'd ask cdent but he's not around | 12:47 |
shilpasd | stephenfin: yes | 12:47 |
*** ociuhandu has joined #openstack-nova | 12:47 | |
shilpasd | but purposefully requested 'http://<host_ip>/compute/v4.1/servers/detail' | 12:47 |
sean-k-mooney | aspiers: am i dont htink the time it examples in that post are good tests | 12:48 |
aspiers | sean-k-mooney: read the whole thread | 12:48 |
aspiers | also currently there are only two uses of six.iteritems in the whole of nova | 12:48 |
sean-k-mooney | we have removed som uses of it recently | 12:49 |
sean-k-mooney | personally given we intend to kill python 3 support in 2 months i could live with items at this point | 12:49 |
stephenfin | aspiers: Personally, I'd just avoid it unless you know you need it | 12:50 |
aspiers | stephenfin: yes, I've just removed it from my patches | 12:50 |
sean-k-mooney | but as a general preference i dont think we should write slow code by default | 12:50 |
aspiers | stephenfin: but I'm wondering whether we should add a hacking check | 12:50 |
sean-k-mooney | aspiers: i think that is overkill | 12:50 |
stephenfin | We can, but we have less than three months of needing to care about Python 2.7 left | 12:51 |
stephenfin | so yeah, overkill | 12:51 |
sean-k-mooney | maybe in Ussuri | 12:51 |
aspiers | ok | 12:51 |
aspiers | sean-k-mooney: in other news, https://review.opendev.org/#/c/677710 | 12:51 |
sean-k-mooney | e.g. in ussuri we should not be writing new code that uses six at all | 12:51 |
stephenfin | sean-k-mooney: Within reason. I'll take readable code over slightly faster code in 90% of cases | 12:51 |
stephenfin | Yeah, if anything we could start stripping out six | 12:51 |
stephenfin | where it made sense, that is | 12:52 |
sean-k-mooney | stephenfin: sure but it adds up. if the perfomace is close readablity win if it way more readabel but way slower we should not use it in performance critical parts of nova or any project | 12:52 |
aspiers | sean-k-mooney: need you to check https://review.opendev.org/#/c/673151/13..15/nova/tests/unit/virt/libvirt/test_host.py | 12:53 |
sean-k-mooney | aspiers: ah your adding the fake data for capablites too. i had that in my orginial version but then dropped it as it was not needed | 12:53 |
aspiers | sean-k-mooney: not adding, more like moving and updating | 12:54 |
sean-k-mooney | that looks like a potentally good approch to take | 12:54 |
sean-k-mooney | refactoring :) | 12:54 |
*** eharney has joined #openstack-nova | 12:55 | |
sean-k-mooney | im in two minds about your test__get_machine_types fucntion | 12:56 |
sean-k-mooney | i dont like use hardcoding data like that in tests that are that coupled but i guess its ok | 12:56 |
sean-k-mooney | i guess its fine since it matching the fixture but it would be nic if you could get teh expected arch/machine types form the fixture | 12:57 |
sean-k-mooney | ill think about it over lunch but ill proably +1 goes to actully have lunch now | 12:58 |
aspiers | sean-k-mooney: thanks :) | 12:59 |
*** lbragstad has joined #openstack-nova | 13:00 | |
*** ociuhandu has quit IRC | 13:00 | |
*** davee_ has joined #openstack-nova | 13:05 | |
*** nweinber__ has joined #openstack-nova | 13:14 | |
mdbooth | dansmith: I've got a much better feeling about this one: https://bugs.launchpad.net/nova/+bug/1840912 | 13:17 |
openstack | Launchpad bug 1840912 in OpenStack Compute (nova) "libvirt calls aren't reliably using tpool.Proxy" [Undecided,New] | 13:17 |
*** davee_ has quit IRC | 13:18 | |
*** davee_ has joined #openstack-nova | 13:19 | |
*** dulek has joined #openstack-nova | 13:20 | |
dulek | Hi! This is probably trivial, but I see this in n-cpu logs on a fresh DevStack instance: `libvirtError: Cannot access storage file '/opt/stack/data/nova/instances/403824ef-422f-4e23-9e3a-d897766aa70d/disk' (as uid:107, gid:107)`. Any ideas how to get rid of that? | 13:20 |
*** tesseract has joined #openstack-nova | 13:22 | |
*** tesseract has quit IRC | 13:22 | |
dansmith | mdbooth: okay, have to do something first but then I'll look | 13:29 |
mdbooth | dansmith: That's just the bug description, btw. Patch incoming. Patch might touch a couple of testing weird spots, so I'm expecting discussion/rework. | 13:30 |
*** brinzhang_ has joined #openstack-nova | 13:31 | |
dansmith | mdbooth: I'll be looking for why this wasn't a thing in the first eight years of its life, of course :) | 13:33 |
mdbooth | dansmith: I think it was ;) I suspect there's a point 2 to this bug which is why libvirt is being slow enough for this customer to trigger it. | 13:33 |
mdbooth | However, we tested it and it's definitely not using the proxy | 13:33 |
*** brinzhang has quit IRC | 13:34 | |
*** ratailor has quit IRC | 13:36 | |
*** munimeha1 has joined #openstack-nova | 13:39 | |
*** shilpasd has quit IRC | 13:39 | |
*** mriedem has joined #openstack-nova | 13:39 | |
*** eharney has quit IRC | 13:40 | |
dansmith | mdbooth: ah, I see, looks like a good lead | 13:42 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Make memtune parameters consistent with libvirt docs and code https://review.opendev.org/636301 | 13:46 |
*** zbr|ooo is now known as zbr | 13:47 | |
*** udesale has quit IRC | 13:47 | |
*** boxiang has quit IRC | 13:48 | |
*** jawad_axd has quit IRC | 13:48 | |
*** boxiang has joined #openstack-nova | 13:49 | |
*** jawad_axd has joined #openstack-nova | 13:49 | |
*** jawad_axd has quit IRC | 13:49 | |
*** BjoernT_ has joined #openstack-nova | 13:50 | |
*** jawad_axd has joined #openstack-nova | 13:50 | |
*** jawad_ax_ has joined #openstack-nova | 13:52 | |
openstackgerrit | Adam Spiers proposed openstack/nova master: Track libvirt host/domain capabilities for multiple machine types https://review.opendev.org/673151 | 13:53 |
*** jawad_axd has quit IRC | 13:55 | |
*** bnemec has joined #openstack-nova | 13:55 | |
*** jawad_ax_ has quit IRC | 13:56 | |
openstackgerrit | Matthew Booth proposed openstack/nova master: libvirt: Fix service-wide pauses caused by un-proxied libvirt calls https://review.opendev.org/677736 | 13:57 |
*** pvradu has joined #openstack-nova | 14:00 | |
*** pvradu has quit IRC | 14:05 | |
*** shilpasd has joined #openstack-nova | 14:10 | |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract new base class for provider usage functional tests https://review.opendev.org/676964 | 14:10 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Provide HW_CPU_X86_AMD_SEV trait when SEV is supported https://review.opendev.org/638680 | 14:10 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add extra spec parameter and image property for memory encryption https://review.opendev.org/664420 | 14:10 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract SEV-specific bits on host detection https://review.opendev.org/636334 | 14:10 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add <launchSecurity> and <driver iommu='on' /> to config.py https://review.opendev.org/636318 | 14:11 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Apply SEV-specific guest config when SEV is required https://review.opendev.org/644565 | 14:11 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Enable booting of libvirt guests with AMD SEV memory encryption https://review.opendev.org/666616 | 14:11 |
*** gregwork has quit IRC | 14:14 | |
*** mrhillsman has quit IRC | 14:14 | |
*** portdirect has quit IRC | 14:14 | |
*** coreycb has quit IRC | 14:14 | |
*** eharney has joined #openstack-nova | 14:15 | |
*** kmalloc has quit IRC | 14:15 | |
*** mrhillsman has joined #openstack-nova | 14:15 | |
*** jmlowe has quit IRC | 14:16 | |
*** mordred has quit IRC | 14:16 | |
*** jbryce has quit IRC | 14:16 | |
*** mgagne has quit IRC | 14:16 | |
*** csatari has quit IRC | 14:16 | |
*** logan- has quit IRC | 14:16 | |
*** arne_wiebalck has quit IRC | 14:16 | |
*** irclogbot_3 has quit IRC | 14:17 | |
*** logan_ has joined #openstack-nova | 14:17 | |
*** mgagne has joined #openstack-nova | 14:17 | |
*** ebbex has quit IRC | 14:18 | |
*** irclogbot_2 has joined #openstack-nova | 14:18 | |
*** ebbex has joined #openstack-nova | 14:18 | |
*** portdirect has joined #openstack-nova | 14:18 | |
*** logan_ is now known as logan- | 14:18 | |
*** coreycb has joined #openstack-nova | 14:18 | |
*** gregwork has joined #openstack-nova | 14:18 | |
*** jmlowe has joined #openstack-nova | 14:19 | |
*** csatari has joined #openstack-nova | 14:19 | |
*** kmalloc has joined #openstack-nova | 14:19 | |
*** jbryce has joined #openstack-nova | 14:19 | |
*** arne_wiebalck has joined #openstack-nova | 14:19 | |
*** Jeffrey4l_ has quit IRC | 14:19 | |
openstackgerrit | Matthew Booth proposed openstack/nova master: libvirt: Fix service-wide pauses caused by un-proxied libvirt calls https://review.opendev.org/677736 | 14:20 |
*** Jeffrey4l has joined #openstack-nova | 14:21 | |
*** mordred has joined #openstack-nova | 14:26 | |
*** mdbooth has quit IRC | 14:31 | |
*** mdbooth has joined #openstack-nova | 14:35 | |
*** pvradu has joined #openstack-nova | 14:36 | |
openstackgerrit | Adam Spiers proposed openstack/nova master: libvirt: Parse the 'os' element from domainCapabilities https://review.opendev.org/673790 | 14:36 |
*** ociuhandu has joined #openstack-nova | 14:39 | |
mriedem | mdbooth: can we avoid the libvirt-python test dependency? i think that will mean local dev needs to install a native package, from what i remember years ago anyway | 14:39 |
mdbooth | mriedem: Yeah, I threw that in there to have a starting point, tbh | 14:40 |
mriedem | and it was a source of problems before too | 14:40 |
*** davee_ has quit IRC | 14:40 | |
mriedem | ok | 14:40 |
mdbooth | mriedem: I mostly wanted to get a full tempest run against it. | 14:40 |
mriedem | so WIP it? | 14:40 |
mdbooth | mriedem: Will do. | 14:40 |
mriedem | thanks | 14:41 |
*** jaosorior has quit IRC | 14:41 | |
mdbooth | mriedem: I really want eyes on it, though :) Also, ideas for useful testing without using libvirt. | 14:41 |
mdbooth | Might look at something using fakelibvirt | 14:42 |
mriedem | i assumed fakelibvirt | 14:42 |
mriedem | since that's what we always use | 14:42 |
sean-k-mooney | mdbooth: you porbaly could mock the call to inspect | 14:42 |
sean-k-mooney | and pass in a set of different moduels for it to use and assert they were wrapped | 14:42 |
mdbooth | sean-k-mooney: The test I really want to do is that Host.list_instance_domains() returns <list of tpool.Proxy(virDomain)> | 14:43 |
*** hemna has joined #openstack-nova | 14:43 | |
mdbooth | If I can get that, I'll be confident that a bunch of things are lined up | 14:43 |
sean-k-mooney | i think that would be doable with out the really libvirt module | 14:44 |
mdbooth | I think so | 14:44 |
*** tbachman has quit IRC | 14:44 | |
mriedem | were you going to open an issue against eventlet for the not-handling-list thing? | 14:44 |
sean-k-mooney | we might have to asser it returns <list of tpool.Proxy(mock)> | 14:44 |
*** mlavalle has joined #openstack-nova | 14:45 | |
sean-k-mooney | or fakelibvirt.virDomain but we should be able to cover the behaivor i think | 14:45 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: Remove 'hw:cpu_policy', 'hw:mem_page_size' extra specs from API samples https://review.opendev.org/675338 | 14:48 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: scheduler: Flatten 'ResourceRequest.from_extra_specs', 'from_image_props' https://review.opendev.org/674894 | 14:48 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: Add support for translating CPU policy extra specs, image meta https://review.opendev.org/671801 | 14:48 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: libvirt: Fold in argument to '_update_provider_tree_for_vgpu' https://review.opendev.org/676729 | 14:48 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: Add reshaper for PCPU https://review.opendev.org/674895 | 14:48 |
*** dtantsur has joined #openstack-nova | 14:48 | |
*** ratailor has joined #openstack-nova | 14:49 | |
dtantsur | hey folks. TheJulia and I might be seeing a race condition during ironic node take over. | 14:49 |
dtantsur | it seems that when the hash ring rebalances, the old n-cpu may clean up the node records, including the resource provide in placement | 14:50 |
sean-k-mooney | node takeover e.g. when a node is moved to a different ironic compute service | 14:50 |
dtantsur | yep | 14:50 |
*** my_nd has joined #openstack-nova | 14:50 | |
sean-k-mooney | is this for all node or just ones in mantaince mode | 14:50 |
dtantsur | it was for a normal node | 14:50 |
dtantsur | apparently under some condition the new (receiving) n-cpu may not re-create the resource provider for the node it took over | 14:51 |
sean-k-mooney | dtantsur: ok mnaser noted there is an issue with nodes in mainance mode being deleted | 14:51 |
*** pvradu has quit IRC | 14:51 | |
dtantsur | I think it was fixed by mriedem recently | 14:51 |
sean-k-mooney | proably i just head about it a day or two ago | 14:52 |
*** lbragstad has quit IRC | 14:52 | |
mnaser | sean-k-mooney, dtantsur, TheJulia: https://review.opendev.org/#/c/675704/ is the stack you might want to look at | 14:52 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Remove deprecated [neutron]/url option https://review.opendev.org/677755 | 14:52 |
sean-k-mooney | dtantsur: its possible the reciving node check if the RP exists pefore teh souce node deletes it | 14:52 |
dtantsur | yep, that's my guess | 14:53 |
dtantsur | relevant log lines: https://etherpad.openstack.org/p/zCQ8vPcGX6 | 14:53 |
* dtantsur pokes jroll | 14:53 | |
sean-k-mooney | the first patch looks relevent https://review.opendev.org/#/c/675704/ | 14:53 |
jroll | ? | 14:54 |
mriedem | mordred: efried: i'm removing the old deprecated [neutron]/url option in https://review.opendev.org/677755 and i noticed that the neutron install guide still uses it: https://bugs.launchpad.net/neutron/+bug/1840930 - as i mentioned in that neutron bug, we could just replace with endpoint_override but i don't think that's what we want, right? | 14:54 |
openstack | Launchpad bug 1840930 in neutron "Networking service in neutron - install guide says to configure nova with [neutron]/url which is deprecated" [Undecided,New] | 14:54 |
mriedem | mordred: efried: we want to avoid endpoint_override and just have config for letting KSA do it's thing with the service catalog | 14:54 |
dtantsur | jroll: see scrollback, potential race condition on take over | 14:54 |
dtantsur | you may know this code better than me | 14:54 |
TheJulia | So an important note from what dtantsur is mentioning is that we're seeing this in our grenade job, which means this is when stein is still running from what we see in the logs | 14:54 |
mriedem | stein doesn't have the fix yet | 14:55 |
mriedem | waiting for stable reviews | 14:55 |
mriedem | https://review.opendev.org/#/q/topic:bug/1839560+branch:stable/stein | 14:55 |
sean-k-mooney | TheJulia: is this only before we do the upgrade. | 14:55 |
jroll | there we go | 14:55 |
jroll | dtantsur: side question: does the new compute service re-add it on the following RT run? | 14:56 |
dtantsur | jroll: nope, never | 14:56 |
TheJulia | sean-k-mooney: well, we never actually get to perform the upgrade | 14:56 |
jroll | oh boy | 14:56 |
dtantsur | it becomes orphaned essentially | 14:56 |
efried | mriedem: Right; for people who just gotta specify, they can s/url/endpoint_override/, but it is better to just let the service catalog do its thing. | 14:56 |
mriedem | efried: yeah, i think the question is - for an install guide, we don't want to document the exceptional path | 14:56 |
sean-k-mooney | TheJulia: i was wondering if ye tried it manually but ya it would fail tempest and not upgrade so make sense | 14:56 |
mriedem | so i was just going to remove the url mention from the install guide | 14:56 |
*** mdbooth has quit IRC | 14:57 | |
mriedem | dtantsur: TheJulia: i'm curious, this has been a thing since rocky, so why are you hitting it now? | 14:57 |
mriedem | new tests? | 14:57 |
TheJulia | sean-k-mooney: we wouldn't even make it to tempest, there is a resource check where placement is consulted | 14:57 |
TheJulia | if we don't have the number of "baremetal" nodes, we eventually time out and fail | 14:57 |
*** davee_ has joined #openstack-nova | 14:57 | |
dtantsur | mriedem: I'm curious as well. The simplest explanation is that it's become more likely for some unrelated reason. | 14:57 |
mriedem | https://review.opendev.org/#/c/571535/ is since rocky i mean | 14:57 |
efried | mriedem: We should just be able to document "do this the same way you would do any service backed by the catalog". mordred might have a central document somewhere that recommends best practices for that. If we don't have that doc, we should. | 14:57 |
* dtantsur remembers how switching to pypy on one of his projects uncovered a few completely unexpected races because of how much faster it was | 14:58 | |
sean-k-mooney | TheJulia: well if we can recreate it reliably we proably should add a depens on against the stien backport and see if that resolves it | 14:58 |
TheJulia | We've seen some fun variations in performance so I guess any changes are going to cause us to find races more often.... | 14:58 |
mriedem | jroll: before the fix, the old compute will delete the comptue node and resource provider, but the compute node record is soft deleted; the new compute will try to create the compute node record with the same uuid - which has a unique constraint on it - and fail, so it won't re-create the resource provider | 14:58 |
TheJulia | sean-k-mooney: could work if it survives a couple rechecks... I think the overall job failure rate has been only 20-30%. | 14:59 |
*** mdbooth has joined #openstack-nova | 14:59 | |
mriedem | i don't know if this matters for y'all, but when the old compute deletes the compute node and resource provider, it will also delete any allocations for an instance on that node/provider, and the new compute won't re-create those allocations... | 15:00 |
sean-k-mooney | that is still high enough to be concerned | 15:00 |
mriedem | back in ocata/pike the new could RT would re-create the allocations | 15:00 |
mriedem | but that code was removed... | 15:00 |
dtantsur | ouch | 15:00 |
jroll | mriedem: yep, that makes sense. we're seeing something similar in ocata recently, but I can't decide if this is related (as the compute_nodes table is also affected) | 15:00 |
jroll | um, that allocations thing seems... really bad | 15:00 |
mriedem | jroll: the "re-use the node uuid for the compute node uuid" thing was only rocky | 15:00 |
jroll | mriedem: right, that's just why I was asking for more info | 15:01 |
sean-k-mooney | jroll: deleteing all the allocation is require to be able to delet eh placmeent RP | 15:01 |
mriedem | jroll: https://review.opendev.org/#/c/602481/ | 15:01 |
sean-k-mooney | if we dont do that then we cant delete the rp wehn we delete the compute service reliably | 15:02 |
jroll | sean-k-mooney: sure, but in an ironic world where we churn these "compute nodes" all the time, that seems scary | 15:02 |
mriedem | ^ removed the thing so that the ironic driver would always tell the RT to report the allocations | 15:02 |
jroll | maybe I'm misunderstanding how this works | 15:02 |
jroll | and I really don't have time to deep dive this part of nova today :( | 15:02 |
mriedem | since i never think if ironic node re-balancing stuff, and we don't test it (at least in nova) i guess i didn't think about this | 15:02 |
mriedem | i would also think that johnthetubaguy (who added this workaround i think) and stackhpc would have noticed a problem by now | 15:03 |
mriedem | but i don't know what version of openstack they are giving to customers | 15:03 |
mriedem | https://review.opendev.org/#/c/602481/ was stein | 15:03 |
sean-k-mooney | when we are rebalcne we are moving the compute nodes between compute service right. and in an ironic case we have mupliple compute nodes (1 per server) each with there one placment RP associates to a singel compute service correct | 15:04 |
sean-k-mooney | just making sure i have that right in my head | 15:04 |
mriedem | ironic node : compute node : instance = 1:1:1 | 15:04 |
jroll | correct, until the end - there is a placement RP for each ironic node / compute node | 15:04 |
mriedem | + : resource provider | 15:04 |
sean-k-mooney | jroll: yep that is want i ment | 15:05 |
sean-k-mooney | one placment RP per compute/ironic node | 15:05 |
jroll | cool, you are correct then | 15:05 |
sean-k-mooney | durign a reblance we are not actully delet ing he nova compute service we are just moveing the comnpute nodes so we really should not need to update placmeent | 15:06 |
sean-k-mooney | or rather delete the compute node/placmente RP | 15:06 |
*** priteau has joined #openstack-nova | 15:07 | |
mriedem | the compute manager doesn't know you're doing a re-balance | 15:07 |
sean-k-mooney | all we should need to do on the nova side is modify the service to compute node mapping but i dont know how we do that today | 15:07 |
jroll | except that a rebalance happens when the number of compute services changes. so if you shut down a compute service (which deletes compute nodes and RPs), ironic nodes get moved to another, and those compute nodes and RPs are re-created | 15:07 |
mriedem | it knows, "oh i was managing node x before, but now the driver says i'm not, so delete" | 15:07 |
mriedem | jroll: yeah and before rocky, those new compute nodes and RPs would all have a unique uuid | 15:08 |
mriedem | since rocky, they re-use the node uuid | 15:08 |
jroll | yeah | 15:08 |
mriedem | so i think with my recent fix that's all fine, | 15:09 |
mriedem | the question is what happens with allocations against the re-balanced node resource provider | 15:09 |
mriedem | b/c i'm guessing since stein those are gone | 15:09 |
sean-k-mooney | mriedem: right but im wonder that im not "managing because the drive said so, therefor delete is required" | 15:09 |
jroll | yeah, that's what worries me | 15:09 |
mriedem | i guess someone needs to do some testing | 15:09 |
sean-k-mooney | in a libvirt case we would normaly delete the compute service and have it delete the compute nodes under neat it | 15:10 |
mriedem | sean-k-mooney: the compute service doesn't delete itself, | 15:10 |
mriedem | that happens in the API | 15:10 |
mriedem | DELETE /os-services/{service_id} | 15:10 |
sean-k-mooney | well yes | 15:10 |
mriedem | which is related to a number of other fun issues | 15:10 |
* TheJulia reads back | 15:10 | |
sean-k-mooney | i know that but that is what i ment | 15:10 |
mriedem | which are in the ML | 15:10 |
sean-k-mooney | we externally trigger the deletion of the compute service when removing a libvir host | 15:11 |
sean-k-mooney | does vmware do something similar to ironic | 15:11 |
sean-k-mooney | they have multiple compute nodes in the same service too right | 15:11 |
mriedem | http://lists.openstack.org/pipermail/openstack-discuss/2019-June/007135.html | 15:11 |
mriedem | sean-k-mooney: no, vmware driver does not do that since kilo or liberty | 15:12 |
mriedem | TheJulia: tl;dr, i think you need https://review.opendev.org/#/q/topic:bug/1839560+branch:stable/stein for your grenade failures, | 15:13 |
mriedem | and there might be a regression due to https://review.opendev.org/#/c/602481/ where allocations in placement are lost during a re-balance | 15:14 |
mriedem | but someone needs to test that | 15:14 |
TheJulia | mriedem: sean-k-mooney: I added a depends-on flag to one of my test patches for the patches above, if I see the issue re-appear on that job, I'll let you all know | 15:14 |
mriedem | so if the lost allocations on re-balance is a real thing, i might have a solution, which i wrote down in a comment in https://review.opendev.org/#/c/602481/ | 15:18 |
mriedem | b/c the code that deletes the resource provider and it's allocations pre-dates the removal of the RT allocation healing code, and unsurprisingly we lost track of all those random one off moving parts | 15:19 |
mriedem | especially since we don't have functional testing of that stuff in tree | 15:19 |
*** ivve has quit IRC | 15:21 | |
*** gyee has joined #openstack-nova | 15:22 | |
*** pvradu has joined #openstack-nova | 15:25 | |
* efried <== doc | 15:33 | |
*** efried is now known as efried_afk | 15:33 | |
openstackgerrit | Adam Spiers proposed openstack/nova master: libvirt: Parse the 'os' element from domainCapabilities https://review.opendev.org/673790 | 15:37 |
gibi | mriedem: hi! I have no problem moving the small parts of the cross-cell resize series to the front. I can re-apply my +2 if you rebase the series | 15:39 |
mriedem | gibi: ok thanks | 15:39 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: FUP for I5576fa2a67d2771614266022428b4a95487ab6d5 https://review.opendev.org/677769 | 15:46 |
*** tbachman has joined #openstack-nova | 15:50 | |
*** mkrai has quit IRC | 15:50 | |
*** mkrai has joined #openstack-nova | 15:53 | |
*** mgagne has quit IRC | 15:54 | |
*** mgagne has joined #openstack-nova | 15:55 | |
aspiers | anyone object to me splitting libvirt/test_driver.py into smaller pieces? at 23k LoC it's almost *double* the size of the next largest file in nova (compute/test_compute.py in case you're curious) | 15:55 |
aspiers | my editor really hates such a huge file, and so do I | 15:55 |
aspiers | efried_afk, sean-k-mooney: ^^^ | 15:56 |
*** igordc has joined #openstack-nova | 15:56 | |
aspiers | of course this will probably cause a few merge conflicts so maybe we need to time it right | 15:56 |
sean-k-mooney | aspiers: you mean it does not like 10s of 1000s or lines | 15:56 |
aspiers | sean-k-mooney: yes, syntax highlighting something that big is really expensive | 15:57 |
aspiers | unless your highlighting algorithm cuts corners | 15:57 |
*** pvradu has quit IRC | 15:58 | |
sean-k-mooney | pycharm does not mind too much | 15:58 |
sean-k-mooney | once it has built the index | 15:58 |
aspiers | Yikes this is a huge list https://review.opendev.org/#/q/project:openstack/nova+path:nova/tests/unit/virt/libvirt/test_driver.py+status:open+branch:master | 15:59 |
aspiers | Still, we could split stuff out in smaller steps to be less disruptive | 16:00 |
aspiers | I was thinking to move all the capabilities / domcaps tests out for a start | 16:00 |
aspiers | libvirt/driver.py is 10k LoC, that should also be split up | 16:01 |
*** pvradu has joined #openstack-nova | 16:02 | |
aspiers | anyway I'm done for today | 16:02 |
*** KeithMnemonic1 has joined #openstack-nova | 16:02 | |
aspiers | probably won't do much the rest of this week | 16:02 |
aspiers | since I've been doing way too much overtime recently | 16:02 |
*** KeithMnemonic has quit IRC | 16:03 | |
*** rpittau is now known as rpittau|afk | 16:05 | |
*** pvradu has quit IRC | 16:05 | |
*** ricolin has quit IRC | 16:05 | |
*** markvoelker has quit IRC | 16:08 | |
sean-k-mooney | aspiers: yes it shoudl but with the rate of change that is hard to do without a dedicated effort | 16:12 |
*** ratailor has quit IRC | 16:12 | |
sean-k-mooney | aspiers: and ya dont burn out. | 16:12 |
mriedem | efried_afk: did cern reach out to you about the deepcopy of the ProviderTree? i think it came up last week, but was reminded after reading https://techblog.web.cern.ch/techblog/post/nova-ironic-at-scale/ | 16:21 |
openstackgerrit | Matthew Booth proposed openstack/nova master: libvirt: Fix service-wide pauses caused by un-proxied libvirt calls https://review.opendev.org/677736 | 16:21 |
*** markvoelker has joined #openstack-nova | 16:22 | |
mriedem | aspiers: you mean moving driver.py tests out of test_driver.py? but leave the code in driver.py? that would be confusing to me, and becomes a mess for git history tracking | 16:22 |
mriedem | there is probably a ton of copy/paste patterns in libvirt driver tests that could be a simple cleanup to start with, rather than moving things *out* of the test_driver.py module | 16:23 |
*** damien_r has quit IRC | 16:35 | |
*** dtantsur is now known as dtantsur|afk | 16:37 | |
*** mdbooth has quit IRC | 16:37 | |
*** igordc has quit IRC | 16:41 | |
*** boxiang has quit IRC | 16:42 | |
*** spsurya has quit IRC | 16:43 | |
*** boxiang has joined #openstack-nova | 16:43 | |
*** davee_ has quit IRC | 16:45 | |
*** Garyx has quit IRC | 16:45 | |
*** xek has quit IRC | 16:47 | |
*** Garyx has joined #openstack-nova | 16:49 | |
*** davee_ has joined #openstack-nova | 16:54 | |
*** ivve has joined #openstack-nova | 16:56 | |
*** bbobrov has quit IRC | 17:00 | |
*** derekh has quit IRC | 17:00 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: WIP: Get instances outside of update_available_resource lock https://review.opendev.org/677790 | 17:02 |
*** psachin has quit IRC | 17:04 | |
*** mkrai has quit IRC | 17:19 | |
*** damien_r has joined #openstack-nova | 17:19 | |
*** priteau has quit IRC | 17:21 | |
*** lbragstad has joined #openstack-nova | 17:21 | |
openstackgerrit | Merged openstack/nova master: Move calls to ovs-vsctl to privsep. https://review.opendev.org/639282 | 17:29 |
openstackgerrit | Merged openstack/nova master: Move setting of device trust to privsep. https://review.opendev.org/639283 | 17:32 |
*** ociuhandu has quit IRC | 17:32 | |
*** mdbooth has joined #openstack-nova | 17:33 | |
*** nweinber_ has joined #openstack-nova | 17:35 | |
*** macz has joined #openstack-nova | 17:36 | |
*** nweinber__ has quit IRC | 17:37 | |
*** mdbooth has quit IRC | 17:38 | |
*** my_nd has quit IRC | 17:43 | |
efried_afk | aspiers: I love the idea of splitting huge files. sean-k-mooney, syntax highlighting isn't the problem so much as the inspection. Every change causes it to reinspect the file, which takes forever on monsters like test_libvirt. If I'm spending any amount of time there, I have to turn inspection off. | 17:44 |
*** damien_r has quit IRC | 17:45 | |
dansmith | efried_afk: without any other context or target articles, | 17:45 |
dansmith | that first sentence sounds like it belongs on your online dating profile | 17:46 |
dansmith | "I like long walks on the beach and splitting huge files" | 17:46 |
*** efried_afk is now known as dansmith1 | 17:46 | |
*** dansmith1 is now known as efried | 17:46 | |
sean-k-mooney | efried_afk: i have not had that problem but it does annoy me that thost files are so large | 17:47 |
efried | dansmith: It was a straight copy/paste | 17:47 |
dansmith | efried: I figured | 17:47 |
dansmith | Male seeking cybernetic female analog capable of introspection of long python files and/or automated splitting thereof | 17:47 |
efried | mriedem: I don't recall anything specific about a ProviderTree deepcopy... | 17:47 |
*** shilpasd has quit IRC | 17:48 | |
efried | oo, that's good dansmith. Clearly you have a lot of experience here. | 17:48 |
dansmith | my kink knows no bounds | 17:48 |
efried | dansmith: are you happy with https://review.opendev.org/#/c/671072/ series at this point? | 17:51 |
efried | gibi: also ^ | 17:52 |
dansmith | efried: I haven't circled back | 17:52 |
openstackgerrit | Eric Fried proposed openstack/nova master: Remove deprecated [neutron]/url option https://review.opendev.org/677755 | 17:56 |
aspiers | mriedem: I would happily move the corresponding code out from driver.py at the same time if that was feasible | 17:56 |
aspiers | I'm not sure why it would become a mess for git history tracking? even git blame has for a long time been able to figure out when stuff gets moved between files | 17:57 |
aspiers | efried: exactly | 17:57 |
mriedem | i'm not a fan of the tests for something in one module living in another module | 17:57 |
aspiers | mriedem: that said, pointers on stuff to deduplicate are welcome | 17:57 |
mriedem | and like i said, i think you can just refactor the duplication in test_driver.py for lots of tests to probably make it smaller | 17:58 |
aspiers | I think our messages crossed :) ^^^ | 17:58 |
mriedem | this isn't an exercise i'm the one that's wanting to take on, so that's up to you, | 17:58 |
mriedem | but i know there is a ton of just copy test foo and tweak 1-2 lines for test bar | 17:58 |
mriedem | i also have no problem with the thing in pycharm | 17:59 |
efried | aspiers: git history is an issue, unfortunately - afaik blame will show the new commit on the new file. You would have to back up to see the previous iteration, which is awkward. (I've run into this a number of times already in placement, and it's pretty debilitating.) | 18:00 |
mriedem | correct, it sucks | 18:00 |
mriedem | the mox removal series is a prime example | 18:00 |
mriedem | it also causes issues with merge conflicts for backports | 18:00 |
sean-k-mooney | well perhaps this is jsut somethin we need to be aware of and think should we be moving things out of the driver into modules used by the driver in the future | 18:00 |
aspiers | efried: no, blame really can show where it came from | 18:01 |
sean-k-mooney | aspiers: yes but git blame on github crashes on those files | 18:01 |
sean-k-mooney | so you have to do it locally | 18:01 |
aspiers | ? | 18:01 |
efried | what's the blame trick, without iterating backward across commits? | 18:01 |
aspiers | mriedem's point about backports is a much more convincing argument against | 18:01 |
aspiers | however, if our policy is "you can't move stuff around, ever" then we're in for a miserable future | 18:02 |
mriedem | i'm not against refactoring giant methods in the libvirt driver module, deduplicating test code in test_driver, etc. i just don't want to see random tests moving out of test_driver into some other module when the code being tested is still in driver.py | 18:02 |
aspiers | mriedem: sure, that's why I said <aspiers> mriedem: I would happily move the corresponding code out from driver.py at the same time if that was feasible | 18:03 |
mriedem | aspiers: no one said that was the policy | 18:03 |
sean-k-mooney | aspiers: try opening https://github.com/openstack/nova/blame/master/nova/virt/libvirt/driver.py | 18:03 |
*** igordc has joined #openstack-nova | 18:03 | |
aspiers | mriedem: ok good :) | 18:03 |
aspiers | was beginning to sound that way | 18:03 |
mriedem | as a nova troll i have a reputation to maintain | 18:04 |
aspiers | :) | 18:04 |
mriedem | as immovable object | 18:04 |
sean-k-mooney | i think we just dont want test to be move with out the code also moving and am not sure that patches that only move code are a good idea | 18:04 |
*** lbragstad_ has joined #openstack-nova | 18:04 | |
aspiers | efried: try git gui blame for instance | 18:04 |
*** lbragstad_ has quit IRC | 18:04 | |
aspiers | some tooltips include "Copied Or Moved Here By:" | 18:04 |
aspiers | can't remember if github or gitea do the same | 18:05 |
sean-k-mooney | aspiers: i think i makes sense to consider moving things if you are refacoritn the code already. | 18:05 |
aspiers | but local is way faster anyway IME | 18:05 |
aspiers | sean-k-mooney: definitely agree with that | 18:05 |
aspiers | well, it's an idea to bear in mind anyway | 18:06 |
aspiers | 23k LoC in one file does not seem sane to me ;-) | 18:06 |
sean-k-mooney | 23K of code to test 9.75K of code and we still miss things... | 18:07 |
sean-k-mooney | the driver is not that bad coverage wise but ther are other part that are less awsome... | 18:07 |
efried | aspiers: git gui appears to be... a GUI :( :( :( | 18:08 |
aspiers | efried: shocker! who'd have thought? | 18:08 |
efried | guess it would take some getting used to. | 18:08 |
sean-k-mooney | aspiers: that is not a standard git command is it a python extentions | 18:09 |
efried | anyway, I agree enormous files suck, and being stuck with them because of sucky reasons also sucks. | 18:09 |
efried | sean-k-mooney: apt install git-gui | 18:09 |
efried | that's if you're running ubuntu. | 18:09 |
efried | If you're running another distro... just shoot yourself and be done. | 18:09 |
sean-k-mooney | PopOS which is based on ubuntu yes | 18:10 |
aspiers | efried: watch your mouth! | 18:10 |
efried | (sorry, I had a bad experience with rhel a decade ago and still haven't recovered) | 18:10 |
aspiers | you ran RHEL as a desktop? eww | 18:10 |
sean-k-mooney | although i have a rhel8 system for the last while... not sure i will keep it but it currently works | 18:10 |
aspiers | Fedora is OK | 18:10 |
efried | yeah, that was my problem | 18:10 |
aspiers | try openSUSE :) | 18:10 |
efried | is that still a thing?? | 18:10 |
efried | Is it written in FORTRAN? | 18:10 |
* aspiers just about manages not to swear | 18:11 | |
sean-k-mooney | aspiers: i used tumbelweed for a while and mostly liked it | 18:11 |
aspiers | ok, good time to leave | 18:11 |
* aspiers goes for a run | 18:11 | |
sean-k-mooney | not sure i liked yast but it was nice to have up to date packages | 18:11 |
*** markvoelker has quit IRC | 18:16 | |
*** mchlumsky has joined #openstack-nova | 18:18 | |
*** markvoelker has joined #openstack-nova | 18:22 | |
*** markvoelker has quit IRC | 18:30 | |
*** ralonsoh has quit IRC | 18:35 | |
*** bbobrov has joined #openstack-nova | 18:35 | |
*** ralonsoh has joined #openstack-nova | 18:35 | |
mriedem | melwitt: a couple of things in the archive patch https://review.opendev.org/#/c/507486/ | 18:49 |
mriedem | and a couple of nits in the nova-next patch on top of that | 18:49 |
mriedem | both should be small | 18:50 |
*** nweinber_ has quit IRC | 18:50 | |
mriedem | melwitt: efried: i think we're ok to merge https://review.opendev.org/#/c/675218/6 now yeah? | 18:50 |
melwitt | thanks | 18:51 |
* efried reinstates +2 | 18:51 | |
melwitt | yeah I think all is well | 18:51 |
*** factor has quit IRC | 18:52 | |
mriedem | onto openstack resource provider inventory set --aggregate | 18:55 |
*** maciejjozefczyk has quit IRC | 18:56 | |
melwitt | heh | 18:57 |
*** mriedem has quit IRC | 18:58 | |
*** mriedem has joined #openstack-nova | 18:59 | |
*** ralonsoh has quit IRC | 19:05 | |
melwitt | I was panicking with the potential loss of my launchpad bug text after a 503 "TimeoutError" but found when I reloaded the page, it posted everything I had written | 19:10 |
melwitt | phhheeewwww | 19:10 |
mriedem | happens a lot, just have to keep refreshing | 19:13 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add extra spec parameter and image property for memory encryption https://review.opendev.org/664420 | 19:17 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract SEV-specific bits on host detection https://review.opendev.org/636334 | 19:21 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add <launchSecurity> and <driver iommu='on' /> to config.py https://review.opendev.org/636318 | 19:21 |
mriedem | melwitt: i think i have an idea for how to make one of multiple rps fail to set inventory in aggregate in https://review.opendev.org/#/c/640898/11 | 19:22 |
mriedem | add allocations to one of them | 19:22 |
openstackgerrit | melanie witt proposed openstack/nova master: Make a failure to purge_db fail in post_test_hook.sh https://review.opendev.org/677806 | 19:24 |
melwitt | mriedem: nice! thanks, I was stumped about that | 19:25 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add extra spec parameter and image property for memory encryption https://review.opendev.org/664420 | 19:25 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Extract SEV-specific bits on host detection https://review.opendev.org/636334 | 19:26 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Add <launchSecurity> and <driver iommu='on' /> to config.py https://review.opendev.org/636318 | 19:26 |
*** nweinber_ has joined #openstack-nova | 19:27 | |
mriedem | melwitt: same, until about 5 minutes ago :) | 19:28 |
melwitt | \o/ | 19:29 |
mriedem | looking at the placement code i'm not totally sure if it handles the "drop inventory below what's allocated but don't remove the resource class" or if it's just "can't remove a resource class inventory that has allocations" - i see the latter in the placement code but not sure about the former, but i'm pretty sure it enforces that | 19:29 |
mriedem | efried: ^ right?! | 19:29 |
mriedem | hmm | 19:31 |
mriedem | https://review.opendev.org/#/c/640898/11 | 19:31 |
efried | mriedem: You can reduce inventory below what's allocated. You can't set total to zero (I'm 95% sure of that, would have to check). | 19:31 |
mriedem | ops | 19:31 |
mriedem | efried: not before 1.26 anyway.. | 19:31 |
mriedem | yeah so if your allocations exceed capacity it only logs a warning | 19:32 |
mriedem | https://github.com/openstack/placement/blob/master/placement/objects/resource_provider.py#L619 | 19:32 |
mriedem | if you try to remove a resource class entirely that has allocations it'll fail https://github.com/openstack/placement/blob/master/placement/objects/resource_provider.py#L84 | 19:32 |
mriedem | so melwitt i guess one of the providers has to have some resource class inventory and a consumer is using that rc, and then in set inventories you drop/omit that rc, | 19:33 |
mriedem | it could just be some CUSTOM_FOO in your test | 19:33 |
melwitt | ah ok | 19:33 |
melwitt | sophisticated | 19:33 |
mriedem | efried: can't set total to 0 but you can set reserved = total with 1.26 | 19:35 |
efried | yes | 19:35 |
efried | we did that specifically so you could take an ironic node out of the pool | 19:35 |
mriedem | yeah i know | 19:35 |
mriedem | mel is trying to trigger a failure on one provider's PUT inventories out of a set of providers in an aggregate | 19:36 |
mriedem | anyway, think it's figured out, i just didn't realize placement would let you do the other thing here https://github.com/openstack/placement/blob/master/placement/objects/resource_provider.py#L619 | 19:36 |
efried | does it matter what the failure is? | 19:36 |
efried | You could PUT a resource class that doesn't exist yet | 19:36 |
melwitt | like a custom class placement doesn't yet know about? | 19:37 |
efried | yeah | 19:37 |
melwitt | ahhh that would be easier | 19:37 |
efried | (careful if you're trying to drive this stuff through the report client - some paths will automatically ensure RCs for you) | 19:38 |
melwitt | I was thinking inside the box of the standard classes | 19:38 |
efried | well, you could make those fail too by setting total=0 :) | 19:38 |
melwitt | this is for osc-placement so should be safe there | 19:38 |
efried | or total=foo for that matter | 19:39 |
melwitt | ok, well, thanks for making this sound really stupidly easy :P | 19:39 |
efried | hell, getting an inventory PUT to *fail* is easy! | 19:39 |
efried | getting it to *succeed* is the hard part | 19:39 |
melwitt | well hang on | 19:39 |
melwitt | needs to fail for only one RP though | 19:40 |
efried | are you putting multiple inventories at once? | 19:40 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Apply SEV-specific guest config when SEV is required https://review.opendev.org/644565 | 19:40 |
efried | like via a single REST call? | 19:40 |
melwitt | this is a new --aggregate command which will loop over RPs in an aggregate and do the same inventory set on all of them | 19:40 |
melwitt | and I need to fail one in the middle | 19:40 |
efried | right, so you need to fail one (entire) REST call in a loop of REST calls | 19:40 |
melwitt | for test coverage for a "log error and continue" behavior | 19:40 |
melwitt | yeah | 19:41 |
efried | so yeah, you can just make a bogus inventory value in that one call | 19:41 |
efried | which is kinda what you're talking about doing I guess. | 19:41 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Enable booting of libvirt guests with AMD SEV memory encryption https://review.opendev.org/666616 | 19:41 |
melwitt | yeah but that's not what the command wants to do, it does the same 'set' for each | 19:41 |
melwitt | ideally I'd want to muck with one of the RPs ahead of time such that it would fail an identical request while the others succeed | 19:42 |
melwitt | so I think I will have to do the allocation thing | 19:43 |
*** markvoelker has joined #openstack-nova | 19:44 | |
efried | melwitt: Yeah, you can add a CUSTOM_FOO to (only) that RP and create an allocation against it. Then since you're PUTting the same inventory (which presumably lacks CUSTOM_FOO) against all the providers, it'll fail for that one since it would amount to removing the CUSTOM_FOO inventory. | 19:45 |
efried | Is that what mriedem suggested? | 19:45 |
melwitt | yes | 19:45 |
efried | smart man | 19:45 |
melwitt | yup | 19:45 |
*** markvoelker has quit IRC | 19:49 | |
*** BjoernT_ is now known as BjoernT | 19:50 | |
* mriedem writes it down | 19:54 | |
*** davee_ has quit IRC | 19:54 | |
melwitt | mriedem: how are you finding the good link to the post_test_hook run under ara-report? | 19:55 |
*** factor has joined #openstack-nova | 19:55 | |
mriedem | you know to get to the ara-report right? https://0bde141365cc25860c5b-0b668b63edb9a2dd0711fb78b2335400.ssl.cf1.rackcdn.com/672840/11/check/nova-next/daad30a/ara-report/ | 19:56 |
melwitt | yeah | 19:56 |
mriedem | i find the post.yaml with 1 task after the devstack-tempest run | 19:56 |
melwitt | oh, I was clicking the > arrow | 19:56 |
melwitt | and just saw a ton of hashes | 19:56 |
mriedem | expand that, then click on the 'changed' box | 19:56 |
melwitt | thanks | 19:57 |
mriedem | that gives you a popup, | 19:57 |
mriedem | then hit the chain link icon in the top right | 19:57 |
melwitt | got it, thank you | 19:58 |
mriedem | efried: reading your reply to alex's thread, why do you say that the numa and pci fields in the instance migratoin context are hypervisor specific? as far as i know they aren't, since they are used by libvirt and i think the hyperv driver as well during moves | 20:05 |
efried | mriedem: then they're specific to libvirt and hyperv. | 20:06 |
efried | that's luck more than anything else | 20:06 |
mriedem | you mean because they wouldn't work for powervm or what? | 20:06 |
efried | yeah, that's the example that springs to my mind obviously. | 20:06 |
efried | also pretty n/a for ironic if I'm not mistaken. | 20:07 |
mriedem | ironic doesn't migrate | 20:07 |
mriedem | but yeah | 20:07 |
efried | so like, *really* n/a :P | 20:08 |
mriedem | those fields are only populated if the instance has numa/pci stuff | 20:08 |
mriedem | which baremetal instances won't have | 20:08 |
efried | right. And in alex_xu's new world, `resources` would only get populated if the instance has local resources to track. | 20:08 |
efried | Any time we just need a count, we leave that bit empty and just use the placement inventory structure. | 20:09 |
mriedem | yeah i haven't read the whole message yet | 20:15 |
mriedem | nor the etherpad | 20:15 |
mriedem | only what i've gathered from irc conversations | 20:15 |
mriedem | efried: mordred: also, this is the patch to remove [neutron]/url from the neutron nova/neutron compute install guide in favor of a KSA-tastic world https://review.opendev.org/677810 | 20:16 |
mriedem | hopefully i didn't misrepresent in the commit message | 20:16 |
openstackgerrit | Adam Spiers proposed openstack/nova master: Enable booting of libvirt guests with AMD SEV memory encryption https://review.opendev.org/666616 | 20:19 |
efried | mriedem: nicely done, +1 | 20:22 |
mriedem | \o/ | 20:27 |
mriedem | i know you sdk/kds/sta people are particular | 20:27 |
mriedem | heh, i just realized that archive_deleted_rows returns 1 meaning, we archived some stuff, keep going until you get rc=0, | 20:28 |
mriedem | but if the command blows up in some unexpected way, we return 1 (rather than like 255) | 20:28 |
efried | that's eek | 20:28 |
mriedem | dansmith: that seems not ideal yeah? ^ | 20:28 |
mriedem | https://github.com/openstack/nova/blob/master/nova/cmd/manage.py#L2625 | 20:29 |
dansmith | hmm, | 20:29 |
dansmith | I thought we had an error case | 20:29 |
dansmith | oh | 20:29 |
dansmith | you mean if it bubbles up to main | 20:30 |
mriedem | right | 20:30 |
mriedem | TypeError or something dumb | 20:30 |
dansmith | yeah | 20:31 |
dansmith | honestly, that "return 1" there in main probably conflicts with other things too, no? | 20:32 |
dansmith | 255 is a common thing for "I dunno wtf happened" | 20:32 |
mriedem | right, nova-status returns 255, that's what got me looking at this | 20:33 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Document archive_deleted_rows return codes https://review.opendev.org/677819 | 20:38 |
efried | dansmith: couple easy ones, if you have a sec: https://review.opendev.org/#/c/676707/ and successor? | 20:38 |
dansmith | mriedem: yeah I dunno what danger might lurk by changing that now, but surely seems like the right thing | 20:40 |
mriedem | https://bugs.launchpad.net/nova/+bug/1840978 | 20:40 |
openstack | Launchpad bug 1840978 in OpenStack Compute (nova) "nova-manage commands with unexpected errors returning 1 conflict with expected cases of 1 for flow control" [Undecided,New] | 20:40 |
mriedem | yeah, just said that in the bug report, | 20:40 |
mriedem | i think it's just a simple upgrade reno yeah? | 20:40 |
dansmith | mriedem: well, more like the impact to tripleo, osa, etc | 20:41 |
dansmith | this potentially affects lots of manage commands | 20:41 |
dansmith | although this is really SHTF level fail cases, so nobody should really be depending on that | 20:41 |
mriedem | i counted simple_cell_setup, map_instances and heal_allocations as returning 1 in non-error cases | 20:43 |
efried | nobody should be relying on 1 being a real failure, because if they are, they're f'ed anyway. | 20:44 |
dansmith | mriedem: no, I mean cases in all the other commands that could raise | 20:44 |
efried | so I think this is a situation where it's okay to change it. | 20:44 |
dansmith | efried: that's my point, the only case would be somewhere that someone tests "is something done yet" by checking if something fails, and uses ret=1 to check | 20:44 |
openstackgerrit | Eric Fried proposed openstack/nova master: Revert "Filter UnsupportedServiceVersion warning" https://review.opendev.org/677825 | 20:44 |
efried | right, which is invalid. | 20:45 |
efried | I'm agreeing with you | 20:45 |
efried | mark your calendar | 20:45 |
dansmith | I don't think it's invalid | 20:46 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Revert "Filter UnsupportedServiceVersion warning" https://review.opendev.org/677825 | 20:46 |
dansmith | there are probably cases where there's no other way to tell if a thing is done yet or not | 20:46 |
*** damien_r has joined #openstack-nova | 20:49 | |
*** damien_r has quit IRC | 20:49 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Revert "Filter UnsupportedServiceVersion warning" https://review.opendev.org/677825 | 20:50 |
mriedem | efried: just rebased on top of the change mentioned in the commit | 20:50 |
efried | yeah, good plan | 20:50 |
efried | though the warning would be gone anyway since 0.34.0 is in u-c | 20:51 |
*** damien_r has joined #openstack-nova | 20:58 | |
*** damien_r has quit IRC | 20:58 | |
*** nweinber_ has quit IRC | 20:59 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Document archive_deleted_rows return codes https://review.opendev.org/677819 | 21:06 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Change nova-manage unexpected error return code to 255 https://review.opendev.org/677832 | 21:06 |
openstackgerrit | James E. Blair proposed openstack/nova master: Fix issues building PDF docs https://review.opendev.org/677028 | 21:14 |
*** altlogbot_3 has quit IRC | 21:16 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Document map_instances return codes in table format https://review.opendev.org/677835 | 21:16 |
* mriedem goes to mow | 21:18 | |
*** mriedem has quit IRC | 21:18 | |
*** BjoernT has quit IRC | 21:22 | |
*** rcernin has joined #openstack-nova | 21:27 | |
*** altlogbot_0 has joined #openstack-nova | 21:37 | |
*** altlogbot_0 has quit IRC | 21:38 | |
*** altlogbot_2 has joined #openstack-nova | 21:41 | |
*** altlogbot_2 has quit IRC | 21:42 | |
*** takashin has joined #openstack-nova | 21:50 | |
*** ivve has quit IRC | 21:57 | |
*** markvoelker has joined #openstack-nova | 21:59 | |
*** bnemec has quit IRC | 21:59 | |
*** munimeha1 has quit IRC | 22:06 | |
*** mlavalle has quit IRC | 22:16 | |
*** markvoelker has quit IRC | 22:24 | |
openstackgerrit | Merged openstack/nova master: Deprecate Aggregate[Core|Ram|Disk]Filters https://review.opendev.org/673496 | 22:27 |
openstackgerrit | Merged openstack/nova master: Re-use DB MetaData during archive_deleted_rows https://review.opendev.org/675218 | 22:27 |
*** threestrands has joined #openstack-nova | 22:34 | |
*** hemna has quit IRC | 22:35 | |
*** markvoelker has joined #openstack-nova | 22:35 | |
*** rcernin has quit IRC | 22:40 | |
*** markvoelker has quit IRC | 22:40 | |
*** eharney has quit IRC | 22:41 | |
*** rcernin has joined #openstack-nova | 22:43 | |
*** tkajinam has joined #openstack-nova | 22:56 | |
*** macz has quit IRC | 23:00 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!