Tuesday, 2024-06-11

*** bauzas_ is now known as bauzas03:44
*** bauzas_ is now known as bauzas04:09
*** bauzas_ is now known as bauzas04:22
*** bauzas_ is now known as bauzas05:27
hamidlotfi_Hi there,06:05
hamidlotfi_An instance whose volume is on CEPH, after abnormally restarting the Compute server where that instance is located, that instance does not come up and displays the following error:06:05
hamidlotfi_https://www.irccloud.com/pastebin/n5z4qgB7/06:05
hamidlotfi_ I checked all of those values for the disk_cachemodes parameter,06:05
hamidlotfi_network=directsync 06:05
hamidlotfi_network=none06:05
hamidlotfi_network=writethrough06:05
hamidlotfi_but in all of the values, the result is the same and I had that error.06:05
*** elodilles_pto is now known as elodilles07:26
*** bauzas_ is now known as bauzas07:46
opendevreviewSahid Orentino Ferdjaoui proposed openstack/nova master: scheduler: fix _get_sharing_providers to support unlimited aggr  https://review.opendev.org/c/openstack/nova/+/92166508:09
*** bauzas_ is now known as bauzas09:17
sean-k-mooneygibi: can you review https://review.opendev.org/c/openstack/nova/+/710848 and the proceedign repoducer again11:16
sean-k-mooneywhen you have time of course but this has been arrond for a long time and if you agree i would like to back port that to stable branches11:17
gibiadded to my queue12:09
*** ykarel_ is now known as ykarel13:10
WJeffsHey, I might be being blind, but is there any options for Numa placement when allocating them to a VM. Say we have a HV with 8 numa groups across 2 cpus, and I want to allocate all the cores from 2 groups, thats no issue, but it gives me 1+6, but is there a way we can make it always use numa on same physical cpu at all times?13:15
sean-k-mooneyWJeffs: form the same cpu package no. if you have a numa affined guest and ask for 2 numa nodes we will either pack or balance the vm across the numa node depenign on a config option13:59
sean-k-mooneyWJeffs: in the past the beahvior was to pack leading to poor performance we now default to spreading 13:59
sean-k-mooneyWJeffs: we did have a request ot supprot socket level optimiations for amd cpus in the past14:00
sean-k-mooneybut we have not had time to implemtn that14:00
WJeffssean-k-mooney: ah it is for AMD Genoa, I guess that is the same part, we are already creating the numa configuration inside the VM and that works great and gives us good performance, now we are trying to test some extra configurations and found that the numas are spread across sockets15:00
WJeffsI'll have a look for the request/blueprint and see if there is anything we can do to help on this.15:01
sean-k-mooneyWJeffs: this is basicaly all contolled by https://github.com/openstack/nova/blob/master/nova/virt/hardware.py#L2449-L2511 currently15:05
sean-k-mooneyto do what you want we would need to add the socket info int the numa toplogy15:05
sean-k-mooneyand then add an addtional sort here https://github.com/openstack/nova/blob/master/nova/virt/hardware.py#L248115:06
sean-k-mooneyto sort the nodes by socket15:06
sean-k-mooneysorry not there just below it https://github.com/openstack/nova/blob/master/nova/virt/hardware.py#L248815:07
sean-k-mooneythe les is for pinned cpus15:07
*** bauzas_ is now known as bauzas15:25
WJeffssean-k-mooney: Thanks, I'll take a look and see if there is something can do to help15:38
sean-k-mooneythe main gap right now is we dont track the socket id in the host numa topogy data stucture15:40
sean-k-mooneyor the numa distance btween numa ndoes15:40
sean-k-mooneysince that really waht you want to minimise15:40
sean-k-mooneylibvirt provide both we jsut dont collect it right now15:40
WJeffsyea, thats the main thing we are trying to do. So its collect from libvirt -> use that data that needs to be added.15:41
sean-k-mooneyya so artom  looked at this brifly a year or two ago15:41
sean-k-mooneywe have a downstream tracker in our backlog to eventually look at this https://issues.redhat.com/browse/OSPRH-3615:42
sean-k-mooneybut we jsut didnt have time or custoemr ask to priories it15:42
artomI even have/had WIP patches up, at some point15:42
artomJust the basics to collect those distance from libvirt15:42
sean-k-mooneyhttps://review.opendev.org/q/topic:%22bp/libvirt-smarter-cpu-placement%2215:42
sean-k-mooneyyep ^15:42
sean-k-mooneythere in merge conflict but its proably pretty trivial15:43
sean-k-mooneythis was the bluepritn https://blueprints.launchpad.net/nova/+spec/libvirt-smarter-cpu-placement15:43
sean-k-mooneybtu not much detail there15:43
WJeffsah wow, it was pretty close then, just come work to tidy up.15:44
sean-k-mooneytests, ensuring this works for live mirgation that sort of thing15:47
WJeffsappreciate all the hard work guys15:47
sean-k-mooneyi dont recal exactly how far artom got but its a good starting point15:47
artomThose two patches are literally all I had.15:47
artomThe hard part is integrating that into the placement of VMs15:48
bauzasnova meeting in 5 mins here15:55
bauzasI could be a bit lagging, don't be afraid15:55
bauzas#startmeeting nova16:00
opendevmeetMeeting started Tue Jun 11 16:00:22 2024 UTC and is due to finish in 60 minutes.  The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
opendevmeetThe meeting name has been set to 'nova'16:00
bauzas#link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting16:00
gibio/16:00
elodilleso/16:00
fwiesel\o16:01
dansmitho/16:01
auniyalo/16:01
bauzasmy current time : 16:01
bauzas64 bytes from par21s19-in-f4.1e100.net (142.250.179.68): icmp_seq=27972 ttl=112 time=193 ms16:01
bauzasanyway, let's start16:01
bauzas#topic Bugs (stuck/critical) 16:02
bauzas#info No Critical bug16:02
bauzas#info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster16:02
bauzasanything to discuss ?16:02
bauzaslooks not16:03
bauzas#topic Gate status 16:03
bauzas#link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:03
bauzas#link https://etherpad.opendev.org/p/nova-ci-failures-minimal16:03
bauzas#link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&pipeline=periodic-weekly Nova&Placement periodic jobs status16:03
bauzaswe had a nova-emulation master job failure https://zuul.openstack.org/build/2f26196745fe4cc8a93b5453317710ee16:03
sean-k-mooneythe fix for that job is still not merged16:04
sean-k-mooneyit keeps hitting job timeout or unrelated failures in other jobs16:04
bauzasbecause of when deleting some instance, it was not found16:04
sean-k-mooneyi have started to look in to how to gather some data to deteimn if we should bump the timeouts in the integrated-compute jobs and its decendent jobs16:04
bauzasok16:05
sean-k-mooneythe simple solution woudl be to temporally add 30mins to the integrated comptue job while i try and figure out what a more reasonable value would be16:05
bauzasbut it was not a timeout issue 16:06
sean-k-mooneynot in this case16:06
sean-k-mooneybut the patch that fixes the OOM issue in that job16:06
sean-k-mooneyis being blocked by timouts in other jobs16:06
bauzasah ok16:06
sean-k-mooneyanyway i think we can move on for now16:07
bauzas#info Please look at the gate failures and file a bug report with the gate-failure tag.16:07
bauzas#info Please try to provide meaningful comment when you recheck16:07
bauzas#topic Release Planning 16:07
bauzas#link https://releases.openstack.org/dalmatian/schedule.html16:07
bauzas#info Dalmatian-2 in 3 weeks16:07
bauzas#info we'll have a spec review day on July 3rd16:08
bauzas(which is a Wednesday)16:08
bauzas#action bauzas to tell this in the mailing-list16:08
bauzas(on July 2nd I'll be off)16:09
bauzaswould people prefer to have this day on Monday, as July 4th is some US holiday16:09
bauzas?16:09
bauzasis it me lagging or nobody replied ?16:11
dansmithnobody replied16:11
bauzasthoughts then ?16:11
dansmithit's right around the american holiday16:12
dansmithmost of us will be plotting to blow up our neighbors on the following day16:12
sean-k-mooneyi have no real prefernce os i think tis also fine if we do it semi async16:13
sean-k-mooneyi.e. if us folks wont be around that day and feel like doing reviews on monday then the rest of us will already have that feedback16:13
sean-k-mooneyon wednesday16:13
bauzasok, let's do this on monday and I'll explain that it can be async for the week16:15
bauzasmoving on16:15
bauzas#topic Review priorities 16:15
bauzas#link https://etherpad.opendev.org/p/nova-dalmatian-status16:15
bauzasnothing to report, I'm planning to review specs but my lag doesn't help much today16:16
bauzashopefully my connection will come back tomorrow16:16
* bauzas won't tell how his fiber is good16:16
bauzasmoving on16:17
bauzas#topic Stable Branches 16:17
bauzaselodilles: please16:17
elodilles#info stable gates should be OK16:17
elodilles#info nova 29.0.2 released for 2024.1 Caracal stable series16:17
elodillesthanks for the review o/16:17
elodilles#info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci16:17
elodillesand that's all from me about stable16:18
bauzascool16:20
bauzasthanks16:20
bauzas#topic vmwareapi 3rd-party CI efforts Highlights 16:20
bauzasfwiesel: anything to tell ?16:20
fwiesel#info Fixed bug in resolving the right branch in case of renaming from "stable/" to "unsupported/"16:20
fwieselNot much, but a small change... I query now the gerrit api to find out how the actual branch name is for each project.16:21
fwieselAny questions or feedback?16:21
elodilles(* "unmaintained" o:))16:21
bauzasyeah, unmaintained is the right wording16:22
bauzasthanks fwiesel for that16:22
fwieselAh, sorry. Yeah, in the code it is correct :)16:22
elodilles:)16:22
fwieselThat's then from my side wrt the 3rd-party CI16:23
bauzascool16:24
bauzas#topic Open discussion 16:24
bauzasanything anyone ?16:24
fwieselHopefully a quick question: I have a draft for a blueprint in etherpad: https://etherpad.opendev.org/p/nova-lazy-metadata-loading 16:24
fwieselHow do I continue from there on?16:24
fwieselRegister it right away and put that in the url?16:25
fwieselOr ask here for feedback? (as I have done then implicitly)16:25
bauzaswell, usually, we discuss specless blueprints in our meetings16:26
bauzasif you want to have it accepted, it's the right time16:26
bauzasor do you want to discuss the design ?16:26
* bauzas clicks but it will take a bit of time16:26
bauzasoh, that's a spec template, nevermind16:27
bauzasI think you can now create the blueprint, link it in the spec and upload that spec16:27
fwieselin nova-specs then?16:28
bauzasyou need to register a blueprint with https://blueprints.launchpad.net/nova/+spec/lazy-metadata-loading16:28
bauzasthen you need to upload the file you created in the nova-specs/2024.2/approved repor16:28
bauzass/repor/directory16:28
bauzasand then push that commit to gerrit16:29
fwieselOkay, thanks. Will do16:30
bauzasfwiesel: pro-tip, in the commit msg (in the nova-specs repo), add a specific "blueprint lazy-metadata-loading" tag in the text16:32
bauzassomething like "Proposes blueprint lazy-metadata-loading"16:32
bauzasit will automatically link the spec in the blueprint whiteboard16:32
bauzasideally, create local branch from that name, so gerrit will have this topic 16:33
fwieselGreat, I'll do that.16:33
bauzasokay, anything else ?16:36
bauzasok, I think we can close the meeting 16:40
bauzasthanks guys 16:40
bauzas#endmeeting16:40
opendevmeetMeeting ended Tue Jun 11 16:40:52 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:40
opendevmeetMinutes:        https://meetings.opendev.org/meetings/nova/2024/nova.2024-06-11-16.00.html16:40
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/nova/2024/nova.2024-06-11-16.00.txt16:40
opendevmeetLog:            https://meetings.opendev.org/meetings/nova/2024/nova.2024-06-11-16.00.log.html16:40
elodillesthanks o/16:41
fwiesel\o16:42
*** bauzas_ is now known as bauzas17:46
*** bauzas_ is now known as bauzas19:44
*** bauzas_ is now known as bauzas19:57
*** bauzas_ is now known as bauzas20:05
*** bauzas_ is now known as bauzas20:50
*** bauzas_ is now known as bauzas20:58
*** bauzas_ is now known as bauzas21:06
*** bauzas_ is now known as bauzas23:22
*** bauzas_ is now known as bauzas23:55

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!