Wednesday, 2024-06-19

ttxLarge Scale SIG meeting here in one hour!07:59
amorinWill be there!08:17
ttxsongwenping: hoping you can join the meeting in 15 minutes so we address questions on your doc!08:42
stanHow do I observe the meeting, from here in this chat?08:49
amorinstan: yes, this will be an IRC meeting08:55
ttxAnd everyone can participate!08:56
ttx#startmeeting large_scale_sig09:00
opendevmeetMeeting started Wed Jun 19 09:00:01 2024 UTC and is due to finish in 60 minutes.  The chair is ttx. Information about MeetBot at http://wiki.debian.org/MeetBot.09:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.09:00
opendevmeetThe meeting name has been set to 'large_scale_sig'09:00
ttxHi everyone, welcome to our monthly Large Scale SIG meeting!09:00
amorino/09:00
ttx#topic Rollcall09:00
ttxping felix.huettner songwenping 09:00
ttxOur agenda is at:09:01
ttx#link  https://etherpad.opendev.org/p/large-scale-sig-meeting09:01
ttxWaiting a few minutes in case other participants join late09:02
songwenping\o/09:02
songwenpinghi ttx09:02
ttxsongwenping: hi! Glad you could make it09:02
ttxOK, let's get started09:03
ttx#topic Brainstorm OpenInfra Live next episode ideas09:03
ttxIn previous meetings we discussed a potential new episode, after unsuccessfully trying to crowdsource one frmo the rset of the community09:03
ttxamorin: we were considering one around infrastructure for GPUs, did you manage to convince anyone at OVH around that?09:04
amorinI completely forgot to talk about it unfortunately, sorry for this09:04
amorinso I will ask, adding in my local todo right now09:04
ttxI was wondering if we could get https://www.nexgencloud.com/ to talk09:05
ttxThey are a one of the biggest buyers of GPUs recently and run an openstack cloud09:05
amorinwhat is the idea in your mind?09:05
amorinhow openstack and gpu can work together?09:05
amorinor how is it consumed by customers?09:05
amorinor the usage of GPU in cloud?09:06
ttxSpecific challenges in providing a large scale GPU cloud, I guess09:06
ttxidentifying any gap09:06
songwenpingGPU management? our product adapt many kinds of GPUs, like A09:06
ttxtrying to anticipate questions the next GPU cloud deployer may have09:07
amorinso, so more related to infrastructure than customer use cases09:07
ttxyeah... Would not mind some shiny workload example too, but that's a bit orthogonal to our SIG purpose09:07
songwenpingA100, A40, V100, P100 and so on.09:07
ttxCould be more of a panel thing09:07
amorinok, I have a guy for this in the team, will ask if he is willing to join/talk about it09:08
ttxExperience operating an OpenStack GPU cloud those days09:08
ttxcool. We'll reach out to Nexgen see if they are interested09:08
ttxand then open it up to others09:08
ttxprobably somethign we'd do in ~October09:09
ttxSeptember we'll be busy at OpenInfra Summit Asia09:09
amorinack, so we have time to refine this, that'd good09:09
ttxand July-August will be tricky09:09
ttx#agreed let's try to do a panel episode around Experience operating an OpenStack GPU cloud09:10
ttx#action amorin to confirm an OVHCloud speaker09:10
amorinack09:10
ttx#action ttx to see if someone from nexgen would be interested09:10
ttx#info targeting October timeframe09:10
amorinmaybe have sylvain bauza in the talk as well? he is involved in GPU and openstack a lot09:11
ttxyeah that's a good idea...09:11
ttx#info Sylvain Bauza could bring the development angle09:11
ttxI'll give it some extra thought and pull Allison in for extra ideas09:12
ttxmoving on to next topic09:12
ttx#topic Large scale doc09:12
ttxsongwenping sent a great report to the mailing-list09:12
ttx#link https://etherpad.opendev.org/p/large-scale-inspur09:12
ttxThere were some open followup questions09:12
amorinyes, that's great, thanks!09:13
ttxmnaser asked "How did you adjust the max number of conns for RabbitMQ and for the relay I assume you used https://docs.ovn.org/en/latest/tutorials/ovn-ovsdb-relay.html ?"09:13
ttxthan amorin had questions too09:13
ttxthen*09:13
amorinyup, I am eager to learn more about what you wanted to achieve and what you exactly did to fix your deployment09:14
songwenpingamorin, good question. we want to manage more nodes as there are big requirement for customer.09:16
ttxsongwenping: did you see those questions on the mailing-list? ideally you would respond there so that everyone benefits09:16
songwenpingwe use k8s infrastructure to deploy openstack09:16
songwenpingsorry, maybe i miss the mail09:17
amorine.g. you mentionned booting 3k instances and having scheduler / placement issue. Is it because you ask those 3k instance in one shot?09:17
ttxsongwenping: still here?09:19
songwenpingyeah09:20
songwenpingi am finding the mail.09:20
songwenpingbut still not find :(09:20
ttxah, let me link09:20
ttx#link https://lists.openstack.org/archives/list/openstack-discuss@lists.openstack.org/thread/ISIG5TG4DYCTDTP4ZJNJFYCSUVYMX5BT/09:21
ttxyou will see both questions there ^09:21
songwenpingamorin, yes, we send requests to create 3k instances in one shot.09:21
amorinthat make sense then, that's an unusual use case, amazing!09:22
ttxideally you would reply by email to the mailing-list again, adressing mnaser's and amorin's questions09:22
ttxthat way everyone else can see the answers09:22
amorinyes, sounds good to me also09:22
ttxsongwenping: would that work for you?09:23
songwenpingttx, could you please forward the mail to me?09:23
amorinit's weird you did no receive it, maybe check you spam box?09:24
ttxcan you see them at the link I just posted?09:24
ttxhttps://lists.openstack.org/archives/list/openstack-discuss@lists.openstack.org/thread/ISIG5TG4DYCTDTP4ZJNJFYCSUVYMX5BT/09:24
songwenpingi can see at the link09:24
ttxok perfect09:24
ttx#action songwenping to reply to the questions on the mailing-list09:25
ttxamorin: is there anything new in the report that could be documented in the large-scale sig doc?09:25
songwenpingbut i canot reply at the link.09:26
amorinI believe yes, we can have something new to add to the doc09:26
ttxI'll forward you both emails now09:27
amorinhowever, we need to explain your use-case correctly also, because, e.g. max_connections = 100000 is unusual and maybe counter productive09:27
songwenpingttx, thansks.09:27
amorinthe rabbit config you did also, I need to understand the details of it09:28
amorinmaybe your situation could also be improved if you switch to quorum queues09:28
amorinI dont know for now to be honest09:28
amorinlet's continue the mail thread09:28
ttxOK emails forwarded... let me know if you receive them :)09:29
songwenpingamorin, we donnot use quorum queues.09:29
ttxOK let's continue the discussion on the mailing-list and we'll see if we can extract a few things from the story to add to the doc09:31
amorinyup09:31
ttx#topic Next meeting(s)09:31
songwenpingttx, exactly not yet receive.09:31
ttxNormally the next meeting would be on July 17, but I won't be around. Should we skip for summer and do next one September 18?09:31
ttxsongwenping: sent to the inspur.com address you used to post 09:32
songwenpingamorin, i will complete the rabbit detail optimization on the etherpad.09:32
ttxgreat!09:32
amorinthanks09:32
songwenpingttx, recevied just now, thanks.09:33
amorinjuly 17 I will also be off09:33
ttxOK so that one is a skip for sure09:33
amorinwe can maybe skip meetings this summer, agre09:33
ttxWe could keep the August 21 one if you are around09:34
amorinI should be there09:35
ttxOK let's keep it on the agenda09:35
ttx#info next meeting, August 21 on IRC09:35
ttx#topic Open discussion09:35
ttxAnything else we should cover today?09:36
amorinmaybe stan you were there to talk about something?09:36
ttxstan: still around?09:38
amorinnothing more on my side09:39
ttxalright then09:39
ttx#endmeeting09:39
opendevmeetMeeting ended Wed Jun 19 09:39:44 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)09:39
opendevmeetMinutes:        https://meetings.opendev.org/meetings/large_scale_sig/2024/large_scale_sig.2024-06-19-09.00.html09:39
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/large_scale_sig/2024/large_scale_sig.2024-06-19-09.00.txt09:39
opendevmeetLog:            https://meetings.opendev.org/meetings/large_scale_sig/2024/large_scale_sig.2024-06-19-09.00.log.html09:39
amorinthank you!09:39
songwenpingnothing from myside09:39
songwenpingbye09:40
ttxThanks amorin and songwenping for participating!09:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!