03:02:26 <Sundar> #startmeeting openstack-cyborg 03:02:27 <openstack> Meeting started Thu Oct 24 03:02:26 2019 UTC and is due to finish in 60 minutes. The chair is Sundar. Information about MeetBot at http://wiki.debian.org/MeetBot. 03:02:28 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 03:02:30 <openstack> The meeting name has been set to 'openstack_cyborg' 03:02:43 <Sundar> #topic Who's here 03:02:46 <Sundar> o/ 03:02:48 <shaohe_feng> seems you did not finish last meeting 03:02:54 <s_shogo> Hi all 03:02:58 <s_shogo> #info s_shogo 03:03:02 <shaohe_feng> o/ 03:03:02 <chenke> #info chenke 03:03:05 <chenke> 0/ 03:03:12 <chenke> hi all~ 03:03:14 <shaohe_feng> #info shaohe_feng 03:03:19 <Yumeng> #info Yumeng 03:03:24 <Sundar> Hi y'all 03:03:30 <Sundar> #topic Agenda 03:03:40 <Sundar> https://wiki.openstack.org/wiki/Meetings/CyborgTeamMeeting#Agenda 03:03:42 <shaohe_feng> what does y' means? 03:03:57 <Sundar> :) "you all" 03:04:22 <Sundar> Anything to add to the agenda? 03:04:30 <xinranwang> hi all 03:04:37 <Sundar> Hey xinranwang 03:04:49 <Sundar> #topic Summit and PTG 03:05:26 <Sundar> First, re. project update at the summit, I got clarification after the last meeting. 03:05:39 <shaohe_feng> will miss the Team dinner :') 03:05:41 <Sundar> It is actually on. But it is not recorded as video. 03:06:27 <Sundar> Sorry to have mis-stated before I got the clarification. Yumeng, I will create the slides and we can work on them. 03:07:01 <Sundar> shaohe_feng: we will get you a good dinner in Beijing ;) 03:07:33 <shaohe_feng> Thanks 03:07:48 <shaohe_feng> you will visit Beijing? 03:07:58 <shaohe_feng> after PTG 03:08:05 <Sundar> PTG: #link http://ptg.openstack.org/ 03:08:18 <Yumeng> Sundar: ok. 03:08:26 <Sundar> All the info for PTG is here. 03:08:45 <Sundar> It turns out that nearly all projects get only a table, not a separate room. 03:09:21 <Sundar> http://ptg.openstack.org/ptg.html Diable is a table, not a room 03:09:25 <Sundar> *Diablo 03:09:48 <Sundar> Not sure how well that works, but that's the case for most projects 03:11:35 <Sundar> Etherpad: The current etherpad (https://etherpad.openstack.org/p/cyborg-ptg-ussuri) will need to be merged into the final one: https://etherpad.openstack.org/p/shanghai-ptg-cyborg 03:11:49 <chenke> We have three days 03:11:49 <Sundar> I will take care of it, once we have enough content in the current etherpad 03:12:14 <Sundar> chenke: True -- but that is a misleading because Wednesday includes the summit too 03:12:36 <chenke> Yes. 03:12:48 <Sundar> I have activities on Wednesday, such as office hours for Cyborg 03:13:23 <Sundar> Also, the project onboarding is part of the PTG this time. So, we will be spending some time talking to newcomers who want to contribute to Cyborg 03:13:42 <Sundar> I suspect it will really be 2 days of discussion 03:14:44 <chenke> It seems that time is sufficient. 03:14:54 <Sundar> Finally: the team photo shoot is part of the PTG: on Thu at 2 pm (after lunch). But that is only 10-15 min 03:15:19 <Sundar> Does anybody have any questions or comments on the PG or the summit? 03:15:21 <Sundar> *PTG 03:17:35 <Sundar> OK, moving on 03:17:43 <Sundar> #topic Doc patches 03:18:24 <Sundar> The documentation has merged into stable/train -- yay! Thanks, xinranwang, Yumeng, chenke and all for proposing/reviewing it. 03:18:43 <chenke> Great. 03:19:00 <Sundar> Now Train has truly left the station for Cyborg. ;) 03:20:02 <Sundar> I see soem new patches from Yumeng and others for docs. Yumeng, you don't expect this to be backported to Train, right? 03:21:27 <Yumeng> sundar: no, no need to backported to Train. what do you think? 03:21:32 <Sundar> We need to check how this differs from other API doc that got merged 03:21:42 <Yumeng> https://review.opendev.org/#/c/690539/ 03:22:35 <Sundar> Your patch has more detail. 03:23:51 <Sundar> OK, anything else on docs? We need similar content for ARQs, etc., I suppose. 03:23:56 <Yumeng> I took keystone api-ref as a reference, they have deprecated and current APIs 03:24:27 <Yumeng> here is the keystone api-ref: https://docs.openstack.org/api-ref/identity/ 03:24:31 <Yumeng> take a look 03:24:38 <Sundar> Cool. 03:24:54 <Sundar> #topic Other patches 03:25:19 <Sundar> https://review.opendev.org/#/c/685542/ 03:25:58 <Sundar> My question is, should we expend more effort on the ksa_adapter and older methods, or should we aim to move to openstacksdk? 03:27:09 <chenke> According to eric's advice. He suggest we use openstacksdk. 03:28:33 <Sundar> So, what should be the status of this patch? 03:28:50 <chenke> It is easier to use than before. 03:29:23 <chenke> If your devstack env test ok. 03:29:47 <chenke> I suggest to merge this patch and this https://review.opendev.org/#/c/690509/2. 03:30:19 <Sundar> Ok, I'll try it out. The 2nd one is definitely an improvement 03:30:31 <xinranwang> Please review this patch too, after async bind merged, GPU is not supportted. https://review.opendev.org/#/c/688239/ 03:30:37 <chenke> Ye. 03:30:49 <chenke> Sundar: Thanks. 03:31:20 <xinranwang> And there is a bug in placement client, please review this one too :) https://review.opendev.org/#/c/688231/ 03:31:31 <Yumeng> Sundar: I think we can merge patch 685542, no more effort on ksa_adapter and openstacksdk for now. 03:32:36 <Sundar> xinranwang, Yumeng: yes to both 03:33:31 <Sundar> Any other patches to be highlighted today? I know there are a bunch of others that need review. 03:35:04 <Sundar> #topic AoB 03:35:21 <Sundar> If there is nothing else, we can get back 25 minutes. 03:36:11 <shaohe_feng> Will we support Orchestrator components(or system IAAS sotrware tools) also want to leverage accelerators? 03:36:25 <shaohe_feng> These components may run on the same host node with VM guest. 03:36:39 <shaohe_feng> Seems this is out of cyborg control. (should we add a reservation mechanism for them?) 03:36:44 <Sundar> Could you elaborate or provide links? 03:37:11 <shaohe_feng> I have attend a meeting about accelerators. 03:38:12 <shaohe_feng> Get the information, not only VM need accelerator, but maybe some other applications run on the host node also need accelerators 03:39:02 <Sundar> It is not common in an OpenStack cluster to have VMs and non-VM processes running on the same compute node. 03:39:51 <shaohe_feng> I meam the non-VM processes are openstack required components 03:40:03 <shaohe_feng> Not sure you know RDT. 03:40:24 <Sundar> Yes, I know RDT. Are you referring to RMD? 03:40:34 <shaohe_feng> No 03:40:37 <shaohe_feng> RDT 03:40:53 <shaohe_feng> the use case is: 03:41:04 <shaohe_feng> Divide cache into 2 parts. 03:41:29 <Sundar> Resource Director technology (RDT) is a set of Intel processor features for cache/memory management. They can be used in OpenStack in different ways. 03:41:42 <shaohe_feng> 1 parts are reserved for system IAAS sotrware run on the node 03:42:04 <shaohe_feng> another parts are reserved for VM. 03:42:49 <Sundar> That part is outside Cyborg though -- it relates to cache/memory and should be configured by the operator. How does it relate to accelerators? 03:43:17 <shaohe_feng> We defined, cyborg can only allocate accelerators to nova, right? 03:43:48 <shaohe_feng> I means like RDT. 03:43:58 <shaohe_feng> we have 5 accelerators. 03:44:11 <shaohe_feng> we divided into 2 parts. 03:45:20 <shaohe_feng> 1 parts have 1 acc, it is out of cyborg control. It is reserved for system IAAS sotrware. Admin allocate them 03:45:56 <shaohe_feng> another parts have the remain 4 acc, cyborg allocate them to nova. 03:46:09 <shaohe_feng> similar like RDT use case. 03:46:38 <shaohe_feng> on we can simple define cyborg. 03:47:20 <shaohe_feng> if cyborg run on a host node. we not allow to allocate accelerators to system IAAS sotrware any more. 03:47:38 <shaohe_feng> we can note this on cyborg docs. 03:47:44 <Sundar> If the admin wants to keep aside some accelerators, he/she can update Placement to mark that resource provider as in-use. 03:48:22 <Sundar> From Cyborg's POV it created the RP but Placement will never pick that RP because it sees it as in-use 03:49:26 <xinranwang> As we know, cinder has enable qat driver, I guess shaohe want to know if Cyborg can help cinder manage this kind of accelerator, is that what you want say, shaohe_feng? 03:49:44 <shaohe_feng> not only cinder 03:49:53 <shaohe_feng> also for some VPN on the host. 03:50:02 <chenke> what does POV means? 03:50:02 <shaohe_feng> and storage 03:50:08 <xinranwang> yes, cinder is just an example 03:50:12 <Sundar> chenke: point of view 03:50:17 <shaohe_feng> DPDK, SPDK also need . 03:50:25 <shaohe_feng> yes, it is an example 03:50:49 <shaohe_feng> so does that means, let cyborg run first before other system IAAS sotrware, it report resources to placement first 03:51:34 <shaohe_feng> and amdin reserve some accelerators, and start the remain system IAAS sotrwares 03:52:11 <shaohe_feng> and amdin reserve some accelerators to the remain system IAAS sotrwares 03:52:57 <Sundar> Not sure where this discussion is headed. If Cinder is using a device, operator should not configure Cyborg driver for that device. If the admin wants to dedicate some networking/storage devices for DPDK/SPDK and they were claimed by Cyborg, the admin could still mark them in placement, as I said. 03:53:01 <shaohe_feng> We can add this usage as a guideline in cyborg doc. 03:54:04 <shaohe_feng> Sundar: can we add this to the cyborg doc? 03:54:17 <Sundar> shaohe_feng: It may be better to wait for concrete usage by operators and get their feedback, instead of adding hypothetical use cases in the docs -- they could be confusing. 03:54:19 <shaohe_feng> maybe DPDK as example. 03:55:10 <s_shogo> I also interested in disabling to allocate some accelerators, as maintenance. 03:55:43 <Sundar> s_shogo: Only when no VM is using that device, right? 03:55:56 <s_shogo> Sundar: yes 03:56:00 <shaohe_feng> Maybe we had better not allow DPDK to use accelerators any more. This can make things more simple. 03:56:09 <xinranwang> Currently, operator plays this role. 03:57:18 <shaohe_feng> For example, had better to avoid DPDK to use some network accelerator. 03:57:50 <shaohe_feng> Then things is simple. 03:57:53 <Sundar> s_shogo: Would the admin-Placement method work for now? We had plans to add health reporting and enabling/disabling devices. Once we have the /v2/devices API, we could add this functionality. Seems ok? 03:58:46 <Sundar> shaohe_feng: Cyborg does not support network resources today. 03:59:02 <shaohe_feng> yes, just a example. 03:59:35 <shaohe_feng> QAT can be used for some network accelerations 03:59:47 <shaohe_feng> and it can be used by DPDK. 04:00:00 <shaohe_feng> not sure cyborg will support QAT 04:00:23 <s_shogo> Sundar: Yes, the approach is good. I want to discuss the methods continuously with example usecase for maintenance and that's requirements. 04:00:26 <shaohe_feng> Or cyborg just plan to manager only VM used accelerators. 04:01:37 <Sundar> shaohe_feng: Do you have specific knowledge that somebody wants to have several QATs in one host, some for VMs, and some for DPDK on the host? 04:02:45 <Sundar> s_shogo: Sure, let's make this a discussion topic at the PTG. Your I could create an etherpad with the problem statement and some ideas. 04:03:15 <Sundar> *You or I 04:04:25 <shaohe_feng> usually,they use kolla to deploy the system IAAS sotrwares include (placement), How do kolla know what's accelerators in placement? 04:04:30 <s_shogo> Sundar : Thanks, I'll update the etherpad (part of the topic is already written) 04:05:31 <Sundar> My suggestion: let's get Nova integ done and make something functional. Then we can start talking about more advanced use cases. 04:05:52 <Sundar> We are over the time. Is there anything else? 04:06:49 <shaohe_feng> Yes, the meeting tells use DPDK can use accelerators. But they does not say DPDK can not live with VM on same host. 04:07:10 <shaohe_feng> Ok. 04:07:36 <shaohe_feng> seems, we did not support much advanced use cases at present. 04:07:40 <shaohe_feng> step by step 04:08:10 <Sundar> shaohe_feng: Let's take this offline. 04:08:16 <Sundar> Good. Thanks a lot, everybody. Have a good day! 04:08:20 <Sundar> #endmeeting