03:03:04 #startmeeting openstack-cyborg 03:03:05 Meeting started Wed Jun 5 03:03:04 2019 UTC and is due to finish in 60 minutes. The chair is Sundar. Information about MeetBot at http://wiki.debian.org/MeetBot. 03:03:06 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 03:03:08 The meeting name has been set to 'openstack_cyborg' 03:03:16 #topic Roll call 03:03:36 #info Sundar 03:04:24 wangzhh, ikuo_o, Biwei: o/ 03:04:34 #info yikun 03:04:41 #info wangzhh 03:04:45 #info ikuo_o 03:04:47 Hi yikun 03:04:56 #info Biwei 03:04:58 hey, Sundar 03:05:07 Agenda: https://wiki.openstack.org/wiki/Meetings/CyborgTeamMeeting#Agenda 03:05:13 ANything to add to that? 03:06:05 #topic Python cyborg client decision 03:06:10 Coco said she has other bussiness. Will miss this meeting. 03:06:24 Please see the thread starting from http://lists.openstack.org/pipermail/openstack-discuss/2019-May/006543.html 03:06:41 wangzhh, thanks for letting us know 03:07:38 For the client, we have 2 options: write our own plugin (as many other projects seem to have done) or integrate with openstackclient repo (less work, but need to depend on the repo owners for reviews) 03:08:56 For the 2nd option, there are some guidelines and rules, which seem ok to me. But we have to discuss with the repo owners about their opinions 03:08:57 Hi all 03:09:04 #info xinranwang 03:09:11 Hi xinranwang 03:09:34 Writing a plugin may not be that bad, given the previous examples 03:09:39 What do you all think? 03:09:58 So, we have to choose one from these two option? 03:10:46 yikun, do you have any other option? These seem like the 2 ways that are recommended or have precedents. We could do neither and write our own client -- but that is not recommended 03:10:47 yikun, I think so. Do u have other suggestions? 03:10:56 I think writing our own plugin is better 03:11:16 Thanks for summarizing. 03:11:26 More indenpendency 03:11:34 Yes, I also vote the first one. Agree with xinran. 03:12:31 It is better to write plugin, but I heard the option is not prepared by OSC. 03:12:48 Maybe my misunderstanding... 03:12:49 OK, just want to sure any other way to complete, and I also vote to the 1st 03:12:58 ikuo_o: "option is not prepared by OSC" -- what do you mean? 03:13:33 OSC folks are not recomending against plugin -- they gave us the options. 03:13:52 please also consider the case where cyborg can run as a standalone project 03:14:24 xinranwang: Sure, both options should be compatible with standalone usage 03:15:17 I see. I thought the plugin option is difficult for OSC side, but maybe my misunderstanding 03:15:17 If it is realized in other works, plugin is better I think. 03:15:19 Sundar, do u have any example of these two options? Maybe two different projects with two options. 03:15:51 We do have the cyborgclient repo with some V1 APIs implemented, right? 03:16:00 wangzhh: There are many examples of plugins: https://docs.openstack.org/python-openstackclient/pike/contributor/plugins.html 03:16:06 Thx. 03:16:20 https://github.com/openstack/python-cyborgclient is that what you are talking about? Xinran 03:17:11 Biwei: Ah yes, thanks 03:17:14 wangzhh: Some examples of commands integrated with OSC: https://github.com/openstack/python-openstackclient/tree/master/openstackclient 03:17:22 wangzhh: I think the nova is 2nd(https://github.com/openstack/python-openstackclient/blob/master/setup.cfg), and placement is 1st example 03:17:23 Includes compute, identity, etc. 03:17:52 https://github.com/openstack/osc-placement 03:17:52 Cool, Thx Sundar, yikun. 03:18:13 xinranwang: The cyborg client for v1 API is neither: it uses osc-like syntax but is not a plugin 03:18:34 Can we complete it by 1st way first, and we our client is stable, and we switch it to 2nd way? 03:18:48 ikuo_o: IIUC, you are also voting for plugin? or are you still considering? 03:18:51 Can we complete it by 1st way first, and when our client is stable, and we switch it to 2nd way? 03:19:15 Sundar: I vote plugin. 03:19:19 yikun: What would motivate us to switch like that? 03:20:03 We need +2+a right at first, because we perhaps frequently at first version. 03:20:04 Is there any necessity to switch to 2nd way? 03:20:33 xinranwang: I think the only reason is sundar mentioned, the osc team want 2nd way. - -# 03:21:01 OK. Shall we say it is unanimously agreed that we will go with the plugin option (at least for now)? 03:21:21 yikun: lol 03:21:26 ok 03:21:42 Sundar: I am fine with 1st one 03:21:45 yes, plugin sounds good 03:22:15 #agreed Cyborg client v2 shall be written as a osc plugin. It was previously decided that it will use the openstack SDK. 03:22:25 Thanks, Biwei and all 03:22:45 +1, and I think the placement repo would be a good example for us, https://github.com/openstack/osc-placement 03:22:53 #topic Cyborg API fixtures in Nova 03:23:08 Please see Line 409 in https://review.opendev.org/#/c/603955/13/specs/train/approved/nova-cyborg-interaction.rst,unified 03:23:32 #link https://review.opendev.org/#/c/603955/13/specs/train/approved/nova-cyborg-interaction.rst@409 03:23:46 Many Nova developers have asked for a functional test in Nova which will call fake CYborg API. This is not the same as a fake CYborg driver, which we discussed before 03:24:35 IIUC, it is enough to do tempest CI with real hardware (no fake driver needed). That is already driven by Xinran and Biwei 03:25:18 But it seems we still need to an upstream CI, not 3rd party CI, which can be a Cyborg API fixture 03:25:34 First, does anybody have any questions on that? 03:26:06 what do you mean by upstream CI? 03:26:25 and 3rd party CI 03:26:55 I want to know those too. 03:27:30 3rd party CI involves equipment from a company outside of OpenStack open infra (Zuul). Upstream CI means that it is all in ZUul, maintained by the foundation, not another company 03:27:50 For example, Intel may set up a FPGA-based CI and integrate that with Zuul: that's 3rd party CI 03:28:17 If we have a CI running in a VM with devstack, that would be upstream 03:28:24 Good? 03:29:10 I guess the comments from matt was talking about is add cyborg api fixture in nova test code? 03:29:25 Like something, the placement done: https://github.com/openstack/nova/blob/master/nova/tests/functional/fixtures.py#L79-L90 03:29:30 yikun: Yes 03:29:59 I think Nova developers are coming from the historical experience that 3rd party hardware/CI sometimes beaks, and may take time to fix. 03:30:06 *breaks 03:30:38 A non-hardware-dependent CI is easier to maintain by the community and more reliable, even if it is not an end-to-end test 03:30:51 ok I see 03:30:55 I understand what 3rd parth CI do, but I am confused about what upstream CI do exactly 03:31:24 is it used for let nova call fake Cyborg API? 03:31:37 xinranwang: They are functional or tempest tests that do not rely on specialize d hardware 03:32:07 Yes, when Nova patches are submitted, they will automatically run tests that will simulate caling Cyborg to get device profiles and ARQs 03:32:43 The functional test fixture will return some values without actually calling Cyborg 03:33:07 You could think of it as mocking CYborg API 03:33:12 Ok, so no more fake driver required in upstream CI, just simulate API layer and return right data? 03:33:19 Yes 03:33:27 Sundar: do you mean we should make upstream CI instead the CI by Xinran and Biwei? 03:33:53 Sundar: got it, thanks 03:34:02 ikuo_o: Xinran/Biwei will work on real hardware i.e. 3rd party CI 03:34:12 that sounds like a different effort from the tempest plugin. Do we need both? Sundar 03:34:26 I think we need both. ;) 03:34:34 Oh got ya 03:34:44 If we do a fake Cyborg driver, we will be doing tempest tests 2 ways, which seems redundant. 03:35:02 I see. 03:35:28 The downside is that the tempest CI will test only FPGAs, not GPUs or anything else 03:35:47 The common Cyborg code will not be exercised by 3rd party tempest CI, as planned now 03:36:08 Sorry -- badly phrased 03:36:26 The Cyborg code for GPUs and other devices will not be tested by 3rd party tempest CI 03:38:27 Does FPGA mean Intel FPGA? 03:38:42 Yes 03:38:59 Depending on your interest, we could push for a fake GPU/generic driver (with tempest) OR fake Cyborg API (functional test) 03:39:07 No need for both IMHO 03:40:04 With tempest, bugs in Cyborg will break Nova tests. 03:40:16 But, of course, we will not have any bugs ;) 03:41:26 Thanks. should we choose the two option (fake GPU/generic driver (with tempest) OR fake Cyborg API (functional test)) now? 03:41:41 tempest test runs in real env, how to use fake driver with tempest 03:42:18 xinranwang: the fake driver will return driver OVOs and implement the same API calls as a regular driver 03:42:39 You mean you cannot put up a VM with a fake driver? 03:43:28 yes, we cannot boot VM with accelerators with fake driver 03:44:08 ikuo_o: We have to implement _some_ tests for the Nova patches to merge. It looks like the 3rd party tempest may take sometime and it is not clear that Nova folks will accept that alone 03:44:57 I got it. 03:45:20 xinranwang: there may be some way to fake that success too? It has been suggested an an option by Nova folks too. E.g. Line 20 of https://etherpad.openstack.org/p/ptg-train-xproj-nova-cyborg 03:46:51 Anyways, may be we can take this up in tomorrow's Zoom meeting, because we have only 15 minutes left for today 03:47:16 Ok 03:47:20 #topic Cyborg specs 03:47:33 Device/driver discovery spec: https://review.opendev.org/#/c/593726 03:48:03 We have already implemented it, though the spec is not merged. Do we need the spec? 03:48:53 any different between specs and impletation? If not I think we didn't need it anymore 03:49:19 yikun +1 03:49:29 +1 03:49:58 The spec was probably obsolete anyway :) So there may be differences. But that may not be relevant anyway. 03:50:46 Do the +1s mean 'we do not need it' or 'if no difference, we don't need it' ? 03:51:28 Thanks Sundar. Then the spec seems unnecessary 03:51:43 we do not need it +! 03:52:00 Good, thanks ikuo_o. I like that answer :) 03:52:21 I already abandoned it. I'll remove it from Nova spec as well 03:52:55 specs/index.rst: https://review.opendev.org/#/c/658498/ I think yikun already did that. SO, abandon this? 03:53:56 https://review.opendev.org/#/c/651061/9/doc/source/index.rst@12 03:53:58 Yes 03:54:17 I fixed it in the other merged patch. 03:54:25 Thanks, yikun. Done. 03:54:48 Please review these specs: https://review.opendev.org/#/c/602978/, https://review.opendev.org/#/c/605237/ 03:55:46 OK, add to my review list 03:55:57 For device profiles, some more changes _may_ be needed in the future for 'nested magic'. See http://eavesdrop.openstack.org/irclogs/%23openstack-placement/%23openstack-placement.2019-06-04.log.html#t2019-06-04T13:17:48 03:56:13 But we can always add a change spec 03:56:40 #topic Cyborg patches 03:57:00 Q: Why not use asserts? They are complementary to exceptions, IMHO 03:57:37 Like I said in the code reviews, other projects use them. And they are useful to check internal program logic. E.g. a function must return a list 03:58:11 If we make exceptions for everything, there will be too many exceptions or, worse, developers will not put the check 03:58:52 yikun said they produce unclear messages 03:59:20 But we can say: assert x == y: "Bad state for x, differ s from y" 03:59:53 "They" means exceptions? 04:00:01 Both the ways of the assert or exception is okay to me, :) But I prefer to use the exception. 04:00:26 ikuo_o: "they" mean asserts 04:01:14 yikun: OK. For e.g., would you put an exception to check that a function returns a list? 04:03:07 Anyways, this is food for thought :) 04:03:37 OK, I think assert and exception has their use case. :) so, I said, both okay to me 04:03:44 If you are all ok, I'll leave the asserts in the code. IMHO, it is a good practice, recommended in software engineering books 04:03:54 yikun, ok. Thanks ;) 04:04:02 Sure. 04:04:11 #topic AoB 04:04:17 There is a Zoom meeting tomorrow 04:04:36 Mostly to wrap up the spec and patch reviews as much as possible 04:04:45 Ok, I will see you tomorrow. 04:05:01 ikuo_o and all, see you all tomorrow :) 04:05:08 ok will join it, and thereare some patches need you guys eyes: 04:05:15 https://review.opendev.org/#/c/658987 04:05:15 ^ the generic driver spec have been approved, and the patch is ready for review. 04:05:15 https://review.opendev.org/#/c/660874/ 04:05:15 ^ also the huawei ascend driver is also ready to merged 04:05:29 Great, will review 04:05:35 any question, you can leave your comments, thanks. :) 04:05:37 Good day, everybody! 04:05:41 #endmeeting