*** jkilpatr has quit IRC | 06:53 | |
*** joseppc has joined #openstack-cyborg | 08:52 | |
*** joseppc has joined #openstack-cyborg | 08:52 | |
*** jkilpatr has joined #openstack-cyborg | 11:02 | |
*** jkilpatr has quit IRC | 11:07 | |
*** jkilpatr has joined #openstack-cyborg | 11:07 | |
*** mikeH has joined #openstack-cyborg | 11:43 | |
*** NokMikeR has joined #openstack-cyborg | 12:43 | |
*** crushil has quit IRC | 13:32 | |
*** crushil has joined #openstack-cyborg | 13:49 | |
*** skelso has joined #openstack-cyborg | 13:58 | |
*** sekelso has joined #openstack-cyborg | 14:00 | |
*** sekelso has quit IRC | 14:02 | |
*** sekelso has joined #openstack-cyborg | 14:03 | |
*** skelso has quit IRC | 14:04 | |
*** zhipeng has joined #openstack-cyborg | 14:07 | |
*** sekelso has quit IRC | 14:10 | |
*** sekelso has joined #openstack-cyborg | 14:10 | |
*** sekelso has quit IRC | 14:33 | |
*** sekelso has joined #openstack-cyborg | 14:34 | |
*** sekelso is now known as skelso | 14:35 | |
*** zhipeng has quit IRC | 14:41 | |
*** mpaolino has joined #openstack-cyborg | 14:47 | |
jkilpatr | anyone alive in here? | 14:53 |
---|---|---|
jkilpatr | oh man I should check on my commits more often. | 14:54 |
NokMikeR | barely | 14:54 |
zhipengh[m] | lol | 14:54 |
*** zhipeng has joined #openstack-cyborg | 14:58 | |
zhipeng | #startmeeting openstack-cyborg | 15:00 |
openstack | Meeting started Wed Mar 29 15:00:18 2017 UTC and is due to finish in 60 minutes. The chair is zhipeng. Information about MeetBot at http://wiki.debian.org/MeetBot. | 15:00 |
openstack | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 15:00 |
openstack | The meeting name has been set to 'openstack_cyborg' | 15:00 |
zhipeng | #topic Roll Call | 15:00 |
crushil | \o | 15:00 |
zhipeng | \o | 15:01 |
jkilpatr | o/ | 15:01 |
zhipeng | waiting for more folks if we have | 15:02 |
skelso | o/ | 15:03 |
mpaolino | \o | 15:03 |
zhipeng | okey let's proceed | 15:06 |
zhipeng | #topic BP review | 15:07 |
zhipeng | #link https://review.openstack.org/#/q/project:openstack/cyborg | 15:07 |
zhipeng | i must appologize i was buried with kubecon work this week so haven't got the time for the reviews | 15:07 |
zhipeng | i see _gryf posted a lot of good comments | 15:08 |
zhipeng | as well as rushil and others | 15:08 |
zhipeng | if we have any outstanding issues, please feel free to shout out | 15:08 |
jkilpatr | do we want to consolidate all DB interaction to the api end point (presumably on the controllers) or have the agent on the computes update the database themselves. | 15:11 |
jkilpatr | essentially that's a question of where to handle communication and parallelism. | 15:11 |
zhipeng | if we choose the latter one, does it mean the controller side will be sometimes out of sync with the agents ? | 15:13 |
jkilpatr | it would make it a concern, it would be possible to prevent if we tried. | 15:14 |
jkilpatr | on the other hand if the agent's don't store anything themselves then we can lose accelerator state if anything disrupts the agent. | 15:14 |
zhipeng | how could we prevent the out-of-sync problem ? using heartbeat ? | 15:17 |
crushil | I would prefer the latter option> And I believe heartbeat can be an option | 15:17 |
jkilpatr | just make sure both sides refresh info from the DB when it might have changed. So proper cache invalidation. | 15:18 |
jkilpatr | and probably more DB load | 15:18 |
crushil | But the agent should definitely keep its database updated | 15:18 |
zhipeng | but in real deployment that always hard to manage, I always hear complaints from our product team about the heartbeat | 15:19 |
zhipeng | cache invalidation is prong to go wrong | 15:19 |
zhipeng | is there a way for us to simplify so that we could avoid the common shortcomings | 15:19 |
jkilpatr | so agent <-> rabbit <-> api end point <-> db | 15:20 |
jkilpatr | that way we don't need invalidation because the end point is the only one that interacts with the DB | 15:20 |
crushil | Isn't that overkill though? Going through hoops to update db? | 15:22 |
crushil | Can | 15:22 |
crushil | Can't we have agent maintain a local copy of the db and agent keeps updating it? | 15:23 |
zhipeng | but if say we have 3 compute node that all have FPGA based iNIC on it | 15:23 |
jkilpatr | why does the agent need info about all accelerators? it only needs to manage one machine (there are many agents on many compute nodes, but each one only needs to care about it's own scope) I guess they could have a mini db each but what good is that. | 15:24 |
zhipeng | if the DB is updated only via local copy | 15:24 |
zhipeng | i'm terminating my thoughts here~~ | 15:24 |
crushil | Well, I feel that the agent needs to have a more efficient way of updating the db | 15:25 |
jkilpatr | maybe we just have the agent update the db for things that are longer term? Like a new accelerator? | 15:26 |
zhipeng | how about the config changes | 15:26 |
zhipeng | that would be part of the life cycle mgmt usually | 15:27 |
zhipeng | or we have two seperate DBs (sorta), the main one resides with the control node | 15:28 |
jkilpatr | I guess agents need to read from the db for config updates, we don't want to obligate the operator to ssh into every node to change a config value. | 15:28 |
zhipeng | like jkilpatr it only get updated on major events | 15:28 |
zhipeng | and agent locally manages a cache | 15:28 |
zhipeng | for the more constant changes ? | 15:28 |
crushil | That seems fine, although how would you define major events? | 15:29 |
zhipeng | the info in the locan cache should be of no concern of either other compute nodes or the control node | 15:29 |
zhipeng | like when accelerator attached, created, deleted | 15:30 |
zhipeng | and for small changes | 15:30 |
zhipeng | agent could advertise to the api-endpoint | 15:30 |
zhipeng | as a bulk of its local cache info, if it deemed important by the control node | 15:31 |
zhipeng | so what I'm think is that user could explicitly define events that they care | 15:31 |
zhipeng | in the cyborg.conf for example | 15:31 |
zhipeng | and those events will be aggregated and reported by the agents to the api-end-point | 15:32 |
* _gryf waves late | 15:32 | |
crushil | So, you're suggesting maintaining a central db and a local cached db. The central db would be updated on all events and cached db will be updated only on select events? | 15:32 |
zhipeng | alongside the usual big event | 15:32 |
ttk2[m] | Jkilpatr here had to go mobile. | 15:32 |
zhipeng | central/main DB will get updated on all the major events (attach/detach/delete/create) and smaller events that user specified | 15:33 |
zhipeng | the local copy is just cache for all the local updates, refreshed constantly | 15:33 |
zhipeng | that is my thinking | 15:33 |
_gryf | zhipeng, +1 for the cache instead of full blown db | 15:34 |
ttk2[m] | Sounds like a workable compromise. | 15:34 |
crushil | that sounds ok to me | 15:34 |
zhipeng | great :) then the DB design would remain the same, and there will be an additional design on the local cache for the agent | 15:35 |
zhipeng | is this ttk2[m] or crushil's ? | 15:35 |
ttk2[m] | I'll do the agent cache. | 15:36 |
zhipeng | fantastic :) | 15:37 |
crushil | cool | 15:37 |
zhipeng | #action Justin to update with the agent cache design | 15:37 |
ttk2[m] | Should I put that in the same blueprint? | 15:37 |
zhipeng | i think you could do that | 15:37 |
zhipeng | it should not be a big deal | 15:38 |
zhipeng | okey any other topics ? | 15:40 |
zhipeng | any questions on Roman's BP re the interaction between Nova and Cyborg | 15:40 |
crushil | nothing from me | 15:43 |
zhipeng | well let's keep review the spec anyways :P | 15:44 |
zhipeng | #topic AoB | 15:44 |
zhipeng | any other buisness ? | 15:44 |
ttk2[m] | Do we want to set a blueprint finish goal? | 15:44 |
zhipeng | ttk[m] what do you mean ? for a deadline date ? | 15:44 |
_gryf | ttk2[m], like a deadline or smth? | 15:44 |
ttk2[m] | Yes. Just something to keep things moving. | 15:45 |
_gryf | in other projects that depends on how big is the core reviewers pipe | 15:45 |
zhipeng | i would love to set a deadline lol | 15:46 |
_gryf | in nova there are deadlines, so they can be sure, that core reviewers spend quality time on desired and accepted features | 15:46 |
zhipeng | what we called in China is the lazy cancer kicked in all the time :P | 15:46 |
_gryf | otoh, I've seen projects, (smaller ones), that they obey only openstack release schedule | 15:47 |
zhipeng | I think setting a deadline/milestone would be a good idea | 15:47 |
_gryf | that fine. | 15:47 |
zhipeng | so that everyone on the same page and pace | 15:47 |
zhipeng | others ? | 15:48 |
ttk2[m] | As long as it's not too tight. | 15:48 |
crushil | So, for current specs, should we say EOR should be the deadline? | 15:48 |
zhipeng | (if we have a milestone then maybe we need a sprint someday ...) | 15:48 |
zhipeng | crushil for specs that would be too relax | 15:48 |
zhipeng | we need to get code in for Pike | 15:48 |
crushil | I meant implementation of the specs | 15:48 |
zhipeng | yes that'd agre | 15:49 |
zhipeng | I would agree | 15:49 |
ttk2[m] | Let's just put a deadline on specs for now. | 15:49 |
zhipeng | Apr 15th ? | 15:49 |
ttk2[m] | That's what 3 meetings? | 15:49 |
crushil | lol | 15:49 |
zhipeng | sounds about right | 15:50 |
zhipeng | too relax or too tight ? lol | 15:50 |
crushil | I think for most of the specs, the owners know what needs to be added | 15:50 |
crushil | So, it is slightly on the more relaxed side | 15:50 |
zhipeng | it is the reviews that takes time | 15:51 |
* _gryf seen blueprints which lasts several months, before being accepted ;] | 15:51 | |
crushil | Fair | 15:51 |
zhipeng | _gryf we are not that famous project so we could move faster XD | 15:51 |
_gryf | which obviously means slipping to the next release :> | 15:51 |
crushil | Let's do April 15th then | 15:51 |
ttk2[m] | +1 | 15:51 |
zhipeng | #vote Apr 15th as the deadline for specs | 15:52 |
zhipeng | okey wrong cmd ... | 15:52 |
zhipeng | but anyway we should all agree on this | 15:52 |
zhipeng | #info Apr 15th for the first milestone on spec freeze | 15:52 |
zhipeng | #agreed Apr 15th for the first milestone on spec freeze | 15:53 |
zhipeng | great discussions folks | 15:53 |
zhipeng | any other buisness ? | 15:53 |
zhipengh[m] | My Chromebook just died.. | 15:54 |
crushil | zhipengh[m], Get a real computer. :P | 15:54 |
zhipengh[m] | lol give me money | 15:55 |
*** joseppc has quit IRC | 15:55 | |
*** zhipeng has quit IRC | 15:58 | |
zhipengh[m] | Okey if no other biz, let me try to end meeting using this handle... Not sure it will work | 15:58 |
zhipengh[m] | #endmeeting | 15:58 |
zhipengh[m] | Bummer... | 15:59 |
_gryf | #endmeeting | 15:59 |
_gryf | lol | 15:59 |
*** NokMikeR has quit IRC | 16:01 | |
*** mpaolino has quit IRC | 16:21 | |
*** mikeH has quit IRC | 17:16 | |
*** mikeH has joined #openstack-cyborg | 17:40 | |
*** zhipeng has joined #openstack-cyborg | 17:50 | |
zhipeng | the longest meeting ever... | 17:51 |
zhipeng | #endmeeting | 17:51 |
openstack | Meeting ended Wed Mar 29 17:51:05 2017 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 17:51 |
openstack | Minutes: http://eavesdrop.openstack.org/meetings/openstack_cyborg/2017/openstack_cyborg.2017-03-29-15.00.html | 17:51 |
openstack | Minutes (text): http://eavesdrop.openstack.org/meetings/openstack_cyborg/2017/openstack_cyborg.2017-03-29-15.00.txt | 17:51 |
openstack | Log: http://eavesdrop.openstack.org/meetings/openstack_cyborg/2017/openstack_cyborg.2017-03-29-15.00.log.html | 17:51 |
*** zhipeng has quit IRC | 17:55 | |
*** mikeH has quit IRC | 18:08 | |
*** mikeH has joined #openstack-cyborg | 18:10 | |
*** crushil has quit IRC | 18:16 | |
*** crushil has joined #openstack-cyborg | 18:17 | |
*** openstackstatus has joined #openstack-cyborg | 18:43 | |
*** ChanServ sets mode: +v openstackstatus | 18:43 | |
*** crushil has quit IRC | 19:09 | |
*** crushil has joined #openstack-cyborg | 19:26 | |
*** crushil has quit IRC | 19:58 | |
*** joseppc has joined #openstack-cyborg | 20:09 | |
*** joseppc has joined #openstack-cyborg | 20:09 | |
*** mikeH has quit IRC | 20:34 | |
*** mikeH has joined #openstack-cyborg | 21:13 | |
*** skelso has quit IRC | 21:26 | |
*** skelso has joined #openstack-cyborg | 21:26 | |
*** skelso has quit IRC | 22:10 | |
*** mikeH has quit IRC | 22:24 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!