14:00:10 #startmeeting neutron_drivers 14:00:11 Meeting started Fri Jun 14 14:00:10 2019 UTC and is due to finish in 60 minutes. The chair is mlavalle. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:12 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:14 The meeting name has been set to 'neutron_drivers' 14:00:36 hi 14:00:52 hi 14:03:00 hi 14:03:33 let's wait 1 more minute 14:05:06 ok, let's get going. we need the minimum legally required quorum 14:05:23 #topic RFEs 14:05:57 well, maybe not.... anyways. let's start discussing it: https://bugs.launchpad.net/neutron/+bug/1824856 14:05:58 Launchpad bug 1824856 in neutron "[RFE] Add new state "ERROR" for HA routers" [Wishlist,Confirmed] 14:06:36 hi 14:06:40 sorry late 14:06:40 hey 14:06:45 also sorry 14:07:19 we have one RFE for today: https://bugs.launchpad.net/neutron/+bug/1824856 14:07:20 Launchpad bug 1824856 in neutron "[RFE] Add new state "ERROR" for HA routers" [Wishlist,Confirmed] 14:08:30 so this was originally reported by jlibosva in https://bugzilla.redhat.com/show_bug.cgi?id=1574950 - but I took care of it recently and pushed it upstream 14:08:31 bugzilla.redhat.com bug 1574950 in openstack-neutron "HA router can become master but won't be reported in database" [Medium,New] - Assigned to amuller 14:10:56 This would be visible to the operator in the "status" attribute of the router, right? 14:11:52 yes, as third possible state 14:11:58 sorry, status :) 14:13:19 I think at some point we had something similar in mind, since the constant already exists: https://opendev.org/openstack/neutron-lib/src/branch/master/neutron_lib/constants.py#L402 14:14:16 I wasn't even aware of this constant :) 14:15:25 how do you report error_message to api user? add the field to routers? 14:17:17 mlavalle: it was used by vmware etc iirc 14:17:19 yamamoto: yes, I think we can add some field like "status details" or something like that 14:20:33 will it be machine-readable? or just for humans? 14:21:05 yamamoto: what do You mean exactly? 14:21:30 are You asking about possible information in this new field? 14:22:26 something like sub-error-code, or some random text in natural language 14:22:27 well, it was added to Neutron here: https://review.opendev.org/#/c/399505/ 14:23:14 yamamoto: I was rather thinking about some text in natural language, maybe pass there directly error message from node on which it happend 14:23:30 so e.g. info about "can't access file ..." or something like that 14:24:57 ok i got it 14:25:32 All errors will be recorded, or there always be only one 14:25:33 and the constant was added for ha routers 14:25:54 liuyulong: I don't think we should record all errors from the past 14:26:02 so I was thinking about only last one 14:26:36 yeah, I think the last one is the relevant one 14:26:42 Yes, this will be important, otherwise the database may not happy if we store large information 14:26:44 mlavalle: but now this constant isn't used in neutron at all: http://codesearch.openstack.org/?q=ROUTER_STATUS_ERROR&i=nope&files=&repos= 14:26:57 oh I know 14:27:03 I pointed that out in the RFE 14:27:21 yes, I see now, sorry :) 14:27:49 the way it was used was also related to ha routers 14:28:46 yes, but it was used to show error on db level, during router creation 14:29:07 and now we want to use during router's lifecycle :) 14:29:39 So if the router is back to normal state, then we need to manually or automatically delete the error info? 14:30:03 yeap 14:30:34 I am good with this RFE 14:30:56 liuyulong: I was thinking that it would be done automatically - in same way like now status is changed e.g. from BACKUP to ACTIVE and vice versa 14:31:10 Such as, a router admin state down-up can sometimes fix the router status. 14:31:55 mlavalle: there might not have been a neutron constant. but i remember some plugins were using "ERROR" for router status 14:33:09 yamamoto, l3_ha mode can now enter the ERROR state. 14:33:25 i'm fine with the RFE 14:33:29 yes, in the git history around this constant I see patches related to the decoupling of plugins from the neutron repo 14:34:12 https://github.com/openstack/neutron/blob/master/neutron/db/l3_hamode_db.py#L421 14:34:35 it was probably reintroduced by the patch above and then removed whent we adopted neutron-lib, which used another constant: https://review.opendev.org/#/c/489398/ 14:35:00 mlavalle: oh, so ^^ looks that it's still there as in initial patch You linked earlier but now other constant is used :) 14:35:22 the value is the same: "ERROR" 14:35:55 It should be fixed IMO, : ) 14:36:20 yes, value is the same :) 14:37:01 haleyb: what do you think? 14:37:04 so in such case also we can put some detailed info in this new field :) 14:37:12 +1 from me 14:37:13 slaweq, when or what's the proper time in the l3 agent side to automatically remove the error info? 14:38:34 liuyulong: this error would come usually from neutron-keepalived-state-change so if this would be changed, it should be changed in Neutron's db also 14:41:05 So the original bugzilla bug says, neutron-keepalived-state-change may have some potential problem, so we will also rely on that to update the database? 14:41:53 liuyulong: yes 14:43:15 ERROR etc without prefix were originally for lbaas. i guess we can remove them at this point. 14:44:26 One issue I can image is if neutron-keepalived-state-change does not spawned, error clean or store to DB? 14:45:34 liuyulong: I think that now if it will not spawn, last status shouldn't be cleaned 14:46:08 Maybe I'm over concerned. : )_ 14:46:32 so let's move ahead with this RFE, then 14:46:42 +1 14:46:49 thx 14:47:48 That's it for today guys 14:47:54 thanks for attending 14:48:00 have a great weekend! 14:48:10 #endmeeting