16:00:09 <mlavalle> #startmeeting neutron_performance 16:00:10 <openstack> Meeting started Mon Sep 23 16:00:09 2019 UTC and is due to finish in 60 minutes. The chair is mlavalle. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:11 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:13 <openstack> The meeting name has been set to 'neutron_performance' 16:00:36 <bcafarel> hello! 16:00:43 <rubasov> hi! 16:00:51 <jrbalderrama> hello 16:00:57 <mlavalle> hi there! 16:02:30 <mlavalle> ok, that is probably enough wait 16:02:36 <mlavalle> #topic Updates 16:02:49 <mlavalle> are any updates from the past two weeks? 16:03:35 <rubasov> sorry, I don't have any 16:03:42 <mlavalle> that's cool 16:03:57 <mlavalle> We appreciatte all you have done so far 16:03:58 <bcafarel> only "following someone's poc review" here 16:04:56 <mlavalle> ok, so some of you are aware that over the past few days I continued working with https://review.opendev.org/#/c/678438/ 16:05:45 <mlavalle> before I explain what I did, please see if you can download https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_e2f/678438/12/check/neutron-rally-task/e2f821c/controller/logs/screen-q-svc.txt.gz 16:06:09 <mlavalle> it is going to take some time 16:07:35 <mlavalle> here's what I did in the last revision of this patch: 16:08:24 <mlavalle> 1) The profiling becomes configurable with options in the default section of neutron.conf 16:09:23 <mlavalle> I added an option named "enable_code_profiling" 16:10:14 <mlavalle> when that is true, profiling will take place in the case of those methods decorated "profile" 16:10:49 <mlavalle> currently just create_port and update_port in the ml2 plugin 16:11:17 <mlavalle> 2) I enabled the profiling for the rally job 16:12:41 <mlavalle> 3) I add the result of each profiled call to the log file so anybody investigating performance only needs to look at the neutron server log in the rally job 16:13:07 <mlavalle> that is why I asked you to download ^^^^ 16:14:13 <njohnston_> wow, it is taking a while 16:14:44 <jrbalderrama> 624M ! 16:15:06 <mlavalle> 4) At the beginning of each profile report in the log file I also log the corresponding osprofiler id like the one shown here: https://storage.gra1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_e2f/678438/12/check/neutron-rally-task/e2f821c/results/report.html.gz#/NeutronNetworks.create_and_bind_ports/output 16:15:27 <bcafarel> get it in a shell, it will leave it compressed (42M gz) 16:16:16 <mlavalle> that way we will be able to correlate the cProfile output with the osprofiler output 16:16:18 <rubasov> and a decent distro will decompress it on the fly if you open the gz with less 16:17:01 <njohnston_> or zcat | less, zgrep is also helpful if you know what you're looking for 16:17:15 <mlavalle> if you have downloaded the file search for "cumtime" 16:17:58 <mlavalle> and you will find the first table of profiling results 16:18:09 <mlavalle> does this all make sense? 16:19:00 <bcafarel> so that's the os-profiler table output? 16:19:27 <mlavalle> no if you find cumtime, you will see the cProfiler resutl 16:19:33 <rubasov> and we'll have one table like that for each call of create_port and update_port? 16:19:43 <mlavalle> correct rubasov 16:20:23 <mlavalle> and at the top of each table, the corresponding osprofiler id should be logged as well 16:20:33 <mlavalle> so we can correlate both views 16:20:38 <mlavalle> makes sense? 16:21:10 <njohnston_> +1 16:22:32 <mlavalle> the problem we had with the osprofiler data is that it was difficult to correlate with specific sections of the code 16:23:18 <rubasov> but now we can add the decorator wherever we want it, cool 16:23:39 <mlavalle> with this approach now we can see the osprofiler trace and relate it to the cProfile output and see sections of the code are doing what, who is calling who and who is being called by who 16:25:26 <mlavalle> rubasov: yes, I could remove the decorators in the patch from the ml2 plugin and merge the patch in its current form 16:25:55 <mlavalle> if we do that, the prospective performance investigator would follow these steps: 16:26:15 <mlavalle> 1) decide what portion of the code she / he wants to analyze 16:26:53 <mlavalle> 2) create a DNM patch adding the decorator to those methdos relevant to the investigation 16:27:17 <mlavalle> 3) See the results in the neutron server log of the rally job 16:28:03 <mlavalle> 4) correlate with the osprofiler output. I am looging both the osprofiler parent id and the trace id, so we all can go at whatever level of granularity we want 16:28:36 <mlavalle> I will even add a section to the devref to explain the process 16:28:39 <mlavalle> makes sense? 16:29:06 <bcafarel> I was about to say, noting these steps in dev doc would be great :) 16:31:02 <njohnston_> I have places where I could make use of this right now 16:31:08 <njohnston_> very awesome work 16:31:22 <rubasov> makes perfect sense 16:31:45 <mlavalle> yeah I know you are working on performance, njohnston_ . I've been thinking of you all along 16:32:01 <mlavalle> there is another approach though.... 16:32:25 <mlavalle> not exclusive with the previous one. In fact, they can be complementary 16:33:25 <mlavalle> here it goes: 16:33:45 <mlavalle> 1) I can decorate methods like this https://github.com/openstack/neutron/blob/master/neutron/api/v2/base.py#L372 16:34:08 <mlavalle> in the base API controller. I can decorate index, show, create, update and delete 16:34:44 <mlavalle> that way we will have data and all the API calls that we exercise in the rally job for each patch 16:35:10 <mlavalle> 2) I can also decorate the loop methos in the agents 16:35:22 <mlavalle> which are also generating osprofiler traces 16:36:00 <mlavalle> and if, during a performance investigation, we need more granularity, we can go with the previous approach 16:36:12 <mlavalle> makes sense? 16:40:41 <njohnston_> yep, makes sense! 16:40:59 <mlavalle> ok, here's the plan for the next two weeks 16:41:36 <mlavalle> 1) I will experiment decorating the base controller. If infra can take it, that is the way we will go 16:41:52 <mlavalle> 2) I will decorate the agent loop methods 16:42:12 <mlavalle> 3) I will add a section to devref 16:43:47 <mlavalle> 4) after merging this code (after the freeze is over) I will start analyzing data for the create / bind port combo. My goal is to have data and findings to discuss in Shanghai 16:44:17 <mlavalle> and hoepfully you can leverage this by then, njohnston_, so we can also discuss your findings 16:44:32 <njohnston_> yes definitely 16:45:15 <mlavalle> rubasov: with the work you are doing, we will be able to correlate at the agent level as well 16:45:41 <rubasov> cool 16:45:54 <mlavalle> ok, that is the end of my update 16:46:03 <rubasov> will find some time to get the code in better shape 16:46:09 <mlavalle> anything else we should discuss today? 16:49:20 <mlavalle> ok, thanks for attending 16:49:25 <mlavalle> have a great week 16:49:30 <bcafarel> same! 16:49:45 <rubasov> you too 16:50:49 <mlavalle> #endmeeting