*** k_mouza has joined #openstack-nova | 00:00 | |
*** k_mouza has quit IRC | 00:05 | |
openstackgerrit | Merged openstack/nova stable/train: api: Set min, maxItems for server_group.policies field https://review.opendev.org/752707 | 00:05 |
---|---|---|
openstackgerrit | Merged openstack/nova stable/rocky: post live migration: don't call Neutron needlessly https://review.opendev.org/750673 | 00:06 |
openstackgerrit | Merged openstack/nova stable/queens: [placement] Add status and links fields to version document at / https://review.opendev.org/751239 | 00:06 |
*** macz_ has joined #openstack-nova | 00:15 | |
*** JamesBenson has joined #openstack-nova | 00:17 | |
*** macz_ has quit IRC | 00:20 | |
*** JamesBenson has quit IRC | 00:22 | |
*** artom has quit IRC | 00:28 | |
*** JamesBenson has joined #openstack-nova | 00:35 | |
*** hamalq has quit IRC | 00:43 | |
*** gyee has quit IRC | 01:04 | |
*** macz_ has joined #openstack-nova | 01:09 | |
*** Liang__ has joined #openstack-nova | 01:14 | |
*** macz_ has quit IRC | 01:14 | |
openstackgerrit | Merged openstack/nova stable/stein: Set different VirtualDevice.key https://review.opendev.org/749415 | 01:29 |
*** spatel has joined #openstack-nova | 01:44 | |
*** jangutter_ has joined #openstack-nova | 01:55 | |
*** jangutter_ has quit IRC | 01:55 | |
*** jangutter has quit IRC | 01:55 | |
*** jangutter has joined #openstack-nova | 01:56 | |
*** rcernin has quit IRC | 02:26 | |
*** Yumeng has joined #openstack-nova | 02:45 | |
*** rcernin has joined #openstack-nova | 02:49 | |
*** brinzhang has joined #openstack-nova | 03:00 | |
*** zzzeek has quit IRC | 03:10 | |
*** ircuser-1 has joined #openstack-nova | 03:11 | |
*** zzzeek has joined #openstack-nova | 03:13 | |
*** lemko has joined #openstack-nova | 04:01 | |
*** dave-mccowan has quit IRC | 04:03 | |
*** manoj_kumar_kata has joined #openstack-nova | 04:04 | |
*** lemko4 has quit IRC | 04:04 | |
*** jangutter_ has joined #openstack-nova | 04:08 | |
*** jangutter_ has quit IRC | 04:08 | |
*** jangutter has quit IRC | 04:08 | |
*** jangutter has joined #openstack-nova | 04:09 | |
*** manoj_kumar_kata has quit IRC | 04:10 | |
*** ratailor has joined #openstack-nova | 04:10 | |
openstackgerrit | Hemanth Nakkina proposed openstack/nova master: Update pci stat pools based on PCI device changes https://review.opendev.org/749175 | 04:15 |
*** jangutter_ has joined #openstack-nova | 04:29 | |
*** jangutter_ has quit IRC | 04:30 | |
*** jangutter has quit IRC | 04:31 | |
*** jangutter_ has joined #openstack-nova | 04:31 | |
*** evrardjp has quit IRC | 04:33 | |
*** evrardjp has joined #openstack-nova | 04:33 | |
*** spatel has quit IRC | 04:38 | |
*** manoj_kumar_kata has joined #openstack-nova | 04:39 | |
*** efried1 has joined #openstack-nova | 04:50 | |
*** eharney has quit IRC | 04:51 | |
*** efried has quit IRC | 04:52 | |
*** efried1 is now known as efried | 04:52 | |
*** eharney has joined #openstack-nova | 05:04 | |
*** tetsuro has joined #openstack-nova | 05:21 | |
*** tetsuro has quit IRC | 05:25 | |
*** zzzeek has quit IRC | 05:30 | |
*** zzzeek has joined #openstack-nova | 05:32 | |
*** manoj_kumar_kata has quit IRC | 05:35 | |
*** manoj_kumar_kata has joined #openstack-nova | 05:41 | |
*** tetsuro has joined #openstack-nova | 05:44 | |
*** tetsuro has quit IRC | 05:46 | |
*** manoj_kumar_kata has quit IRC | 06:03 | |
*** manoj_kumar_kata has joined #openstack-nova | 06:04 | |
*** Liang__ has quit IRC | 06:05 | |
*** Liang__ has joined #openstack-nova | 06:08 | |
*** vishalmanchanda has joined #openstack-nova | 06:09 | |
*** Yumeng has quit IRC | 06:20 | |
*** tetsuro has joined #openstack-nova | 06:24 | |
*** manoj_kumar_kata has quit IRC | 06:32 | |
*** manoj_kumar_kata has joined #openstack-nova | 06:32 | |
*** slaweq has joined #openstack-nova | 06:35 | |
*** manoj_kumar_kata has quit IRC | 06:37 | |
*** manoj_kumar_kata has joined #openstack-nova | 06:37 | |
*** tetsuro has quit IRC | 06:46 | |
*** tetsuro has joined #openstack-nova | 06:50 | |
*** ralonsoh has joined #openstack-nova | 06:53 | |
*** tetsuro has quit IRC | 06:55 | |
*** tetsuro has joined #openstack-nova | 06:57 | |
*** andrewbonney has joined #openstack-nova | 07:01 | |
*** tetsuro has quit IRC | 07:03 | |
*** tetsuro has joined #openstack-nova | 07:03 | |
openstackgerrit | Mamduh proposed openstack/os-vif master: [WIP] get_ifname_by_pci_address compare phys_port_id https://review.opendev.org/758352 | 07:04 |
*** manoj_kumar_kata has quit IRC | 07:08 | |
*** freenzyfriday has joined #openstack-nova | 07:13 | |
*** tesseract has joined #openstack-nova | 07:20 | |
*** rpittau|afk is now known as rpittau | 07:22 | |
*** tetsuro has quit IRC | 07:32 | |
*** rcernin has quit IRC | 07:35 | |
*** rcernin_ has joined #openstack-nova | 07:35 | |
*** tetsuro has joined #openstack-nova | 07:37 | |
openstackgerrit | Hemanth Nakkina proposed openstack/nova master: Update pci stat pools based on PCI device changes https://review.opendev.org/749175 | 07:41 |
*** dtantsur|afk is now known as dtantsur | 07:41 | |
*** rcernin_ has quit IRC | 07:42 | |
*** tosky has joined #openstack-nova | 07:42 | |
*** tetsuro has quit IRC | 07:46 | |
*** tetsuro has joined #openstack-nova | 07:47 | |
*** tetsuro has quit IRC | 07:50 | |
*** Liang__ has quit IRC | 08:12 | |
*** CeeMac has joined #openstack-nova | 08:12 | |
*** Liang__ has joined #openstack-nova | 08:13 | |
*** ociuhandu has joined #openstack-nova | 08:16 | |
*** tetsuro has joined #openstack-nova | 08:17 | |
*** freenzyfriday has quit IRC | 08:22 | |
*** derekh has joined #openstack-nova | 08:39 | |
*** k_mouza has joined #openstack-nova | 08:49 | |
*** k_mouza_ has joined #openstack-nova | 08:52 | |
*** k_mouza has quit IRC | 08:55 | |
*** tkajinam is now known as tkajinam|away | 09:17 | |
*** tkajinam|away is now known as tkajinam | 09:17 | |
*** jangutter has joined #openstack-nova | 09:23 | |
bauzas | good late morning Nova | 09:24 |
*** zzzeek has quit IRC | 09:25 | |
*** tetsuro has quit IRC | 09:26 | |
*** jangutter_ has quit IRC | 09:26 | |
*** zzzeek has joined #openstack-nova | 09:29 | |
*** rcernin_ has joined #openstack-nova | 09:32 | |
*** tetsuro has joined #openstack-nova | 09:36 | |
*** rcernin_ has quit IRC | 09:36 | |
gibi | o/ | 09:37 |
*** tetsuro has quit IRC | 09:40 | |
lyarwood | Morning morning | 09:55 |
*** rcernin_ has joined #openstack-nova | 09:56 | |
stephenfin | gibi: Heads up that I've added specless BP approval for https://blueprints.launchpad.net/nova/+spec/compact-havana-db-migrations to the agenda for today's meeting. I won't be around though so I can move to next week if it proves to be any way contentious | 10:11 |
* stephenfin lol'd when he read the summary for https://blueprints.launchpad.net/nova/+spec/compact-havana-db-migrations from the *Icehouse* release | 10:11 | |
stephenfin | "...and make it easier to move to Alembic in the future." /o\ | 10:11 |
stephenfin | that went super well, evidently | 10:12 |
tosky | stephenfin: is that related to the usage of sqlalchemy-migrate? | 10:15 |
stephenfin | tosky: Dumping the use of it, yes | 10:15 |
tosky | so sqlalchemy-migrate is still a dependency, but it's kind of abandoned (x namespace), I've tried to fix the zuul jobs but of course most of the people marked as maintainers are not around | 10:16 |
stephenfin | tosky: It's dead. Alembic is apparently a far better designed tool that we should have switched to ages ago but, you know, ENOTIME | 10:24 |
stephenfin | but our number of DB migrations has plummeted in recent releases so no better time, I'm thinking. We shall see | 10:25 |
*** rcernin_ has quit IRC | 10:25 | |
openstackgerrit | Merged openstack/nova stable/victoria: Follow up for cherry-pick check for merge patch https://review.opendev.org/757613 | 10:31 |
*** rcernin_ has joined #openstack-nova | 10:42 | |
gibi | stephenfin: ack. I don't expect too much pushback on that | 10:47 |
openstackgerrit | Lee Yarwood proposed openstack/nova master: Add regression test for bug #1899835 https://review.opendev.org/758239 | 10:50 |
openstack | bug 1899835 in OpenStack Compute (nova) "n-cpu attempts to disconnect volumes after early pre_live_migration failures on the destination during a live migration" [Undecided,New] https://launchpad.net/bugs/1899835 - Assigned to Lee Yarwood (lyarwood) | 10:50 |
lyarwood | stephenfin: ^ another pre_live_migration functional test if you have time today | 10:50 |
*** tbachman has quit IRC | 11:02 | |
*** rcernin_ has quit IRC | 11:06 | |
*** Liang__ has quit IRC | 11:07 | |
*** lemko has quit IRC | 11:07 | |
tosky | stephenfin: I know it's dead, but as long as projects still depend on it (because ENOTIME), it's not really dead :) | 11:09 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: Compact pre-Juno database migrations https://review.opendev.org/758394 | 11:17 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: TODO: Compact pre-Kilo database migration https://review.opendev.org/758395 | 11:17 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: Compact pre-Liberty database migrations https://review.opendev.org/758396 | 11:17 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: TODO: Compact pre-Mitaka database migrations https://review.opendev.org/758397 | 11:17 |
openstackgerrit | Stephen Finucane proposed openstack/nova master: WIP: Compact pre-Newton database migrations https://review.opendev.org/758398 | 11:17 |
stephenfin | gibi: Think you could take a spin through https://review.opendev.org/#/q/status:open+project:openstack/nova+branch:master+topic:remove-xenapi maybe early next week too? I assume we can merge now that V is out the door | 11:20 |
stephenfin | lyarwood: +2 to that and its predecessor | 11:22 |
stephenfin | <3 functional tests | 11:22 |
stephenfin | lyarwood: Not nova, but could you buzz through https://review.opendev.org/#/c/755887/ and https://review.opendev.org/#/c/755888/ | 11:22 |
gibi | stephenfin: thanks for the reminder, will do | 11:23 |
*** lemko has joined #openstack-nova | 11:23 | |
stephenfin | Does anyone know where that docs that mriedem did on novaclient-OSC gaps is gone to? Think I'm going to spend some time this cycle closing that out too | 11:24 |
gibi | it is an etherpad... looking.. | 11:25 |
gibi | https://etherpad.opendev.org/p/compute-api-microversion-gap-in-osc | 11:25 |
*** elod is now known as elod_afk | 11:26 | |
stephenfin | gibi++ thanks! | 11:28 |
gibi | stephenfin: I also thought about the relationship of o.vo and mypy. o.vo handles the fields by generating properties for it runtime so that static checker is clueless. But I think w might be able to generate a .pyi interface file for o.vos that has the fields with static types to help mypy | 11:31 |
*** zzzeek has quit IRC | 11:33 | |
*** zzzeek has joined #openstack-nova | 11:34 | |
*** ratailor has quit IRC | 11:37 | |
*** belmoreira has joined #openstack-nova | 11:38 | |
*** k_mouza_ has quit IRC | 11:40 | |
*** k_mouza has joined #openstack-nova | 11:46 | |
sean-k-mooney | that makes sense althoug we can proably also extend mypy to pars ovo but the pyi file is proably simpler | 11:47 |
*** bbowen has quit IRC | 11:48 | |
*** bbowen has joined #openstack-nova | 11:48 | |
gibi | sean-k-mooney: yeah a plugin to mypy is also a possibility but as far as I see the plugin interface of mypy is unstable | 11:49 |
sean-k-mooney | i have not really looked at it | 11:50 |
gibi | https://mypy.readthedocs.io/en/stable/extending_mypy.html#extending-mypy-using-plugins "The plugin system is experimental and prone to change. I" | 11:51 |
*** rcernin_ has joined #openstack-nova | 11:59 | |
sean-k-mooney | we could likely write a small script that would loop over the nova/objects dir and produce the pyi files and run that in the tox env | 12:04 |
sean-k-mooney | we could optionally commit them or just have them generated on the fly | 12:05 |
*** raildo has joined #openstack-nova | 12:06 | |
*** lbragstad has quit IRC | 12:07 | |
gibi | sean-k-mooney: yeah, that is the direction of my thinknig | 12:14 |
lyarwood | stephenfin: FWIW I wanted to sort the evacuate command out in osc | 12:16 |
lyarwood | TIL sean-k-mooney had a change for this | 12:16 |
lyarwood | added to the PTG pad | 12:19 |
sean-k-mooney | lyarwood: yep it more or less done but we want to rename it back to evacuate | 12:28 |
sean-k-mooney | its been on my todo list for a whiel if you feel like taking it over | 12:28 |
sean-k-mooney | if not i can try and do it before the ptg | 12:28 |
lyarwood | sean-k-mooney: so was that outcome, that we wanted to stay with evacuate for now? | 12:29 |
lyarwood | sean-k-mooney: if so I can rework it now | 12:29 |
sean-k-mooney | lyarwood: thats what dansmith wanted | 12:30 |
sean-k-mooney | also since i want to add recreate as a new action it proably better not to rename | 12:30 |
brinzhang | gibi, sean-k-mooney: I encounter a question, I want to backport https://review.opendev.org/#/q/status:merged+project:openstack/nova+branch:master+topic:bp/numa-aware-live-migration to Rocky release | 12:30 |
brinzhang | gibi, sean-k-moooney: Because there is a big gap between its minor versions, I only merged the changes below the api, including the RPC Version changes, but changed rpc_version=5.3 to rpc_version=5.1 to adapt to the version number of Rocky release. | 12:30 |
sean-k-mooney | lyarwood: the main concen was people thinking it was a new thing calling it resurect | 12:31 |
sean-k-mooney | brinzhang: that a feature not a bug | 12:31 |
sean-k-mooney | brinzhang: so you cant backprot it | 12:31 |
lyarwood | sean-k-mooney: ack yup np, I'll rework it now in that case | 12:31 |
brinzhang | gibi: but, when I request live migration the rpcverion=5.1 can not valid | 12:31 |
sean-k-mooney | brinzhang: rpc backports cant realy be done | 12:31 |
brinzhang | sean-k-mooney: I know, but if I upgrade the rocky release to ussuri, that will be some impact than backport, so I just want to backport it | 12:33 |
brinzhang | but encountered the rpcverion invalid issue | 12:33 |
sean-k-mooney | brinzhang yes | 12:33 |
sean-k-mooney | you cant backport rpc change because it will break upgrades and rolling deployments | 12:33 |
sean-k-mooney | we have tried downstream and it really did not go well | 12:34 |
*** artom has joined #openstack-nova | 12:34 | |
brinzhang | does it need to change the oslo.messaging to adopt 5.1? | 12:34 |
brinzhang | I got +1 on the basis of the rpc version of the Rocky version, which is not a complete backport | 12:35 |
sean-k-mooney | +1 from who? | 12:35 |
sean-k-mooney | we should not attempt to backport this upstream | 12:36 |
brinzhang | +1 from rocky rpc version 5.0-->5.1, and change the target_compute version from 35 to 36 | 12:36 |
brinzhang | I looked into the service db table, the node compute can be 36, but when we restart the compute service,the rpc version also in 5.0 | 12:37 |
brinzhang | sean-k-mooney: Backport is a last resort | 12:38 |
brinzhang | base on https://github.com/openstack/nova/blob/stable/rocky/nova/compute/rpcapi.py#L351, +1 for the rpcversion | 12:39 |
*** elod_afk is now known as elod | 12:39 | |
brinzhang | refer this patch: https://review.opendev.org/#/c/634606/86 | 12:40 |
*** macz_ has joined #openstack-nova | 12:40 | |
*** nweinber has joined #openstack-nova | 12:41 | |
gibi | brinzhang: I've never tried backporting an RPC change. | 12:44 |
sean-k-mooney | gibi: stephenfin has | 12:45 |
gibi | I did assisted backporting API changes downstream, there we simply put the new API into the latest microversion of that branch. It is ugly but it works | 12:45 |
*** macz_ has quit IRC | 12:45 | |
sean-k-mooney | that is ya less then good if you have to ship it to others | 12:46 |
brinzhang | gibi: I want to know what key points I have overlooked. This method of incrementing the version number based on the rocky version is the same as adding a feature to the master, but why doesn't it take effect? | 12:46 |
sean-k-mooney | our rule is if it has object,db,api or rpc change its not backported | 12:46 |
brinzhang | the rules are clear :D | 12:47 |
sean-k-mooney | am have you pinned your rpc version in the config | 12:48 |
gibi | sean-k-mooney: yeah, it was a compromise where the necessary risk and future cost was accepted (and I don't have to touch it so I'm fine with it) | 12:48 |
brinzhang | but in theoretically, my thought can pass, but it is not | 12:48 |
sean-k-mooney | gibi: we have considered doing the same for api changes but decied not to for feature discovery | 12:49 |
gibi | brinzhang: might be your problem related to the RPC version alias on the given branch (simply guessing here) | 12:49 |
sean-k-mooney | e.g. we did not want ot create interoperabity issue between clouds | 12:50 |
sean-k-mooney | gibi: yep that is want i was wondering for the config | 12:50 |
sean-k-mooney | more or less | 12:50 |
sean-k-mooney | e.g. is it pinned to a version and or an alias | 12:50 |
brinzhang | gibi: alias? what does this mean? | 12:51 |
brinzhang | RPC version alias? | 12:52 |
gibi | brinzhang: https://docs.openstack.org/nova/latest/user/upgrade.html#concepts | 12:52 |
sean-k-mooney | https://github.com/openstack/nova/blob/stable/rocky/nova/compute/rpcapi.py#L354-L364 | 12:53 |
*** lbragstad has joined #openstack-nova | 12:53 | |
brinzhang | sean-k-mooney: we dont change the VERSION_ALIASES | 12:54 |
brinzhang | it is as the same as your paste | 12:54 |
sean-k-mooney | yes but what do you have your rpc version set to in the config | 12:54 |
sean-k-mooney | blank, auto, an alias or a value | 12:55 |
brinzhang | the default, because we dont config the rpcversion in conf file | 12:55 |
sean-k-mooney | so i think that will default to the newest version supported by the current node | 12:56 |
sean-k-mooney | meaing if you have not applie the change to all serivce some will be 5.0 and other will be 5.1 | 12:56 |
sean-k-mooney | auto i think figures out thet max supported version for all nodes | 12:57 |
sean-k-mooney | for the numa migration at least both comptue nodes and the conductor would need 5.1 | 12:57 |
brinzhang | sean-k-mooney: yeah, I can try your suggestion, I changed two nodes's code in my 3 nodes cluster | 12:58 |
brinzhang | but stoped the nova-* services in another node | 12:58 |
sean-k-mooney | i think we store the version in the db somewhere | 12:59 |
sean-k-mooney | so stopping it might not be enough dansmith is who i normally ask about this becasue i generally try to avoid it | 12:59 |
brinzhang | I just find the compute version in service db table | 12:59 |
sean-k-mooney | this is still the video i go to to understand this stuff https://www.youtube.com/watch?v=wVXiw09XP04 i wonder how much of that still applies | 13:00 |
brinzhang | https://github.com/openstack/nova/blob/stable/rocky/nova/conf/rpc.py this conf file seems not contain your said auto, default value | 13:01 |
brinzhang | yeah, I will see the vedio tomorrow, need to jump the firewall | 13:02 |
sean-k-mooney | auto is not the default | 13:02 |
brinzhang | if I want to use auto, how to config? | 13:02 |
sean-k-mooney | https://github.com/openstack/nova/blob/stable/rocky/nova/conf/upgrade_levels.py | 13:03 |
sean-k-mooney | but also that is the file you want | 13:03 |
sean-k-mooney | the default is unest which means use the latest vers this code knows | 13:03 |
sean-k-mooney | https://github.com/openstack/nova/blob/stable/rocky/nova/conf/upgrade_levels.py#L55-L61 | 13:03 |
sean-k-mooney | [upgrade_level]/compute=auto | 13:04 |
brinzhang | got it, thanks | 13:04 |
sean-k-mooney | in this case you want 5.1 | 13:04 |
sean-k-mooney | so you can just set that | 13:05 |
sean-k-mooney | you also need to do it for the conductor | 13:05 |
sean-k-mooney | https://github.com/openstack/nova/blob/stable/rocky/nova/conf/upgrade_levels.py#L117 | 13:05 |
brinzhang | yeah, I will try, two ways: 1. set [upgrade_level]/compute=auto in nova compute node, and then to do live migration, | 13:05 |
sean-k-mooney | you need to set it for both compute and conductor | 13:06 |
brinzhang | 2. replaced all nodes's code of nova, and then try again | 13:06 |
brinzhang | ack | 13:06 |
sean-k-mooney | i assume you have also backported all the ovo changes | 13:06 |
sean-k-mooney | e.g. the change to the migrate_data object to store the numa toplogy | 13:07 |
brinzhang | yes, I was backported all the bp's patches | 13:08 |
sean-k-mooney | there have been some bugfixes too but it more or less shoudl be fine | 13:08 |
sean-k-mooney | brinzhang: you are aware you can manually live migrate numa instance before too right | 13:09 |
sean-k-mooney | we just dont recaluate the xml | 13:09 |
sean-k-mooney | so if you have 1 free compute node you can live mgirate without thse backports to do a rolling upgrade | 13:09 |
sean-k-mooney | you just need to ensure the cpus and hugepages exctra are free on the dest | 13:10 |
brinzhang | yes | 13:10 |
brinzhang | I will use two node to have a test, at leat two node, because I should ensure this feature is fine, otherwise, it will have some risk, it's not allow | 13:11 |
brinzhang | Later I will search for related bug fixes and backport | 13:12 |
sean-k-mooney | have you also backported sriov live migration i assume not | 13:13 |
brinzhang | sriov live migration we completed it by ourself, dont need to backport ^^ | 13:13 |
brinzhang | If we upgrade in future, and we will use the merged codes too | 13:14 |
sean-k-mooney | the reason i bring that up is numa migratio nmerged after sriov and at least 1 of the patches in the numa serises is adding compatiablty with sriov live migrfation | 13:14 |
sean-k-mooney | your going from rocky to train right | 13:15 |
brinzhang | yes, it contained, I was found your point | 13:15 |
sean-k-mooney | train is the verssion that had both so once your on train you shoudl be fine | 13:15 |
brinzhang | maybe from rocky to ussuri | 13:15 |
brinzhang | because we also want to use the pmem feature too ^ | 13:16 |
brinzhang | sean-k-mooney: thanks for your help | 13:17 |
brinzhang | I will go home now, have a holiday sean-k-mooney, gibi :D | 13:17 |
gibi | brinzhang: o/ | 13:17 |
dansmith | sean-k-mooney: compute=auto will try to select the version supported by the oldest compute service in the system, based on service_version | 13:41 |
sean-k-mooney | dansmith:yes | 13:41 |
sean-k-mooney | the max version suspport by the oldest | 13:41 |
sean-k-mooney | which is why i was saying to use 5.1 | 13:42 |
sean-k-mooney | not auto | 13:42 |
sean-k-mooney | since brinzhang is not update all the nodes | 13:42 |
dansmith | ah, you said [05:56:27] <sean-k-mooney>so i think that will default to the newest version supported by the current node | 13:42 |
dansmith | but you meant that for if it's unset or something? | 13:42 |
sean-k-mooney | yes unset | 13:42 |
dansmith | okay I was skimming the backscroll and thought you were saying that was the auto behavior, | 13:43 |
dansmith | but I see now | 13:43 |
*** lbragstad has quit IRC | 13:55 | |
bbowen | sean-k-mooney: Do you know the latest on https://trello.com/c/f0lLrWxf/2465-swisscom-schweiz-agsev302710209bz1860395neutronrarp-packets-after-live-migration-are-dropped-by-br-link-due-to-a-timing-issue-be, I thought you were making a change and it was not on neutron team. was a hint in rhos-prio to go back to compute | 13:58 |
bbowen | ^now on neutron | 13:59 |
lyarwood | bbowen: wrong channel :) | 13:59 |
bbowen | ty | 13:59 |
*** vishalmanchanda has quit IRC | 14:01 | |
*** tkajinam has quit IRC | 14:20 | |
*** rcernin_ has quit IRC | 14:27 | |
*** macz_ has joined #openstack-nova | 14:28 | |
*** macz_ has quit IRC | 14:33 | |
artom | stephenfin, I think I'd like a video chat to explain https://review.opendev.org/#/c/756070/6/nova/virt/hardware.py if you have some time today or next week | 14:34 |
artom | Not so much the change itself, more the context around it | 14:34 |
artom | You set `instance_cell.min_threads = threads_no`, and I'm trying to work backwards to where threads_no is coming from, and it's... hard :P | 14:44 |
*** iurygregory has quit IRC | 14:57 | |
*** iurygregory has joined #openstack-nova | 14:57 | |
*** macz_ has joined #openstack-nova | 15:01 | |
*** abhishekk is now known as abhishekk|away | 15:06 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Use the non polling notification waiter in func test https://review.opendev.org/758445 | 15:07 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Create a fixture around fake_notifier https://review.opendev.org/758446 | 15:07 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Use NotificationFixture for legacy notifications too https://review.opendev.org/758448 | 15:09 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Test the NotificationFixture https://review.opendev.org/758450 | 15:10 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Move fake_notifier impl under NotificationFixture https://review.opendev.org/758451 | 15:10 |
gmann | gibi: I have doctor appointment and would not be able to join today meeting. | 15:13 |
gibi | gmann: ack, thanks for the headsup | 15:13 |
*** belmoreira has quit IRC | 15:18 | |
*** mlavalle has joined #openstack-nova | 15:22 | |
*** tbachman has joined #openstack-nova | 15:32 | |
*** dklyle has joined #openstack-nova | 15:36 | |
*** gyee has joined #openstack-nova | 15:39 | |
lyarwood | stephenfin: you've been working on LM bugs recently with the periodic jobs right? | 15:42 |
lyarwood | stephenfin: did you find any issues with _sync_instance_power_state running *after* an instance had already paused? | 15:42 |
artom | lyarwood, I think he's on PTO this EU afernoon - he cancelled a meeting we had with Brian and Erwan | 15:49 |
lyarwood | ah forgot my bad | 15:49 |
lyarwood | thanks artom | 15:49 |
gibi | nova meeting starts in 10 minutes on #openstack-meeting-3 | 15:51 |
*** rpittau is now known as rpittau|afk | 15:52 | |
*** lbragstad has joined #openstack-nova | 15:55 | |
*** dave-mccowan has joined #openstack-nova | 15:56 | |
*** k_mouza has quit IRC | 16:00 | |
*** k_mouza has joined #openstack-nova | 16:11 | |
dansmith | melwitt: ahh, maybe the separate joins are only happening when we grab multiple instances | 16:13 |
dansmith | if that's the case, then just making the get-by-uuid path work the same would be easy | 16:14 |
melwitt | is there a specific part of the code you're looking at that makes you think that? | 16:15 |
*** k_mouza has quit IRC | 16:15 | |
dansmith | I shouldn't say "separate joins"... they're called "manual joins" in the db api, because they're separate queries | 16:15 |
dansmith | melwitt: look at this: https://github.com/openstack/nova/blob/master/nova/db/sqlalchemy/api.py#L1278 | 16:15 |
dansmith | melwitt: and this: https://github.com/openstack/nova/blob/master/nova/db/sqlalchemy/api.py#L1328 | 16:16 |
melwitt | ahh | 16:16 |
dansmith | when we moved flavor info into sysmeta, we caused a giant explosion on every instance list, right around icehouse, | 16:17 |
*** songwenping_ has joined #openstack-nova | 16:17 | |
dansmith | so we moved to this manual join method where we query the metadata tables separately, but in bulk, to avoid the n:m join | 16:17 |
dansmith | so the single case is similar, just where n=1, but if m is large enough, still unideal | 16:18 |
sean-k-mooney | we basicaly need to use a subqurry for the userdata | 16:18 |
sean-k-mooney | and get the rest of the data normally | 16:18 |
*** k_mouza has joined #openstack-nova | 16:18 | |
dansmith | subquery or separate query? not sure how the former will help | 16:19 |
sean-k-mooney | seperate query | 16:19 |
dansmith | yeah, we do that on list, just don't on get, that's my point | 16:19 |
sean-k-mooney | yeah | 16:19 |
dansmith | all the machinery is there from list, so get can just call it but with the single instance we care about | 16:20 |
sean-k-mooney | i didnt look a the code specificaly but this sounded familar as i think we already optimised this once before for that list case | 16:20 |
*** songwenping__ has quit IRC | 16:20 | |
sean-k-mooney | its just one of those things that works fine for small number and then explode because its n*m complexity | 16:21 |
sean-k-mooney | in size | 16:21 |
dansmith | for list, yeah | 16:21 |
sean-k-mooney | well in this case its joing userdata with each metadata propertiy | 16:21 |
dansmith | for single get n=1, but m is large | 16:21 |
dansmith | right | 16:21 |
sean-k-mooney | ya so its still liniar instad of O(1) | 16:22 |
sean-k-mooney | not as bad but still kind of sucks | 16:22 |
dansmith | linearly big is still big :) | 16:22 |
*** k_mouza has quit IRC | 16:23 | |
*** martinkennelly has joined #openstack-nova | 16:26 | |
*** k_mouza has joined #openstack-nova | 16:27 | |
openstackgerrit | melanie witt proposed openstack/nova stable/ussuri: Follow up for cherry-pick check for merge patch https://review.opendev.org/758463 | 16:30 |
*** k_mouza has quit IRC | 16:31 | |
*** eharney has quit IRC | 16:33 | |
*** ociuhandu_ has joined #openstack-nova | 16:35 | |
*** ociuhandu has quit IRC | 16:38 | |
*** dtantsur is now known as dtantsur|afk | 16:39 | |
*** ociuhandu_ has quit IRC | 16:40 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Ignore PCI devices with 32bit domain https://review.opendev.org/756697 | 16:43 |
*** lpetrut has joined #openstack-nova | 16:44 | |
sean-k-mooney | gibi: i have seen devices that use teh 32bit domian that can be assigned to qemu | 16:45 |
sean-k-mooney | althoguh i was using a patched qemu | 16:45 |
gibi | sean-k-mooney: interesting. Could you check if the patch included support for 32 bit domain? there is a link in the commit message for the qemu code | 16:46 |
sean-k-mooney | it was an internal intel build for unrelated hardware | 16:46 |
sean-k-mooney | it was never upstreamed :) | 16:46 |
gibi | melwitt: improved https://review.opendev.org/756697 | 16:46 |
sean-k-mooney | gibi: looking at your patch i think its fine but it woudl be nice to add some docs for this limitation somewhere | 16:47 |
sean-k-mooney | maybe in the pci whitelist docs somewhere | 16:47 |
gibi | sean-k-mooney: we already limit the whitelist to 16 bit | 16:47 |
melwitt | gibi: thanks, will look | 16:47 |
sean-k-mooney | gibi: but do we document that limitation or just do it in code | 16:48 |
gibi | sean-k-mooney: good point, we actually not https://docs.openstack.org/nova/latest/configuration/config.html#pci.passthrough_whitelist | 16:49 |
gibi | I will add extra information to the config description | 16:49 |
sean-k-mooney | ya we jsut say "PCI address of the device. Both traditional glob style and regular expression syntax is supported." | 16:49 |
sean-k-mooney | if we can add a note that we only supprot 16 bit domains ill be happy | 16:50 |
gibi | sean-k-mooney: I will do | 16:50 |
gibi | but not today. As it is beer time for me now | 16:51 |
gibi | see you all tomorrow | 16:51 |
gibi | o/ | 16:51 |
sean-k-mooney | gibi: redhatters will be off tomorrow | 16:51 |
sean-k-mooney | its a company day | 16:51 |
sean-k-mooney | just an fyi | 16:51 |
gibi | sean-k-mooney: thanks for the heads up, enjoy your free day | 16:52 |
sean-k-mooney | ill try to not spend it working on ci :) | 16:53 |
melwitt | sean-k-mooney: am I missing something or does that make all invalid configs ignored? maybe I just have a wrong understanding | 16:55 |
sean-k-mooney | i have not done a full review but ill take a look now | 16:56 |
sean-k-mooney | your wondering about https://review.opendev.org/#/c/756697/4/nova/pci/devspec.py | 16:57 |
sean-k-mooney | the transfromataion of PciConfigInvalidWhitelist to PciInvalidDevice | 16:57 |
sean-k-mooney | this change actully does not take into account the glob "*:whatere" adresses ro the regex support | 16:58 |
*** lpetrut has quit IRC | 16:59 | |
sean-k-mooney | self.domain = pci_addr.get('domain', REGEX_ANY) also does not truncate to just 16 bits today | 16:59 |
melwitt | I don't know this parsing code really, but I was just wondering would this mask all/most actual invalid configs | 16:59 |
*** derekh has quit IRC | 17:00 | |
sean-k-mooney | so its cactaching PciConfigInvalidWhitelist | 17:01 |
sean-k-mooney | which is only raised here https://github.com/openstack/nova/blob/master/nova/pci/devspec.py#L51-L66 | 17:01 |
sean-k-mooney | but it would catch any case where the value exceed the max | 17:01 |
sean-k-mooney | or where it was invalid | 17:01 |
*** hamalq has joined #openstack-nova | 17:02 | |
melwitt | yeah, that's where I'm wondering is there a way to tell when it's a 32bit domain parse fail? so that we could raise something different? | 17:02 |
sean-k-mooney | we can look at teh data we parsed | 17:02 |
melwitt | I see | 17:03 |
sean-k-mooney | we could also split the excptions | 17:03 |
melwitt | kinda like the "if v > maxval" | 17:03 |
sean-k-mooney | ya so https://github.com/openstack/nova/blob/master/nova/pci/devspec.py#L57-L66 | 17:03 |
sean-k-mooney | this is the two cases where its currently raised | 17:03 |
sean-k-mooney | but we can have the maxval branc raise a subclass of PciConfigInvalidWhitelist | 17:04 |
sean-k-mooney | and then catch that | 17:04 |
melwitt | yeah, so we are currently hitting the "if v > maxval" condition | 17:04 |
sean-k-mooney | e.g. PciConfigInvalidWhitelistMaxExceeded | 17:04 |
sean-k-mooney | yes so we want to ignore those devices | 17:05 |
sean-k-mooney | the other case is shwere we get non hex adresses | 17:05 |
melwitt | so we want to ignore all that are > maxval | 17:05 |
sean-k-mooney | i guess we coudl ignore those too | 17:05 |
sean-k-mooney | these are comming from libvirt | 17:05 |
sean-k-mooney | well actully no theey are not | 17:05 |
sean-k-mooney | its the whitelist at this point althogh some of this code is common | 17:06 |
melwitt | ok, that's where I was confused was, I was assuming we only wanted to ignore the ones > maxval. do we want to ignore non hex addresses? just saying that this patch seems to just make the validations all ignored. is there are reason to keep the validations? | 17:07 |
sean-k-mooney | am well we should not allow you to whitelist device that are invliad | 17:11 |
sean-k-mooney | i need to look at the bug | 17:11 |
melwitt | ok. I'll try to write more comments | 17:13 |
sean-k-mooney | so i think the issue is that we sue thise function to filter the pci deivce returned by libvirt | 17:14 |
sean-k-mooney | so even if you dont have whitelist value that is 32bits it will explode when we try to filter the value | 17:14 |
*** k_mouza has joined #openstack-nova | 17:14 | |
sean-k-mooney | we do nova.pci.devspec.WhitelistPciAddress.match() | 17:15 |
sean-k-mooney | we proably dont want to catch the exption where gibi is | 17:16 |
sean-k-mooney | instead we proably want to only catch it and skip the device wehre we do the match | 17:16 |
melwitt | ok, so this issue is NOT to do with the whitelist at all. I see | 17:18 |
melwitt | so the whitelist parsing code is reused to parse the devices returned by libvirt | 17:18 |
melwitt | ok, I was missing that | 17:18 |
*** k_mouza has quit IRC | 17:19 | |
sean-k-mooney | so here https://github.com/openstack/nova/blob/master/nova/pci/manager.py#L118-L122 | 17:19 |
sean-k-mooney | yep we reuse https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/whitelist.py#L80-L88 | 17:20 |
melwitt | ok, this way we're sure we're not parsing a whitelist | 17:20 |
*** andrewbonney has quit IRC | 17:20 | |
sean-k-mooney | so what we do is we build a device spec object for every entry in the whitelist | 17:20 |
sean-k-mooney | then for every device returned by the vert driver we constuct a second one and see if tey match | 17:21 |
sean-k-mooney | its the libvirt address that is 32bit | 17:21 |
melwitt | but the exception guessed raised in __init__ not in the match() call? | 17:22 |
melwitt | *is | 17:22 |
*** brinzhang has quit IRC | 17:22 | |
sean-k-mooney | from here https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/devspec.py#L217-L238 | 17:23 |
*** brinzhang has joined #openstack-nova | 17:23 | |
melwitt | ohhhh | 17:23 |
melwitt | guh | 17:23 |
openstackgerrit | Sylvain Bauza proposed openstack/nova master: Fix the vGPU dynamic options race https://review.opendev.org/758470 | 17:23 |
melwitt | so devices_json is a list of whitelist pci objects | 17:24 |
melwitt | maybe not... I'll look. this is so confusing to me | 17:26 |
sean-k-mooney | its a json blob we construct for the data we get from libvirt | 17:26 |
sean-k-mooney | the nodedev xmls | 17:26 |
sean-k-mooney | it looks like a list of these https://github.com/openstack/nova/blob/131f7606c179bd08bde3b7ae0e6bb0b59acf1545/nova/tests/unit/pci/test_manager.py#L32-L41 | 17:28 |
melwitt | yeah, just that device_assignable is from the Whitelist class | 17:28 |
sean-k-mooney | i have to look this up everythim but it generated here https://opendev.org/openstack/nova/src/branch/master/nova/virt/libvirt/driver.py#L7127-L7150 | 17:29 |
sean-k-mooney | i rewrote this recently | 17:30 |
sean-k-mooney | but we get all the nodedev form libvirt that are pci device ro nics | 17:31 |
melwitt | thanks | 17:31 |
melwitt | oh it's the dev_filter that's the whitelist, facepalm. sorry | 17:31 |
melwitt | I have gone through this before too, but still mess it up | 17:32 |
sean-k-mooney | dev_filter is an instance of the Whitelist class yes | 17:33 |
sean-k-mooney | whic has one filed specs which holds all the whitelist entries | 17:33 |
sean-k-mooney | then we just do | 17:34 |
sean-k-mooney | for spec in self.specs: | 17:34 |
sean-k-mooney | if spec.match(dev): | 17:34 |
sean-k-mooney | return True | 17:34 |
sean-k-mooney | return False | 17:34 |
sean-k-mooney | so if any of the entries match the json seriasition of the pci device form the virt driver w return true | 17:34 |
melwitt | ok | 17:35 |
sean-k-mooney | that mach is the on defiend in WhitelistPciAddress | 17:35 |
*** k_mouza has joined #openstack-nova | 17:36 | |
sean-k-mooney | which does https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/devspec.py#L238 | 17:36 |
sean-k-mooney | whic eventually expodes | 17:37 |
sean-k-mooney | it actully expoding here https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/devspec.py#L237 | 17:37 |
sean-k-mooney | when we contuct the dev object to compare agaisnt form teh adress we get form libvirt | 17:38 |
sean-k-mooney | that why its expoliding in init | 17:38 |
sean-k-mooney | we constuct the object form the string to reuse the parsing that was written for teh whitelist | 17:39 |
sean-k-mooney | so we dont have to have it twice | 17:39 |
melwitt | is pci_phys_addr the whitelist entry? | 17:40 |
*** k_mouza has quit IRC | 17:40 | |
sean-k-mooney | no that is the pci adress form libvirt | 17:41 |
sean-k-mooney | self.pci_address_spec is the object form the whitelist | 17:41 |
sean-k-mooney | arent the names great | 17:41 |
sean-k-mooney | so easy to follow and not confusing at all | 17:41 |
melwitt | /o\ | 17:42 |
*** martinkennelly has quit IRC | 17:42 | |
sean-k-mooney | melwitt: its not taking this branch https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/devspec.py#L231-L234 | 17:43 |
sean-k-mooney | but it would explode if it did | 17:43 |
melwitt | well, what I mean is, IIUC (which I probably don't) one of those construct of PhysicalPciAddress is "correctly" parsing a whitelist entry and the other one is parsing an address from libvirt. is my understanding wrong? | 17:43 |
sean-k-mooney | that is for pci device that are capable fo sriov | 17:43 |
melwitt | oh, ok. nevermind my question then | 17:43 |
sean-k-mooney | for PF if you list a PF we allow all the VF if they are allocated to be used | 17:44 |
sean-k-mooney | that is what that branch is for | 17:44 |
sean-k-mooney | so if whitelist a PF its a shortcut to whitelist the VFs | 17:44 |
melwitt | that makes sense | 17:44 |
sean-k-mooney | anyway the tl;dr for gibi is we shoudl catch the excpetion here https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/whitelist.py#L85-L87 instead | 17:46 |
sean-k-mooney | to that if should be in a try | 17:47 |
sean-k-mooney | and we retrun fause if we get the PciConfigInvalidWhitelist excption | 17:48 |
sean-k-mooney | whcih is what he is doing here https://review.opendev.org/#/c/756697/4/nova/pci/manager.py | 17:48 |
sean-k-mooney | back to your orginal question | 17:48 |
sean-k-mooney | https://review.opendev.org/#/c/756697/4/nova/pci/devspec.py | 17:48 |
sean-k-mooney | is still going to raise and excption if you input an invalid whitelist | 17:49 |
sean-k-mooney | so i think the compute agent will still fail to start in that case | 17:49 |
sean-k-mooney | but we support 3 different type of adresses. | 17:50 |
*** k_mouza has joined #openstack-nova | 17:50 | |
sean-k-mooney | gibi only exeteded this one since its the one we use for the virt driver objcts | 17:51 |
sean-k-mooney | since gibi is not catching the excption it will still fail here | 17:52 |
sean-k-mooney | https://github.com/openstack/nova/blob/2c3ff1004e8812da0bc4e5502762e6b9a7650846/nova/pci/whitelist.py#L52-L79 | 17:52 |
sean-k-mooney | when we are parsing the whitelist if it had a 32bit domain | 17:52 |
sean-k-mooney | this will raise the new excption | 17:53 |
sean-k-mooney | spec = devspec.PciDeviceSpec(ds) | 17:53 |
sean-k-mooney | line 75 | 17:53 |
*** k_mouza has quit IRC | 17:55 | |
ganso | hi folks! If anyone could please spare a few minutes to look at this stable/train backport, should be really simple and quick as the other ones: https://review.opendev.org/#/c/757870/ | 18:21 |
*** artom has quit IRC | 18:22 | |
*** tesseract has quit IRC | 18:22 | |
*** artom has joined #openstack-nova | 18:22 | |
ganso | melwitt, lyarwood, elod ^ | 18:22 |
*** ralonsoh has quit IRC | 18:22 | |
melwitt | ganso: ack will look later today | 18:24 |
ganso | melwitt: thank you! | 18:24 |
*** ociuhandu has joined #openstack-nova | 18:55 | |
*** ociuhandu has quit IRC | 18:59 | |
*** gregwork has quit IRC | 19:26 | |
elod | ganso: the backport looks good to me, just as it was ok to ussuri. +2 | 20:12 |
ganso | elod: thanks! | 20:12 |
*** nweinber has quit IRC | 20:14 | |
*** k_mouza has joined #openstack-nova | 20:50 | |
*** k_mouza has quit IRC | 20:54 | |
*** k_mouza has joined #openstack-nova | 20:54 | |
*** raildo has quit IRC | 21:13 | |
*** jobewan has quit IRC | 21:14 | |
*** k_mouza has quit IRC | 21:14 | |
*** slaweq has quit IRC | 21:31 | |
*** brinzhang_ has joined #openstack-nova | 21:35 | |
*** brinzhang has quit IRC | 21:39 | |
*** rcernin_ has joined #openstack-nova | 22:02 | |
*** rcernin_ has quit IRC | 22:19 | |
*** slaweq has joined #openstack-nova | 22:22 | |
*** hamalq has quit IRC | 22:32 | |
*** hamalq has joined #openstack-nova | 22:33 | |
*** slaweq has quit IRC | 22:38 | |
*** tkajinam has joined #openstack-nova | 22:59 | |
*** macz_ has quit IRC | 23:03 | |
*** mlavalle has quit IRC | 23:23 | |
*** tosky has quit IRC | 23:23 | |
*** dklyle has quit IRC | 23:35 | |
*** k_mouza has joined #openstack-nova | 23:51 | |
*** k_mouza has quit IRC | 23:55 | |
*** zzzeek has quit IRC | 23:55 | |
*** zzzeek has joined #openstack-nova | 23:59 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!