Friday, 2023-11-17

tkajinamhi. seems skip level grenade job is broken because of the recent tooz release which contains https://review.opendev.org/c/openstack/tooz/+/89135508:18
tkajinamthe problem seems to be that etcd3 in stable/2023.1 is 3.3 which does not provide /v3 endpoint and it's not updated to 3.4 or later during upgrade process.08:19
tkajinamwondering if we can(or should) update etcd during the process08:19
tkajinamhttps://zuul.opendev.org/t/openstack/builds?job_name=grenade-skip-level-always&project=openstack/nova08:19
tkajinamexamples ^^^08:19
opendevreviewTakashi Kajinami proposed openstack/devstack stable/2023.1: Stable-only: Force v1beta etcd api  https://review.opendev.org/c/openstack/devstack/+/90122108:27
fricklertkajinam: IMO upgrading etcd in grenade would seem sensible. I'm not sure about possible operator impact though08:29
opendevreviewTakashi Kajinami proposed openstack/devstack stable/2023.1: Stable-only: Force v1beta etcd api  https://review.opendev.org/c/openstack/devstack/+/90122308:29
tkajinamprobably this may be an alternative solution ^^^08:30
fricklerthat may fix the testing, but doesn't seem feasible for existing deployments, at least I wouldn't know how to communicate this properly to affected parties08:33
tkajinamfrickler, we have an interface to override api version so operators need to set the one according to the etcd version they have. When they have etcd 3.3 then they may start with v1alpha, then upgrade etcd to 3.4 and then switch to v1beta.  then upgrade etcd to 3.5 and switch to v108:33
tkajinamwe can consider detecting the available versions automatically. though. we discussed it a while ago but no progress was made. I can take time to look into it later.08:41
opendevreviewTakashi Kajinami proposed openstack/devstack stable/2023.1: Stable-only: Force v1beta etcd api  https://review.opendev.org/c/openstack/devstack/+/90122109:10
-opendevstatus- NOTICE: Gerrit will be unavailable for a short time starting at 15:30 UTC as it is upgraded to the 3.8 release. https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/thread/XT26HFG2FOZL3UHZVLXCCANDZ3TJZM7Q/14:07
dansmithtkajinam: you think that's only stable related? I see it on master too: https://zuul.opendev.org/t/openstack/build/f3154ea9c84143259e4d325cb96f384a15:07
dansmithI see I'm late to the party from reading -discuss15:20
dansmithdamn timezones :)15:20
tkajinamdansmith, the problem is caused by the config file generated at the source version so we need it only in stable/2023.115:29
dansmithtkajinam: okay perhaps I was conflating that and the etcd thing, but the latter is definitely seen on master15:30
dansmith(i.e. non-grenade things)15:30
tkajinamhttps://bugs.launchpad.net/python-tooz/+bug/204381015:30
tkajinamah, wait15:30
tkajinamhmm. that specific failure seems to be caused by15:31
tkajinam2023-11-17 14:56:36.287910 | controller | The conflict is caused by:15:31
tkajinam2023-11-17 14:56:36.287947 | controller |     etcd3 0.12.0 depends on grpcio>=1.27.115:31
tkajinam2023-11-17 14:56:36.287980 | controller |     The user requested (constraint) grpcio===1.58.015:31
tkajinambut that's not related to our recent discussion. maybe it's a new thing, but the error looks quite strange15:32
tkajinamgrpcio===1.58.0 does not conflict with grpcio>=1.27.115:32
dansmithwonderful15:33
tkajinammaybe we have to drop etcd3 driver. we deprecated it a while ago because python-etcd3 is unmaintained. if etcd3 is causing strange thing then it's the time.15:34
tkajinamthough I may attempt to find what that strange error is15:34
dansmithI guess I'm surprised everyone isn't hitting this ^ because it looks fairly deterministic.. is it worth a recheck or no?15:36
tkajinamI'm trying to check whether the error consistently appears. though I can't check zuul/gerrit now because of server errors, likely caused by upgrade going on now15:38
tkajinamI'd suggest recheck once15:38
dansmithyup, pokay15:39
opendevreviewJames Parker proposed openstack/whitebox-tempest-plugin master: Add plugin specific sriov vnic port parameter  https://review.opendev.org/c/openstack/whitebox-tempest-plugin/+/90023916:33
-opendevstatus- NOTICE: The Gerrit upgrade is complete, however we have Zuul offline in parallel for a schema migration, so any events occurring during this time will be lost (requiring a recheck or similar to trigger jobs once it returns to service); we'll update again once this is complete.16:34
-opendevstatus- NOTICE: Zuul is fully back in service now, but any events occurring prior to 17:05 UTC may need a recheck to trigger jobs.17:13

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!