Tuesday, 2026-06-09

*** ykarel_ is now known as ykarel10:32
dtantsurHi folks! I'd appreciate review on https://review.opendev.org/c/openstack/project-config/+/991483 please13:01
*** jgilaber_ is now known as jgilaber14:01
clarkbfungi: I know we're going into the board meeting tunnel I +2'd ^ with my note about the branches situation I think it will be fine but maybe double check what I wrote in case it jogs memory before we approve?14:55
fungiyeah, it sounds correct to me14:56
fungiapproved it14:56
clarkbsounds good14:58
opendevreviewMerged openstack/project-config master: Add 3 more Metal3 projects to untrusted-projects  https://review.opendev.org/c/openstack/project-config/+/99148315:09
dtantsurThank you!15:38
dtantsurHmm, is it just me or did gerrit go for a walk a minute ago?15:40
dansmithsame here15:40
dtantsurPings work and curl went through eventually, but Firefox times out.15:41
dtantsurYeah, curl -I finishes successfully in 97 seconds. More AI happiness?15:43
clarkbsystem load is relatively low and gerrit is up and running. I wonder if it is the proxy getting overloaded15:45
fungiproxy as in apache? i can take a look at the server-status scorecard15:50
clarkbfungi: is apache status trackign enabled on review? I do suspect it is apache running out of slots. Though I can see logs from about 9 hours ago where it complains about running out of max request workers15:50
clarkbfungi: yes apache15:50
clarkber though I can see it complaining in the error.log about that 9 housr ago I don't see current complaints15:51
fungiit's definitely acting like it's overloaded, just trying to `wget http://localhost/server-status` from a shell on the server is hanging right now15:52
fungiso definitely acting like it may be out of worker slots15:52
fungifinally returned to me and it looks like all workers are either open with no current process or reading requests15:53
fungiso all running workers are full up15:54
clarkbok in the past we've avoided increasing those limits due to the thread pool sizes on the gerrit side15:54
fungii'll try to characterize/summarize the pending requests15:54
clarkbthanks15:54
fungithough it looks like the bulk of them haven't actually gotten the request in yet, it's just showing up as empty15:55
clarkbfungi: look at `ss -npt | less` or similar. One ip address tands out15:58
clarkboh thats our own address?15:58
clarkboh I'm looking at he columns wrong duh thats the local side15:58
clarkbbut that does show an imbalance between frontend connections and connections to the backend15:59
clarkbso its like we're not even trying to connect to gerrit?15:59
fungii see a single ipv6 address in the server-status output with about 20 outstanding requests15:59
fungianother with 6, one with 4, a few with 2, and about 180 with only one request each16:00
clarkbsanity checking the backend further gerrit show-queue looks fine16:01
clarkbso  Ithink this is on the frontend16:01
fungimight be a good excuse to add anubis there16:02
clarkbI see a bunch of git upload packs getting through16:05
clarkbin theory most of that traffic should be funneled to the gitea farm16:05
clarkb(but if that is the traffic that is the problem anubis may not be immediately helpful)16:06
ykarelhi can ^ also make zuul gate result not getting reported and patch not being merged?16:17
ykarelall jobs passed https://zuul.openstack.org/buildset/ce5b2b3bf586458c88f1d57d928cc5ee but it says buildset in progress16:18
ykarelclarkb, fungi ^16:18
fungiykarel: possibly, zuul does use the gerrit rest api in order to be able to return inline review comments16:18
fungiso if it can't establish an https connection to gerrit it may impact result reporting16:19
ykarelack, so need to recheck or possible to merge based on above results itself? this was top in the gate queue16:19
fungiseems like zuul considers the buildset in progress but the change item is no longer enqueued in any pipeline according to the status view16:22
clarkbits possible that the db reporting didn't happen because gerrit reporting failed?16:22
opendevreviewDmitriy Rabotyagov proposed openstack/project-config master: Deprecate Vitrage project  https://review.opendev.org/c/openstack/project-config/+/98287816:22
clarkbthe data you see in your zuul web ui for historical dbs comes from the database and that is a separate reporting step16:23
fungiclarkb: ooh, okay right this isn't in-memory state it's pulling from sql so yes may have not gotten recorded there16:23
ykarelwhen i was watching the status page, there were 3-4 patches in top of the gate queue which had all the jobs passed, at that time gerrit was not loading for me16:23
fungiykarel: so yes, i expect that change possibly others failed to report, which will be evidenced by a missing verified vote from the zuul user, a recheck will unfortunately test them all over again but should work as long as gerrit doesn't start having problems again16:48
fungiit seems fine at the moment after forcefully restarting the main apache daemon a little while ago. we're still going through logs and coming up with some potential mitigations, though it's as of yet unclear what caused it to stop recycling worker processes or forwarding requests to gerrit16:51
opendevreviewStephen Finucane proposed openstack/pbr master: Add Resolute, py314 testing  https://review.opendev.org/c/openstack/pbr/+/98993822:06

Generated by irclog2html.py 4.1.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!