Monday, 2024-12-02

opendevreviewDr. Jens Harbott proposed opendev/system-config master: Update IRC channel docs  https://review.opendev.org/c/opendev/system-config/+/93680607:11
fricklerinfra-root: ildikov: ^^ that's what I came up with, please have a look and feel free to amend if needed07:12
opendevreviewAlbin Vass proposed zuul/zuul-jobs master: prepare-workspace-git: Make it possible to sync a subset of projects  https://review.opendev.org/c/zuul/zuul-jobs/+/93682812:46
opendevreviewAlbin Vass proposed zuul/zuul-jobs master: prepare-workspace-git: Make it possible to sync a subset of projects  https://review.opendev.org/c/zuul/zuul-jobs/+/93682812:53
opendevreviewKarolina Kula proposed zuul/zuul-jobs master: DNM Switch to KVM  https://review.opendev.org/c/zuul/zuul-jobs/+/93602312:54
opendevreviewKarolina Kula proposed openstack/diskimage-builder master: DNM Testing on KVM  https://review.opendev.org/c/openstack/diskimage-builder/+/93602412:55
opendevreviewDr. Jens Harbott proposed opendev/system-config master: Update IRC channel docs  https://review.opendev.org/c/opendev/system-config/+/93680613:07
ildikovfrickler: Thank you! I added one question to see if I interpreted the new text as intended, but otherwise the change looks good. The '-1' is for the typo.13:16
opendevreviewAlbin Vass proposed zuul/zuul-jobs master: prepare-workspace-git: Make it possible to sync a subset of projects  https://review.opendev.org/c/zuul/zuul-jobs/+/93682813:52
opendevreviewDr. Jens Harbott proposed opendev/system-config master: Update IRC channel docs  https://review.opendev.org/c/opendev/system-config/+/93680613:54
fungimailing list server maintenance begins in an hour, at 1500z: https://etherpad.opendev.org/p/lists-openinfra-org-migration13:58
fungiall advance steps are completed, so an hour from now we'll pick up with step #4 to kick it off13:59
fricklerfungi: small question on the config change, but not critical I think. also I'll be kind of around in case something goes badly wrong14:09
opendevreviewJeremy Stanley proposed opendev/system-config master: Move OpenInfra mailing lists to new domain  https://review.opendev.org/c/opendev/system-config/+/93630314:34
fungithanks frickler!14:35
fungimaintenance starts in 5 minutes, i've got a root screen session open on lists0114:54
opendevreviewKarolina Kula proposed zuul/zuul-jobs master: DNM Switch to KVM  https://review.opendev.org/c/zuul/zuul-jobs/+/93602314:56
fungiand the session is logging to ~root/screenlog.0 for posterity14:57
fungi#status log Mailing lists services are offline for the next two hours for maintenance, but messages will be deferred and delivered once work has concluded: https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/thread/NQ3CPI2OX2TKE3OWNATUBA477T4SYTUZ/15:00
opendevstatusfungi: finished logging15:00
fungistopping exim415:00
fungistopping mailman containers15:00
fungibacking up the database now, this will take a few minutes to complete15:01
fungithat's done, now i'm applying the 15 (!) sql update queries we need for this15:05
fungithe bounceevent table queries didn't match any rows, i'm skeptical but it's possible that's fine. i'll check the table contents when i'm done15:09
fungithere are only 28 rows in that table at the moment, and none of them are for lists.openinfra.dev subscribers, so it's fine15:14
fungimy latest edit to 936303 seems like it's problematic, taking a look real quick though it's not critical that it merge right now15:15
fungiyeah, trivial typo for one of the errors...15:17
fungiother one is an oopsie on my part, assert assumed a path edit where there was none15:18
opendevreviewJeremy Stanley proposed opendev/system-config master: Move OpenInfra mailing lists to new domain  https://review.opendev.org/c/opendev/system-config/+/93630315:20
fungithat should hopefully take care of it15:20
fungiback to the maintenance tasks, skipping step 9 for the moment15:21
fungiand made a note that we shouldn't go past step 17 until that gets done15:22
fungifor step 10, i'll make sure the patch to the apache vhost config reflects the last edit to 93630315:23
fricklersorry for having been late with my review, we could also merge PS3 and add the testing in a followup?15:25
fungii've manually applied the latest vhost config from 936303 now15:31
fungiapache is restarted, and i'm restarting the mailman containers now15:33
fungithe various components take a couple of minutes to start, then we can check redirects and content manually (but have to accept the incorrect cert temporarily)15:34
fungithe redirect for the example url is working correctly, but i'm still getting a 503 service unavailable for the moment15:36
fungiokay, it's responding to me now15:38
fungihttps://lists.openinfra.dev/archives/list/foundation@lists.openinfra.dev/thread/QQ6CUBG337L3UP7FXKLWTCHBJISTQBOC/ gets correctly redirected to https://lists.openinfra.org/archives/list/foundation@lists.openinfra.org/thread/QQ6CUBG337L3UP7FXKLWTCHBJISTQBOC/ and has content15:38
fungiso that's good15:39
fungistarting exim back up15:40
fungii'm going to reply to my foundation ml message about the maintenance now and make sure it comes through and ends up in the archive15:41
clarkbfungi: school run is complete anything I can do to be helpful with the mm3 work?15:48
fungii think we're still on track, i just sent a message to the foundation ml, though it hasn't arrived yet15:48
clarkbinfra-root I would be super appreciative if we can land https://review.opendev.org/c/opendev/system-config/+/936305/ sometime today with a plan for restarting Gerrit later in my day as system load falls. That will put us in a good spot to do the upgrade of gerrit at the end of the week as I can retest upgrading latest to latest bugfix releases15:49
fungiokay, i'm going to start tracing my message through mta logs15:52
fungi2024-12-02 15:45:48 1tI8cI-0007Zc-Mz => foundation@lists.openinfra.dev R=dnslookup T=remote_smtp H=lists.openinfra.dev [2001:4800:7813:516:be76:4eff:fe04:5423] X=TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256 CV=no DN="C=UK,O=Exim Developers,CN=lists01.opendev.org" C="250 OK id=1tI8cO-005Vn8-LL"15:53
fungithat's from my mta, so far so good15:53
fungi2024-12-02 15:45:48 1tI8cO-005Vn8-LL ** foundation@lists.openinfra.dev R=mailman_router T=mailman_transport H=localhost [127.0.0.1]: SMTP error from remote mail server after RCPT TO:<foundation@lists.openinfra.dev>: 550 Requested action not taken: mailbox unavailable15:55
fungihah! my local edits forgot about the forwarding aliases15:55
fungilemme add those quickly15:55
clarkbah ok you sent to the old address and that failed due to missing aliases makes sense15:55
fungiright, i wanted to test the forwarding aliases, exactly what i forgot to hand-apply15:56
fungii'll make a mental note to adjust system-config-run-lists3 to also log the /etc/aliases file in future15:58
fungii've temporarily stopped exim again while i'm working to fix this bit15:58
fungiokay, /etc/aliases.domain has been edited to match what 936303 should generate, and i'm starting exim again16:04
fungialso 936303 is passing tests, so i've un-wipped it16:08
fungigoing to try re-sending my message to the foundation ml now16:09
fungifoundation@lists.openinfra.dev host lists.openinfra.dev [2001:4800:7813:516:be76:4eff:fe04:5423] SMTP error from remote mail server after RCPT TO:<foundation@lists.openinfra.dev>: 550 Unrouteable address16:16
fungiboth lists.openinfra.dev and lists.openinfra.org are in the local_domains list in exim4.conf, and foundation@lists.openinfra.dev is aliased to foundation@lists.openinfra.org in the aliases.domain file16:17
clarkbmaybe its unroutable beacuse it asked mailman to take delivery and mailman refused?16:18
clarkbthough I guess we have exim logs to check first right?16:18
fungii'm digging into the logs on lists01 now to see if there's more detail16:18
fungiyep16:18
corvusexim -d -bt foundations@lists.openinfra.dev16:18
fungi2024-12-02 16:13:53 H=azathoth.yuggoth.org [2001:4802:7802:102:be76:4eff:fe20:6e0c] X=TLS1.2:ECDHE_SECP256R1__RSA_SHA256__AES_256_GCM:256 CV=no F=<fungi@yuggoth.org> rejected RCPT <foundation@lists.openinfra.dev>: Unrouteable address16:18
corvusfungi: i'm just catching up, but how is it supposed to route?16:19
fungithe entry in /etc/aliases.domain should reroute it to foundation@lists.openinfra.org and hand it off to the mailman_router16:20
fungifile check: /var/lib/mailman/core/var/lists/${local_part}.${domain}16:20
fungiexpanded file: /var/lib/mailman/core/var/lists/foundation.lists.openinfra.org16:20
fungistat() yielded -116:20
fungiyeah, so we do need to move some files16:20
fungii'm going to stop exim and mailman containers again for a moment while i adjust those16:21
fungidid a `find /var/lib/mailman/ -name "*openinfra.dev*"` and other than old pipermail archives (which i need to reevaluate whether they still go to the right place now that we have blanket redirects), it's just the perlist directories under /var/lib/mailman/core/var/lists/ that need renaming16:23
corvus++16:24
fungiokay, those have been moved, services are back up again, will try to send a message once more16:35
clarkbI got it16:39
fungias did i. it made it into the archive too: https://lists.openinfra.org/archives/list/foundation@lists.openinfra.org/thread/QQ6CUBG337L3UP7FXKLWTCHBJISTQBOC/16:39
fungithe new domain shows up as desired in relevant headers for list id and urls16:41
fungii think we're all set to approve 936303 at this point and get the ssl cert in place16:41
clarkbrereviewing now16:44
clarkbfungi: whats the deal with the updated rewrite rules? I guess the old ones just didn't work?16:45
fungiyeah, my reading of the N option was apparently wrong, something was causing it to not get applied16:46
fungiand also i needed to make the match string explicit there, %{SERVER_NAME} didn't work though maybe %{HTTP_HOST} would have16:46
fungiand i made the http vhost rewrite more thorough since otherwise it would still have resulted in a second redirect to clients16:47
JayFfungi: I think the bounce processing stuff is busted? I just got an email to -owners that *my own address* was removed from the list16:48
clarkbfungi: +2 from me16:48
JayFif I, as one of the active admins of the list, got my stuff disabled; wouldn't that mean we might have disabled a bunch of other valid members, too?16:49
fungiJayF: yeah, i wanted to look at that next today. i ran into the same thing over the weekend, it seems my bounce score got incremented because my mailserver rejected a message for openstack-discuss-owner about a message held for moderation which included spam content16:49
JayFHonestly I kinda would suggest, if possible, we revert *all* the bounce removals since the change. It's already hard enough to communicate with users when we don't have bugs that kick 'em off the list.16:50
JayFif literally list admins can't stay subscribed, who can?16:50
fungibut also there are subscribers getting disabled because of dmarc signature mismatches for posts from people at cisco and fujitsu, both of these issues suggest that the separate verp probes aren't being applied correctly. hoping to look into it as soon as i'm done withthis maintenance16:51
JayFYeah, this reinforces to me that we should, if possible (and I know this is easy to say as someone who wouldn't have to be doing the work) revert the removals of recent16:51
fungiand yes, i can reenable all the disabled subscriptions too16:51
JayFthank you :)16:51
clarkbso can JayF 16:51
clarkbits in the members list I believe and entries can be toggled back over16:52
JayFI can do it manually, one at a time16:52
JayFis there a bulk way to do it?16:52
fungii may be able to script something more conveniently, it's hundreds of subscribers at this point16:52
clarkbbut also I think this points to maybe being less aggressive in bounce processing rather than disabling it? Trying to deliver hundreds of emails a year to invalid addresses is problematic for other reasons16:53
fungithe bulk of whom probably are defunct addresses, but without manually parsing the ndr associated with each one's disablement it's hard to know16:53
JayFMy suggestion to revert for now doesn't say we shouldn't re-enable after making it less aggressive, just that we clearly have false positives that we should cleanup first imo16:55
clarkbsure16:55
opendevreviewJeremy Stanley proposed opendev/system-config master: Fix old openinfra.dev Pipermail archive rewrites  https://review.opendev.org/c/opendev/system-config/+/93686017:04
fungiminor followup fix ^17:05
fungii should probably add a regression test for that too17:09
fungimmm, though those only work when the files exist, which they won't on our test nodes because they're the result of manual import work, so not really easy to test without adding mock archive data17:13
fungias soon as 936303 merges, and assuming we're not already in an hourly prod run, i'll take lists01 back out of the disable list so we will hopefully get the letsencrypt update on it as immediately as possible17:22
clarkbfungi: if you get a chance can you review 936305 I'd like to land that today and restart gerrit on 3.9.8 so that I can rerun through testing using the latest images this week17:43
fungiyou bet17:45
opendevreviewJames E. Blair proposed zuul/zuul-jobs master: Add mirror-container-images role and job  https://review.opendev.org/c/zuul/zuul-jobs/+/93557417:47
opendevreviewMerged opendev/system-config master: Move OpenInfra mailing lists to new domain  https://review.opendev.org/c/opendev/system-config/+/93630317:50
fungiremoved lists01 from the disable list17:58
fungiinfra-prod-letsencrypt is running now18:01
fungihttps://lists.openinfra.org/ has a working cert now18:13
clarkband hopefully we didn't recreate the old name and old lists :)18:13
clarkbI doubt we did18:13
fungii need to dig through the deploy log on bridge before i'm sure18:14
fungibut was waiting for infra-prod-service-lists3 to finish (just completed)18:14
fungithe task to write /etc/aliases.domain changed, presumably because the entries i hand-added didn't end up in the same order as ansible would have sorted them18:33
fungisimilarly for the docker-compose file which subsequently triggered a pull and restart18:33
fungithe vhost config got updated too, temporarily reverting my manual application of https://review.opendev.org/93686018:35
clarkbthat all sounds like expected actions though?18:35
fungiso far, yes, i'm still reading...18:36
fungiwhich triggered an apache reload18:36
fungiso other than some checks that reported changed: true, that was it. looks right18:37
fungiand that's it for the maintenance18:38
fungiapproved 936305 for the gerrit point releases18:39
clarkbthanks18:41
clarkbwill plan to do the restart sometime in my afternoon as things calm down18:41
clarkband then tomorrow after meetings my intention is to rerun through the upgrade process with the new images, update notes as necessary, and finish up any remaining todos. I think I need a change to update the image we deploy with ansible for example18:42
opendevreviewJeremy Stanley proposed opendev/system-config master: Enable extra VERP probes in Mailman  https://review.opendev.org/c/opendev/system-config/+/93687319:11
fungiinfra-root: JayF: ^ that's what we wanted all along, i think. for some reason i thought that was on by default19:13
fungialso i've closed out the root screen session on lists01, but the transcript from it is saved as ~root/screen.lists01.maintenance.2024-12-02.log19:14
clarkb+2 from me19:15
fungihttps://lists.mailman3.org/archives/list/mailman-users@mailman3.org/message/4YFOLEIPOXPZD4Q6H73LT4RIAQPRNDIE/ looks like what we need for reenabling subscriptions on openstack-discuss19:19
fungii'll start on that in parallel19:19
fungi#status log Reenabled 325 subscriptions which had been recently disabled for excessive bounces on the openstack-discuss mailing list19:27
fungiJayF: ^19:27
opendevreviewMerged opendev/system-config master: Fix old openinfra.dev Pipermail archive rewrites  https://review.opendev.org/c/opendev/system-config/+/93686019:30
opendevreviewMerged opendev/system-config master: Update Gerrit images to 3.9.8 and 3.10.3  https://review.opendev.org/c/opendev/system-config/+/93630519:30
fungionce 936873 lands, any new subscription disablements should indicate legitimately defunct addresses19:30
JayFAwesome, thanks. 19:43
fungihuh, the jjb maintainers are continuing to develop it and tag new releases (according to the deprecation warning we got that jenkins-job-builder-6.4.2.tar.gz is a non-normalized sdist filename)19:57
clarkbfungi: its still quite popular I think20:14
fungithat's pretty awesome20:15
clarkbimage promotion succeeded for 936305 so I think we can update to the latset 3.9.8 image when we are ready20:15
clarkbfungi: on the mailman side of things are we sufficiently caught up that a gerrit restart at ~21:30 should be fine?20:17
clarkbthinking if we get gerrit done by then maybe we can also update lodgeit20:17
fungiclarkb: absolutely, though getting 936873 merged soonish would be good if any other infra-root has a moment to look it over20:17
fungiand yeah, getting lodgeit knocked out too would be amazing20:18
clarkbI feel liek the verp setting is straightforward enoguh we could just go ahead and approve that20:18
clarkbany objections?20:18
fungicorvus sometimes has opinions on mailing list management. i think more generally though we were all operating under the assumption that this was the default behavior in mm3 (possibly my fault for misinterpreting documentation and/or mailman-users discussions)20:20
fungithanks frickler!20:23
corvussgtm20:24
clarkbI've just noticed that our screenshots of gerrit 3.9.8 show we're missing some font glyphs for the arrows showing you what change you are on in the change relation chain20:27
clarkbI don't think that is a major issue though and half suspect it isn't new with 3.9.8 either20:27
clarkbwe render one of the utf8 blocks with the code inside within the test env and simply adding more fonts would likely fix it20:28
clarkbI'm finding my held node now to confirm I don't see the same on my local browser with 3.9.820:28
clarkbconfirmed not an issue with my local browser so must be a problem with the test env20:29
clarkbopendevorg/gerrit   3.9       9ba502024800 <- this is the docker image we're currently running for gerrit 3.9.7.x21:07
funginoted21:08
clarkbGetting ready to do a quick update to the new 3.9.8 image. Does this look good #status notice We are updating Gerrit to the latest 3.9 version in preparation for Friday's Gerrit upgrade to 3.10. You may notice a short outage of Gerrit.21:08
clarkbThen also last time we did an upgrade it was asked that we announce it a lot more to remind people a day or so in advance so I'll probably #status notice and send antoher email on Thursday if it looks like we're going to proceed21:09
fungisure, wfm, though i usually lead with the fact that the service is going offline momentarily (followed by reasons)21:09
clarkbhow about this #status notice Gerrit will have a short outage while we update to the latest 3.9 release in preparation for our 3.10 upgrade on Friday21:10
clarkbthats a bit shortere and more to the point21:10
clarkbI've started a screen on review02. I'll send that revised notice at 21:30 UTC then proceed with a pull, verification of hte new image, down, mv of waiting queue, then up -d21:12
fungiloks great, yep21:15
fungilooks21:15
fungiattached to the screen session21:16
clarkbI went ahead and pulled the image as I can do that first and docker inspect shows that image matches https://hub.docker.com/layers/opendevorg/gerrit/3.9/images/sha256-cea988456b9c158ba9920b745e3484c2ba76e1ea7125468e5ebcb7204b146aa721:26
clarkb#status notice Gerrit will have a short outage while we update to the latest 3.9 release in preparation for our 3.10 upgrade on Friday21:30
opendevstatusclarkb: sending notice21:30
-opendevstatus- NOTICE: Gerrit will have a short outage while we update to the latest 3.9 release in preparation for our 3.10 upgrade on Friday21:30
clarkbI wonder if we can make the bot go faster21:33
* clarkb waits patiently21:33
opendevstatusclarkb: finished sending notice21:34
clarkbok proceeding now21:34
clarkbINFO  com.google.gerrit.pgm.Daemon : Gerrit Code Review 3.9.8-1-gcf4c706eb8-dirty ready21:35
clarkbweb loads for me but I'm still waiting for diffs to show up again21:36
clarkbfungi: there is a traceback trying to discover the openid endpoint. In an incognito tab I do get it to redirect all the way to ubuntu one just fine but haven't logged in yet (I'll work on that after diffs show up) any chance you can login with your extra user too to check that works as epxected?21:38
clarkbI half suspect the problem is related to the link involved (it was using gitiles?)21:38
fungilooking21:38
clarkbok I do get diffs now21:38
clarkbwe don't even publish gitiles links anymore too so I wonder if that is a bot21:39
fungii was logged out of the webui already, so logged in with my normal account21:39
fungiit redirects back to https://review.opendev.org// without logging me in21:39
clarkbya I see a few more of those errors in the log now arg21:40
clarkbok so now what :/21:40
fungitrying with my secondary test accound21:40
fungiaccount21:40
clarkbit says cannot discover openid then lists the openid path21:41
fungimmm, i get https://review.opendev.org/SignInFailure,SIGN_IN,Contact+site+administrator but it's possible i previously disabled that account for a test21:41
clarkbwhich I'm wondering if those don't line up with external account ids for some reason21:41
fungiwere there changes to the openid functionality?21:41
clarkbhttps://www.gerritcodereview.com/3.9.html#398 not that I was aware of21:42
fungimaybe we were testing too soon? my normal account is logged in now and getting the normal dashboard21:42
clarkbI wonder if it is related to the account caches21:42
fungiooh, mayhaps21:42
clarkbI restarted my tail of the log to see if we still get those21:43
clarkband now I'll try to login in a different or incognito browser21:43
clarkbfungi: it redirected me to // but I was logged in21:46
clarkband since I restarted my tail only hte what i think is bogus gitiles link has exploded not other people (nor myself)21:46
fungiyeah, having a bit of deja-vu about the extra trailing /21:46
clarkbI'm going to log out again and then login on a change page and see what happens21:46
clarkbstarting from a change page it wasn't happy then I logged in again and then it worked21:49
clarkband the openid it says it cannot discover in the error_log message seems to match what I have listed under my identities21:49
fungithat's definitely weird... maybe something is cached browser-side initially and not matching up?21:51
clarkbthis was in an incognito tab so shouldn't be using teh cache I don't think21:51
clarkband yes I feel like we've run into something similar in the past21:51
clarkbI want to say its something like gerrit caches openid stuff on startup and if the remote isn't happy when you get problems?21:52
clarkbI don't remmber the specifics or feel confident in that only to say I think we have hit something similar in the past and I feel like the solution wasn't really a fxi fix just somethign to live with21:52
clarkbtracing the back and forth between openid server and gerrit the openid server sends us back to gerrit openid login url then that 302's to location: https://review.opendev.org//21:57
clarkbI feel like maybe something changed to where gerrit is redirecting to / explicitly and maybe we're not proxypassing that properly? I don't see anything obviously wrong in the login handshake but it does send us explicitly to //22:00
clarkbI would've expected it to send us to whatever page we were on before22:00
clarkbya the return_to argument is https://review.opendev.org/OpenIDstuff22:01
clarkbso its gerrit making the decision to redirect to // from there and in the process landing us in a weird page for the user22:02
fungiwe could try dropping the trailing / from the destinations in our ProxyPass and ProxyPassReverse directives, but mostly stabbing in the dark now22:03
fungiat least it's not hard to test if it's only apache config adjustments22:05
clarkbI think the fail to discover thing is maybe just noise22:06
clarkbI just did a new login and got that traceback to trip but it let me in. Didnd't change the behavior re // but I was able to log in22:06
clarkboh wait maybe I didn't log in22:06
clarkbthe failure to discover is a timeout error to the openid server22:06
opendevreviewMerged opendev/system-config master: Enable extra VERP probes in Mailman  https://review.opendev.org/c/opendev/system-config/+/93687322:07
clarkbthe annoying thing is this seems to work the vast majority of the time (except for the // problem) which is making it hard to undersatnd what is noise and what isn't22:09
clarkbI think that what may be happening with the discovery errors is when you log in after the dance gerrit fetches your openid info from the backend to populate things like name and so on22:10
clarkbname email? and that can fail for existing users and its fine but new users might fail hard?22:10
clarkbbut it also doesn't happen 100% of the time as I can login multiple times and it only fails occasionally22:10
fungiyeah, this is weird, and i want to say it wasn't the case with the previous version... but maybe that was a matter of timing and there's something separate going on with ubuntu one openid?22:13
clarkbya fwiw canonical web url does have a /22:14
fungimmm22:14
clarkbwhich makes me wonder if our proxy pass is appending one to the reverse path22:14
clarkband maybe older gerrit was chomping the trailing / but doesn't now or something22:15
fungii'll buy that22:17
clarkbhttps://gerrit-review.googlesource.com/c/gerrit/+/44220122:18
clarkbthis is almost certainly the cause22:18
clarkbI don't know why yet but it has to do with login redirects and is new in 3.9.822:19
clarkbso ya I think we ignore the discovery problem that we see tracebacks for for the moment under the assumption it is the ubuntu side having an occasional sad and now focus on understanding ^ to address login redirect ebhavior22:19
clarkband this will affect 3.10 too so I guess good to rip the bandaid off22:20
clarkbhttps://gerrit-review.googlesource.com/c/gerrit/+/421238 is the original change. I was hoping it would have mroe info22:20
clarkbI half wonder if we went from going from /login/stuff to //login/stuff and then that initial / prefix is being carried all the way through the login process22:21
clarkbI'll try and trace that now22:21
clarkbits the other way around its trying to login to /login//22:23
fungioh, huh...22:25
clarkbactually it does /login/%2F22:25
clarkband it makes that same request even when logging in from a cahgne page22:25
fungiwow, so encodes the / from the login url i guess? which then doesn't get deduplicated22:26
clarkbya though looking at the change itself it seems like the /login/%2f behavior should've been there before. Its only the new prefix stuff that has changed22:27
clarkband manually logging in via navigation to /login/ produces the same // result post login22:28
clarkbso maybe this isn't related? it just seems too coincidental to not be somehow22:28
clarkbI'm going to ask upstream22:29
clarkbfungi: can you check replication is happy for 936873? that was the last thing on my todo list after I got derailed by the login fun22:30
fungioh, sure22:31
clarkbI suspect we don't need to rollback if this is the only issue22:32
fungihttps://opendev.org/opendev/system-config agrees the merge commit for that is current22:32
clarkbas it is annoying but workable22:32
clarkbok if I manually login via /login it works. /login/ and /login/%2f produce the broken https://review.opendev.org// result22:35
clarkbin that change getBaseUrl must return '/' otherwise we wouldn't be sent to /login/%2f22:38
clarkbit would still redirect to /login/ but maybe assign chomps trailing /'s or something22:39
clarkbfungi: I'm going to dive into the code and diffs between 3.9.7 and 3.9.8 to try and understand this more. My read on the situation is that this is annoying but not a fatal issue and we don't need to revert22:43
clarkbif your read is different feel free to chime in and we can work on a revert instead. However, I half expect that 3.10 is affected too based on the cherry picking of the suspected change and that would mean the upgrade would need to be sorted out if we revert22:44
clarkbcompare https://gerrit.googlesource.com/gerrit/+/refs/tags/v3.9.8/polygerrit-ui/app/elements/core/gr-router/gr-router.ts#450 to https://gerrit.googlesource.com/gerrit/+/refs/tags/v3.9.7/polygerrit-ui/app/elements/core/gr-router/gr-router.ts#457 there is actually more difference22:47
clarkbhttps://gerrit-review.googlesource.com/c/gerrit/+/442162 maybe22:49
fungiyeah, pressing forward makes more sense than rolling back. if we end up with a bunch of complaints we can revisit22:56
corvusclarkb: oh are you talking about on current review.o.o?  for some reason i thought you were looking at a preview job23:10
clarkbcorvus: yes this is production after the 3.9.7 -> 3.9.8 update we just did23:11
clarkbmy goal was to get that done today so that I can test the 3.9.8 -> 3.10.3 upgrades before the upgrade Friday (hopefully as early as tomorrow to start that testing)23:12
corvusgot it.23:12
clarkbbut I'm likt 99% certain this is fallout from a fairly big refactoring upstream did around paths and changing urls and redirects23:12
clarkbso would affect us on 3.10.3 too (and maybe 3.10.2 depending if/when they backported this stuff to 3.10)23:12
clarkbI've found at least one bug in their code but its not directly related to this I don't think23:15
clarkblooking at https://review.rdoproject.org/r/q/status:open+-is:wip the sign in button /login/ path is actually capturing the query or change page info there23:24
clarkbso that when you login you go back to that spot23:24
fungithey're on 3.7.8, so fairly far back23:26
clarkbya but that was good for some new info23:26
clarkbif I hard refresh (maybe just a regular refresh will do I haven't tested yet) a change page or the path that a single / redirects you to then the sign in url path acts like the rdoproject gerrit's and appends the full path behind the login/23:27
clarkbthe problem I think ultimately is that the sign in path is not being updated to match the page you are on23:28
clarkbwhich previously it was so you're always ending up at the dashboard page or whatever23:28
clarkbits possible the // path problem has been there forever but / isn't really a valid path in gerrit it always redirects you to something else23:28
clarkbya I think the problem is that gerrit sin't updating the url to follow you as you either get redirected or navigate around23:32
clarkbbut ya I believe this to be the issue23:35
clarkband I think I can live with that for now knowing what is going on23:35
clarkbI've posted some of this debugging process upstream in discord23:35
clarkbI'll see if they want me to file an issue etc23:36
fungii guess the discord-matrix bridge for that channel died some time ago23:37
clarkbyes. I've brought it up but current gerrit membership dioesn't know how that was setup or how todebug it23:37
clarkbgerrit's upstream gerrit doesn't seem to have this problem with the sign in button23:37
clarkboh wait no it does23:38
clarkbgo to https://gerrit-review.googlesource.com and then hover the sign in button you'll see its a login// too23:38
clarkbnavigate to a change and same thing23:38
clarkbif I login there it redirects me to my personal dashboard even if I started on a change page23:39
opendevreviewVladimir Kozhukalov proposed zuul/zuul-jobs master: Respect image registry in container roles  https://review.opendev.org/c/zuul/zuul-jobs/+/93690923:43
clarkbto summarize the problem is with Gerrit's construction of the url in the 'sign in' buttons. They are lacking sufficient context to send you back to where you want to go. Since / was never really a valid url that was never expected to work but is what we end up falling back on in these cases23:47
clarkbI think this is ok. We can just ask people to live with it for a bit particularly since Gerrit 3.11 upstream also seems to exhibit the same behavior so its either downgrade and wait a while or upgrade and deal with it 23:47
clarkbfungi: any objection to me closing the root screen on review now?23:47
clarkbwe recorded the image id for 3.9.7 above if we do want to fallback (or we can rebuild onto 3.9.7 new images)23:48
clarkbbut other than the login weirdness I think this is looking ok ? and the login weirdness is understood well enough to deal with it23:48
fungiclarkb: no objection23:52
clarkbdone23:57
clarkbupstream has asked if I can test with a certain change reverted (442162) I'll work on that tomorrow as I have to sort out a meeting agenda now and send that out before dinner23:58
clarkbmy plan is to push that revert to stable-3.9 upstream then depends-on locally and hold the 3.9 system-config-run job23:58
clarkbshould be straightforward once I figure out how to push to upstream again23:58

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!