Thursday, 2026-06-18

opendevreviewdaniel.pawlik proposed openstack/ci-log-processing master: Fix Docker image upload in post pipeline  https://review.opendev.org/c/openstack/ci-log-processing/+/99387907:40
opendevreviewMerged openstack/ci-log-processing master: Fix Docker image upload in post pipeline  https://review.opendev.org/c/openstack/ci-log-processing/+/99387908:12
dpawlikhi fungi clarkb Could you check what is the issue that the ci-log-processing-upload-image job does not work? https://zuul.opendev.org/t/openstack/builds?job_name=ci-log-processing-upload-image&skip=0 08:44
dpawlikI guess it might be something related to secret, but if you can take a look what is on the executor side, would be great 08:45
dpawlikI would like to update the logscraper/logsender today, next week I would like to perform opensearch upgrade to 3.5 as AWS console suggested 08:45
dpawlikI built that on the host. So now would be good just to update on Dockerhub :)08:53
mnasiadkadpawlik: there was an earlier discussion about users of the OpenSearch platform - do you know any?08:56
dpawlikProbably melwitt is still using it, also other team related to network, maybe slaweq 09:05
dpawlikmaybe ykarel_ or sean-k-mooney09:09
dpawlikit's just guessing09:09
sean-k-mooneymnasiadka: i use it often to debug ci failure and understand if its a one of or persitent failure09:18
mnasiadkaOk, so that’s purely RedHat - I was just interested :)09:18
sean-k-mooneyoftend in this case is perhaps once a month09:18
mnasiadkaDo you have any tools for that?09:18
sean-k-mooneyso09:19
sean-k-mooneyhttps://opensearch.logs.openstack.org/_dashboards/app/data-explorer/discover/#?_g=(filters:!(),refreshInterval:(pause:!t,value:0),time:(from:now-30d,to:now))&_a=(discover:(columns:!(hosts_region,project),interval:d,sort:!()),metadata:(indexPattern:'94869730-aea8-11ec-9e6a-83741af3fdcd',view:discover))&_q=(filters:!(('$state':(store:appState),meta:(alias:!n,disabled:!f,index:'09:19
sean-k-mooney94869730-aea8-11ec-9e6a-83741af3fdcd',key:project,negate:!f,params:(query:openstack%2Fnova),type:phrase),query:(match_phrase:(project:openstack%2Fnova)))),query:(language:kuery,query:'message:%22Cursor%20needed%20to%20be%20reset%22'))09:19
sean-k-mooneywell https://opensearch.logs.openstack.org/_dashboards/app/data-explorer/discover/09:19
sean-k-mooneyis the upstream opensearch09:19
sean-k-mooneythat is open to everyone09:19
sean-k-mooneyits not redhat sepcific09:19
sean-k-mooneywe used ot have elasticsearch and logstash before it09:20
mnasiadkaOh boy, I love that query language09:20
sean-k-mooneythe login if you dont know it the super secure openstack openstack09:21
sean-k-mooneymnasiadka: so ya you can use it to look at basiclly the last 10 days of ci logs09:22
sean-k-mooneyso when i see contibtors recheckign becasue of X more then once ot tweice in a few days09:22
sean-k-mooneyi will often see if i know why and if i want a sense for how bad the bleeding is or when it may have started ill check openserch to determin that09:23
mnasiadkaGreat to see it’s used, but the entry bar is not that low ;-)09:24
sean-k-mooneyya its not very visabel that it exits 09:25
sean-k-mooneyyou kind fo have to been around long enouch for some one else to tell you and it does not come up often09:25
sean-k-mooneyusuesll i will convert a dashboard to a tinyurl https://tinyurl.com/2k63h77v09:26
sean-k-mooneyand share that in irc when im debuging something09:26
sean-k-mooneythat issue will be fixed by https://review.opendev.org/c/openstack/nova/+/90078309:28
sean-k-mooneythat just an example of me lookign at a passing job and seeing tracebacks and going that shoudlnot happen09:29
sean-k-mooneythen using opensrach to see how often09:29
sean-k-mooneyin that particalar case it came up while i was lokign at a failure but that traceback was not relevent which made it harder to figureout what the actual issue was09:30
mnasiadkadpawlik: secret decryption failed09:32
mnasiadkadpawlik: https://paste.opendev.org/show/b2lAzJ3k3wlia6QJjbg5/09:33
sean-k-mooneyfrickler: ya i almost alwasy share the urls via a shortenere but it woudl be nice if it was built in.09:33
sean-k-mooneyfrickler: clarkb  if it didnt exist it would not fundementally break my workflow but i woudl jsut stop tryign to use data to determin if an issue is happening often or not and jsut go by how often i see it09:34
sean-k-mooneyi might be special in that if zuul is red on a cahnge i review  even if its a non voting job i almost alwasys click in and at least chcek why09:35
sean-k-mooneyas in even if the patch is not +2w's ill quickly check to see if i can bring that context back into the review whne relevent even if its to just tell the contibtor (ignore this job failre it whas because of X know issue)09:37
dpawlikthanks mnasiadka. Thanks for confirmation 09:46
mnasiadkanp09:48
opendevreviewdaniel.pawlik proposed openstack/ci-log-processing master: Update registry credentials  https://review.opendev.org/c/openstack/ci-log-processing/+/99388809:56
dpawliklet's see if my comment "secret encrypted with opendev/system-config repo" is actual09:57
opendevreviewMerged openstack/ci-log-processing master: Update registry credentials  https://review.opendev.org/c/openstack/ci-log-processing/+/99388810:34
dpawlikmnasiadka still it fails.10:48
dpawlikCould you check if issue is with decryption? If yes, now I will take  openstack/ci-log-processing  and remove the comment10:49
opendevreviewdaniel.pawlik proposed openstack/ci-log-processing master: Re-encrypt registry password with openstack/ci-log-processing  https://review.opendev.org/c/openstack/ci-log-processing/+/99389610:52
opendevreviewMerged openstack/ci-log-processing master: Re-encrypt registry password with openstack/ci-log-processing  https://review.opendev.org/c/openstack/ci-log-processing/+/99389611:20
opendevreviewGuillaume Boutry proposed openstack/project-config master: sunbeam: retire all single charm repositories  https://review.opendev.org/c/openstack/project-config/+/90366611:37
mnasiadkadpawlik: I see some recent runs of that job passed?11:51
dpawlikindeed! Thanks Michal12:07
opendevreviewSeongsoo Cho proposed openstack/openstack-zuul-jobs master: Add Weblate client Ansible role  https://review.opendev.org/c/openstack/openstack-zuul-jobs/+/99143213:09
fungidpawlik: it looks to me like the cause is that you missed https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/thread/WBBLBI6ZS6FA6Q5ZMH4C2MWPL3WG3H24/13:30
fungiso it's been broken since january13:30
fungii guess nobody used it in the past 5 months (or they got stale/misleading results to their queries)13:30
fungiaccording to git history, today was the first time that secret was updated in years13:31
fricklerI guess this is a good moment to remind people not only to re-encrypt the old credentials, but disable those and create and upload new ones13:35
sean-k-mooneyfungi: wait are you saying that when i have been using opensearch for the last 6 months the data was all old?13:41
sean-k-mooneybecause i was seeign result form more recent then that i.e. Jun 16, 2026 13:42
fungisean-k-mooney: maybe it didn't rely on that job working then13:42
sean-k-mooneyya im not sure checkign now woudl not really show either way if its now working13:43
sean-k-mooneybut i gueslly query for the last 30 days13:43
sean-k-mooneyand typically get back 10 days worth of results13:43
sean-k-mooneyi think that woudl have retued nothing if that was entrily broken13:43
fungiyeah, so it was the ci-log-processing-upload-image job's dockerhub credential, i guess what broke was the ability to push new versions of the docker image, for which there had been none needed anyway13:44
sean-k-mooneyah that makes sense13:44
fungiwhich means the service was probably fine, just couldn't be updated13:44
sean-k-mooneyso it was getting the data fine but we could not update it13:45
fungii initially thought it was something like a write credential for pushing log data into opensearch, but i guess that's handled elsewhere13:45
sean-k-mooneywe had a leak or zuul secrete cve backin january right?13:45
sean-k-mooneyso ye rotatted the base zuul salt13:46
sean-k-mooneyand invlaidate all the secrete if i recall properly13:46
sean-k-mooneyso https://review.opendev.org/c/openstack/ci-log-processing/+/993896 was just updating them13:47
fungiyes, security vulnerability that could have exposed the per-project encryption keys for secrets, which we fixed but had no way to be sure they hadn't ended up in someone's hands first13:47
sean-k-mooneyhence frickler's reminded to jost use upload the same one with the new public key13:47
sean-k-mooneybut to also roate it on the backend13:48
fungiwhich is why in the ml post i linked above, we encouraged everyone to take the opportunity to also reset any credentials or keys that had been encrypted as secrets13:48
fungias an added precaution13:49
sean-k-mooneyfungi: on an unrelated note i discoverd my zuul ci didnt have file system currption becuase the pod was using the empty dir k8s volume type meaning the git caches were drop on pod recreates13:50
sean-k-mooneyfungi: so my issue was with tryign to clone nova or cinder a 34Kbps13:51
fungioof, that sounds... painful13:51
sean-k-mooneyim currently debating fixing that13:52
sean-k-mooneyor jsut redinstalling those servers with openstack13:52
sean-k-mooneyproxmox is not bad but there is no zuul/nodepools supprot and if i ever want to do anyting that cant run in a contianer like devstask my current setup cant accomidate that13:53
fungiwriting a proxmox driver for zuul-launcher probably wouldn't be all that hard13:54
fungithough i don't know enough about the platform to say that for sure13:55
opendevreviewGuillaume Boutry proposed openstack/project-config master: sunbeam: retire all single charm repositories  https://review.opendev.org/c/openstack/project-config/+/90366614:19
dpawlikfungi hey, yes, I miss that mail. I have not time to make any changes in ci-log-processing, especially that I recently switched team15:19
fungiyeah, seems like it didn't have any direct impact until now, when you were updating the image15:19
melwittre: the backscroll, yes I am a OpenSearch user :)15:42
mnasiadkasean-k-mooney: I had the same problem with zuul-operator, ended up having a local fork with a lot of improvements, because the upstream project… well… doesn’t have traction :)15:48
mnasiadkaBut yes, emptydir doesn’t help15:48
sean-k-mooneymnasiadka: well in this case the main issue for my install is networking15:51
sean-k-mooneyi need to debug why its so slow15:51
sean-k-mooneybut yes im also runing a fork with 2 minor tweaks15:52
sean-k-mooneyi needed to change the ssh key permissions15:52
sean-k-mooneyand something else minor15:52
mnasiadkaCloning nova with the default zuul git config nearly always times out, I had to tune the timeouts - because the defaults are sort of aggressive, but I can understand why15:53
sean-k-mooneyhttps://github.com/SeanMooney/zuul-operator/commit/d2dc4be75a8402c8159764dc24423487add72fe4 and https://github.com/SeanMooney/zuul-operator/commit/89ce1ae82de739c25155c3262578b841efd975eb15:53
mnasiadkaAnd restarting zuul components with emptydir and losing cache is a bit meh15:53
sean-k-mooneywell yes nova is indeed a big boy15:53
clarkbmnasiadka: opendev tunes those defaults too. I don't think that is a bug but it is something you need to be aware of if dealing with larger repos15:53
sean-k-mooneyalthogh normlaly when my networkin is not borked this vm has 4-5G down and about a thent of that up15:54
mnasiadkaclarkb: let’s say running zuul is a journey, and I’ve been on one for the last 6 months I guess ;)15:54
sean-k-mooneyits actlly not so hard to do its just  a vim like learnign curve the first time15:55
sean-k-mooneyalso that kind of unfair the docker compoes is really easy15:55
sean-k-mooneytryign to run it on k8s is less so15:56
clarkbthere are definitely upsides and downsides to each approach. I do think that more people should be thinking about using simpler tools15:57
clarkbk8s is great at a certain scale. But I'm not sure that most people using it for every last thing actually benefit much15:58
mnasiadkaI started with docker compose, but I got tempted by an existing Kubernetes cluster and zuul-operator, which in the beginning worked great, but then I wanted to customize more and more - and now I have like 10 patches on top of upstream16:01
mnasiadkaAnd seriously with a running service it’s hard now to go to some other solution, because the migration will take time16:02
clarkbyup I'm speaking broadly. I don't actually have anything against k8s or people using k8s. I think that it became a default and often times it seems like overkill16:03
mnasiadkaActually Zuul running in k8s is kind of neat, because I can scale any component and k8s will scale the workers accordingly16:04
mnasiadkaBut getting there was surely some amount of work I didn’t anticipate :)16:05
mnasiadka(As in how hard can that be kind of attitude)16:05
sean-k-mooneyclarkb: i used k8s for my current ci because i wante to learn how flux worked for ci/cd16:14
sean-k-mooneyi have been very tempeted over the year to add zuul to kolla-ansible16:14
sean-k-mooneybecause that is so much simpler and easier to debug16:14
clarkbone big upside to k8s (particularly if someone else manages the cluster) is that you can stop thinking so hard about the system layer (OS upgrades etc)16:15
clarkbI think the scaling argument is less useful if you're running a static fleed of k8s nodes to make that happen (though I know you can auto scale that layer too I don't think that is as common)16:15
clarkbs/fleed/fleet/16:15
sean-k-mooneyjust so long as your not on the hook to now maintain that k8s instacne as well :)16:15
clarkbyup exactly and for many that is the case because you can pay amazon to do it pretty cheapyl16:15
sean-k-mooneyso its defnintly ceaper on power/hardware costs for me not to but if i think about my personal time i likely shouuld just pay someone else too16:16
sean-k-mooneybut on the other hand its good to have a playgorund to learn these things in lower risk envs16:17
clarkbAs for downsides there are a lot of weird behaviors that are not always apparent upfront that you learn later that make things awkward from a process perspective. Like how to resize a volume that is part of a volume claim template. Or dealing with the chowning on volume attachment. Or the default common ingress implementation suddenly not being maintained on short notice16:19
clarkbits all solvable. Its just a different set of problems to "I need to upgrade this operating system"16:20
*** haleyb is now known as haleyb|out22:49

Generated by irclog2html.py 4.1.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!