Tuesday, 2023-08-15

opendevreviewJan Gutter proposed openstack/devstack master: Update etcd version to 3.4.27  https://review.opendev.org/c/openstack/devstack/+/89135307:49
fricklergmann: kopecmartin: freezer is another global_venv victim. do we want to add symlinks to devstack for non-core projects, too, or should we expect this to be done in the project's devstack plugin? https://zuul.opendev.org/t/openstack/build/8eb8ee33b26a42229a7c0f18118a876308:27
fricklermakes me wonder how many of these near-dead projects we might have been missing. it also raises the question of who should be considered responsible for doing such adaptions08:35
kopecmartinfrickler: how many projects are we talking about? .. in general, I'd say this can be used as a push to either update the projects or retire them .. if no one complains that the projects stopped working with devstack, they're in fact dead and making any changes to support them isn't cost effective09:35
fricklerkopecmartin: how many projects are affected is difficult to tell, since they tend to not run any jobs, so no way to tell whether things are failing10:04
fricklerif you take a look at how many zuul config errors there still are remaining one year after the queue config change, and how many I had to fix for others, maybe >50% of OpenStack? https://review.opendev.org/q/topic:fix-zuul-errors10:05
kopecmartin#startmeeting qa15:00
opendevmeetMeeting started Tue Aug 15 15:00:19 2023 UTC and is due to finish in 60 minutes.  The chair is kopecmartin. Information about MeetBot at http://wiki.debian.org/MeetBot.15:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:00
opendevmeetThe meeting name has been set to 'qa'15:00
kopecmartin#link https://wiki.openstack.org/wiki/Meetings/QATeamMeeting#Agenda_for_next_Office_hours15:00
lpiwowaro/15:00
kopecmartino/15:01
kopecmartinlet's get to it15:03
kopecmartin#topic Announcement and Action Item (Optional)15:03
kopecmartintomorrow PTL and TC nominations start15:03
kopecmartin#link https://governance.openstack.org/election/15:03
kopecmartinlet me know if anyone wanted to be PTL for QA15:04
kopecmartinif there won't be any candidates, i'll probably continue 15:04
kopecmartin#topic Bobcat Priority Items progress15:05
kopecmartin#link https://etherpad.opendev.org/p/qa-bobcat-priority15:05
lpiwowarI would be definitely for keeping you as a PTL!:)15:05
kopecmartin:) 15:05
* kopecmartin checking the doc15:05
kopecmartinlast week we merged the 'Add option to install everything in global venvs'15:06
kopecmartin#link https://review.opendev.org/c/openstack/devstack/+/55893015:06
kopecmartinthat broke a few projects, e.g. freezer15:07
kopecmartin#link https://zuul.opendev.org/t/openstack/build/8eb8ee33b26a42229a7c0f18118a876315:07
kopecmartinhowever, it seems freezer isn't active, there haven't been updates in the project for some time15:07
kopecmartinthanks to the patch , which had impact on multiple projects, we'll see how many projects are still active and needed by users15:08
fricklerit also broke nova, neutron and glance15:08
fricklerbut I fixed those already15:08
kopecmartinright, thanks for that15:08
kopecmartinif anyone hits a similar issue like the one above, please, let us know in this channel15:09
kopecmartine.g.15:09
kopecmartin    FileNotFoundError: [Errno 2] No such file or directory: 'freezer'15:09
frickleror "command not found", although we have that way too often even in passing jobs, too15:10
kopecmartinthe patch which added debian-bookworm job to devstack was merged as well15:10
kopecmartin#link https://review.opendev.org/c/openstack/devstack/+/887547/715:10
kopecmartindoes it mean this priority item is finished? - 'Support for Debian 12 (bookworm) in devstack'15:10
fricklerand gmann proposed a PTI patch that will make it the default platform for 2024.115:11
fricklerI think it is finished module the ongoing cleanup work, yes15:11
frickler*modulo15:12
kopecmartinnice15:12
kopecmartin#topic Gate Status Checks15:13
kopecmartin#link https://review.opendev.org/q/label:Review-Priority%253D%252B2+status:open+(project:openstack/tempest+OR+project:openstack/patrole+OR+project:openstack/devstack+OR+project:openstack/grenade)15:13
kopecmartinno patches there15:13
kopecmartinanything urgent to review?15:13
lpiwowarThis is not urgent => https://review.opendev.org/c/openstack/tempest/+/890798 ... But if someone has time to comment and do the review it would be great:)15:14
kopecmartinsure15:15
lpiwowarthanks!15:15
kopecmartin#topic Bare rechecks15:17
kopecmartin#link https://etherpad.opendev.org/p/recheck-weekly-summary15:17
kopecmartinno updates since last week15:17
kopecmartin#topic Periodic jobs Status Checks15:17
kopecmartinperiodic stable full15:18
kopecmartin#link https://zuul.openstack.org/builds?pipeline=periodic-stable&job_name=tempest-full-yoga&job_name=tempest-full-xena&job_name=tempest-full-zed&job_name=tempest-full-2023-115:18
kopecmartinperiodic stable slow15:18
kopecmartin#link https://zuul.openstack.org/builds?job_name=tempest-slow-2023-1&job_name=tempest-slow-zed&job_name=tempest-slow-yoga&job_name=tempest-slow-xena15:18
kopecmartinperiodic extra tests15:18
kopecmartin#link https://zuul.openstack.org/builds?job_name=tempest-full-2023-1-extra-tests&job_name=tempest-full-zed-extra-tests&job_name=tempest-full-yoga-extra-tests&job_name=tempest-full-xena-extra-tests15:18
kopecmartinperiodic master15:18
kopecmartin#link https://zuul.openstack.org/builds?project=openstack%2Ftempest&project=openstack%2Fdevstack&pipeline=periodic15:18
kopecmartintempest-full-centos-9-stream is broken15:18
kopecmartin#link https://zuul.openstack.org/builds?job_name=tempest-full-centos-9-stream15:18
kopecmartinCould not find versioned identity endpoints when attempting to authenticate. Please check that your auth_url is correct. Internal Server Error (HTTP 500)15:19
kopecmartinalthough i'm not sure if that's the original cause15:19
fricklerAug 12 03:17:04.971803 np0034931512 devstack@keystone.service[44769]: failed to open python file /opt/stack/data/venv/bin/keystone-wsgi-public15:20
fricklerI think global_venv is still broken on c9s, so the flag needs to be set in tempest to disable it15:21
kopecmartinthanks! i'll propose the patch 15:22
fricklernot sure why tempest doesn't inherit this https://review.opendev.org/c/openstack/devstack/+/558930/38/.zuul.yaml#70315:22
kopecmartinhmm, i'll check the inheritance path15:23
fricklerdo we want a periodic job for bookworm, too? seems I didn't add one yet15:24
kopecmartinhm, do we have other distros in periodic too? 15:25
kopecmartinthat reminds me, i should add bookworm to the distros check in the agenda15:25
fricklerwell we have this centos job15:26
kopecmartinright, i guess it won't hurt 15:27
kopecmartin#topic Distros check15:28
kopecmartincs-915:28
kopecmartin#link https://zuul.openstack.org/builds?job_name=tempest-full-centos-9-stream&job_name=devstack-platform-centos-9-stream&skip=015:28
kopecmartindebian15:28
kopecmartin#link https://zuul.openstack.org/builds?job_name=devstack-platform-debian-bullseye&skip=015:28
kopecmartinfocal15:28
kopecmartin#link https://zuul.opendev.org/t/openstack/builds?job_name=devstack-platform-ubuntu-focal&skip=015:28
kopecmartinrocky15:28
kopecmartin#link https://zuul.openstack.org/builds?job_name=devstack-platform-rocky-blue-onyx15:28
kopecmartinopenEuler15:28
kopecmartin#link https://zuul.openstack.org/builds?job_name=devstack-platform-openEuler-22.03-ovn-source&job_name=devstack-platform-openEuler-22.03-ovs&skip=015:28
kopecmartinall seems more or less good15:30
kopecmartin#topic Sub Teams highlights15:30
kopecmartinChanges with Review-Priority == +115:30
kopecmartin#link https://review.opendev.org/q/label:Review-Priority%253D%252B1+status:open+(project:openstack/tempest+OR+project:openstack/patrole+OR+project:openstack/devstack+OR+project:openstack/grenade)15:30
kopecmartinno patches15:30
kopecmartin#topic Open Discussion15:30
kopecmartinanything for the open discussion?15:30
lpiwowarI have one not that important question. 15:31
lpiwowarWould it be OK to extend account-generator to generate system scoped admin? 15:31
lpiwowarCurrently, it does not generate it and it causes failure in the tempest-full-test-account-py3 job. 15:31
lpiwowar#link https://baadd3700d0f959a2148-132839ae31c66569d89d81463638a953.ssl.cf1.rackcdn.com/890798/7/check/tempest-full-test-account-py3/1e6521b/testr_results.html15:31
kopecmartinhmm, system-scoped admin , isn't it deprecated? 15:32
fricklerthat url looks baadd ... *scnr*15:32
lpiwowarSo we should drop the system scouped admin in the test that is failing? 15:32
kopecmartin:D15:32
lpiwowar:D15:33
* kopecmartin trying to find the srbac doc, one sec15:33
fricklermaybe ask gmann as our srbac specialist15:33
lpiwowarfrickler: +1 15:33
kopecmartintry to check this, maybe there's something which would suggest what to do with that15:35
kopecmartin#link https://governance.openstack.org/tc/goals/selected/consistent-and-secure-rbac.html15:35
lpiwowarYes, I know this doc. I will try to take a look at this again:).15:35
kopecmartinphase115:37
kopecmartin... "Keeping everything as project scoped will make sure to fail the operations performed with a system scoped token (which does not have project_id) early with 403 instead of failing it with 500 in the lower layer." ...15:37
kopecmartinthat refers to system-scoped token in general, does it include the admin one too? my assumption would be yes15:38
kopecmartinbut better to double check15:38
kopecmartinoh, can't the test fail because of that ^^15:38
kopecmartin?15:38
lpiwowarThe test is failing because of missing system scoped admin creds in the accounts.yaml. 15:39
lpiwowarhttps://opendev.org/openstack/tempest/src/branch/master/tempest/scenario/test_unified_limits.py#L3215:39
lpiwowar"Details: No credentials matching role: admin, scope: system specified in the accounts file"15:39
fricklermaybe the test should be skipped in that case, instead of failing?15:40
lpiwowarDynamic creds will create the system scoped admin on the go ... 15:40
lpiwowarfrickler: skipped when pre-prov creds are used?15:40
fricklerskipped when "No credentials matching role: admin, scope: system specified in the accounts file"15:41
lpiwowarfrickler: ok, that sounds good to me:)15:41
fricklerthat's it then?15:43
lpiwowarfrickler, kopecmartin: +1, thanks for the help!15:44
kopecmartinlpiwowar:  you can try to propose a wip patch to see what happens if you add 'admin' to the test's credentials 15:44
kopecmartinand meanwhile figure out what's right, not just working, solution, i'd try follow up with gmann 15:45
kopecmartinlet's move on15:45
kopecmartin#topic Bug Triage15:45
kopecmartin#link https://etherpad.openstack.org/p/qa-bug-triage-bobcat15:45
kopecmartini've recorded the numbers but haven't had time to go through the bugs :/ 15:45
kopecmartinthe numbers look stable though15:46
kopecmartinanyway, that's all from my side15:46
lpiwowarI just want to quickly mention this. Last week I talked with professor Corey we talked with at the PTG in Vancouver kopecmartin. It seems that we might have a group of students working on Tempest starting next year. I just wanted to inform everyone that we could potentially have some skilled students joining us to lend a hand :)15:46
kopecmartinin theory sounds good, in reality it will require quite a lot of work on management as the students will require guidance, however, it will be awesome if they help us with something 15:49
kopecmartinwe're always looking for new contributors 15:49
lpiwowarFrom what I heard from the professor Corey I think that the students might help us a lot:).15:50
kopecmartinthat'd be great, thanks for the initiative lpiwowar, i'm looking forward to some results  :) 15:50
kopecmartinlet's wrap it up, i still need to propose one or 2 patches before my eod (9 minutes :D )15:51
lpiwowarkopecmartin: I will keep you updated:) 15:51
kopecmartinthanks15:51
lpiwowarkopecmartin: gl :D 15:51
kopecmartinthank you everyone, see you next week 15:51
kopecmartin#endmeeting15:51
opendevmeetMeeting ended Tue Aug 15 15:51:38 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)15:51
opendevmeetMinutes:        https://meetings.opendev.org/meetings/qa/2023/qa.2023-08-15-15.00.html15:51
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/qa/2023/qa.2023-08-15-15.00.txt15:51
opendevmeetLog:            https://meetings.opendev.org/meetings/qa/2023/qa.2023-08-15-15.00.log.html15:51
gmannlpiwowar: for RBAC, system scope are dropped except ironic and keystone. do you need that for those? 15:58
opendevreviewMartin Kopec proposed openstack/tempest master: Remove Glance v1 APIs tests  https://review.opendev.org/c/openstack/tempest/+/89059215:58
dansmithare we running out of disk space on workers? https://zuul.opendev.org/t/openstack/build/cb4edb50665247f99e0111f96817190f/log/job-output.txt#2151916:02
opendevreviewMartin Kopec proposed openstack/tempest master: Centos-9 isn't yet compatible with global_venv  https://review.opendev.org/c/openstack/tempest/+/89151716:03
gmannlpiwowar: we can support that in account generator file but that need more fixes than just system scope16:05
dansmithhere's a different ENOSPC manifestation: https://zuul.opendev.org/t/openstack/build/3c3e8c2364a94ccfb155d5e9829a4b4716:21
elodillesgmann kopecmartin : if you could have a quick look at this rocky-eol patch, that would be awesome: https://review.opendev.org/c/openstack/releases/+/88794816:25
clarkbdansmith: those errors are on the zuul side since logs have to be copied from test machines to the executors in order to upload from there. Zuul does limit per build disk usage but I think that isn't with quotas and is instead via a monitor that will kill builds early instead so you'd get a different error in that case16:26
dansmithclarkb: okay I dunno what to make of that :)16:26
clarkbbasically someone with zuul admin access needs to go look at those servers cc fungi16:27
clarkblooking at disk utilization of ze12 and ze10 graphs there is a definite jump in usage16:27
clarkbzuul itself won't have upgraded yet for the week so this is unlikely to be changes to the zuul service and instead is some sort of change on the job side using more disk16:27
clarkbI haven't had breakfast yet or loaded ssh keys so will be a bit before I can look more closely16:28
clarkblooks like a wide spread thing across all executors though16:28
lpiwowargmann: The failing test works with keystone and glance. I'm not sure whether system scope is really needed there. I can check.16:30
dansmithclarkb: ack thanks16:33
gmannlpiwowar: glance does not need system scope. keystone does.16:50
gmannelodilles: thanks. done16:54
-opendevstatus- NOTICE: Zuul job execution is temporarily paused while we rearrange local storage on the servers16:55
elodillesgmann: thanks too!16:56
-opendevstatus- NOTICE: Zuul job execution has resumed with additional disk space on the servers17:44
fungidansmith: hopefully no more enospc errors17:52
dansmithfungi: many thanks17:53
fungiwhen replacing the executors some time back we missed that we had the service data on a separate disk and didn't set the new servers up that way, so they've been chugging along putting it on the 40gb rootfs. surprisingly it only just now started to become a problem17:54
dansmithI'm sure we've all been there :P17:54
funginow the executors have a dedicated 70gb filesystem for that data17:55
fungiwe rejiggered all 12 in under an hour. teamwork makes the dream work17:55
fungithankfully the executors have a disk utilization governor, so for the most part they avoided taking builds (which was severely limiting throughput but also at least meant very few builds that ran failed to have space)17:57
fungion https://grafana.opendev.org/d/21a6e53ea4/zuul-status you can see in the executors graph where they stopped accepting most builds around 14:50 in order to avoid running out of disk18:00
fungialso the used hdd graph tells most of the story18:00
fungiin https://grafana.opendev.org/d/21a6e53ea4/zuul-status?orgId=1&viewPanel=26&from=now-90d&to=now you can see where the executors were replaced in the first two weeks of july, we just didn't notice the jump in the graph18:03
melwittjust saw an interesting fail on one of my patches "rsync: [receiver] write failed on "/var/lib/zuul/builds/690f066ff653490e8e0deaba0291e2a6/work/logs/controller/logs/devstack.journal.gz": No space left on device (28)"18:12
clarkbmelwitt: ya thats the issue dansmith was talking about above. It should be better now. If you see those occuring with timestamps after ~17:37UTC let us know18:15
melwittclarkb: cool, thanks. I searched for "no space" in my client and it didn't come back with anything so I thought it was new .. mah bad18:18

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!