Thursday, 2026-05-07

opendevreviewMichal Nasiadka proposed openstack/ansible-collection-kolla master: CI: Switch linters to lint-requirements.txt  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98758405:46
opendevreviewMichal Nasiadka proposed openstack/ansible-collection-kolla master: CI: Switch linters to lint-requirements.txt  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98758405:48
opendevreviewMichal Nasiadka proposed openstack/ansible-collection-kolla master: CI: Switch linters to lint-requirements.txt  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98758405:48
opendevreviewIlia Petrov proposed openstack/kolla-ansible master: Fix Skyline nginx service proxy paths  https://review.opendev.org/c/openstack/kolla-ansible/+/98758705:50
opendevreviewMerged openstack/ansible-collection-kolla master: CI: Switch linters to lint-requirements.txt  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98758406:13
mnasiadkablanson[m], bbezak, frickler: https://review.opendev.org/c/openstack/releases/+/987590 gazpacho rc1 patch - please try to not merge anything in master before 2026.1 is branched :)07:25
bbezakkk07:25
opendevreviewIlia Petrov proposed openstack/kolla-ansible master: Fix Skyline nginx service proxy paths  https://review.opendev.org/c/openstack/kolla-ansible/+/98758707:34
blanson[m]ack 07:42
opendevreviewPierre Riteau proposed openstack/kayobe master: Bump Ansible collections and roles  https://review.opendev.org/c/openstack/kayobe/+/98759408:03
opendevreviewOpenStack Release Bot proposed openstack/ansible-collection-kolla stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98759708:11
opendevreviewOpenStack Release Bot proposed openstack/ansible-collection-kolla stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98759808:11
opendevreviewOpenStack Release Bot proposed openstack/ansible-collection-kolla master: Update master for stable/2026.1  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98759908:11
tafkamaxHi I have a question. `neutron_dhcp_agent_dnsmasq_qdhcp-5225f6f9-dc56-42e8-b0aa-d866b49b36ed` <-- when we ran stop on our containers these still remained active on compute nodes. 08:14
tafkamaxI think these are generated if dhcp is enabled on a subnet?08:14
opendevreviewOpenStack Release Bot proposed openstack/kolla-ansible stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/kolla-ansible/+/98760008:14
opendevreviewOpenStack Release Bot proposed openstack/kolla-ansible stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/kolla-ansible/+/98760108:14
opendevreviewOpenStack Release Bot proposed openstack/kolla-ansible master: Update master for stable/2026.1  https://review.opendev.org/c/openstack/kolla-ansible/+/98760208:14
tafkamaxIs this OK if these remaing running or should they be stopped some other way?08:14
opendevreviewOpenStack Release Bot proposed openstack/kolla stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/kolla/+/98760308:15
tafkamaxThey are spawned with this image quay.io/openstack.kolla/neutron-dhcp-agent:2025.1-ubuntu-noble08:15
opendevreviewOpenStack Release Bot proposed openstack/kolla stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/kolla/+/98760408:15
opendevreviewOpenStack Release Bot proposed openstack/kolla master: Update master for stable/2026.1  https://review.opendev.org/c/openstack/kolla/+/98760508:15
tafkamax* They are spawned with this image quay.io/openstack.kolla/neutron-dhcp-agent:2025.1-ubuntu-noble08:15
tafkamax* They are spawned with this image neutron-dhcp-agent:2025.1-ubuntu-noble08:15
mnasiadkayes, that’s how it should be - neutron-dhcp-agent stop should not stop the services running08:17
mnasiadkaBut I think we should have a look in some operational commands how to clean up :)08:17
tafkamaxoh okay08:18
tafkamaxOr migrate them to a second compute host?08:18
tafkamaxR/N I shutdown all services on a single compute08:19
tafkamaxfor maintenance08:19
mnasiadkaNo, neutron-dhcp-agent should manage the number of running services (whether you configure it to maintain ha config or not)08:21
mnasiadkaYou can just remove them08:21
tafkamaxok08:21
tafkamaxopendevreview: seems like a small fix, we are experiencing this aswell08:22
tafkamaxwill test it out on our test cluster08:22
opendevreviewSeunghun Lee proposed openstack/kayobe master: DNM: Test --continue-on-unreachable  https://review.opendev.org/c/openstack/kayobe/+/91051108:26
opendevreviewSeunghun Lee proposed openstack/kayobe master: DNM: Test --continue-on-unreachable  https://review.opendev.org/c/openstack/kayobe/+/91051108:27
ViiCongrats on the 2026.1 release! I think a lot of really great work went into this cycle, huge thanks to everyone involved :)08:28
opendevreviewPierre Riteau proposed openstack/kayobe master: Add release note for broken conditionals  https://review.opendev.org/c/openstack/kayobe/+/98662908:28
blanson[m]Taavi Ansper: I'll check it out, can't be merged before the rc thing, but you can backport afterward 08:30
tafkamaxWe will probably cherry pick if works on test08:31
blanson[m]how's your skyline experience btw ? 08:31
blanson[m]we'd like to deploy it, some old school people are reluctant about it. from what I've seen recently it seemed to work fine ? 08:32
opendevreviewMerged openstack/ansible-collection-kolla stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98759708:34
opendevreviewMerged openstack/ansible-collection-kolla stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/98759808:36
opendevreviewMerged openstack/kolla stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/kolla/+/98760308:37
fprzewoznblanson[m] I've implemented it recently - TLS enabled, paired with OIDC IdP. It worked almost fine, for SSO got issue with https://bugs.launchpad.net/skyline-apiserver/+bug/2083564 (for now I got just skyline_keystone_url override on config set to {{ keystone_public_url }}/v3/. Second issue I got was08:39
fprzewoznhttps://bugs.launchpad.net/skyline-apiserver/+bug/2133859 (resolved by using 2025.2 image). And third was https://bugs.launchpad.net/kolla-ansible/+bug/2091935 but I fixed it in https://review.opendev.org/c/openstack/kolla-ansible/+/980968 08:39
opendevreviewPierre Riteau proposed openstack/kayobe master: [WIP] Add support for --use-test-images option  https://review.opendev.org/c/openstack/kayobe/+/98760708:39
opendevreviewMerged openstack/kolla stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/kolla/+/98760408:41
fprzewoznBut if you are planning on running 2025.2 without SSO, then it should work out of the box :) 08:42
opendevreviewSeunghun Lee proposed openstack/kolla-ansible master: Add flag for MariaDB heuristic recovery  https://review.opendev.org/c/openstack/kolla-ansible/+/96167508:42
blanson[m]fprzewozn: we plan on using sso, 2025.2 or 2026.1 hopefully nothing older for skyline the sso part would be internal use only so I guess we can educate people while this is fixed in skyline ? 08:42
blanson[m]client facing wouldn't use sso (for now)08:43
fprzewoznin SSO deployment pay attention to endpoints and firewalls, I've encountered some totally unrelated error messages where the issue was that during auth redirect it wasn't able to connect from internal interface to it's own external one 08:47
fprzewoznofc same case as for Horizon with SSO, but here you got 2 ports  08:47
opendevreviewMerged openstack/kolla-ansible stable/2026.1: Update .gitreview for stable/2026.1  https://review.opendev.org/c/openstack/kolla-ansible/+/98760008:48
opendevreviewMerged openstack/kolla-ansible stable/2026.1: Update TOX_CONSTRAINTS_FILE for stable/2026.1  https://review.opendev.org/c/openstack/kolla-ansible/+/98760108:51
tafkamax<blanson[m]> "how's your skyline experience..." <- Most ppl use it more than horizon09:00
blanson[m]Taavi Ansper: do they ? I might be living under a rock tbh 09:01
tafkamaxIn my org i mean09:02
tafkamaxusers and admin(s)09:02
blanson[m]oh ! yh I make do with horizon but it's getting old... 09:04
blanson[m]and there are some interesting new panels in skyline like the barbican one that would be really useful for clients09:04
tafkamaxI am running 2025.2 in production with SSO 09:05
tafkamaxoidc and keycloak09:05
tafkamaxI had the issue I fixed myself.09:05
tafkamaxwith the federation stuff09:05
tafkamaxand also there was a bug in skyline itself, but that was fixed quickly.09:05
tafkamaxoh fprzewozn linked the skyline-apiserver bug. Interesting that for me it works, but he still mentions its not working.09:06
opendevreviewPierre Riteau proposed openstack/kayobe master: [WIP] Add support for --use-test-images option  https://review.opendev.org/c/openstack/kayobe/+/98760709:14
fprzewozntafkamax I had this bug on 2025.1 and switched skyline to 2025.2 tag 09:32
opendevreviewPierre Riteau proposed openstack/kayobe master: [WIP] Add support for --use-test-images option  https://review.opendev.org/c/openstack/kayobe/+/98760709:42
opendevreviewFranciszek Przewoźny proposed openstack/kolla-ansible master: Allow overwriting Prometheus exporter listen addresses  https://review.opendev.org/c/openstack/kolla-ansible/+/98328109:44
tafkamaxblanson:  You had EXP with cyborg.09:50
tafkamaxI got my GPU to show under accelerators.09:50
tafkamaxI suppose I need to create a profile. But I don't know what options to put there. openstack accelerator device profile create --help09:51
tafkamaxThere are not many guides09:51
blanson[m]hum 09:55
blanson[m]let me see my notes 09:56
tafkamaxcan i specify hostname?10:02
blanson[m]$ openstack accelerator device profile create gpu_a40 '["resources:PGPU":"1","trait:CUSTOM_GPU_PRODUCT_ID_2235":"required"]' is my example10:03
tafkamaxaha10:03
blanson[m]you get the trait name from os resource provider trait list 10:04
tafkamaxHmm, could cyborg be not fully working if i get these for some CLI invocations: 'Proxy' object has no attribute 'get_attribute'10:04
tafkamaxopenstack accelerator device attribute show 10:04
*** jhorstmann is now known as Guest883810:04
blanson[m]and then in nova, os flavor set --property 'accel:device_profile=gpu_a40' gpu.a40.xlarge10:05
blanson[m]I have no cyborg capable cluster on hand to test the cli 10:06
tafkamaxaha I got CUSTOM_NVIDIA_26B9 from trait list10:09
tafkamaxi used the trait list on the GPU itself10:10
blanson[m]in resource provider list you should have something like <hostname>_0000:xx:00.0 ? 10:10
tafkamaxYes its there10:10
tafkamaxit had two traits that and CYBORG_OWNER10:10
blanson[m]then trait list on this you should have CUSTOM_GPU_NVIDIA and CUSTOM_GPU_PRODUCT_XXXXXX ? or something ? 10:11
tafkamaxjust CUSTOM_NVIDIA_26B910:12
tafkamaxi wonder if its because no nvidia-smi is installed?10:12
blanson[m]I also have, from my notes: add intel/amd_iommu=on, iommu=pt, and bind vfio-pci driver to all GPUs10:13
tafkamaxhmm well iommu10:18
tafkamax[    3.477685] iommu: Default domain type: Translated10:18
tafkamax[    3.477685] iommu: DMA domain TLB invalidation policy: lazy mode10:18
tafkamax[    3.524567] pci 0000:c0:00.3: Adding to iommu group 010:18
tafkamaxand so on...10:18
tafkamaxvfio-pci was not enabled on the hypervisor10:19
tafkamaxmodprobe vfio-pci10:20
tafkamaxCreate or edit /etc/modprobe.d/vfio.conf and add your IDs:10:21
tafkamaxoptions vfio-pci ids=xxxx:yyyy,xxxx:zzzz10:21
tafkamaxDoes that sound reasonable?10:21
blanson[m]it probbly does 10:27
blanson[m]I remember us going nuclear and patching initramfs to early load it 10:28
blanson[m]but it's probbly overkill10:28
tafkamaxi will continue this #_oftc_#openstack-cyborg:matrix.org 10:44
tafkamaxI tried a simple profile with a single trait10:44
tafkamaxand it failed aswell10:44
opendevreviewPierre Riteau proposed openstack/ansible-collection-kolla master: Add Docker config option for Prometheus endpoint  https://review.opendev.org/c/openstack/ansible-collection-kolla/+/93671711:33
blanson[m]woooo 2026.111:58
blanson[m]mnasiadka: I assume everything is good and we can start have merges again ? 11:59
mnasiadkaI think let’s hold off for tomorrows images upload - and try running all CI jobs on 2026.1 - and focus on fixing them12:00
mnasiadkaIn case we’ll need to backport any fixes before doing a final GA release of gazpacho12:00
mnasiadkaBecause now it’s only rc112:00
blanson[m]ack 12:01
blanson[m]will try to get ahead of tmrw images and build some in our repo this PM to see if there are any catastrophic failures12:01
opendevreviewPierre Riteau proposed openstack/kayobe master: Add support for --use-test-images option  https://review.opendev.org/c/openstack/kayobe/+/98760712:02
butjar_We have observed error messages in Horizon in our most recent kolla deployments stating “Too many open files” which manifested as 5xx server errors in the UI. The error appears to have also affected the neutron CI: https://review.opendev.org/c/openstack/neutron/+/971064. Moreover, https://review.opendev.org/c/openstack/kolla-ansible/+/971801 was recently merged, which might fix the issue.12:29
butjar_After some research we think the issue is related to some changes in docker engine v29, since ulimits where decreased drastically: https://docs.docker.com/engine/release-notes/29/#packaging-updates-9.12:31
butjar_I'm not sure if the issue is already known, and if this is the wrong place for raising it. Should we file a bug at launchpad for it?12:33
opendevreviewPierre Riteau proposed openstack/kayobe master: Add support for using Kolla test images  https://review.opendev.org/c/openstack/kayobe/+/98760712:43
opendevreviewPiotr Milewski proposed openstack/kolla-ansible master: prometheus: extend openstack-exporter service disabling and tuning flags  https://review.opendev.org/c/openstack/kolla-ansible/+/98764712:45
opendevreviewPiotr Milewski proposed openstack/kolla-ansible master: prometheus: extend openstack-exporter service disabling and tuning flags  https://review.opendev.org/c/openstack/kolla-ansible/+/98764713:13
opendevreviewPierre Riteau proposed openstack/kayobe master: Add support for using Kolla test images  https://review.opendev.org/c/openstack/kayobe/+/98760713:45
tafkamaxblanson: got it working, in the end the compute node was left disabled from maintenance14:59
tafkamaxand the query from placement had !COMPUTE_NODE_DISABLED14:59
tafkamax:D14:59
tafkamaxvm attached the gpu instantly and nvidia-smi just works15:05
blanson[m]nice, if you ever get vgpus working I'd like to know how 15:06
blanson[m]lol15:06
tafkamaxWe don't want to pay nvidida tax15:06
blanson[m]because I pulled my hair out with them15:07
blanson[m]you could use intel b50 pro 15:07
tafkamaxwe have amd gpu, but i see amd is not supported in the cyborg drivers15:07
blanson[m]and up 15:07
tafkamaxhmm true15:07
tafkamaxthat is a good idea15:07
blanson[m]driver is in the kernel15:07
blanson[m]everything is free15:07
blanson[m]cyborg support is ???15:07
blanson[m]but in theory it's just virtual functions from cyborg pov, should work just fine ? 15:08
blanson[m](says the guy that never tested it)15:08
tafkamaxthat is actually an very good idea. as we have 1U nodes and we got 2 spare L40S though and just added 1 currently, we will add the 2nd one later on. so the other nodes we coould try the intel b5015:08
tafkamaxand our R&D might use it in their phys machine if umm it doesnt work15:08
tafkamaxand if it doesnt it should just PCIE passthrough?15:08
mnasiadkabutjar_: we’re setting them in K-A for each container - see https://github.com/openstack/kolla-ansible/blob/627ed9b9a8f8dcb7885de00a956c65bbbb369af0/ansible/group_vars/all/common.yml#L10817:39
opendevreviewMichal Nasiadka proposed openstack/kolla-ansible master: cinder: Copy multipath.conf into cinder-volume container  https://review.opendev.org/c/openstack/kolla-ansible/+/98773217:44

Generated by irclog2html.py 4.1.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!