Wednesday, 2024-10-09

opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Fail in when installing RabbitMQ/Erlang versions not possible  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93179808:01
noonedeadpunkI think we should look into adding infra caching mirrors for rabbitmq08:07
noonedeadpunkwe're getting too much timeouts lately08:07
jrosserwell that would not be the first time i had asked if that could happen08:07
jrosserbut anyway :/08:07
noonedeadpunkI can't recall what were reasons not to do that?08:07
noonedeadpunkexcept diskspace?08:07
jrosserrequired space iirc08:07
jrosserand being told just to use the distro package08:08
noonedeadpunkpffffff08:08
jrosserwe just try again :)08:08
noonedeadpunkyeah08:09
kleiniStill on A and trying to get hosts from Ubuntu 20.04 to 22.04: How would you migrate VMs from a compute host away? I am looking for as much automation as possible.09:15
noonedeadpunknova host-evacuate-live?09:21
noonedeadpunkBut I think you'll need to disable compute for scheduling first09:21
noonedeadpunkhttps://docs.openstack.org/nova/latest/admin/evacuate.html09:22
noonedeadpunkoops. it's just for offline evacuate09:22
kleiniI found that documentation, too. But there is no documentation about evacuating a compute host, that is still alive with alive VMs.09:24
noonedeadpunkWell, command is still there https://docs.openstack.org/python-novaclient/latest/cli/nova.html#nova-usage09:25
noonedeadpunk`host-evacuate-live`09:25
noonedeadpunkah, that's the link to the section: https://docs.openstack.org/python-novaclient/latest/cli/nova.html#nova-host-evacuate-live09:26
noonedeadpunkIIRC - that part never made it to openstack client or sdk09:26
noonedeadpunkI had some WIP patch for that, but never finished it09:26
kleiniokay, then I will stick to my while read ID; do openstack server migrate --live-migration --wait $ID; done < <(openstack server list --all-projects --host computeSomething)09:28
kleiniinteresting: nova host-evacuate-live seems to utilize some faster block migration than openstack server migration --live-migration --block-migration according to network throughput in monitoring09:45
noonedeadpunkit should be exactly the same actually09:51
noonedeadpunkas after all it does almost same as you've pasted, just if refactor it to use xargs instead :D09:52
kleininova host-evacuate-live works pretty well. 1 live migration running, all others queued, no more interaction except monitoring needed09:54
opendevreviewMerged openstack/openstack-ansible-rabbitmq_server master: Update rabbitmq and erlang repo locations to the current recommended locations.  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93175310:20
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Use json formatter for feature_flags fetcher  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190311:40
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Register _cluster_state as a separate variable  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190511:45
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Register _cluster_state as a separate variable  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190511:46
noonedeadpunkI'm thinking and can't decide if https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/931801/1/playbooks/rabbitmq_server.yml should be part of the playbook or role11:48
noonedeadpunkas it feels it might make sense to add it to the role directly11:48
noonedeadpunkbut then we need way bigger refactoring11:48
noonedeadpunk(which I guess I'm about to od now11:48
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-plugins master: Remove feature flags pre-upgrade step  https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/93180111:57
andrewbonneynoonedeadpunk: that step is intentionally before 'stop rabbitmq nodes which are not the upgrader' as it doesn't work when they're down. It did always seem odd that the stop step happened in the playbook though, so if that moved to the role it would be no bad thing11:59
noonedeadpunkyeah. I\m jsut looking at completely refactoring how upgrades are done12:03
noonedeadpunkin order to make them rolling rather then blue/green or whatever we have12:03
opendevreviewMerged openstack/openstack-ansible-os_tacker master: Ensure that first/last host detection is deterministic  https://review.opendev.org/c/openstack/openstack-ansible-os_tacker/+/92464012:11
luvnhi everyone! yesterday i installed openstack-ansible @ master in an ubuntu vm and everything was working fine12:17
luvnhowever, today i noticed that all openstack-related commands are returning gateway timeout (http 504)12:17
luvnand it looks like my / directory is pretty much full12:18
noonedeadpunko/12:18
luvnhas anyone encountered this problem before?12:18
noonedeadpunkI assume that was AIO or smth?12:18
luvnyes, aio!12:18
noonedeadpunkAs there're quite extensive logging 12:18
noonedeadpunkSo potentially you need to clean-up journald12:19
noonedeadpunkAlso, we're using sparse files, which advertise more diskspace then there's actually available on system12:19
noonedeadpunkso if you go with real data usage on volumes or vms inside - that can go out of control12:19
jrosseri think that the AIO (as it's primarily a test fixture) enables debug logging on pretty mich everything12:20
jrosserso that will go pretty nuts with the log volumes12:20
luvni was basically just testing some volumes functionalities with a cirros instance12:20
luvnand that's right, pretty much all containers run on debug mode12:20
luvn(or services)12:20
jrosser  /var/lib/nova can get large as well12:20
luvnto clean up journald is it as simple as `journalctl --vacuum-size=<size>`, something like that?12:21
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Use json formatter for feature_flags fetcher  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190312:26
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Register _cluster_state as a separate variable  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190512:26
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Register _cluster_state as a separate variable  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190512:27
noonedeadpunkyup12:28
noonedeadpunkI thin kwe do have some defaults set for this though...12:28
noonedeadpunkhttps://opendev.org/openstack/openstack-ansible/src/branch/master/tests/roles/bootstrap-host/templates/user_variables.aio.yml.j2#L44-L5012:29
luvnthat's interesting12:30
luvni freed up some space but i'm not sure it helped12:32
luvnis it a problem if i reboot my host with aio deployed?12:32
noonedeadpunkso, we had history of not recovering after reboots.... but I _think_ it's better now12:32
mgariepymaybe the losetup stuff won't come back12:33
noonedeadpunkyeah12:33
noonedeadpunkthere were issues with ordering of losetup and lvm12:33
noonedeadpunkeven if losetup is back, it might not be loaded as lvm devices12:33
luvnbetter not to do that then haha12:36
mgariepywhat is consuming the disk space ?12:36
luvnthe top consuming directories in my vm are: /openstack (64gb) and /var (23gb)12:38
luvnfor instance12:38
luvn /dev/vda1        97G   95G  2.8G  98% /12:39
mgariepymaybe check in the /openstack dir to see which one is consuming space or if it's spread accross all dir ?12:42
luvninside the /openstack the heaviest files are the *.img ones, swift{1,2,3}.img, nova.img, manila.img, cinder.img and ceph{1,2,3}.img12:44
luvnthey are all listed as 1.0T files, but i guess it's not their real size, though?12:44
noonedeadpunknah, these are sparse files12:44
noonedeadpunkbut it's where your images and volumes will end up in12:45
luvnafter all these *.img files the heaviest one is a logfile from the galera container (1.0G approx.)12:46
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Move verification of cluster_state to a separate file  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190513:08
opendevreviewMerged openstack/openstack-ansible-os_octavia master: Move rpc_conn_pool_size to oslo_messaging_rabbit  https://review.opendev.org/c/openstack/openstack-ansible-os_octavia/+/93046913:24
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Refactor rabbitmq_upgrade behavior  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93196214:55
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-plugins master: Move flags check and service stop into role  https://review.opendev.org/c/openstack/openstack-ansible-plugins/+/93180114:58
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Refactor rabbitmq_upgrade behavior  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93196214:59
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Use json formatter for feature_flags fetcher  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190315:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Move verification of cluster_state to a separate file  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190515:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Move verification of cluster_state to a separate file  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190515:05
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Refactor rabbitmq_upgrade behavior  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93196215:05
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Move verification of cluster_state to a separate file  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93190515:29
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Refactor rabbitmq_upgrade behavior  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93196215:29
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Refactor rabbitmq_upgrade behavior  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93196215:32
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Cleanup unneeded upgrade tasks  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93197316:04
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Move RabbitMQ restart to handlers  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93197416:28
opendevreviewDmitriy Rabotyagov proposed openstack/openstack-ansible-rabbitmq_server master: Use rabbitmq_cluster_state task instead of command  https://review.opendev.org/c/openstack/openstack-ansible-rabbitmq_server/+/93197516:35
noonedeadpunk /o\16:35
noonedeadpunkI wonder how many things I've broke...16:36
noonedeadpunk*broken16:36
noonedeadpunkAnd still a patch to allow force re-bootstrapping is needed16:36
noonedeadpunkand then potentially - 4.0 upgrade16:38
noonedeadpunkI tried to split that all into chunks as much as I could16:40
jrosser8-O blimey16:50
jrosserwhat could possibly go wrong :)16:50
noonedeadpunkyeah, lol16:55

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!