Friday, 2021-05-14

*** gyee has quit IRC01:21
*** spatel_ has joined #openstack-ansible02:17
*** spatel_ is now known as spatel02:17
*** spatel has quit IRC02:19
*** evrardjp has quit IRC02:33
*** evrardjp has joined #openstack-ansible02:33
*** macz_ has joined #openstack-ansible03:32
*** macz_ has quit IRC03:37
*** cyberpear has quit IRC04:06
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible-galera_server master: Revert "Update mariadb version to 10.5.10"  https://review.opendev.org/c/openstack/openstack-ansible-galera_server/+/79110704:15
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible master: Bump ansible-base to 2.10.9  https://review.opendev.org/c/openstack/openstack-ansible/+/79129304:16
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible master: Add option to remove group from inventory  https://review.opendev.org/c/openstack/openstack-ansible/+/79127704:17
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible master: Change order of swift and gnocchi installation  https://review.opendev.org/c/openstack/openstack-ansible/+/79126104:18
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible-os_gnocchi master: Switch gnocchi service name to service  https://review.opendev.org/c/openstack/openstack-ansible-os_gnocchi/+/79125404:22
*** macz_ has joined #openstack-ansible05:33
*** macz_ has quit IRC05:38
noonedeadpunkjrosser:  with depends-on revert maria bump things do not fail anymore06:34
*** klamath_atx has joined #openstack-ansible06:39
jrosserwell we should merge that revert ASAP06:43
*** andrewbonney has joined #openstack-ansible07:13
*** partlycloudy has quit IRC07:23
*** partlycloudy has joined #openstack-ansible07:26
*** macz_ has joined #openstack-ansible07:34
noonedeadpunkI think I will submit a bug07:35
noonedeadpunkhm, well, now I still see issues even with depends on, ie for adjutant... https://review.opendev.org/c/openstack/openstack-ansible-os_adjutant/+/777607/07:36
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible-os_adjutant master: Install mysql client libraries  https://review.opendev.org/c/openstack/openstack-ansible-os_adjutant/+/77760707:36
noonedeadpunkoh, it's another depends on :)07:37
*** macz_ has quit IRC07:38
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible master: Decrease manila tempest coverage  https://review.opendev.org/c/openstack/openstack-ansible/+/79120207:39
*** tosky has joined #openstack-ansible07:47
*** sakharkar has joined #openstack-ansible07:57
sakharkarIs Openstack Ansible Victoria supports all endpoints on SSL?07:58
*** sshnaidm|afk is now known as sshnaidm|pto08:00
noonedeadpunksakharkar: yep, it does08:06
noonedeadpunkthere's haproxy_ssl_all_vips variable, and you can set openstack_service_adminuri_proto and openstack_service_internaluri_proto to https08:08
*** masterpe has quit IRC08:37
*** masterpe has joined #openstack-ansible08:40
openstackgerritMerged openstack/openstack-ansible-galera_server master: Revert "Update mariadb version to 10.5.10"  https://review.opendev.org/c/openstack/openstack-ansible-galera_server/+/79110708:52
*** jcath has joined #openstack-ansible09:16
jcathhello~~~ jrosser09:17
*** jcath has quit IRC09:20
*** jcath has joined #openstack-ansible09:21
*** prometheanfire has quit IRC09:24
*** prometheanfire has joined #openstack-ansible09:24
*** jcath has quit IRC09:27
*** macz_ has joined #openstack-ansible09:35
sakharkarnoonedeadpunk: I tried the deployment and is failing at setup openstack.yaml while creating database for keystone,09:39
*** macz_ has quit IRC09:40
sakharkarnoonedeadpunk: Deployment fail error : http://paste.openstack.org/show/805378/09:44
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible master: Bump ansible-base to 2.10.9  https://review.opendev.org/c/openstack/openstack-ansible/+/79129309:44
noonedeadpunksakharkar: I think it's not related to SSL and I believe your setup-infrastructure.yml has failed somewhere first time you ran it?09:45
noonedeadpunkthe thing here is that your utility container was not setup properly. So I'd suggest re-running utility-install.yml (maybe with -e venv_rebuild=true in case it's broken)09:47
sakharkarnoonedeadpunk: My user_variable.yaml looks like this http://paste.openstack.org/show/805379/09:48
noonedeadpunkI'm not sure it's related atm10:05
*** macz_ has joined #openstack-ansible11:36
*** macz_ has quit IRC11:40
*** jawad_axd has joined #openstack-ansible12:12
*** mgariepy has quit IRC12:21
*** mgariepy has joined #openstack-ansible12:35
*** jawad_axd has quit IRC12:36
*** jawad_axd has joined #openstack-ansible12:36
*** spatel_ has joined #openstack-ansible13:15
*** spatel_ is now known as spatel13:15
spatelFolks, any idea, last night we hit by large DDoS and that isolate my galera cluster so now one of node saying this - ERROR 1047 (08S01): WSREP has not yet prepared node for application use13:17
spatelGoogle saying this is what happened when your node isolate..13:17
*** macz_ has joined #openstack-ansible13:37
*** d34dh0r53 has joined #openstack-ansible13:38
openstackgerritDmitriy Rabotyagov proposed openstack/openstack-ansible-os_neutron master: Implement uWSGI for neutron-api  https://review.opendev.org/c/openstack/openstack-ansible-os_neutron/+/48615613:39
*** macz_ has quit IRC13:41
openstackgerritMerged openstack/openstack-ansible-os_nova master: setup.cfg: Replace dashes with underscores  https://review.opendev.org/c/openstack/openstack-ansible-os_nova/+/78889213:48
noonedeadpunkand connection for all ports is open between them?13:50
*** cyberpear has joined #openstack-ansible13:57
*** jawad_axd has quit IRC13:58
admin0spatel, as long as any one of your node is good.. galera can be recovered and fixed14:13
admin0but you have to do some stuff manually also14:14
spateladmin0 thanks after reboot node-1 (bad one) fix issue.14:15
spatelI just reboot lxc container i meant14:15
noonedeadpunkoh, I expected you did that at first place:)14:25
noonedeadpunkeventually on Train I catched bug where galera didn't want to re-sync, when I decided to re-create one of the containers. It was keep saying that this new clean container is up to date and synced for some reason...14:26
noonedeadpunk(believe that some sort of bug)14:26
openstackgerritMerged openstack/openstack-ansible stable/train: Prepare Train to EM  https://review.opendev.org/c/openstack/openstack-ansible/+/79065514:40
noonedeadpunkFolks ,did anybody used Cyborg for managing GPUs or other sr-iov devices?14:42
jrosserI only ever did sriov with nova/neutron, did pci pass through with nova and also vgpu with nova14:51
jrosserfeels like a lot of crossover between nova and cyborg and not really obvious where it’s all heading14:52
noonedeadpunkoh, so you can just sr-iov gpu? as in nova doc they're talking about nics only, and saw https://indico.cern.ch/event/776411/contributions/3345183/attachments/1851624/3039917/02_-_vGPUs_with_OpenStack_-_Accelerating_Science.pdf where has been said it's not implemented (no idea when it was)14:53
noonedeadpunkI mean, nvidia and amd does vgpu pretty differently14:53
noonedeadpunkwhile nvidia tesla (which require licensing) does fully virtualize gpus with their driver, AMD jsut create SR_IOV devices14:54
jrosserI did nvidia vgpu pretty straightforward with nova14:54
jrosserbut yes licensing (inside the VM doh!) is needed14:54
noonedeadpunkyeah, I actully was reading Cyborg specs for half a day and still have no idea why it's needed. as looks like nova wrapper of the days where there was no proper placement features...14:55
jrosserI think there are cases for generic accelerators which need firmware uploading to have a particular function14:56
jrosserlike fpga cards, that seems to be what I understood cyborg for14:56
jrosserbut I see they add smartnic also and then I’m confused14:57
noonedeadpunkI see. I was kind of thinking about AMD as well, as because of SR-IOV you can place them separatelly from servers in rack and just do pci-e over ethernet (or smth like that)14:57
noonedeadpunkwell, at day 1 presentation they were talking about sr-iov drives like nic, gpus... and I got confused there14:58
*** macz_ has joined #openstack-ansible14:58
noonedeadpunkand well, for W they added nvidia vgpu support https://specs.openstack.org/openstack/cyborg-specs/specs/wallaby/approved/vgpu-driver-proposal.html14:58
noonedeadpunkso returning back to vgups, I hoped to be able to have gpu on every compute, since gpu stack will be standalone, and accessible through net15:00
*** spatel has quit IRC15:00
noonedeadpunkbut I think I won't know for sure if it's possible to make this work without some POC env, which be super costy...15:01
jrosseryou can prototype with T4 for nvidia15:01
jrosserthat does all the vgpu stuff15:01
noonedeadpunkwell, I know that nvidia will work with their vgpu implementation15:02
jrosserstill $$$$ but less than $$$$$$15:02
noonedeadpunkI'm not sure about AMD :)15:02
noonedeadpunkthey have pretty different concept of vgpus https://www.amd.com/en/graphics/workstation-virtual-graphics15:03
jrosserone of the subtle things is now the gpu gets divided, if you want multiple vm sharing a gpu15:03
jrosser*how15:03
*** spatel_ has joined #openstack-ansible15:03
noonedeadpunkwell, yes, with SR-IOV you can't do qos, and protect from noisy neighbours15:03
*** spatel_ is now known as spatel15:03
noonedeadpunkwhile nvidia with full virtualisation solves that15:04
spatelnoonedeadpunk thanks for that info.. my openstack is queens15:04
jrosserfor nvidia the operator has to decide in nova config “divide this into quarter GPU” or whatever15:05
jrosserand it’s very static setup, even though the underlying nvidia stuff does not have that limitation any more15:05
noonedeadpunkbut I think you can pass more like several cores into instance?15:05
jrosserwould need to check that, not sure15:06
noonedeadpunkit should be defined with flavor isn't it?15:06
spatelsriov is technology it can apply anywhere, like gpu, disk IO, nic etc.. i don't think its only for NIC15:07
noonedeadpunk`openstack flavor set vgpu_1 --property "resources:VGPU=1"`15:07
jrosserreally it depends on the workload, if everyone wants the same allocation it’s all very simple15:08
noonedeadpunknot scenario of public cloud :)15:08
jrosserbut if you want to offer very small gpu instances for developers and a farm of the largest possible instance for rendering or something, then you have to think really hard about how you allocate those sizes to the physical gpu15:09
noonedeadpunkyeah, agree15:10
noonedeadpunkfrom other side with native sr-iov like amd do - everybody can use like 100% of gpu...15:11
noonedeadpunkand all others are struggling15:12
jrosseryeah, a bit unrelated but I got some U.2 nvme format pcie video encoders coming15:13
jrosserthe go in where nvme drives would normally go and will need pci passthrough15:13
noonedeadpunkand still no-use for Cyborg? what a pity :)15:14
jrosserreally interesting to see general purpose pci devices turning up in disk drive form factor15:14
noonedeadpunkyeah, that's true actually15:14
noonedeadpunkand I think I like that, because pci is super limited in amount, when you might want to have raid controller or proper networking card...15:15
jrosseroh well AMD server :)15:15
noonedeadpunkyeah, we're also about to switch to these :)15:16
noonedeadpunkBut I think I will go nvidia way as well. At least that's know to work (and even had some experience with that)15:19
*** spatel has quit IRC15:38
openstackgerritMerged openstack/openstack-ansible-haproxy_server master: Use integrated tests for haproxy_server  https://review.opendev.org/c/openstack/openstack-ansible-haproxy_server/+/79009015:51
*** macz_ has quit IRC16:12
*** gyee has joined #openstack-ansible16:29
*** spatel_ has joined #openstack-ansible16:38
*** spatel_ is now known as spatel16:38
*** rh-jlabarre has quit IRC17:16
*** andrewbonney has quit IRC17:41
spatelI know i asked this question before but again i would like to ask if someone has better answer, How do you guys do capacity planning in private or public cloud?17:58
spatelmy management folks like to know these number for sales team and for other reason also17:59
*** macz_ has joined #openstack-ansible21:03
*** macz_ has quit IRC21:07
*** macz_ has joined #openstack-ansible21:08
*** spatel has quit IRC21:35
*** macz_ has quit IRC21:45
*** macz_ has joined #openstack-ansible21:49
*** dpawlik has quit IRC22:45
*** dpawlik7 has joined #openstack-ansible22:53
*** tosky has quit IRC23:11
*** macz_ has quit IRC23:14

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!