Friday, 2024-11-01

gokhan_ hello noonedeadpunk , I have nearly completed multi region deployment with osa. but I had to do multiple things manually. there is a one problem left. We can not get quotas from region 2. there are logs with os-quota-set with http 200 status but on client side we are getting unknown http 500 error. I don't know why quota set is not working on region2  11:38
noonedeadpunkgokhan_: and what;s in logs? as 500 shold return nice stack trace11:48
gokhan_noonedeadpunk, sorry 504  error not 500 11:50
gokhan_there is no trace 11:50
noonedeadpunkso 504 that's no backend?11:50
noonedeadpunktimout11:50
noonedeadpunkhuh11:50
noonedeadpunkI guess that quota is eing asked from the keystone11:51
noonedeadpunkso I wonder if it's reachable from the internal network?11:51
noonedeadpunkas otherwise you might need to set to use keystone over public11:51
gokhan_it is weird that ı can see region2 admin quotas from horizon which is on region1 11:53
noonedeadpunkyou share the keystone, right?11:54
gokhan_yes keystone is reachable on internal network,  is it possible to connect region1 keystone internal endpoint from region2 11:56
gokhan_yes 2 regions are using same keystone 11:56
gokhan_noonedeadpunk, but I can list neutron and volume quotas, the problem is on compute only 12:01
jrossergokhan_: this is horizon giving you trouble?12:06
gokhan_no, I am running command on region2 utility container12:06
jrosserah ok, sure12:07
jrosserdid you try —debug?12:07
jrosserreason I asked about horizon is that we see severe regression in horizon performance in the admin pages for compute with caracal12:08
gokhan_yes I tried it. I am sharing debug logs 12:09
gokhan_jrosser, https://paste.openstack.org/show/b2xIlQpM1TVT8EIFgTfF/12:13
jrosserand what actually is 10.0.3.98:8774?12:16
jrosserthat is your internal vip for one of the regions?12:18
gokhan_yes is it is internal vip for region 2 12:18
jrosserthen next thing to do is see if for some reason haproxy returned that, or the backend service12:20
jrosserthe answer will be in the log12:20
gokhan_I checked nova api os compute logs but there is nothing. it is weird that sometimes I see os-quota-set  requests with http status 200 12:22
gokhan_nova client commands are working except quota 12:25
jrosserthe 504 has to come from somehere though12:40
jrosseryou should tail all the logs whilst making the request12:40
gokhan_jrosser, in nova api logs quota set request http status is 200 but on haproxy http status is 504 12:56
gokhan_it is weird 12:56
jrosserhaproxy can hit a timeout and 504 before the backend is done, if the request takes a long time13:00
jrosserthe various letters in the haproxy log tell you about this iirc13:00
gokhan_jrosser, I tried with using nova api container, but it takes too much time 13:05
gokhan_and it worked 13:05
gokhan_I alse see warning logs about keystone.discover tries to connect region 1 keystone with internal vip but it is not accesible13:06
gokhan_maybe problem is about keytone but I am not sure 13:07
gokhan_jrosser, Nov 01 16:01:58 hyperconverged01-nova-api-container-77e8b260 nova-api-wsgi[6004]: 2024-11-01 16:01:58.370 6004 WARNING keystoneauth.discover [None req-2a9fa0e5-1d08-48be-9d18-e73bb24c147a 2087a7fcaba54a4aa9bee259b2dd6766 758f99377e4a4c7f9bed7cdbe122739b - - default default] Failed to contact the endpoint at http://10.0.3.98:5000 for discovery. Fallback to using that endpoint as the base url.13:10
gokhan_this is not expected,it is region1 internal vip 13:11
noonedeadpunkthat's why I asked if keystone is reachable 13:27
noonedeadpunkand I guess expected depends on where keystone is placed13:27
noonedeadpunkas with shared keystone services will tend to connect to it13:27
noonedeadpunkor you stretched the database instead, and have a keystone backend in each region connecting to the same DB?13:28
gokhan_noonedeadpunk, keystone is reachable on public vips but not on internal vips 13:28
noonedeadpunkok, you said opposite thing an hour ago13:28
gokhan_noonedeadpunk, yes "I stretched the database instead, and have a keystone backend in each region connecting to the same DB", 13:29
noonedeadpunkok, then you need to have overrides per region group_vars, of there keystone endpoint is I assume13:30
noonedeadpunkor well13:30
noonedeadpunknot exactly...13:30
noonedeadpunkwhat I did - is that we have internal endpoints set as fqdn13:30
noonedeadpunkand we resolve them differently per region13:31
noonedeadpunkthough keystone endpoint is different from others13:33
noonedeadpunkie region1.cloud.com, region2.cloud.com, identity.cloud.com13:33
gokhan_so you mean identity.cloud.com is used for inernal endpoint in both region 13:35
gokhan_interna13:35
noonedeadpunkwell, it's internal.identity.cloud.com, but yeah13:35
noonedeadpunkand then I added override for hosts record to resolve it differently13:36
noonedeadpunkwith openstack_host_custom_hosts_records13:37
gokhan_noonedeadpunk, ok thanks I get it:) it seems if we connects internal vips  for keystones , it can also work 13:39
noonedeadpunkor you can override to use public IPs instead13:39
noonedeadpunkthough you'd need to do that everywhere, as we don't have a variable to configre that right now everywhere13:40
noonedeadpunkthough might be good idea to add one...13:40
gokhan_noonedeadpunk, thanks I have to go now, may be ask new questions later :) 13:41
BaronvaileWhat can I do if my ORG is decypting every SSL connection and it is causing the bootstrap-ansible script to fail?20:27
jrosserBaronvaile: it would be helpful if you can share an example of what happens20:28
jrosserfor example, do you mean they man-in-the-middle all https connections and replace the cert with their own wildcard?20:29
Baronvailejrosser: During the bootstrap-ansible.sh run I get an ERROR: The certificate of 'opendev.org' is not trusted. followed by one that it 'doesn't have a known issuer'20:30
jrosserand does anything like curl or wget work for https things?20:31
BaronvaileThis happenes because our Palo Alto's firewalls are setup the decrypt and injust the appliance name into the certificate.20:31
jrosserlike I say it would be most helpful if you can share the actual error at somewhere like paste.opendev.org20:32
jrosserotherwise it is very hard to give you specific advice20:32
BaronvaileIt might. I had to put a GIT_SSL_NO_VERIFY=true on the deployment host git environment just to get past that part.20:32
Baronvailejrosser: Thanks. I'll try to paste the text next week and bounce back here.20:34
jrosserif it were me, I would ignore everything to do with openstack-ansible until you can be sure that your host is setup correctly to deal with the firewall20:34
jrosserusually this involves getting the CA certificate that the firewall is using and installing it into the relevant trust stores on your host20:34
jrosserand make sure wget/curl and python are correctly using the system trust store20:35
BaronvaileThanks. I'll look getting the certificate to load into the local store.20:38
jrosserand just a tip that python needs specific config to use that20:40
jrosserthe python “requests”20:40
jrosserlibrary uses a bundled set of CA certs and won’t see your extra cert unless you adjust things20:41

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!