Friday, 2019-01-25

*** macza has quit IRC00:05
openstackgerritMerged openstack/openstack-ansible stable/rocky: Update systemd_service role for lock path fix  https://review.openstack.org/63275100:08
*** strattao has quit IRC00:18
*** markvoelker has quit IRC00:20
*** macza has joined #openstack-ansible00:33
*** cmart has quit IRC00:36
prometheanfirewhere are the rules for nat set up?00:36
prometheanfirefor the containers00:36
*** macza has joined #openstack-ansible00:37
logan-https://github.com/openstack/openstack-ansible/blob/master/tests/roles/bootstrap-host/tasks/prepare_networking.yml#L159-L19700:37
logan-oh sorry, that's for the gate00:37
logan-https://github.com/openstack/openstack-ansible-lxc_hosts/blob/master/templates/lxc-system-manage.j2#L76-L11100:38
prometheanfirelogan-: well, I am kinda running gate00:39
prometheanfireI ran run_tests.sh then tox -e functional from the tests repo00:39
prometheanfireoff an infra image00:40
logan-the first is covering things like nova tempest vms reaching the internet. the second is what you see in the gate and prod handling masq for the eth0 interface in the containers00:40
prometheanfireok, and dnsmasq is in charge of the reload00:42
prometheanfirehad to restart the lxc-dnsmasq service, works now, will have to look into it if it occurs again00:43
*** tstrul has joined #openstack-ansible00:47
*** tstrul has quit IRC00:48
*** bgmccollum has quit IRC00:48
*** tstrul has joined #openstack-ansible00:48
openstackgerritMatthew Thode proposed openstack/openstack-ansible-lxc_container_create master: Add gentoo support  https://review.openstack.org/63309200:48
*** tstrul has quit IRC00:50
*** tstrul has joined #openstack-ansible00:50
*** tstrul has quit IRC00:51
*** tstrul has joined #openstack-ansible00:52
*** nurdie has joined #openstack-ansible00:54
*** TxGirlGeek has joined #openstack-ansible00:54
*** tstrul has quit IRC00:55
*** tstrul has joined #openstack-ansible00:55
*** tstrul has quit IRC00:58
*** tstrul has joined #openstack-ansible00:59
*** nurdie has quit IRC00:59
*** tstrul has quit IRC00:59
prometheanfireevery time I rerun the functional tests networking breaks01:07
*** sawblade6 has quit IRC01:07
*** sawblade6 has joined #openstack-ansible01:08
*** gyee has quit IRC01:10
openstackgerritMatthew Thode proposed openstack/openstack-ansible-memcached_server master: add gentoo support to memcached role  https://review.openstack.org/63309301:11
*** bgmccollum has joined #openstack-ansible01:17
openstackgerritMatthew Thode proposed openstack/openstack-ansible-lxc_hosts master: add gentoo support  https://review.openstack.org/60839301:21
*** TxGirlGeek has quit IRC01:30
*** cmart has joined #openstack-ansible01:37
openstackgerritMichael Vollman proposed openstack/openstack-ansible-os_manila master: Basic working os_manila role  https://review.openstack.org/61193001:42
cloudnullevenings all, sorry I was away most the day.02:10
cloudnullCeeMac did you make things go ?02:10
cloudnullis it working now ?02:11
cloudnullprometheanfire - we are using networkd in the containers, or it should be.02:11
*** cmart has quit IRC02:14
*** vollman has quit IRC02:24
*** markvoelker has joined #openstack-ansible02:25
prometheanfirecloudnull: my problem is that I have to restart lxc-dnsmasq every retest02:32
cloudnullwhy?02:33
prometheanfiredunno, focusing on rabbit now02:54
prometheanfireI'm in the phase where I just need to make minor changes (I hope)02:54
*** markvoelker has quit IRC02:55
*** ebbex has quit IRC02:55
*** chandankumar has quit IRC02:57
*** chandankumar has joined #openstack-ansible02:59
prometheanfireone thing I'll probably need ot do is set up a passed through dir for portage binpkgs03:00
prometheanfirecloudnull: question about rabbitmq upgrade check03:05
prometheanfirehttps://github.com/openstack/openstack-ansible-rabbitmq_server/blob/master/tasks/rabbitmq_upgrade_check.yml#L3503:05
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible-os_nova master: Fix default init config override for nova-compute  https://review.openstack.org/63310403:05
prometheanfireshell: "equery l net-misc/rabbitmq-server --format '$version' | tail -n 1"03:06
*** udesale has joined #openstack-ansible03:06
prometheanfirethat's how I was trying to do it, but it returns nothing when not installed (correctly)03:08
prometheanfirebut....03:08
prometheanfirefatal: [infra1]: FAILED! => {"msg": "The conditional check 'not installed_rabbitmq.stdout | search(rabbitmq_package_version)' failed. The error was: error while evaluating conditional (not installed_rabbitmq.stdout | search(rabbitmq_package_version)): '_rabbitmq_package_version' is undefined\n\nThe error appears to have been in03:08
prometheanfire'/root/.ansible/roles/rabbitmq_server/tasks/rabbitmq_upgrade_check.yml': line 71, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n\n- name: Compare installed version of RabbitMQ with new version variable\n  ^ here\n"}03:08
prometheanfireI think that this wouldn't work with zero output03:20
prometheanfire    - not installed_rabbitmq.stdout | search(rabbitmq_package_version)03:20
prometheanfireso I think it's a bug?03:20
prometheanfirecloudnull: here you are for a little better context03:27
prometheanfirehttps://pasted.tech/pastes/0db21523579dde0ef26992716ed1f28210c13e25.raw03:27
*** markvoelker has joined #openstack-ansible03:52
*** markvoelker has quit IRC04:25
*** shyamb has joined #openstack-ansible04:38
*** macza has quit IRC04:55
*** udesale has quit IRC05:01
*** udesale has joined #openstack-ansible05:04
*** shyamb has quit IRC05:07
*** shyamb has joined #openstack-ansible05:07
*** shyamb has quit IRC05:12
*** shyamb has joined #openstack-ansible05:13
*** markvoelker has joined #openstack-ansible05:22
*** shyamb has quit IRC05:32
*** shyamb has joined #openstack-ansible05:32
*** nurdie has joined #openstack-ansible05:39
*** macza has joined #openstack-ansible05:39
*** spsurya has joined #openstack-ansible05:40
*** udesale has quit IRC05:45
*** udesale has joined #openstack-ansible05:46
*** markvoelker has quit IRC05:54
*** shyamb has quit IRC05:55
*** shyamb has joined #openstack-ansible05:56
*** shyamb has quit IRC06:17
*** shyamb has joined #openstack-ansible06:21
*** macza_ has joined #openstack-ansible06:24
*** kopecmartin|off is now known as kopecmartin|devc06:27
*** macza has quit IRC06:28
*** macza_ has quit IRC06:28
*** shyamb has quit IRC06:34
*** shyamb has joined #openstack-ansible06:35
*** shyam89 has joined #openstack-ansible06:38
*** shyamb has quit IRC06:41
*** shyam89 has quit IRC06:42
*** shyamb has joined #openstack-ansible06:43
*** shyam89 has joined #openstack-ansible06:48
*** shyamb has quit IRC06:49
*** nurdie has quit IRC07:05
*** nurdie has joined #openstack-ansible07:05
*** nurdie has quit IRC07:09
*** sohny has joined #openstack-ansible07:13
openstackgerritOpenStack Proposal Bot proposed openstack/openstack-ansible master: Imported Translations from Zanata  https://review.openstack.org/63315007:21
openstackgerritMatthew Thode proposed openstack/openstack-ansible-rabbitmq_server master: add gentoo support to rabbitmq role  https://review.openstack.org/63315207:26
openstackgerritMatthew Thode proposed openstack/openstack-ansible-tests master: add Gentoo jobs as non-voting  https://review.openstack.org/60810207:27
prometheanfirestuck on Apply rabbitmq policies, but closer07:27
*** ebbex has joined #openstack-ansible07:35
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Added dependencies of os_tempest role  https://review.openstack.org/63272607:43
prometheanfirestrace showed in wait, killed and finished, on to galera (tomorrow)07:44
*** shyam89 has quit IRC07:48
*** shyam89 has joined #openstack-ansible07:48
*** markvoelker has joined #openstack-ansible07:52
*** shyam89 has quit IRC08:20
*** markvoelker has quit IRC08:25
*** shardy has joined #openstack-ansible08:25
chandankumarjrosser: Hello08:35
chandankumarjrosser: where we store netstat output in the osa jobs logs?08:35
*** tosky has joined #openstack-ansible08:49
*** shardy has quit IRC08:50
*** shardy has joined #openstack-ansible08:51
chandankumarI basically need to check whether port 22 is open or not in centos 7 tempest jobs08:52
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Added dependencies of os_tempest role  https://review.openstack.org/63272608:55
*** shyamb has joined #openstack-ansible08:59
*** hamzaachi has joined #openstack-ansible09:00
*** shyamb has quit IRC09:16
*** shyamb has joined #openstack-ansible09:16
*** shyamb has quit IRC09:20
*** shyamb has joined #openstack-ansible09:20
*** markvoelker has joined #openstack-ansible09:22
*** hamzaachi has quit IRC09:22
*** udesale has quit IRC09:29
*** udesale has joined #openstack-ansible09:30
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: [DNM] debug gate failure  https://review.openstack.org/63317309:36
*** asettle has joined #openstack-ansible09:40
*** rgogunskiy has joined #openstack-ansible09:47
*** markvoelker has quit IRC09:55
*** shyamb has quit IRC10:08
*** aedc has joined #openstack-ansible10:10
odyssey4meoh man, the Q->R upgrade periodic just passed within 30 secs of the limit: http://zuul.openstack.org/build/3880243188e64944a5e293c76fdb7b9a10:17
chandankumarodyssey4me: Hello10:18
chandankumarodyssey4me: where can i find in the logs whether port 22 is open or not?10:18
chandankumarodyssey4me: i mean netstat output in centos7 tempest failure logs?10:18
odyssey4mechandankumar for role tests, this is the script that collects the logs: https://github.com/openstack/openstack-ansible-tests/blob/master/test-log-collect.sh10:18
*** shardy has quit IRC10:19
openstackgerritMerged openstack/openstack-ansible master: Imported Translations from Zanata  https://review.openstack.org/63315010:19
*** shardy has joined #openstack-ansible10:20
*** hamzaachi has joined #openstack-ansible10:21
*** shyamb has joined #openstack-ansible10:21
*** shyamb has quit IRC10:28
*** rgogunskiy has quit IRC10:31
*** shyamb has joined #openstack-ansible10:31
*** rgogunskiy has joined #openstack-ansible10:32
*** shyamb has quit IRC10:36
openstackgerritChandan Kumar proposed openstack/openstack-ansible-tests master: Gather different port status on different hosts  https://review.openstack.org/63317910:38
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Always generate stackviz irrespective of tests pass or fail  https://review.openstack.org/63196710:42
*** macza has joined #openstack-ansible10:47
*** macza_ has joined #openstack-ansible10:48
*** macza has quit IRC10:51
*** markvoelker has joined #openstack-ansible10:52
*** macza_ has quit IRC10:52
*** shyamb has joined #openstack-ansible10:55
*** shardy has quit IRC10:55
*** noonedeadpunk has joined #openstack-ansible11:01
*** shardy has joined #openstack-ansible11:10
*** shardy has quit IRC11:15
*** shardy has joined #openstack-ansible11:17
*** shardy has quit IRC11:22
*** shyam89 has joined #openstack-ansible11:23
*** markvoelker has quit IRC11:25
*** shyamb has quit IRC11:26
*** shyam89 has quit IRC11:34
*** shardy has joined #openstack-ansible11:35
*** shardy has quit IRC11:39
chandankumarodyssey4me: need some help on this review https://review.openstack.org/#/c/633179/11:46
chandankumarodyssey4me: http://logs.openstack.org/79/633179/1/check/openstack-ansible-functional-centos-7/460d822/logs/ara-report/result/49a48f9f-2710-4cf2-9882-3a9f1efa189b/11:46
odyssey4mechandankumar hosts: all, but delegate to locahost? that makes absolutely no sense - why not just target localhost?11:47
*** hamzaachi has quit IRC11:47
*** shyamb has joined #openstack-ansible11:47
odyssey4mechandankumar oh, I see what you're trying to do - you're trying to make use of the fact cache to get the listening ports?11:48
chandankumarodyssey4me: yes11:49
chandankumarodyssey4me: I just copied it from openstack-ansible11:49
odyssey4mechandankumar but only for localhost?11:49
jrosserthat is my ugly code :/11:49
chandankumarodyssey4me: https://github.com/openstack/openstack-ansible/blob/master/playbooks/listening-port-report.yml11:49
jrosserit visits 'all', registers the netstat output and the writes a txt file of all of that on localhost11:50
*** CeeMac has quit IRC11:50
chandankumarwill i add a single task to run only netstat command and gather the output in a file?11:50
odyssey4mejrosser ah ok, I see it now11:50
odyssey4mechandankumar the play needs user:root for that to work11:51
jrossernetstat on the host wont see in the containers11:51
jrosserso it ended up like that11:51
chandankumarodyssey4me: ok11:51
odyssey4mechandankumar also, netstat may need to also be installed on every host - I dunno if it's there by default11:51
ionihello guys, i know it's not related to openstack ansible but mostly you guys run a cloud of different dimensions. have you guys had this problem with resources that werent deallocated from a compute node after a migration? here is a log:  https://paste.xinu.at/ecVs8/11:52
jrosseriirc i wrote this really for multinode deployments to go and grab from everywhere, and afaik netstat was available11:52
jrosserbut that may be an ubuntu-ism11:52
*** shardy has joined #openstack-ansible11:53
openstackgerritChandan Kumar proposed openstack/openstack-ansible-tests master: Gather different port status on different hosts  https://review.openstack.org/63317911:58
*** CeeMac has joined #openstack-ansible11:59
*** CeeMac has quit IRC12:02
*** CeeMac has joined #openstack-ansible12:02
chandankumarodyssey4me: I am trying to add the dependencies here https://review.openstack.org/#/c/632726/ but it is failing12:07
odyssey4mechandankumar that's odd12:09
odyssey4meI don't know why that's failing like that, given nothing else is changing.12:12
odyssey4meThere will definitely be a clash between config_template in openstack-ansible-plugins and config_template. I don't know why the transition is taking so long - perhaps evrardjp can comment?12:13
*** shyamb has quit IRC12:16
evrardjpI didn't get the chance to start to work on that?12:16
evrardjpsimply as that :)12:16
evrardjpwe need to be coordinated on it12:16
odyssey4meevrardjp then perhaps we should schedule a day to get it done?12:16
evrardjpyeah that sounds fair12:16
odyssey4meor we put up all the patches needed, but -w them, then approve them all in the appropriate sequence?12:17
evrardjpif possible in european timezones would be easier for me and to get a faster ci12:17
evrardjpit depends on how far we want to go too12:19
odyssey4meI'm away at FOSDEM next Friday. I'll see you at FOSDEM, so perhaps we can put together a etherpad to plan what needs to be done, and can send it to the ML for comment - then set the date to get it done.12:19
evrardjplgtm12:19
chandankumarodyssey4me: odyssey4me if we get it done, then we can also remove this copied stuff https://github.com/ceph/ceph-ansible/blob/master/library/config_template and script12:20
chandankumarand make them dependent on it12:20
odyssey4mechandankumar yes, although that may require that we have infra publish the config_template role to ansible-galaxy12:21
*** nurdie has joined #openstack-ansible12:24
jamesdentonmronin12:27
jamesdentonerrrr... good morning12:27
*** nurdie has quit IRC12:27
*** nurdie has joined #openstack-ansible12:28
CeeMacmorning jamesdenton12:31
CeeMacwell, afternoon here :)12:31
jamesdenton:)12:31
*** nurdie has quit IRC12:32
chandankumarodyssey4me: port patch worked but this time http://logs.openstack.org/79/633179/2/check/openstack-ansible-linters/a0f260c/job-output.txt.gz#_2019-01-25_12_06_35_56361312:55
odyssey4mechandankumar 'ANSIBLE0012 Commands should not change things if nothing needs doing'12:55
odyssey4meadd 'changed_when: false' to the netstat command12:56
chandankumarok12:56
odyssey4meits purpose is to gather a form of fact12:56
odyssey4mefact gathering should not result in a task being 'changed'12:56
openstackgerritChandan Kumar proposed openstack/openstack-ansible-tests master: Gather different port status on different hosts  https://review.openstack.org/63317912:59
chandankumarodyssey4me: yes, thanks :-)12:59
chandankumarodyssey4me: jrosser here is the output from last run http://logs.openstack.org/79/633179/2/check/openstack-ansible-functional-centos-7/f3465eb/logs/host/listening_port_report.txt.txt.gz13:01
*** eumel8 has joined #openstack-ansible13:05
*** hamzaachi has joined #openstack-ansible13:06
CeeMacok, fun question time13:14
CeeMacsay, hypothetically, I'd been an idiot and forgot to exlude my LB VIP from the range allocatable for containers, and one of the galera containers picked up the address13:15
CeeMacif i removed that container (destory and remove from inventory)13:16
CeeMacand fixed the used_ips, then ran setup-hosts again, that would generate a new container with a new (non-conflicting) ip13:17
CeeMacif i then re-ran the os-galera playbook, would it just add another container to the same galera cluster and not try to recreate the whole cluster?13:17
*** strattao has joined #openstack-ansible13:18
CeeMachypothetically >_>13:18
jamesdentonthis is all starting to make sense now :D13:20
CeeMacalso, is there some command i need to run to remove that galera container from the cluster first?13:20
* CeeMac facepalms13:20
jamesdentonI can't say for sure. There could be something useful in here: https://docs.openstack.org/openstack-ansible/latest/admin/maintenance-tasks/galera.html13:22
*** nurdie has joined #openstack-ansible13:22
*** nurdie has quit IRC13:23
CeeMacactually, would ammending the inventory file manually to change the IP cause any issues?13:23
CeeMacthat would be more straightforward13:23
*** nurdie has joined #openstack-ansible13:24
jamesdentonYou might be able to get away with destroying the container as described, then changing the IP in the inventory, then rebuilding it and rerunning respective playbooks. Caveat Emptor.13:24
CeeMacyeah, thats what I was thinking13:26
CeeMaci can just edit the json file directly right, or is there a dynamic-inventory command to change ip?13:28
*** nurdie has quit IRC13:28
jamesdentonI think you'd have to munge by hand13:29
CeeMack13:29
CeeMaccool i'll give that a go, should hopefully make everything else just work!13:30
CeeMacor break it completely13:30
* CeeMac shrugs13:30
jamesdentonbreaking things is part of the journey13:30
CeeMacthats how we learn right?13:30
CeeMaci'll need to re-run haproxy-install as well right? to update the new container ip?13:32
CeeMacor fudge the config manually?13:32
jamesdentonyes13:32
jamesdentonrerun13:32
CeeMack. thanks13:32
CeeMacis it worth removing the ansible facts information for that container as well?13:37
jamesdentonsure13:42
CeeMacok, i've changed the IP in the containers ansible_facts file, the openstack_inventroy.json and the openstack_hostnames_ips.yml file13:45
CeeMacthink that covers everything?13:46
jamesdentonyou can verify the changes by running inventory/dynamic_inventory.py13:47
chandankumarodyssey4me: https://bugs.launchpad.net/openstack-gate/+bug/1808010 tempest issues13:48
openstackLaunchpad bug 1808010 in OpenStack Compute (nova) "Tempest cirros ssh setup fails due to lack of disk space causing config-drive setup to fail forcing fallback to metadata server which fails due to hitting 10 second timeout." [Medium,Confirmed]13:48
jamesdenton /opt/openstack-ansible/scripts/inventory-manage.py --list-host might even work13:48
CeeMacseems to look ok13:49
jamesdentoncool.13:49
CeeMacso, run setup-hosts --limit <host>,<container> first then os-galera --limit <host>,<container> ?13:50
jamesdentonmore or less13:51
jamesdenton+ haproxy13:51
CeeMacah, yes13:51
CeeMacnot forgetting haproxy13:51
*** sohny has quit IRC13:51
odyssey4mechandankumar so https://github.com/openstack/openstack-ansible-os_tempest/blob/master/defaults/main.yml#L242-L250 needs changing up then?13:52
chandankumarodyssey4me: yes13:52
chandankumardoing that13:52
odyssey4mefantastic, thanks13:52
chandankumarodyssey4me: updating image to 3.613:52
chandankumarodyssey4me: from where we get sha256 hash?13:53
chandankumarodyssey4me: there I am seeing pnly md5sums13:53
odyssey4mechandankumar just download it and run sha256sum against the file13:53
chandankumarodyssey4me: ok13:54
odyssey4mechandankumar you could also change it to use md5 instead13:57
odyssey4mechecksum: "md5:..."13:57
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320813:57
chandankumarodyssey4me: ^^13:58
odyssey4mechandankumar perhaps 'Related-Bug: xxxx' should be used?13:59
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320814:00
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320814:01
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320814:03
odyssey4mechandankumar ok, let's see how the gates respond14:04
odyssey4medo you think this resolves the gate issue we've had for a few days?14:05
chandankumarodyssey4me: I think so14:09
chandankumarodyssey4me: it started happening once we enabled config-drive i think14:10
odyssey4mechandankumar here's hoping that it works :)14:15
chandankumarodyssey4me: cloudnull jrosser https://review.openstack.org/633179 port scanning is good to go14:16
CeeMacjamesdenton, so that seems to have resolved that little issue14:23
jamesdentonglad to hear!14:23
jamesdentonis there a 'but' in there?14:24
*** rgogunskiy has quit IRC14:45
*** dave-mccowan has joined #openstack-ansible14:46
*** hamzaachi has quit IRC14:49
*** strattao has quit IRC14:50
*** dave-mccowan has quit IRC14:51
chandankumarodyssey4me: one of the job just failed while downloading image14:51
*** Soopaman has joined #openstack-ansible14:55
*** strattao has joined #openstack-ansible15:01
*** spsurya has quit IRC15:07
CeeMacjamesdenton, sorry, got distracted eleswhere.  No 'but' so far!15:19
CeeMacalthough I've rediscovered another issue i never managed to resolve in previous runs15:19
CeeMacuploading images through the dashboard doesn't appear to work15:19
CeeMacjust trying to work through logs15:20
*** vollman has joined #openstack-ansible15:22
*** sohny has joined #openstack-ansible15:27
jamesdentoncloudnull with the systemd role, is it possible to force a service to restart due to changes to a different service or task?15:28
*** ostackz has joined #openstack-ansible15:28
*** spsurya has joined #openstack-ansible15:30
chandankumarodyssey4me: http://logs.openstack.org/08/633208/4/check/openstack-ansible-functional-distro_install-centos-7/4525118/job-output.txt.gz#_2019-01-25_14_48_36_06495315:32
chandankumarodyssey4me: it is showing in distro jobs also15:33
chandankumarodyssey4me: I am not sure what to do in this case?15:34
ostackzodyssey4me hi, Im trying to rerun all rocky playbooks(after not being able to add single compute node), but stuck with this https://pastebin.com/raw/j5mqc9Xr  Could you point me how that can be solved? Need to clear caches, how?15:38
*** pamsoo has joined #openstack-ansible15:39
cloudnullmornings15:39
cloudnulljamesdenton yes15:39
cloudnullyou can make a service PartOf another15:40
cloudnullhttps://www.freedesktop.org/software/systemd/man/systemd.unit.html#PartOf=15:40
cloudnullyoud have to add that as a config_template override at this time however if we think it'd be useful as a general thing within the role we can add that15:40
openstackgerritChandan Kumar proposed openstack/openstack-ansible-os_tempest master: Update cirros from 3.5 to 3.6  https://review.openstack.org/63320815:42
chandankumarodyssey4me: ^^ libselinux fix in same patch15:42
*** pamsoo has quit IRC15:45
broken_oneostackz: it could be that the python package versions being asked to install either are outdated for your ubuntu version or they dont exist in pip's db15:45
broken_onemy team ran into a similar issue on centos7 and we cheated by installing them in the base OS rather than letting OSA try to.. YMMV15:48
chandankumarbroken_one: we were already installing it there15:49
chandankumarbroken_one: but from ansible is runned and intreprter used needs to have libselinux python known15:49
chandankumarbroken_one: https://github.com/openstack/openstack-ansible-os_tempest/blob/master/vars/redhat-7.yml#L2115:50
ThiagoCMCGuys, I'm facing a hard time to deploy OSA (on Ubuntu 18) with "ceph-install.yml". I'm confused. The following doc: https://docs.openstack.org/openstack-ansible/rocky/user/ceph/full-deploy.html - looks incomplete.15:51
ThiagoCMCThen, I found this: http://bicofino.io/2017/09/05/openstack-ocata-deployment-with-openstack-ansible-and-ceph/15:51
ThiagoCMCWhich have more details on its openstack_user and user_vars15:51
FrankZhangchandankumar: hey man, I saw your commit gathering port status. Is this one related to centOS gating debugging? Thanks15:52
ThiagoCMCand also this: https://www.openstackfaq.com/openstack-ansible-ceph/15:52
ThiagoCMCI have a friend with huge experience in ceph-ansible, the cluster is healthy!15:52
ThiagoCMCAlso, Nova Hypervisors are seeing the Ceph storage space! And from ceph-mon containers, I can run `ceph` commands...15:53
chandankumarFrankZhang: https://review.openstack.org/#/c/633208/ will fix the issue15:53
ThiagoCMCHowever, Glance is failing to upload images  and "os-cinder-install.yml" is failing to finish.15:53
ostackzbroken_one trying to figure out how to work around this. You mean install pip with apt? Wondering if this is what breaks it "You are using pip version 18.1, however version 19.0.1 is available."15:53
ThiagoCMCWho can help me with this?  :-P15:53
FrankZhangchandankumar: cool thanks!15:53
ThiagoCMCSorry about sending too many messages...  ^_^15:54
*** hamzaachi has joined #openstack-ansible15:54
ThiagoCMCadmin0, can you ping me when you're available?  :-P15:55
chandankumarThiagoCMC: is virtualenv requests issue?15:55
ThiagoCMCchandankumar, I don't think so...15:55
ThiagoCMCchandankumar, when I try to upload an image to glance, it stalls and fails. Even small raw cirros image.15:56
*** udesale has quit IRC15:56
broken_oneostackz: maybe try chandankumar suggestion first15:59
broken_oneostackz: the pip version was not an issue for us16:00
broken_oneostackz: using yum was the way we fixed some python libs and using pip for a couple of others16:00
jamesdentonthanks, cloudnull!16:01
chandankumarbroken_one: if it is a import error request issue then install pip == 8.0.1  if not then I need ot take a look at the error16:01
*** TxGirlGeek has joined #openstack-ansible16:02
broken_onechandankumar: confirming we are still doing the "cheaty" way16:03
ThiagoCMCchandankumar, this is a screenshot where I can see that Nova is connected to Ceph: https://imgur.com/a/o15SPyi - so, it looks good but, not for Glance, neither Cinder.16:06
odyssey4meostackz remember this chat? http://eavesdrop.openstack.org/irclogs/%23openstack-ansible/%23openstack-ansible.2019-01-23.log.html#t2019-01-23T15:25:3416:07
odyssey4meostackz we got to the bottom of why that was happening: http://eavesdrop.openstack.org/irclogs/%23openstack-ansible/%23openstack-ansible.2019-01-24.log.html#t2019-01-24T17:22:5316:07
*** macza has joined #openstack-ansible16:08
*** cmart has joined #openstack-ansible16:09
broken_oneostackz chandankumar we aren't using the cheaty way in the kickstart files anymore so it appears OSA fixed the issue16:10
broken_oneat least for centos716:11
ostackzodyssey4me sorry not following, could that be Im in wrong git branch?16:12
*** macza has quit IRC16:13
*** aedc has quit IRC16:15
*** aedc has joined #openstack-ansible16:15
odyssey4meostackz yep, if you were getting error relating to the blazar role, then you were using OSA master, not rocky16:17
ostackzodyssey4me this pattern makes me think I somehow have managed to checkout stein, right? ...19.0.0.0b1...16:17
odyssey4meyep, exactly16:18
ostackzso, need to wipe out everything and start over I guess. There seems nothing fixable16:18
broken_oneostackz: you can see which branch you are working on by running $ git branch16:18
odyssey4meostackz well, yeah - you can't roll openstack back :/16:19
CeeMacostackz, I hadn't changed much so i was able to checkout stable/rocky again16:19
CeeMacnot sure if you are in the same boat16:19
*** shyamb has joined #openstack-ansible16:20
ostackzCeeMac my playbooks failed along the way asking to upgrade galera and rabbitmq like "-e rabbitmq_upgrade=true". I did that, but I guess that was bad for rocky and will have to start from scratch.16:21
openstackgerritJames Denton proposed openstack/ansible-role-systemd_service master: Allow PartOf to be defined in unit section  https://review.openstack.org/63323516:22
jamesdentoncloudnull >> https://review.openstack.org/#/c/633235/16:22
openstackgerritJames Denton proposed openstack/openstack-ansible-os_neutron master: [WIP] Deploy Vector Packet Processing (VPP) Platform for Neutron  https://review.openstack.org/63164416:25
CeeMacostackz, yeah that doesn't sound great16:35
CeeMacunless you complete the upgrade and just roll with it16:35
prometheanfirewhat's the 'current' desired version of rabbit in master?16:36
broken_oneostackz chandankumar: ok we fixed our python lib issue using this: https://bugs.launchpad.net/networking-midonet/+bug/173031416:37
openstackLaunchpad bug 1636567 in openstack-ansible "duplicate for #1730314 devstack mitaka installation fails with error "Running setup.py bdist_wheel for libvirt-python: finished with status 'error'" in Ubuntu 16.10" [High,Fix released] - Assigned to Jesse Pretorius (jesse-pretorius)16:37
broken_onerepo_build_upper_constraints_ovverides:16:38
broken_one - libvirt-python==4.10.016:38
chandankumarbroken_one: cool!16:39
*** spatel has joined #openstack-ansible16:43
spatelcloudnull: ^^16:43
spateli have ELK question16:43
cloudnulljamesdenton one nit but otherwise nice!16:43
cloudnullspatel whats up?16:43
spatelI have 10 node ELK cluster running and its heavily used in production.. index rate is close to million/s16:44
spatelDo you know how do i create separate replication network in ELK?16:44
*** tstrul has joined #openstack-ansible16:45
spatelI want to set MTU9000 on replication network so it won't interfere with rest of the network..16:45
spateli didn't find any setting in ELK doc to set different replication network16:46
jamesdentoncloudnull nit away!16:46
jamesdentona global option.. systemd_partof... would default to what, null?16:46
*** macza has joined #openstack-ansible16:48
chandankumarodyssey4me: http://logs.openstack.org/08/633208/5/check/openstack-ansible-functional-distro_install-centos-7/eabbeab/logs/openstack/tempest1/stestr_results.html still failing at WARN: failed: route add -net "0.0.0.0/0" gw "192.168.74.1"16:48
chandankumarodyssey4me: I need we can disable config drive option in tempest.conf16:49
*** redrobot has quit IRC16:51
odyssey4mebroken_one oh dear, that's going to hurt badly at some point - we don't build the libvirt whell on purpose because it's so tied to the libvirt binary - instead we symlink it from the host into the venv16:55
*** asettle has quit IRC16:57
broken_oneodyssey4me: it is what we had to do to make it work16:57
broken_onewe are open to a better solution16:58
cloudnulljamesdenton just undefined .16:58
cloudnullthen add a note in the defaults/main.yml how its used.16:58
cloudnullsimilar to the "systemd_after" option16:58
odyssey4mebroken_one it's the end of my day, but I'd be happy to talk through it in more detail on monday16:58
broken_onesure thing.  have a good weekend :)16:59
cloudnullspatel nothing we have at the moment17:00
cloudnullhowever we could certainly map that out.17:00
jrossercloudnull: i wonder if thats packetbeat on * for little gain?17:00
jrosserspatel: do you know which elasticseaarch indexes are responsible for that? You can look in the management tabs in kibana to see which are the busy ones17:01
*** stuartgr has quit IRC17:02
*** stuartgr has joined #openstack-ansible17:03
spateljrosser: looking..17:05
spatelI have 32G memory on data node and i allocated 28G for JAVA HEAP17:06
*** shyamb has quit IRC17:07
odyssey4meOK, I’m out for the day & weekend.17:09
odyssey4meHave a great w/end folks!17:09
chandankumarodyssey4me: Have a great weekend, see ya on monday:-)17:09
*** sohny has quit IRC17:10
jrossero/ enjoy17:10
*** sohny has joined #openstack-ansible17:10
spateljrosser: check this out https://ibb.co/RyvnfLF17:11
*** gyee has joined #openstack-ansible17:11
spatelThat is my index management tab17:11
jrosseriirc there is a view that you can order by index rate (i'm not in front of mine just now to check)17:12
*** mrhillsman is now known as mrhillsman_lunch17:13
prometheanfirehttps://github.com/ansible/ansible/pull/51340 may help, if it ever gets looked at17:14
*** sohny has quit IRC17:15
*** hamzaachi has quit IRC17:17
cloudnullspatel jrosser looks like we can set the publish host, which controls the flow of intra elasticsearch node traffic "https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-network.html#advanced-network-settings"17:27
cloudnullit be something worth exposing by default ?17:27
prometheanfirecloudnull: you ever use async and reties together?17:28
cloudnulli have not17:28
cloudnullthough i think we do that in a couple places?17:28
cloudnulllike, lxc-hosts when creating the cache?17:28
prometheanfireok17:29
spatelcloudnull: as per your link network.publish_host should be my replication network right ?17:30
cloudnullyes.17:30
spatelnetwork.bind_host should be data network where client will pump data17:30
spatelLet me see..17:30
cloudnullif we exposed that, it'd need to do an inteface and address lookup17:30
jrosserspatel: what actual issue are you seeing - too much traffic on a particular interface?17:31
spatelI am seeing my data nodes are very slow and same time seeing lots of traffic between nodes... look like replication traffic..17:32
spatelOne thought was may be i don't have mtu 9000 set that may over killing my OS performance..17:33
cloudnullhttps://pasted.tech/pastes/f17fef5c5a17ff46c13d6e66c863edd9b5a7e12f - maybe something like that to set the replication network17:34
spatelother thought was reduce JVM memory and give more memory to OS, this is my current setting ->  ( 32G total = 28G (jvm) + 4G (os) )17:34
cloudnullsetting the MTU to 9000 might help however elk traffice shouldn't be fragementing very much, if at all.17:35
jrosserreducing the index rate can only help17:38
jrosserthats why i keep asking if you deployed packetbeat17:38
cloudnull^ thats a good question17:38
jrosserand having ram available for filesystem cache is always a bonus17:39
*** sohny has joined #openstack-ansible17:39
*** sohny has quit IRC17:39
*** sohny has joined #openstack-ansible17:39
*** aedc has quit IRC17:40
jrossergoing to mtu9000 will only make a marginal difference, and if you don't see a bunch of 'red' kernel time in htop and can attribute it to ip stack processing then it's likley not going to help17:40
*** sohny has quit IRC17:43
*** sohny has joined #openstack-ansible17:43
*** sohny has quit IRC17:45
*** spatel has quit IRC17:57
ThiagoCMCGuys, I just did a fresh install of OSA Rocky / Ubuntu, heat.log shows:18:02
ThiagoCMCThe option "__file__" in conf is not known to auth_token18:02
ThiagoCMCThe option "here" in conf is not known to auth_token18:02
ThiagoCMCCouple warnings...18:03
ThiagoCMCWhen I try to access: https://172.29.235.250/project/stacks/ - error.18:03
ThiagoCMCError: Unable to retrieve stack list.18:05
ThiagoCMCAny idea?18:05
*** shardy has quit IRC18:06
ThiagoCMC"root@vuosctrl-3-heat-api-container-d6d7a3d4:/etc/heat# grep here * -r" shows nothing...  lol18:06
ThiagoCMCweird18:06
ThiagoCMCI'm running: `openstack-ansible os-heat-install.yml` again18:10
ThiagoCMC"Playbook execution success"18:10
ThiagoCMCHowever, Heat still doesn't work.18:12
ThiagoCMC:-(18:12
ThiagoCMCThere are two more warnings...18:12
ThiagoCMCDeprecated: Option "deferred_auth_method" from group "DEFAULT" is deprecated for removal (Stored password based deferred auth is broken when used with keystone v3 and is not supported.).  Its value may be silently ignored in the future.18:12
ThiagoCMCkeystonemiddleware.auth_token [-] AuthToken middleware is set with keystone_authtoken.service_token_roles_required set to False. This is backwards compatible but deprecated behaviour. Please set this to True.18:12
ThiagoCMCMaybe that's why it's broken?18:13
*** redrobot has joined #openstack-ansible18:14
*** mrhillsman_lunch is now known as mrhillsman18:15
*** nurdie has joined #openstack-ansible18:23
*** hamzaachi has joined #openstack-ansible18:25
*** nurdie has quit IRC18:28
*** Soopaman has quit IRC18:29
*** aedc has joined #openstack-ansible18:44
*** nurdie has joined #openstack-ansible18:44
*** nurdie has quit IRC18:48
*** kstev has joined #openstack-ansible18:59
*** tstrul has quit IRC19:08
*** kstev has quit IRC19:15
openstackgerritMatthew Thode proposed openstack/openstack-ansible-openstack_hosts master: add gentoo support  https://review.openstack.org/60832519:36
*** electrofelix has quit IRC19:37
kaiokmoThiacoCMC: did you see if the heat services are running inside the container?19:39
ThiagoCMCkaiokmo, yes, heat api, cfn and engine are running.19:41
ThiagoCMCI can even restart them with systemctl19:41
*** mgariepy has joined #openstack-ansible19:43
ThiagoCMCkaiokmo, after restarting heat-api: https://paste.ubuntu.com/p/NyDNMdSkcH/ <- tail log19:47
* prometheanfire wonders if rabbit needs wait_for so that policy isn't trying to be set before it's fully running19:48
*** hamzaachi has quit IRC19:48
prometheanfire"stderr": "Error: this command requires the 'rabbit' app to be running on the target node. Start it with 'rabbitmqctl start_app'.\nArguments given:\n\t-q -n rabbit@infra1 list_policies -p /19:48
ThiagoCMCAlso, my `openstack user list --os-cloud=default` works fine.19:50
*** hamzaachi has joined #openstack-ansible19:52
openstackgerritKevin Carter (cloudnull) proposed openstack/openstack-ansible-ops master: Read the path for the logstash queue path  https://review.openstack.org/63326519:55
cloudnullThiagoCMC there was a heat issue we were working the other day with guilhermesp20:12
cloudnullsomething to do with the policy files being out of place?20:12
cloudnulli', not sure what the resolution of that was?20:12
cloudnullThiagoCMC : maybe that's related to what you're seeing20:13
*** strattao has quit IRC20:14
*** kstev has joined #openstack-ansible20:18
*** TxGirlGeek has quit IRC20:20
*** aedc has quit IRC20:22
ThiagoCMCcloudnull, I tried to re-deploy it many times already, always starting fresh (from MaaS), same result, Heat isn't working with latest OSA/Rock stable branch...  =/20:24
*** mgariepy has quit IRC20:25
*** nurdie has joined #openstack-ansible20:27
*** kstev has quit IRC20:29
cloudnulldoes it say something about permission errors?20:29
cloudnullanything interesting in the heat logs ?20:30
*** med_ has quit IRC20:30
*** nurdie has quit IRC20:31
jrosserThis looks related to those messages https://review.openstack.org/#/c/515291/20:33
ThiagoCMCjrosser, yes, looks the same! https://paste.ubuntu.com/p/NyDNMdSkcH/20:36
jrosseri think that is saying you should ignore those messages?20:36
ThiagoCMCWell, thing is that Heat isn't working.20:37
ThiagoCMCAnd, OSA Rocky uses Keystone v3, right?20:37
*** spsurya has quit IRC20:37
ThiagoCMC"Stored password based deferred auth is broken when used with keystone v3 and is not supported."20:37
ThiagoCMCHard to ignore this one, if OSA/Rocky is set to keystone v3 only. I'm not sure.20:38
ThiagoCMC`openstack stack list` returns: "ERROR: Internal Error"20:39
ThiagoCMCFrom utility container20:40
jrosserthe most useful info will be in the heat service log files i suspect20:44
*** DanyC has joined #openstack-ansible20:45
openstackgerritMatthew Thode proposed openstack/openstack-ansible-rabbitmq_server master: fix rabbit slow starts causing errors  https://review.openstack.org/63327220:45
ThiagoCMCHeat service log?20:46
ThiagoCMCI can only see heat.log inside of its container...20:46
*** nurdie has joined #openstack-ansible20:48
openstackgerritMichael Vollman proposed openstack/openstack-ansible-os_nova master: Avoid distro installing unused services  https://review.openstack.org/63327520:51
*** nurdie has quit IRC20:52
openstackgerritMichael Vollman proposed openstack/openstack-ansible-os_cinder master: Avoid distro installing unused services  https://review.openstack.org/63327620:53
*** hamzaachi has quit IRC20:59
openstackgerritMichael Vollman proposed openstack/openstack-ansible-os_neutron master: Avoid distro installing unused services  https://review.openstack.org/63327721:02
*** errr has quit IRC21:04
openstackgerritMatthew Thode proposed openstack/openstack-ansible-rabbitmq_server master: add gentoo support to rabbitmq role  https://review.openstack.org/63315221:10
*** TxGirlGeek has joined #openstack-ansible21:16
*** DanyC has quit IRC21:20
prometheanfireosa needs galera/mariadb, is it specifically 'MariaDB Galera Cluster 10.0 Series' from https://downloads.mariadb.org/ or can it be mariadb from the links above that?21:21
*** TxGirlGeek has quit IRC21:28
*** errr has joined #openstack-ansible21:28
ThiagoCMCAlso, `openstack stack list --debug" shows: "RESP BODY: {"explanation": "The server has either erred or is incapable of performing the requested operation.", "code": 500, "error": {"traceback": null, "type": "SSLError"}, "title": "Internal Server Error"}"21:33
prometheanfirefor mariadb, are any of these features needed by it?21:33
prometheanfirebackup bindist galera pam perl server systemd (-client-libs) -cracklib -debug -extraengine -innodb-lz4 -innodb-lzo -innodb-snappy -jdbc -jemalloc -kerberos -latin1 -libressl -mroonga -numa -odbc -oqgraph -profiling -rocksdb (-selinux) -sphinx -sst-mariabackup -sst-rsync -sst-xtrabackup -static -systemtap -tcmalloc -test -tokudb -xml -yassl21:33
ThiagoCMC"openstack user list --os-cloud=default" works!21:33
prometheanfirenuma might be useful if that is passed into containers, not sure if that scheduling is numa aware outside or is passed through21:34
*** demtwistas has joined #openstack-ansible21:38
*** demtwistas has quit IRC21:41
*** cmart has quit IRC21:41
ThiagoCMCcloudnull, jrosser, found something! I guess... On haproxy after running `openstack stack list`:21:41
ThiagoCMChaproxy[22408]: 10.0.3.241:40892 [25/Jan/2019:21:40:54.123] keystone_service-front-1/1: SSL handshake failure21:41
ThiagoCMCNot sure if it's related21:42
ThiagoCMCThis is actually, happening from time to time without my intervention...21:43
ThiagoCMCI'm using self-sign SSL cert for my lab, created like this: https://www.digitalocean.com/community/tutorials/how-to-create-a-self-signed-ssl-certificate-for-apache-in-ubuntu-18-0421:52
ThiagoCMCI used this many times in the past without problems...21:52
*** spatel has joined #openstack-ansible21:59
-spatel- total used free shared buff/cache available21:59
-spatel- Mem: 31G 30G 344M 15M 791M 479M21:59
-spatel- Swap: 4.0G 477M 3.5G21:59
spatelMy looks bad..21:59
spateli think i should reduce jvm memory from 28 to 24G22:00
*** TxGirlGeek has joined #openstack-ansible22:07
jrosserThiagoCMC: dig around this a bit https://github.com/openstack/openstack-ansible-os_heat/commit/785fcfd33d29ddfee54f09cd6bf126990d64e4dd22:13
jrosserPerhaps something that previously would have hit the internal endpoint now hits the external one22:14
jrosserAnd then it doesn’t understand your self signed very, perhaps22:14
jrosser*cert22:14
openstackgerritMatthew Thode proposed openstack/openstack-ansible-galera_client master: add gentoo support to galera_client  https://review.openstack.org/63328922:18
*** spatel has quit IRC22:32
*** cmart has joined #openstack-ansible22:47
openstackgerritMatthew Thode proposed openstack/openstack-ansible-openstack_hosts master: add gentoo support  https://review.openstack.org/60832522:49
ThiagoCMCjrosser, checking that, thanks!22:50
*** KeithMnemonic has quit IRC22:51
ThiagoCMCIs there a way to tell OSA that I'm using self-sign certs so it will just accept it as a "normal" cert?22:52
openstackgerritMatthew Thode proposed openstack/openstack-ansible-galera_client master: add gentoo support to galera_client  https://review.openstack.org/63328922:53
*** eumel8 has quit IRC23:15
cloudnullThiagoCMC nothing in osa directly, however if you sync your certs out to your environment (containers, hosts, etc) that'd work23:18
cloudnullyou could also do something like so https://learn.hashicorp.com/vault/secrets-management/sm-pki-engine23:20
cloudnullah, reading back.23:20
*** kopecmartin|devc is now known as kopecmartin|off23:20
cloudnullif that SSL certificate on Haproxy is a public endpoint you could use something like letsencrypt to get a real certificate?23:21
cloudnullotherwise, --insecure on your client call should do the trick23:21
*** cmart has quit IRC23:23
*** cmart has joined #openstack-ansible23:28
*** partlycloudy has joined #openstack-ansible23:48
ThiagoCMCThat's true! I'll try letsencrypt! I forgot how easy is to get valid certs these days...  =P23:53
ThiagoCMCMaybe OSA should add support for letsencrypt!  :-P23:53

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!