*** pcaruana has quit IRC | 00:04 | |
*** pcaruana has joined #openstack-ansible | 00:16 | |
*** macz_ has joined #openstack-ansible | 01:00 | |
*** macz_ has quit IRC | 01:05 | |
*** DanyC has joined #openstack-ansible | 01:13 | |
*** DanyC has quit IRC | 01:18 | |
*** DanyC has joined #openstack-ansible | 01:47 | |
*** DanyC has quit IRC | 01:52 | |
*** spatel has joined #openstack-ansible | 02:03 | |
*** openstacking_123 has quit IRC | 02:41 | |
*** macz_ has joined #openstack-ansible | 02:49 | |
*** macz_ has quit IRC | 02:53 | |
*** threestrands has joined #openstack-ansible | 03:39 | |
*** DanyC has joined #openstack-ansible | 03:48 | |
*** DanyC has quit IRC | 03:53 | |
*** spatel has quit IRC | 04:11 | |
*** DanyC has joined #openstack-ansible | 04:24 | |
*** DanyC has quit IRC | 04:29 | |
snadge | ERROR: Could not find a version that satisfies the requirement rally===3.0.0 | 04:33 |
---|---|---|
snadge | oh i see.. im not the only one | 04:33 |
*** evrardjp has quit IRC | 04:36 | |
*** evrardjp has joined #openstack-ansible | 04:36 | |
*** udesale has joined #openstack-ansible | 04:42 | |
*** dave-mccowan has joined #openstack-ansible | 05:14 | |
*** miloa has joined #openstack-ansible | 05:18 | |
*** DanyC has joined #openstack-ansible | 06:15 | |
*** DanyC has quit IRC | 06:54 | |
*** rpittau|afk is now known as rpittau | 07:17 | |
*** mensis has joined #openstack-ansible | 07:19 | |
CeeMac | morning | 07:21 |
*** tosky has joined #openstack-ansible | 07:29 | |
*** kleini has joined #openstack-ansible | 07:35 | |
ioni | aio setups can be rebooted and all the networking comes up correctly ? | 07:43 |
*** jbadiapa has joined #openstack-ansible | 07:50 | |
*** DanyC has joined #openstack-ansible | 08:16 | |
*** DanyC has quit IRC | 08:24 | |
*** DanyC has joined #openstack-ansible | 08:24 | |
*** threestrands has quit IRC | 08:26 | |
openstackgerrit | Arx Cruz proposed openstack/openstack-ansible master: DNM - POC - Collect logs https://review.opendev.org/702676 | 08:32 |
*** andrewbonney has joined #openstack-ansible | 08:33 | |
*** thuydang has joined #openstack-ansible | 08:48 | |
*** mensis has quit IRC | 09:00 | |
openstackgerrit | Arx Cruz proposed openstack/openstack-ansible master: DNM - POC - Collect logs https://review.opendev.org/702676 | 09:18 |
*** thuydang has quit IRC | 09:18 | |
*** thuydang has joined #openstack-ansible | 09:24 | |
*** gshippey has joined #openstack-ansible | 09:30 | |
jrosser | ioni: i'm not sure about that, it's certainly not tested | 09:30 |
*** sshnaidm|afk is now known as sshnaidm | 09:30 | |
jrosser | noonedeadpunk: any conclusion on rally? I have been testing out using meta: end_play to make it not deploy on < py 3.6 | 09:32 |
noonedeadpunk | ioni: on reboot loop drives are not get up | 09:33 |
jrosser | did we have a patch for that? | 09:33 |
noonedeadpunk | jrosser: haven't finished yet. ended up on deploying sandbox, will continue in several minutes. | 09:33 |
ioni | noonedeadpunk, ok, thanks | 09:33 |
noonedeadpunk | I'm concerned a bit about not deploying rally, as this will result in non-working rally for users | 09:34 |
noonedeadpunk | on train and stein | 09:34 |
noonedeadpunk | and I know ppl are deploying it... | 09:34 |
openstackgerrit | Merged openstack/openstack-ansible-ops master: Expose Journalbeat queue configuration https://review.opendev.org/715969 | 09:43 |
kleini | Interesting discovery: My deployment of Keystone works if I use OSA 19.0.8 and Keystone logs have the same warning messages. So there is something wrong in stable/stein HEAD with Keystone bootstrapping. | 09:50 |
noonedeadpunk | kleini: that's interesting thing... | 09:51 |
noonedeadpunk | Will test that out | 09:51 |
noonedeadpunk | and will hold off new release then | 09:51 |
jrosser | noonedeadpunk: i was looking at the failing jobs last night and say something broken with keystone | 09:55 |
jrosser | i have a meeting right now but can dig it out shortly | 09:55 |
noonedeadpunk | wondering how https://review.opendev.org/#/c/714309/ has passed the CI then... | 09:56 |
noonedeadpunk | kleini: was you trying stable/stein with https://review.opendev.org/#/c/714309/ or without it? | 09:56 |
noonedeadpunk | it it updated keystone... | 09:56 |
noonedeadpunk | oh... actually it didn't... | 09:57 |
kleini | I tested including that change. | 09:58 |
*** DanyC has quit IRC | 10:16 | |
*** DanyC has joined #openstack-ansible | 10:16 | |
*** DanyC has quit IRC | 10:20 | |
kleini | btw openstack_service_setup_host on utility containers does not work 19.0.8, as it is missing then openstacksdk in that container | 10:32 |
kleini | General question: Should OpenStack-Ansible always work with arbitrary values for all the possible configuration options? This results in a nearly untestable options matrix and I doubt to get a hopefully stable deployment... | 10:33 |
*** weshay|ruck has quit IRC | 10:35 | |
*** weshay_ has joined #openstack-ansible | 10:35 | |
jrosser | kleini: this is working in my environment for the setup host http://paste.openstack.org/show/791390/ | 10:43 |
kleini | So I need both settings? I only added the first one. | 10:47 |
*** DanyC has joined #openstack-ansible | 10:56 | |
*** DanyC has quit IRC | 10:56 | |
*** DanyC has joined #openstack-ansible | 10:56 | |
*** thuydang has quit IRC | 11:04 | |
*** rpittau is now known as rpittau|bbl | 11:05 | |
openstackgerrit | Merged openstack/openstack-ansible-os_neutron master: Missing document start "---" https://review.opendev.org/715103 | 11:06 |
*** DanyC has quit IRC | 11:12 | |
*** DanyC has joined #openstack-ansible | 11:13 | |
*** DanyC has quit IRC | 11:17 | |
*** DanyC has joined #openstack-ansible | 11:19 | |
jrosser | kleini: yes i think you need both to pick up the python libraries installed into the utility venv inside the utility container | 11:22 |
kleini | okay, thanks | 11:23 |
jrosser | kleini: regarding your question about many possible config values, OSA provides "sensible defaults" which are tested in CI. It is expected that you'll need adjust those for your particular deployment, referring to the documentation for the openstack services | 11:23 |
jrosser | in addition we allow arbitrary key/value to be injected into pretty much all the config files from your user variables, so you are very much in control of what config goes where | 11:24 |
kleini | thanks for your help and your answer. But failing a lot with clean deployments from scratch makes me suspicious whether we can use it to run a productive system. Or we need to learn much more as we currently now. | 11:30 |
*** kopecmartin has quit IRC | 11:51 | |
*** kopecmartin has joined #openstack-ansible | 11:52 | |
openstackgerrit | Arx Cruz proposed openstack/openstack-ansible master: DNM - POC - Collect logs https://review.opendev.org/702676 | 11:56 |
*** kopecmartin has quit IRC | 11:58 | |
*** weshay_ has quit IRC | 11:59 | |
*** kopecmartin has joined #openstack-ansible | 12:01 | |
*** rh-jelabarre has joined #openstack-ansible | 12:03 | |
jrosser | kleini: there are lots of people using OSA for production clouds right now, don't worry | 12:05 |
*** kopecmartin has quit IRC | 12:06 | |
jrosser | however it is still up to you to make a lot of decisions about networking and storage just like it would be with any other openstack deployment | 12:06 |
*** kopecmartin has joined #openstack-ansible | 12:07 | |
kleini | storage is an existing Ceph. I only need to add the according configuration to OSA. ceph.conf is described but it is not described, how to add glance ceph client passwords and so on. | 12:08 |
kleini | ephemeral will be on local SSDs | 12:08 |
kleini | images and volumes on Ceph | 12:08 |
jrosser | you need to point OSA to your existing ceph cluster | 12:09 |
kleini | network is currently 10G fiber, tagged VLAN, redundant switches and Open vSwitch on every node for bridges | 12:09 |
jrosser | here is what I have in user_variable_storage.yml to set up access to an external ceph cluster http://paste.openstack.org/show/791393/ | 12:10 |
kleini | sure, I need to point OSA to the existing ceph cluster, but this is not described anywhere | 12:10 |
*** weshay_ has joined #openstack-ansible | 12:10 | |
kleini | thanks for your paste, I found something similar. again missing is, how to configure passwords | 12:11 |
*** weshay_ is now known as weshay|ruck | 12:12 | |
jrosser | there is a very small note about external ceph here https://docs.openstack.org/openstack-ansible/latest/user/ceph/full-deploy.html | 12:13 |
jrosser | but that really should be better | 12:13 |
jrosser | which passwords do you need to configure? | 12:13 |
kleini | glance_ceph_client: glance configures, that glance authenticates as user glance on Ceph. and where does the password for this glance user in Ceph come from? | 12:14 |
*** rpittau|bbl is now known as rpittau | 12:15 | |
kleini | that base64 encoded string in the client.glance.keyring file in /etc/ceph | 12:15 |
noonedeadpunk | it's auth not via password but with cephx | 12:16 |
noonedeadpunk | https://docs.ceph.com/docs/emperor/rados/operations/auth-intro/ | 12:16 |
jrosser | setting up the ceph pools and permissions on an external cluster is outside the scope of OSA, and if you were using ceph-ansible to do that it would be via something like this https://github.com/ceph/ceph-ansible/blob/4ac99223b2dff5cf264e1b1632bf89583bff3a25/roles/ceph-defaults/defaults/main.yml#L726-L731 | 12:17 |
noonedeadpunk | and token should be generated with ceph_client | 12:17 |
noonedeadpunk | s/generated/taken/ | 12:17 |
jrosser | giving the ip of the ceph mons to OSA, the expectation is that ansible can connect to the mon and retrieve the keyrings | 12:17 |
jrosser | that is not always possible, so you can also provide the keyrings manually if you are not allowed to connect to the mon with your OSA ansible | 12:18 |
jrosser | noonedeadpunk: we really need better docs for *all* of this :/ | 12:18 |
noonedeadpunk | ++ cant agree more | 12:18 |
kleini | okay, need to read about that, I though it was a username and a password. Maybe OSA can fetch the keyrings but I doubt it. Then everybody would be able to access the Ceph just by knowing the user glance exists there. | 12:19 |
jrosser | i don't think thats how it works | 12:20 |
jrosser | they keyring needs to be present on the ceph client | 12:20 |
kleini | okay, so ceph client is in my case a glance container and glance in it, created by OSA. How does OSA place the keying in this container? | 12:20 |
noonedeadpunk | and osa reaches mon where admin keyring is present... | 12:20 |
noonedeadpunk | so it gets keyring from there and place inside osa container | 12:21 |
kleini | sorry, mon is not reachable by OSA. very different system, where I have no access at all | 12:21 |
noonedeadpunk | actually in train you can jsut provide keyrings folder to osa | 12:21 |
noonedeadpunk | and don't give it access to mons | 12:21 |
kleini | that's what I need | 12:21 |
jrosser | kleini: like i said earlier there are two possible mechanisms supported in OSA for external ceph clusters | 12:22 |
jrosser | either you allow ansible to ssh into the mon and grab the key, or you provide the keys manually on the deploy host | 12:22 |
jrosser | they are then copied to the necessary places | 12:22 |
jrosser | in your case you need to request that the necessary keys are generated by the ceph cluster admin | 12:23 |
jrosser | the example in ceph-ansible is probably exactly what you need | 12:23 |
noonedeadpunk | I guess we didn't have option to provide keys in stein? | 12:23 |
noonedeadpunk | or we did backport that... | 12:24 |
jrosser | here is the patch? https://review.opendev.org/#/c/671762/ | 12:25 |
noonedeadpunk | oh, it's even in master.... | 12:25 |
noonedeadpunk | but I think it's pretty safe to take this exact role from master... | 12:26 |
jrosser | or we backport it | 12:26 |
noonedeadpunk | or this, yes:) | 12:26 |
jrosser | becasue this is a very reasonable use case | 12:26 |
openstackgerrit | Jonathan Rosser proposed openstack/openstack-ansible-ceph_client stable/train: Importing keyrings from files rather than from mons https://review.opendev.org/716274 | 12:26 |
jrosser | kleini: which branch are you using? | 12:27 |
noonedeadpunk | stein | 12:27 |
jrosser | thats not a clean backport | 12:27 |
jrosser | let me quickly see how bad the conflict is | 12:28 |
kleini | I am currently testing with 19.0.8, 19.0.11, stable/stein. But I will test 20.0.2, too. | 12:30 |
*** rh-jelabarre has quit IRC | 12:31 | |
*** rh-jelabarre has joined #openstack-ansible | 12:31 | |
kleini | whatever runs better with my necessary configuration changes: Users domain with LDAP auth, existing Ceph, dedicated network hosts, nspawn containers, neutron OVS | 12:32 |
openstackgerrit | Jonathan Rosser proposed openstack/openstack-ansible-ceph_client stable/stein: Importing keyrings from files rather than from mons https://review.opendev.org/716278 | 12:33 |
kleini | what is the nspawn equivalent for lxc_container_allow_restarts=false? | 12:34 |
jrosser | kleini: perhaps this? https://github.com/openstack/openstack-ansible-nspawn_container_create/blob/master/defaults/main.yml#L58-L62 | 12:38 |
maharg101 | I'm setting openstack_venv_python_executable: python3 in all.yml, but finding that the lxc containers don't have a python3. Is there a simple fix ? Python3 appears to be required by some roles now e.g. rally | 12:38 |
jrosser | maharg101: which branch are you deploying? | 12:38 |
*** udesale_ has joined #openstack-ansible | 12:38 | |
*** thuydang has joined #openstack-ansible | 12:38 | |
jrosser | kleini: you should find that the file defaults/main.yml in each of the OSA ansible roles is the "external interface" to the role. variables that are intended for you to override should all be listed in the defaults | 12:39 |
kleini | thanks, that's a good hint | 12:40 |
*** udesale has quit IRC | 12:41 | |
maharg101 | jrosser: stable/train | 12:43 |
jrosser | on centos7? | 12:43 |
maharg101 | yep | 12:43 |
jrosser | that is a known issue, rally is "branchless", i.e it does not respect the standard openstack release cycles | 12:43 |
*** rh-jelabarre has quit IRC | 12:44 | |
jrosser | there was a new version released this week which needs >= py3.6 so that has broken | 12:45 |
jrosser | if you do not need rally please just comment out the rally playbook from playbooks/setup-openstack.yml | 12:45 |
maharg101 | gotcha - thanks jrosser | 12:45 |
jrosser | noonedeadpunk is currently looking to see if we have a solution | 12:46 |
*** jamesden_ has quit IRC | 12:47 | |
maharg101 | so setting openstack_venv_python_executable: python3 in all.yml didn't seem to work so well regardless of rally | 12:47 |
kleini | 20.0.2 fails building the venv in the utility container with Python3: "Failed building wheel for ujson". Is that a known issue on Ubuntu 18.04? | 12:47 |
maharg101 | kleini: yes that's what I'm hitting, in the wheel build virtualenv | 12:49 |
maharg101 | TASK [python_venv_build : Create the wheel build virtualenv (if it does not exist)] | 12:49 |
kleini | TASK [python_venv_build : Build wheels for the packages to be installed into the venv] | 12:50 |
kleini | only some little steps further | 12:50 |
*** jamesden_ has joined #openstack-ansible | 12:53 | |
jrosser | yeah ujson..... just a moment | 12:53 |
*** rh-jelabarre has joined #openstack-ansible | 12:54 | |
jrosser | maharg101: i don't think you should adjust the python executable like that | 12:58 |
jrosser | the deployment is python2 on centos for very good reason | 12:59 |
jrosser | which is why on stable/train is says openstack_venv_python_executable: "{{ (ansible_distribution == 'CentOS') | ternary('python2', 'python3') }}" | 12:59 |
maharg101 | jrosser: yep was just trying to get it past the rally error, have reverted ! | 12:59 |
jrosser | kleini: can you give some more context for your 20.0.2 error, maybe paste the relevant things to paste.opentstack.org? like the few tasks leading up to it failing? | 13:02 |
kleini | http://paste.openstack.org/show/791398/ | 13:03 |
kleini | sorry, I tried to upgrade my 19.0.8 deployment to 20.0.2, just to test, whether that works... | 13:03 |
*** weshay has joined #openstack-ansible | 13:23 | |
*** weshay|ruck has quit IRC | 13:25 | |
noonedeadpunk | seems that paste is not full... | 13:26 |
noonedeadpunk | last line stripped | 13:26 |
jrosser | i am just starting a stable/train lxc aio here to see if i can repoduce | 13:28 |
jrosser | i have to meeting now so will let that just run in the background for a while | 13:28 |
maharg101 | huge thanks for your help jrosser and sending best wishes to all in the OSA community | 13:31 |
jrosser | thankyou :) | 13:31 |
kleini | huge thanks from me, too! | 13:32 |
kleini | this helps a lot! | 13:32 |
*** macz_ has joined #openstack-ansible | 13:38 | |
*** thuydang has left #openstack-ansible | 13:38 | |
*** openstacking_123 has joined #openstack-ansible | 13:40 | |
*** macz_ has quit IRC | 13:42 | |
noonedeadpunk | jrosser: I'm thinking if we should better use py2 for stretch because of py3.5 on board rather than py3 venvs? | 13:43 |
jrosser | we could move it into the same class of deploy as centos7 | 13:44 |
jrosser | and we have a transition already on stable/train? | 13:44 |
noonedeadpunk | At least we've dropped it on master | 13:44 |
noonedeadpunk | so yeah | 13:45 |
jrosser | that would mean a breaking change on train? | 13:45 |
jrosser | or can we just do part of it | 13:45 |
noonedeadpunk | I'm just afraid that with some time number of packages which will break for it will grow... | 13:46 |
*** openstacking_123 has quit IRC | 13:49 | |
jrosser | kleini: maharg101 my AIO is past the utility container setup without error | 13:56 |
jrosser | this is from checking out stable/train | 13:56 |
jrosser | if things are going strange on a multinode deployment its sometimes useful to have an AIO build handy to compare against | 13:57 |
maharg101 | jrosser: containerised ? | 14:00 |
jrosser | with lxc, yes | 14:02 |
jrosser | ok setup-infrastructure just finished OK | 14:02 |
maharg101 | I don't know what difference AIO could make.. | 14:03 |
jrosser | this is what i did http://paste.openstack.org/show/791401/ | 14:03 |
jrosser | an AIO will use the auto-generated config, thats probably the largest difference | 14:04 |
jrosser | here is where ujson is needed http://paste.openstack.org/show/791403/ | 14:07 |
noonedeadpunk | oh, btw | 14:14 |
noonedeadpunk | ujson has been fixed with 2.0.3 version released in beggining of march | 14:15 |
noonedeadpunk | (or even 2.0.0) | 14:15 |
noonedeadpunk | so we probably shoud write it down to our overrides of contraints | 14:15 |
noonedeadpunk | I'll offer some patch I guess | 14:16 |
jrosser | noonedeadpunk: thats already in requirements u-c i think | 14:16 |
noonedeadpunk | but not for stable | 14:16 |
noonedeadpunk | for master only | 14:17 |
jrosser | i was interested to try to reproduce why that might fail on train | 14:17 |
jrosser | but i can't | 14:17 |
noonedeadpunk | It depends on hardware and things | 14:17 |
noonedeadpunk | Last week here was another folk who has faced with the same issue | 14:17 |
jrosser | oh is this a compiler specific thing? | 14:17 |
noonedeadpunk | yeah | 14:17 |
noonedeadpunk | there was even a ML regarding that... | 14:17 |
noonedeadpunk | http://lists.openstack.org/pipermail/openstack-discuss/2020-February/012376.html | 14:18 |
noonedeadpunk | sorry it's not beggiining of the thread:( | 14:18 |
*** jamesden_ has quit IRC | 14:21 | |
*** dave-mccowan has quit IRC | 14:23 | |
jrosser | noonedeadpunk: i have a patch for ujson | 14:25 |
jrosser | should we do master and backport or direct to the stable branches? | 14:25 |
noonedeadpunk | I think direct but thought to inlcude it to https://review.opendev.org/#/c/715215/ | 14:26 |
noonedeadpunk | not sure about stein though - it seems that it raises issue only for py3 compiler... | 14:26 |
noonedeadpunk | we don't need it for master as it's already in u-c | 14:27 |
jrosser | shall i leave for you to add there? | 14:27 |
jrosser | it's just one line after all | 14:27 |
noonedeadpunk | I just edit global-requiremnts as well so it may be a conflict there... | 14:27 |
jrosser | also we need to unblock the lxc_<> roles | 14:28 |
noonedeadpunk | yeah | 14:28 |
jrosser | those are now preventing other things from merging | 14:28 |
noonedeadpunk | jrosser: or actually, we can build py3.6 for stretch with pyenv.... | 14:32 |
noonedeadpunk | but not sure if it's not to much effort... | 14:33 |
*** dave-mccowan has joined #openstack-ansible | 14:33 | |
jrosser | stretch is the biggest corner case we have as it's py3.5 | 14:37 |
jrosser | i figure we'd have to add pyenv to openstack-hosts? | 14:39 |
*** spatel has joined #openstack-ansible | 14:48 | |
spatel | Is there any vendor ethernet nic support SR-IOV with security-group feature? | 14:49 |
spatel | Mellanox doesn't support security-group with sriov | 14:49 |
jrosser | afaik the security group part sits either in iptables or ovs | 14:50 |
jrosser | so it doesnt really exist when using normal SRIOV | 14:50 |
jrosser | i would be checking out what the current state is with mellanox ASAP2 with regard to security groups | 14:51 |
*** Adri2000 has quit IRC | 14:51 | |
kleini | I doubt, that security groups can be run on hardware through SR-IOV. That is too complex. | 14:55 |
*** Adri2000 has joined #openstack-ansible | 14:56 | |
jrosser | mellanox have an openflow switch on the nic | 14:57 |
jrosser | in terms of the silicon a lot of this is possible, depending on things being wired up correctly into ovs offloading | 14:57 |
kleini | interesting | 15:00 |
*** macz_ has joined #openstack-ansible | 15:03 | |
*** dave-mccowan has quit IRC | 15:04 | |
*** dave-mccowan has joined #openstack-ansible | 15:06 | |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible stable/train: Fix python2 support https://review.opendev.org/715215 | 15:09 |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible stable/train: Bump SHAs for stable/train https://review.opendev.org/714310 | 15:13 |
noonedeadpunk | jrosser: I missed - what was the result for keystone on stable/stein? | 15:14 |
noonedeadpunk | Have we things to fix there or I can suggest a release? | 15:14 |
*** spatel has quit IRC | 15:14 | |
jrosser | i need to find it again - not sure which branch actually | 15:17 |
noonedeadpunk | I guess kleini had problems with stable/stein | 15:19 |
noonedeadpunk | while on the latest tag things were working | 15:19 |
noonedeadpunk | I can spawn another snadbox in case you've lost yours | 15:19 |
jrosser | this was in a zuul job perhaps a one off | 15:20 |
jrosser | stacktrace in the keystone log | 15:20 |
kleini | This happened for me with stable/stein including that mentioned commit. it does not happen with 19.0.8. will test tomorrow 19.0.11 | 15:21 |
*** spatel has joined #openstack-ansible | 15:22 | |
jrosser | noonedeadpunk: i cannot find the job with the odd keystone error | 15:25 |
jrosser | i'm fairly sure the exception was "slice indices must be integers or None" | 15:28 |
*** tsturm2 has joined #openstack-ansible | 15:28 | |
*** DanyC has quit IRC | 15:46 | |
*** DanyC has joined #openstack-ansible | 15:46 | |
jrosser | noonedeadpunk: this is part of the problem with the lxc roles https://review.opendev.org/#/c/707943/13/test-vars.yml | 15:49 |
jrosser | one of the lxc_container_create tests checks for there being no apparmor profile present | 15:49 |
jrosser | but we now define one unconditionally | 15:50 |
*** DanyC has quit IRC | 15:51 | |
* noonedeadpunk tries to recall the reason why I added that | 15:52 | |
jrosser | the unconfined profile is necessary to make buster work | 15:53 |
noonedeadpunk | but it breaks everything else? | 15:53 |
noonedeadpunk | not everything, but our tests I guess | 15:54 |
*** DanyC has joined #openstack-ansible | 15:54 | |
jrosser | yes, the side effect is breaking this https://github.com/openstack/openstack-ansible-lxc_container_create/blob/master/tests/test-containers-functional.yml#L53-L58 | 15:54 |
jrosser | becasue it is included with vars_file: common/test-vars.yml it is very hard to override for a specific host | 15:56 |
jrosser | and even then, it might be necessary to make a container start at all on buster | 15:56 |
jrosser | in which case that test can never pass | 15:56 |
*** udesale_ has quit IRC | 16:01 | |
noonedeadpunk | #startmeeting openstack_ansible_meeting | 16:04 |
openstack | Meeting started Tue Mar 31 16:04:37 2020 UTC and is due to finish in 60 minutes. The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot. | 16:04 |
openstack | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 16:04 |
*** openstack changes topic to " (Meeting topic: openstack_ansible_meeting)" | 16:04 | |
openstack | The meeting name has been set to 'openstack_ansible_meeting' | 16:04 |
noonedeadpunk | #topic office hours | 16:04 |
*** openstack changes topic to "office hours (Meeting topic: openstack_ansible_meeting)" | 16:04 | |
arxcruz | o/ | 16:04 |
jrosser | o/ | 16:05 |
noonedeadpunk | o/ | 16:06 |
*** rpittau is now known as rpittau|afk | 16:07 | |
noonedeadpunk | ok, so, arxcruz, take a world:) | 16:08 |
noonedeadpunk | *word | 16:08 |
noonedeadpunk | lol | 16:08 |
arxcruz | hehe | 16:08 |
arxcruz | so, we are working in consolidate our skip list in one single repository | 16:08 |
arxcruz | https://opendev.org/openstack/openstack-tempest-skiplist | 16:09 |
arxcruz | the idea is have a tool that will give you a list of tests to be skiped based on job, release, and installer (tripleo, osa, etc) | 16:09 |
arxcruz | also, a ansible module to call it directly on ansible | 16:09 |
arxcruz | we want it integrated with os_tempest as much as possible as well | 16:09 |
arxcruz | the idea is call something like tempest-skip --release master --job bla | 16:10 |
arxcruz | and it return the skipped tests that we can pass to tempest | 16:10 |
arxcruz | if anyone is interested in help, you are more than welcome, we are now in phase of discuss what the tool will do, and how | 16:11 |
arxcruz | so it's a good start point :) | 16:11 |
arxcruz | we are doing this, because now, tripleo have jobs per component | 16:11 |
arxcruz | tripleo-component-compute | 16:11 |
arxcruz | tripleo-component-network | 16:11 |
arxcruz | and sometimes we see tests failing in one job, but not in the other, because the component have a bug or whatever other reason | 16:12 |
arxcruz | so we need now to be able to have a skip list per job/release | 16:12 |
arxcruz | and we were for a long time wanting to have the skip list in their own repository | 16:12 |
arxcruz | instead of use the one we are using right now, that is from our now deprecated validate-tempest role | 16:12 |
arxcruz | if osa are interested on this approach, it would be nice to coordinate collaboration :) | 16:13 |
arxcruz | that's it :) | 16:13 |
noonedeadpunk | ok, I see. Not really sure I got how ansible module should act. Like what it should do except running that command and what output it will provide? | 16:14 |
arxcruz | the mvp is call this command, and it return a list of the tests to be skipped, that can be saved in a txt file and pass to tempest | 16:14 |
arxcruz | as we are doing today | 16:15 |
arxcruz | have an ansible module is just an idea if that will be done, or it would be easier to just call the command we are discussing | 16:15 |
jrosser | vars_files: "{{ release ~ '/' ~ job '/' skiplist.yml }}" | 16:15 |
noonedeadpunk | Ok, so it's output can be registered and passed to tempest role include as a variable? | 16:15 |
arxcruz | yes | 16:16 |
arxcruz | probably can be done | 16:16 |
arxcruz | as i said, we are in the beginning | 16:16 |
arxcruz | planning everything | 16:16 |
noonedeadpunk | actually yes, I like jrosser's way of thinking... | 16:16 |
jrosser | this can probably be an ansible role that is called with branch/job and a var name | 16:16 |
jrosser | it then set_fact that var name | 16:17 |
jrosser | then everything is nicely decoupled | 16:17 |
arxcruz | yup, can be done in this way | 16:18 |
arxcruz | but i really looking for more integration between tripleo and osa :D | 16:18 |
jrosser | maybe these can all co-exist | 16:18 |
arxcruz | and have it integrated in os_tempest role | 16:18 |
arxcruz | not only for us, but to osa | 16:18 |
jrosser | i expect OSA would prefer something natively ansible in preference to a cli tool | 16:18 |
arxcruz | and that's why I wanted to have an ansible module or role | 16:19 |
jrosser | sure | 16:19 |
jrosser | is there anything you would like to specifically integrate in os_tempest? | 16:19 |
jrosser | roles calling roles can get messy | 16:20 |
arxcruz | I would like that the skip list used by osa be there as well :) | 16:20 |
arxcruz | of course cores would be by both groups | 16:20 |
jrosser | right - so if we could set a var with a role that generated the skip list we can pass it to os_tempest today | 16:20 |
arxcruz | yup | 16:21 |
arxcruz | we can work in this direction | 16:21 |
jrosser | and that would get wired in somewhere like this https://github.com/openstack/openstack-ansible/blob/master/playbooks/os-tempest-install.yml#L31-L33 | 16:21 |
jrosser | i have to be afk for a while | 16:22 |
arxcruz | sure | 16:22 |
jrosser | noonedeadpunk maybe you have some thoughts too? | 16:22 |
arxcruz | anyway, we are working now in how the tool, so it might take a while until we are in a position to make everything work together | 16:23 |
arxcruz | so, all help are welcome :) | 16:23 |
arxcruz | that's all from me | 16:23 |
noonedeadpunk | Yeah, I actually think that roles should remain as lightweight as possible. As we have option to write blacklists it's good to use it. IF somesthing needs to be adjusted in os_tempest regarding format of passed variables it it - it's good | 16:24 |
noonedeadpunk | But not sure that we should add this module as a requirement to the role | 16:24 |
noonedeadpunk | As it will influence not so well in case of role standalone usage | 16:24 |
arxcruz | I see | 16:24 |
arxcruz | yeah, we can think about it in the future | 16:25 |
arxcruz | when we have something to show :D | 16:25 |
noonedeadpunk | actually even if we make such dependendy - another var should be passed to notify whether use it or not | 16:26 |
noonedeadpunk | But I think that we also may be using your blacklisting role for our CI jobs as well | 16:27 |
noonedeadpunk | so I probably pretty interested to have such tooling | 16:29 |
arxcruz | cool :) | 16:30 |
arxcruz | glad to hear :) | 16:30 |
*** evrardjp has quit IRC | 16:36 | |
*** evrardjp has joined #openstack-ansible | 16:36 | |
velmeran | So I got everything up and running last night, could login to the web interface and even uploaded an image. This morning looking at things I found my compute node is not there. | 16:39 |
velmeran | looking at the system, the service for the neutron agent crashing/restarting constantly | 16:39 |
velmeran | neutron-linuxbridge-agent: 2020-03-31 09:38:10.766 18509 ERROR neutron.plugins.ml2.drivers.linuxbridge.agent.linuxbridge_neutron_agent [-] Interface eth12 for physical network flat does not exist. Agent terminated! | 16:39 |
noonedeadpunk | velmeran: sorry we have kinda meeting here :p | 16:40 |
noonedeadpunk | at least trying to have:) | 16:40 |
velmeran | ah no problem | 16:40 |
noonedeadpunk | Ok, so another thing I wanted to say is that our rocky finally entered EM | 16:40 |
noonedeadpunk | and I hope that train bump will be merged soon as well | 16:41 |
noonedeadpunk | btw, openstack seems not to be supporting python 3.5 which comes with debian stretch | 16:41 |
noonedeadpunk | however, we deploy venvs on py3.5 there and CI says it's wrking | 16:41 |
noonedeadpunk | so we can kinda continue doing that or can actually rollback to py2... | 16:42 |
noonedeadpunk | which will be kinda regression for users | 16:42 |
* jrosser back | 16:53 | |
*** pcaruana has quit IRC | 16:56 | |
noonedeadpunk | jrosser: do you have some thoughts on this? | 16:59 |
jrosser | the easiest thing would be to not deploy rally, on stretch | 16:59 |
noonedeadpunk | In terms of rally, it can be deployed on py3 I believe | 17:00 |
*** ryneq has quit IRC | 17:00 | |
jrosser | so the issue there is the lack of py3 support on train | 17:01 |
noonedeadpunk | the thing is that py3.5 has not been tested according to https://governance.openstack.org/tc/reference/runtimes/train.html#python-runtime-for-train | 17:01 |
jrosser | hrrm well yes then the whole business of deploying on stretch is not supported on that basis? | 17:01 |
noonedeadpunk | smth like that | 17:02 |
noonedeadpunk | despite it works now | 17:02 |
jrosser | maybe we start small | 17:02 |
noonedeadpunk | (probably) | 17:02 |
jrosser | backport the necessary changes to python_venv build, which are are going to need anyway | 17:02 |
jrosser | and then switch over just the utility host stuff | 17:02 |
jrosser | but it will still fail though? | 17:02 |
jrosser | becasue 3.5 | 17:03 |
noonedeadpunk | nope. but we don't run tempest against all projects tbh | 17:03 |
jrosser | i thought the main issue was the installation of rally requiring >= py3.6 | 17:03 |
noonedeadpunk | what do you what to backport for python_venv_build? | 17:03 |
noonedeadpunk | jrosser: yeah, in case it's from master | 17:04 |
jrosser | i fear we may be talking about different things :) | 17:04 |
noonedeadpunk | but I think we can bump rally to 1.7 and live with it | 17:04 |
noonedeadpunk | Also we maybe should do it in better way but currently it's not easy without circullar dependency | 17:05 |
noonedeadpunk | ok. so I think we have 2 problems now. Rally that's support only py3.6+ | 17:05 |
noonedeadpunk | and openstack not tested with 3.5 (but it seems to work as for now) | 17:06 |
*** andrewbonney has quit IRC | 17:07 | |
noonedeadpunk | I think between not being able to deploy rally and deploy <3.0.0 it's better to chose deploy <3.0.0? | 17:08 |
jrosser | yes i would agree | 17:08 |
noonedeadpunk | And actually https://review.opendev.org/#/c/715215/ passes for debian | 17:08 |
jrosser | and that patch also fixes centos | 17:09 |
noonedeadpunk | yeah | 17:09 |
noonedeadpunk | it's a bit messy | 17:09 |
noonedeadpunk | but can't imagine another cleaner patch without disabling half of the ci | 17:09 |
jrosser | it's ok - these are all external things that have changed underneath us | 17:10 |
*** pcaruana has joined #openstack-ansible | 17:10 | |
jrosser | i think we would better spend the time getting the backlog of patches in good shape than worry too much about stretch | 17:10 |
jrosser | unless there are some deployments that are depending on something we are missing | 17:11 |
noonedeadpunk | yeah, agree | 17:11 |
noonedeadpunk | so I think we almost have clean branches then | 17:12 |
jrosser | i need to go AFK again (TZ changed this is now an hour later for me) | 17:12 |
noonedeadpunk | except lxc thing | 17:12 |
noonedeadpunk | changed for me as well... | 17:12 |
noonedeadpunk | ok, then I think we've done | 17:12 |
noonedeadpunk | #endmeeting | 17:12 |
*** openstack changes topic to "Launchpad: https://launchpad.net/openstack-ansible || Weekly Meetings: https://wiki.openstack.org/wiki/Meetings/openstack-ansible || Review Dashboard: https://bit.ly/2SAcGAn" | 17:13 | |
openstack | Meeting ended Tue Mar 31 17:12:57 2020 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 17:13 |
jrosser | yes there are still small things - like my cleanup for infra pip/virtualenv has broken again | 17:13 |
openstack | Minutes: http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-03-31-16.04.html | 17:13 |
openstack | Minutes (text): http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-03-31-16.04.txt | 17:13 |
openstack | Log: http://eavesdrop.openstack.org/meetings/openstack_ansible_meeting/2020/openstack_ansible_meeting.2020-03-31-16.04.log.html | 17:13 |
noonedeadpunk | jrosser: sorry:( | 17:13 |
jrosser | don't worry :) | 17:13 |
noonedeadpunk | oh, seems we're cleaning up too good now https://zuul.opendev.org/t/openstack/build/337476d000a74b0da7e8ff9ea7fc26b5/log/job-output.txt#4876 | 17:15 |
noonedeadpunk | oh, btw, regarding zun, I think we should try to merge that https://review.opendev.org/#/c/692407/ | 17:20 |
noonedeadpunk | and work futher only with zun role | 17:20 |
noonedeadpunk | I guess intergration itself is usefull and according to sandbox it works ok | 17:20 |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible stable/train: Set bionic upgrade job to voting https://review.opendev.org/708238 | 17:21 |
*** jbadiapa has quit IRC | 17:52 | |
mnaser | noonedeadpunk: simple backport https://review.opendev.org/716274 | 17:59 |
mnaser | https://review.opendev.org/655331 another simple change | 18:00 |
mnaser | https://review.opendev.org/690664 another trivial change | 18:03 |
mnaser | and https://review.opendev.org/665238 :) | 18:04 |
noonedeadpunk | last two not so simple as have conflicts | 18:04 |
mnaser | (sorry, i'm going over all of oru chagnes) | 18:04 |
mnaser | oh they do? crap | 18:05 |
mnaser | i don't see them in gertty | 18:05 |
mnaser | oh they do. | 18:05 |
openstackgerrit | Mohammed Naser proposed openstack/openstack-ansible-lxc_container_create master: Use lxc-3 syntax for tests https://review.opendev.org/665238 | 18:05 |
mnaser | that's a rebase | 18:06 |
noonedeadpunk | the last one will fail I guess as tets are broken now:( | 18:06 |
noonedeadpunk | but yeah | 18:06 |
mnaser | i thougth they were broken for train only? | 18:06 |
noonedeadpunk | I guess we just didn't try to land anything for master | 18:07 |
noonedeadpunk | but not sure though | 18:07 |
noonedeadpunk | I can rebase these ones | 18:08 |
noonedeadpunk | if you wish | 18:08 |
noonedeadpunk | velmeran: I guess we've lost your questions while meeting( | 18:08 |
noonedeadpunk | what's in yours provider_networks? | 18:08 |
noonedeadpunk | I guess you've set br-vlan on eth12? | 18:09 |
mnaser | noonedeadpunk: yeah.. can you also do the retirement of pip_install and repo_build so we don't "Release" them when we don't use them? | 18:10 |
velmeran | Hi, I think I figured out the issue, its to do with the mapping of physical_interface_mappings = vlan:eth12,flat:br-vlan | 18:10 |
noonedeadpunk | yeah | 18:10 |
noonedeadpunk | velmeran: yep and it's set in openstack_user_config.yml in provider_networks | 18:10 |
noonedeadpunk | I guess you have this https://opendev.org/openstack/openstack-ansible/src/branch/master/etc/openstack_deploy/openstack_user_config.yml.aio.j2#L98-L106 | 18:11 |
noonedeadpunk | So you should use only valid interfaces that you do have on your hosts | 18:11 |
velmeran | I'm on centos, so I don't have a /etc/network/interfaces where some one showed how to make a fake eth12 on the interface for it to bind, so trying to sort that out | 18:11 |
noonedeadpunk | in terms of br-vlan (flat) you can just omit it | 18:11 |
noonedeadpunk | and use vxlan isntead | 18:11 |
velmeran | like, just remove the whole -network section for the flat part? | 18:12 |
noonedeadpunk | yep | 18:12 |
velmeran | okay, let me give that a try. | 18:12 |
noonedeadpunk | and re run os-neutron-install.yml | 18:13 |
noonedeadpunk | velmeran: flat part which relies to br-vlan) | 18:13 |
*** mgariepy has quit IRC | 18:15 | |
*** miloa has quit IRC | 18:19 | |
*** mgariepy has joined #openstack-ansible | 18:21 | |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-galera_server master: tox: Keeping going with docs https://review.opendev.org/690664 | 18:23 |
velmeran | Okay, that seems to have gotten the systems happier with the linuxbridge, but on the compute node I'm now getting errors about the oslo.messaging drivers referencing rabbit, giving me a connection failed. compute node still doesn't seem to show up in any services/agent queries from openstack. | 18:27 |
velmeran | I only did the neutron install playbook, maybe I need to do the full openstack install to fix rabbit? | 18:28 |
noonedeadpunk | and how do they look like?:) | 18:30 |
noonedeadpunk | errors I mean. can you paste them? | 18:31 |
velmeran | Mar 31 11:25:48 opnstack-cmpute-01 neutron-linuxbridge-agent: 2020-03-31 11:25:48.025 1519 ERROR oslo.messaging._drivers.impl_rabbit [req-4e0413d1-054e-4b3b-9bca-f5c4e574d6fe - - - - -] Connection failed: [Errno 113] EHOSTUNREACH (retrying in 6.0 seconds): error: [Errno 113] EHOSTUNREACH | 18:32 |
velmeran | the main controller doesn't have any errors at the moment. | 18:32 |
noonedeadpunk | this looks like some networking issue.... | 18:32 |
noonedeadpunk | So is mgmt network present on compute nodes and are cotrollers reachable through it? | 18:33 |
noonedeadpunk | and actually are rabbit containers laucnhed | 18:34 |
*** mgariepy has quit IRC | 18:35 | |
velmeran | humm. so rabbit container is running, but I can't ping it from compute. | 18:36 |
velmeran | should I have a lxcbr0 on compute? | 18:36 |
noonedeadpunk | no | 18:37 |
noonedeadpunk | but you should have mgmt network there | 18:37 |
noonedeadpunk | the one which is on eth1 of rabbit container | 18:37 |
noonedeadpunk | (generally) | 18:37 |
velmeran | yea, controller can ping the rabbit mgmt network address. | 18:37 |
noonedeadpunk | and another controller? | 18:38 |
noonedeadpunk | if you have several ofc | 18:38 |
noonedeadpunk | like management network should be present and reachable on every host and container | 18:39 |
noonedeadpunk | nice, train bump passed ci https://review.opendev.org/#/c/714310/ | 18:40 |
velmeran | just a single controller, compute, storage system. storage can't ping rabbit either | 18:41 |
noonedeadpunk | I guess storage can miss mgmt network if it's required | 18:42 |
noonedeadpunk | storage is connected generally with storage net | 18:43 |
noonedeadpunk | So I think you should look into your networking and how mgmt net is configured | 18:43 |
velmeran | yea, something strange is up. I have that mgmt network setup as a /22 on the switch, router, and on each server, and everything seems to talk to the 10.0.60.X subnet, but the 10.0.61,62,63 aren't talking. | 18:45 |
jrosser | velmeran: is this physical hosts? | 18:46 |
velmeran | no, all three host are just VM's, I am mostly just testing deployment before I get some hardware in to do a real deploy. | 18:47 |
*** mgariepy has joined #openstack-ansible | 18:48 | |
jrosser | you will need to make sure that whatever you are using to provide the VM does not do any kind of mac address security | 18:50 |
jrosser | "just VMs" brings a whole load of baggage that you don't get with physical hosts, and what you describe happens very commonly when people try to test out multinode openstack in vmware, for example | 18:50 |
*** DanyC has quit IRC | 18:52 | |
velmeran | yea, I wouldn't be surprised if that was causing issues. right now it seems the containers on the controller can see each other, can see the controller, and can get internet, but they aren't seeing anything else. | 18:53 |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-repo_build master: Retire openstack-ansible-repo_build role https://review.opendev.org/716389 | 18:55 |
jrosser | velmeran: it is likley becasue the VM is only allowed to send from the mac address that the hypervisor thinks it has | 18:56 |
*** d34dh0r53 has quit IRC | 18:56 | |
jrosser | but these many more interfaces you've created have a bunch of mac/ip which that doesnt understand | 18:57 |
*** cloudnull has quit IRC | 18:57 | |
velmeran | okay. I'll do some research, its a Vmware setup with a dvSwitch, using a trunk port setup to pass vlans into that then get defined by the host. | 18:58 |
jrosser | jamesdenton: ^ do you have any tips here for vmware? | 18:59 |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-repo_build master: Retire openstack-ansible-repo_build role https://review.opendev.org/716389 | 19:00 |
*** tsturm2 has quit IRC | 19:06 | |
*** Soopaman has joined #openstack-ansible | 19:08 | |
openstackgerrit | Dmitriy Rabotyagov (noonedeadpunk) proposed openstack/openstack-ansible-repo_build master: Retire openstack-ansible-repo_build role https://review.opendev.org/716389 | 19:16 |
noonedeadpunk | mnaser: my turn to ping you for a vote :p https://review.opendev.org/#/c/715215/ https://review.opendev.org/#/c/714310/ https://review.opendev.org/#/c/708238/ | 19:17 |
mnaser | noonedeadpunk: nice, +3, does that mean once this lands, stable/train should be unblocked? | 19:19 |
noonedeadpunk | yeah | 19:21 |
noonedeadpunk | actually only https://review.opendev.org/#/c/715215/ would be enough | 19:22 |
noonedeadpunk | (to unblock train) | 19:22 |
mnaser | cool, once that lands, lets do a mass-recheck | 19:23 |
*** DanyC has joined #openstack-ansible | 19:24 | |
*** DanyC has quit IRC | 19:29 | |
openstackgerrit | Merged openstack/openstack-ansible-galera_server master: tox: Keeping going with docs https://review.opendev.org/690664 | 19:30 |
*** mgariepy has quit IRC | 20:21 | |
openstackgerrit | Merged openstack/openstack-ansible-ceph_client stable/train: Importing keyrings from files rather than from mons https://review.opendev.org/716274 | 20:26 |
*** mgariepy has joined #openstack-ansible | 20:34 | |
*** kleini has quit IRC | 20:40 | |
*** gshippey has quit IRC | 20:46 | |
*** spatel has quit IRC | 21:44 | |
*** jamesden_ has joined #openstack-ansible | 22:21 | |
*** DanyC has joined #openstack-ansible | 22:33 | |
*** jamesden_ has quit IRC | 22:37 | |
*** DanyC has quit IRC | 22:39 | |
velmeran | Okay. I got it working... 99% reading, 1% making three simple changes on the dvswitch in vcenter... | 22:44 |
*** Soopaman has quit IRC | 22:49 | |
*** tosky has quit IRC | 22:54 | |
*** macz_ has quit IRC | 23:06 | |
*** ianychoi has quit IRC | 23:15 | |
*** ianychoi has joined #openstack-ansible | 23:16 | |
*** rh-jelabarre has quit IRC | 23:43 | |
openstackgerrit | Merged openstack/openstack-ansible-ceph_client master: Updated from OpenStack Ansible Tests https://review.opendev.org/712359 | 23:49 |
velmeran | So here are my notes on getting things "working" on ESXi/vCenter 6.7 and Centos7 Latest: http://paste.openstack.org/show/791427/ | 23:49 |
velmeran | Hope that helps people out. | 23:49 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!