nibalizer | hrm | 00:00 |
---|---|---|
nibalizer | jhesketh: ill fix it for that | 00:01 |
fungi | del can be used to remove a key from an associative array, or to delete an arbitrary python object reference for that matter | 00:10 |
fungi | sort of the inverse of variable assignment | 00:10 |
* jhesketh 's irc client hasn't been pinging him | 00:19 | |
jhesketh | anteaya: cool thanks | 00:19 |
jhesketh | nibalizer: yeah if the object doesn't exist you get an error (key maybe, I can't remember) | 00:20 |
Clint | yeah, KeyError | 00:27 |
nibalizer | jhesketh: we are wrapping up | 00:28 |
nibalizer | talk to you on the other side | 00:28 |
jhesketh | nibalizer: no trouble | 00:28 |
jhesketh | have a good evening all! | 00:29 |
anteaya | thanks jhesketh, have a great day | 00:29 |
jhesketh | will do | 00:29 |
anteaya | see you tomorrow | 00:29 |
jhesketh | :-) | 00:29 |
*** rfolco has quit IRC | 00:41 | |
*** rfolco has joined #openstack-sprint | 00:43 | |
*** rfolco has quit IRC | 00:43 | |
*** imcsk8_ has joined #openstack-sprint | 01:26 | |
*** baoli has joined #openstack-sprint | 01:58 | |
*** yolanda has quit IRC | 02:28 | |
*** sivaramakrishna has joined #openstack-sprint | 02:46 | |
*** baoli has quit IRC | 03:55 | |
*** baoli has joined #openstack-sprint | 04:03 | |
*** yolanda has joined #openstack-sprint | 04:24 | |
*** baoli has quit IRC | 04:28 | |
*** baoli has joined #openstack-sprint | 04:29 | |
*** baoli has quit IRC | 04:39 | |
clarkb | pretty sure that the issue with making a mirror was security groups | 04:40 |
clarkb | we have two rules without a source ip range | 04:40 |
clarkb | this is also very funky with openstackclient and I do not recommend it | 04:40 |
clarkb | puppet is building a mirror now | 05:02 |
clarkb | if this works we can add dns records in the morning and run nodepool against the cloud | 05:02 |
clarkb | the problem with security group rules was we had the defaults, no port 22 ingress just an intergroup rule which is useless for external communication. I deleted the two intergroup rules (ipv4 and ipv6) and added two rules that allow all ipv4 and all ipv6 | 05:04 |
clarkb | we typically have to do this in every cloud we setup as users so this is expected | 05:04 |
clarkb | http://15.184.54.245/ tada | 05:17 |
clarkb | apache did not start after the first reboot, so going to see if that is consistent but its up | 05:17 |
clarkb | will be down a few minutes while I reboot | 05:18 |
clarkb | AH00558: apache2: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1. Set the 'ServerName' directive globally to suppress this message I think that may be the issue I had to manually restart apache after rebooting | 05:20 |
clarkb | but we can figure that out in the morning, otherwise we hvae a mirror | 05:20 |
*** openstack has joined #openstack-sprint | 13:21 | |
*** mrmartin has quit IRC | 13:55 | |
*** rfolco has joined #openstack-sprint | 14:01 | |
pabelanger | clarkb: nice! | 14:03 |
pabelanger | Ya, I think ansible-launch-node could be use to ensure sane defaults for security groups too | 14:04 |
pabelanger | atleast give a warning if they are missing | 14:04 |
pabelanger | heading down to lobby for some breakfast | 14:05 |
*** AJaeger has joined #openstack-sprint | 14:19 | |
fungi | on my way to lobby to meet for pancakes | 14:26 |
mordred | pabelanger: yes. I agree | 14:30 |
*** krtaylor has quit IRC | 15:02 | |
*** mrmartin has joined #openstack-sprint | 15:05 | |
*** krtaylor has joined #openstack-sprint | 15:15 | |
*** yolanda has quit IRC | 15:24 | |
*** krtaylor has quit IRC | 15:34 | |
*** krtaylor has joined #openstack-sprint | 15:47 | |
*** baoli has quit IRC | 15:58 | |
*** ricky1 has joined #openstack-sprint | 15:58 | |
*** ricky1 has left #openstack-sprint | 15:59 | |
*** baoli has joined #openstack-sprint | 15:59 | |
*** ricky1 has joined #openstack-sprint | 16:04 | |
mordred | morning everybody - I hit high travel burnout levels and went home instead of joining you in fort collins | 16:08 |
mordred | downside - I don't get to beer with everyone | 16:08 |
mordred | upside - I probably won't snap and kill anyone with an ax | 16:08 |
*** dfflanders has joined #openstack-sprint | 16:10 | |
*** ricky1 has quit IRC | 16:13 | |
jeblair | mordred: we're having a conversation about booting and dns etc | 16:14 |
* crinkle approves not being axemurdered | 16:14 | |
clarkb | rcarrillocruz: http://paste.openstack.org/show/488043/ is what I did to openstackci security groups | 16:14 |
jeblair | mordred: maybe we should get on the pbx? | 16:14 |
clarkb | rcarrillocruz: basically delete the intergroup rule because it is redundant when you allow all traffic and they don't perform well. Then add some rules to allow all ipv4 and ipv6 traffic | 16:14 |
mordred | jeblair: yes. maybe so | 16:15 |
*** yolanda has joined #openstack-sprint | 16:16 | |
*** mrmartin has quit IRC | 16:17 | |
*** yolanda has quit IRC | 16:18 | |
*** yolanda has joined #openstack-sprint | 16:19 | |
rcarrillocruz | ok | 16:22 |
rcarrillocruz | clarkb: | 16:22 |
rcarrillocruz | was able to reconnect to my bouncer | 16:22 |
rcarrillocruz | thx | 16:22 |
mordred | /var/lib/puppet/reports/controller00.hpuswest.ic.openstack.org/ | 16:23 |
mordred | dest: /var/lib/puppet/reports/{{ ansible_fqdn }} | 16:24 |
mordred | src: "{{ puppet_logfile }}" | 16:25 |
mordred | logfile: "{{ puppet_logfile }}" | 16:25 |
clarkb | rcarrillocruz: http://paste.openstack.org/show/488043/ | 16:26 |
rcarrillocruz | ack | 16:26 |
nibalizer | ohai | 16:28 |
nibalizer | i am at the secueiry checkpoibt thing | 16:28 |
nibalizer | can someone fetch me? | 16:28 |
Clint | nibalizer: on it | 16:28 |
nibalizer | ty | 16:29 |
mordred | https://review.openstack.org/284236 | 16:30 |
jeblair | Clint: https://review.openstack.org/275491 | 16:30 |
jeblair | clarkb: ^ | 16:30 |
mordred | r = requests.post(endpoint, json=payload, **requests_kwargs) | 16:36 |
mordred | https://review.openstack.org/284243 | 16:42 |
rcarrillocruz | yolanda: https://review.openstack.org/#/c/284246/ | 16:46 |
*** baoli_ has joined #openstack-sprint | 16:46 | |
*** baoli has quit IRC | 16:48 | |
clarkb | rcarrillocruz: you also need to delete the existing group rules | 16:52 |
clarkb | nova falls over when you add too many hosts to a security group that has group rules | 16:54 |
clarkb | so you actually need to delete them :/ | 16:54 |
clarkb | rcarrillocruz: it may be easiest, to delete all rules, then add the ingress and egress back in? | 16:55 |
rcarrillocruz | clarkb: i'm dealing other thing on east baremetal, where i'm testing things on | 16:58 |
rcarrillocruz | http://paste.openstack.org/show/488051/ | 16:59 |
rcarrillocruz | can you tell me if you have two different 'default' secgroups in west | 16:59 |
rcarrillocruz | cos that's another edge case i need to fix then | 16:59 |
clarkb | wow you have two default security groups | 16:59 |
jeblair | clarkb: sigh | 16:59 |
rcarrillocruz | yeah , w.t.f. | 16:59 |
clarkb | rcarrillocruz: are you using admin creds? | 17:01 |
clarkb | I wonder if those are two different security groups for different users | 17:01 |
rcarrillocruz | yeah | 17:01 |
rcarrillocruz | admin | 17:01 |
clarkb | rcarrillocruz: I think that is it | 17:02 |
clarkb | jeblair: https://review.openstack.org/#/c/283944/ | 17:02 |
rcarrillocruz | so what user do you use in west | 17:02 |
rcarrillocruz | openstackci | 17:02 |
rcarrillocruz | ? | 17:02 |
rcarrillocruz | on the clouds.yaml | 17:02 |
rcarrillocruz | ... | 17:02 |
clarkb | rcarrillocruz: I used openstackci because that is where I booted the mirror, but now we have to do the same for openstackjenkins | 17:03 |
clarkb | basically each user needs the same config | 17:03 |
*** mjturek1 has left #openstack-sprint | 17:03 | |
med_ | Is mordred in FOCO/HPE today? | 17:05 |
Clint | in a way | 17:05 |
clarkb | | 355739 | infracloud-west | ubuntu-trusty | ubuntu-trusty | 1456332856 | None | None | building | 00:00:11:55 | | 17:06 |
mordred | med_: nope. I'm at home and have dialed in | 17:06 |
mordred | clarkb: so - it is possible to configure the global default security group | 17:07 |
mordred | clarkb: that changes what the defintion of the security group is that gets defined on project creation | 17:07 |
mordred | clarkb: I THINK | 17:07 |
mordred | clarkb: but I'm not sure it's worth doing that - since as you point out we have to fix this on our other clouds too | 17:08 |
clarkb | mordred: last time we looked into that I thought we decided we couldn't do it | 17:08 |
mordred | clarkb: so just having some ansible that makes the security groups be what we want is probably more sane | 17:08 |
clarkb | because neutron is opinionated | 17:08 |
mordred | oh - that's also possible | 17:08 |
fungi | also there's always the chance we need to fix something about our security groups after the project is created, and changing the global default only fixes it for projects you create after you change that, as memory serves | 17:10 |
mordred | yah | 17:12 |
clarkb | mordred: the latest run seems to have run iwth your changes in and it failed | 17:21 |
clarkb | mordred: the file is logged and noted in the puppet_run_all logfile | 17:21 |
mordred | woot | 17:22 |
clarkb | mordred: I am assuming your next thing is to try and post it directly? | 17:22 |
mordred | clarkb: that is a much nicer error message | 17:23 |
mordred | nibalizer: regarding that ^^ | 17:23 |
mordred | nibalizer: is there any way that puppetdb is breaking because there is no controller cert in the puppetmaster CA even though we're connecting as puppetmaster? | 17:24 |
mordred | nibalizer: the error is: | 17:24 |
mordred | Exception: [Errno 336265218] _ssl.c:355: error:140B0002:SSL routines:SSL_CTX_use_PrivateKey_file:system lib" | 17:24 |
mordred | use_PrivateKey makes me think something is not authing in the way we think | 17:25 |
clarkb | jeblair: OS_CLIENT_CONFIG_FILE=/etc/openstack/all-clouds.yaml python shade-launch-node.py --cloud openstackci-infracloud-west --region RegionOne --flavor mirror --image ubuntu-trusty --config-drive $FQDN | 17:34 |
nibalizer | mordred: woot | 17:37 |
nibalizer | mordred: a little nitpick on one of the patches | 17:44 |
nibalizer | otherwise lgtm | 17:44 |
mordred | nibalizer: k | 17:44 |
nibalizer | can we get another core on 284290 and 284292 | 17:50 |
rcarrillocruz | mordred: was hoping to chat about the ansible creation of initial resources of our clousd thing. I've been pushing random playbooks to system-config (as you probably saw), but I was thinking into pulling the cloud launcher bits from infra-ansible into its own repo, as an ansible role | 17:56 |
rcarrillocruz | so infra can use | 17:56 |
rcarrillocruz | infra-ansible can use it | 17:56 |
rcarrillocruz | other things can use it | 17:56 |
rcarrillocruz | thoughts | 17:56 |
rcarrillocruz | ? | 17:56 |
rcarrillocruz | my idea is that we would have a big yaml | 17:56 |
rcarrillocruz | with all our clouds resources inventory (servers, networks, flavors, etc) | 17:57 |
rcarrillocruz | and the role would just create all them | 17:57 |
rcarrillocruz | right now it assumes it's just a run against a single cloud, but could do it multi-cloud | 17:57 |
rcarrillocruz | clarkb: https://review.openstack.org/#/c/284246/ | 17:57 |
clarkb | rcarrillocruz: that doesn't appear to delete the group rules | 18:00 |
rcarrillocruz | right, because it's idempotent... | 18:00 |
rcarrillocruz | if there are, it will just change them | 18:00 |
rcarrillocruz | if they are not there, it will create them | 18:01 |
clarkb | rcarrillocruz: where are we deleting them then? | 18:01 |
clarkb | we must delete the old inter group rules | 18:01 |
clarkb | rcarrillocruz: see http://paste.openstack.org/show/488043/ | 18:01 |
mordred | nibalizer: I just made three patches in ansible-puppet, system-config and ansible-puppet that you might want to look at | 18:01 |
mordred | nibalizer: they're not urgent | 18:02 |
nibalizer | mordred: fure | 18:02 |
nibalizer | sure | 18:02 |
mordred | rcarrillocruz: yes, I agree with big yaml that the rle would just create them | 18:02 |
nibalizer | you might +2a this https://review.openstack.org/#/c/278850/ | 18:02 |
mordred | rcarrillocruz: I mean, I think it's actually just entry in group_vars | 18:02 |
mordred | rcarrillocruz: but yeah | 18:02 |
mordred | rcarrillocruz: and also, the entry for each server could totally have cloud: rax or whatever in its entry | 18:03 |
rcarrillocruz | so | 18:03 |
rcarrillocruz | how about | 18:03 |
rcarrillocruz | i create a github repo | 18:03 |
rcarrillocruz | with the splitted role | 18:03 |
rcarrillocruz | then i show folks | 18:03 |
rcarrillocruz | then we talk about bringing it into openstack-infra | 18:03 |
rcarrillocruz | ? | 18:03 |
clarkb | we don't need github | 18:03 |
clarkb | maybe I missed something | 18:03 |
rcarrillocruz | i can go ahead and create a project in that namespace | 18:03 |
mordred | clarkb: he's saying "why don't I make the split and put it somewhere" | 18:03 |
rcarrillocruz | right | 18:04 |
mordred | yah. then we can import it | 18:04 |
mordred | yes. we need a launch-node role | 18:05 |
mordred | essentially | 18:05 |
mordred | so we need a repo called 'ansible-role-launch-node' | 18:05 |
jeblair | mordred, clarkb, crinkle: remote: https://review.openstack.org/284325 Set hostname in launch node | 18:07 |
jeblair | 15.184.55.9 | 18:13 |
*** degorenko is now known as _degorenko|afk | 18:14 | |
mordred | rcarrillocruz: this is the role you're talking about splitting right: ubygems/rubygems-mirror ? | 18:15 |
mordred | gah | 18:15 |
rcarrillocruz | no | 18:16 |
rcarrillocruz | def i'm not a ruby guy | 18:16 |
rcarrillocruz | :D | 18:16 |
rcarrillocruz | let me link you | 18:16 |
mordred | rcarrillocruz: this is the role you're talking about splitting right: https://git.openstack.org/cgit/openstack-infra/infra-ansible/tree/roles/setup_openstack_resources | 18:16 |
rcarrillocruz | http://git.openstack.org/cgit/openstack-infra/infra-ansible/tree/roles/setup_openstack_resources | 18:16 |
rcarrillocruz | that one | 18:16 |
mordred | yah | 18:16 |
rcarrillocruz | ++ | 18:16 |
rcarrillocruz | so idea would be | 18:16 |
rcarrillocruz | you feed in a yaml | 18:16 |
rcarrillocruz | that would contain: | 18:16 |
rcarrillocruz | clousd: | 18:16 |
rcarrillocruz | bluebox: | 18:16 |
rcarrillocruz | servers: | 18:17 |
rcarrillocruz | .... | 18:17 |
rcarrillocruz | flavous: | 18:17 |
rcarrillocruz | ..... | 18:17 |
rcarrillocruz | infracloud-west: | 18:17 |
rcarrillocruz | servers: | 18:17 |
rcarrillocruz | ... | 18:17 |
rcarrillocruz | so on and so forth | 18:17 |
crinkle | fungi: nibalizer https://review.openstack.org/#/c/284331 | 18:18 |
clarkb | rcarrillocruz: cloud then region then server is the hierarchy I think | 18:20 |
rcarrillocruz | good call | 18:21 |
rcarrillocruz | region below cloud | 18:21 |
rcarrillocruz | ++ | 18:21 |
jeblair | who wants to review the grafana change? https://review.openstack.org/283312 | 18:23 |
AJaeger | jeblair: I did... | 18:26 |
pabelanger | AJaeger: ^ | 18:27 |
AJaeger | pabelanger: What do you mean? | 18:28 |
rcarrillocruz | yolanda: https://review.openstack.org/#/c/284246/ | 18:29 |
jeblair | AJaeger: thanks! :) | 18:29 |
AJaeger | pabelanger: wrong channel? Did you want to point out the project-config change? | 18:29 |
pabelanger | AJaeger: sorry, lagged out. Was referring to the grafana change | 18:31 |
pabelanger | having some bad wifi atm | 18:32 |
AJaeger | no worries | 18:33 |
mordred | fungi: https://git.openstack.org/cgit/openstack-infra/infra-ansible/tree/roles/setup_openstack_resources | 18:39 |
mordred | gah | 18:39 |
mordred | fungi: https://github.com/emonty/ansible/commit/c737bd48bc4ee246c378898abb80bacdd80c0e2f | 18:39 |
mordred | rcarrillocruz, clarkb: I disagree - I think it should be servers: and each server should have cloud and region as params | 18:40 |
clarkb | mordred: hrm | 18:40 |
mordred | this is because the loops in the yaml will be very weird othewise | 18:40 |
mordred | region and cloud are parameters to os_serveR: | 18:40 |
clarkb | I guess either way works | 18:40 |
clarkb | ah | 18:40 |
mordred | so it's an easy loop the one way | 18:41 |
clarkb | yup | 18:41 |
fungi | exciting! so what conditions cause the list to be duplicated? or is this just belt/suspenders on theory that it might happen? | 18:41 |
mordred | fungi: no clue. just belt and suspenders :) | 18:41 |
fungi | okay | 18:41 |
fungi | curious to see whether this makes it go away | 18:41 |
mordred | me too | 18:42 |
mordred | rcarrillocruz, clarkb: also, the yaml can go into group_vars/localhost.yml | 18:43 |
mordred | rcarrillocruz: and can just be referenced that way | 18:43 |
jeblair | fatal: [cacti.openstack.org]: FAILED! => {"changed": false, "failed": true, "msg": "value of logdest must be one of: stdout,syslog, got: ['stdout']"} | 18:44 |
jeblair | hrm. | 18:44 |
jeblair | that kind of looks like an ansible puppet role problem... | 18:45 |
clarkb | jeblair: ya a type mismatch | 18:45 |
clarkb | between string and list of string | 18:46 |
jeblair | i don't know why it would only happen when running the playbook manually... | 18:46 |
jeblair | clarkb: yeah | 18:46 |
mordred | jeblair: how are you running the playbook manually? | 18:47 |
jeblair | ansible-playbook --limit='cacti.openstack.org:localhost' /tmp/jeblair.yaml | 18:47 |
jeblair | mordred: /tmp/jeblair.yaml is a copy of the 'else' playbook | 18:47 |
jeblair | mordred: with !disabled removed from hosts | 18:47 |
mordred | jeblair: /tmp/jeblair.yaml is going to be missing the group_vars | 18:47 |
jeblair | because cacti is in the disabled host list | 18:47 |
mordred | jeblair: you need to also copy the group_vars dir from the playbooks dir | 18:48 |
mordred | jeblair: that said - you have uncovered a bug in the role | 18:48 |
*** mrmartin has joined #openstack-sprint | 18:48 | |
mordred | logdest=dict( | 18:56 |
mordred | required=False, default=['stdout'], | 18:56 |
mordred | choices=['stdout', 'syslog']), | 18:56 |
jeblair | clarkb, pleia2, mordred, nibalizer: root@puppetmaster:~# ./kick.sh cacti.openstack.org | 18:57 |
nibalizer | jeblair: oo | 18:57 |
mordred | jeblair, nibalizer, clarkb: https://review.openstack.org/284348 | 19:00 |
mordred | jeblair, nibalizer, clarkb: https://review.openstack.org/284352 | 19:06 |
nibalizer | pabelanger: http://paste.openstack.org/show/488081/ should get you started | 19:09 |
*** mrmartin has quit IRC | 19:14 | |
rcarrillocruz | clarkb: https://review.openstack.org/#/c/284246/ is good now | 19:22 |
rcarrillocruz | we had to debug some random ansible issue, if you set environment at a task the with_lines command doesn't get it, just the task shell command :/ | 19:22 |
pabelanger | nibalizer: thanks | 19:27 |
mordred | rcarrillocruz: reviewed | 19:30 |
rcarrillocruz | mordred: lol | 19:35 |
rcarrillocruz | i didn't push | 19:36 |
rcarrillocruz | just commited locally | 19:36 |
rcarrillocruz | (facepalm) | 19:36 |
mordred | heh | 19:36 |
rcarrillocruz | check it out again pls | 19:36 |
mordred | rcarrillocruz: ossum | 19:37 |
pabelanger | http://grafana.openstack.org/dashboard/db/nodepool-infra-cloud | 19:54 |
pabelanger | exciting | 19:54 |
jeblair | mordred, fungi, yolanda: remote: https://review.openstack.org/284386 Fix omfra admin creds in all-clouds.yaml | 20:03 |
fungi | gah | 20:04 |
fungi | jeblair: i guess this will allow us to perform admin-level actions from the puppetmaster host rather than limiting it to the openstackci user. seems desirable, but i can see why it was previously openstackci for consistency with clouds where we don't have admin access | 20:07 |
jeblair | fungi: no, it's a new cloud definition: "admin-infracloud-west" | 20:08 |
jeblair | fungi: we have one for bluebox too | 20:08 |
jeblair | fungi: we have 2 cloud dfns for most clouds, and 3 for these | 20:08 |
fungi | oh, yep | 20:09 |
fungi | i missed the admin- prefix on the name there | 20:10 |
clarkb | jeblair: `OS_CLIENT_CONFIG_FILE=/etc/openstack/all-clouds.yaml openstack --os-cloud openstackjenkins-infracloud-west --os-region-name RegionOne server list` | 20:16 |
clarkb | note that image list does not work it gets a 404, nova image-list does work for whatever reason | 20:17 |
AJaeger | Is nibalizer change to run devstack-trusty in infra cloud fine to merge? https://review.openstack.org/#/c/284391 Then I'll +A... | 20:17 |
nibalizer | AJaeger: yes | 20:17 |
nibalizer | we're sprinting as you know | 20:18 |
nibalizer | we think it will work | 20:18 |
AJaeger | nibalizer: I know you're sprinting - just don't know timing ;) | 20:18 |
AJaeger | nibalizer: then let's try it ;) | 20:18 |
clarkb | mordred: `OS_CLIENT_CONFIG_FILE=/etc/openstack/all-clouds.yaml openstack --os-cloud openstackci-infracloud-west --os-region-name RegionOne server list` on the puppetmaster 404s | 20:20 |
mordred | clarkb: ossum | 20:20 |
clarkb | https://jenkins03.openstack.org/computer/ubuntu-trusty-infracloud-west-8281553/ | 20:25 |
clarkb | https://jenkins03.openstack.org/job/swift-coverage-bindep/2/console | 20:30 |
clarkb | the first job on infracloud | 20:30 |
AJaeger | 0.3 jobs ;) in use according to grafana | 20:31 |
AJaeger | wooot! | 20:31 |
nibalizer | https://review.openstack.org/#/c/284398/ to bump to 10 nodes | 20:32 |
jeblair | https://review.openstack.org/281310 | 20:34 |
pabelanger | https://jenkins03.openstack.org/job/swift-coverage-bindep/2/console infracloud-west | 20:36 |
pabelanger | doh | 20:36 |
pabelanger | should read before posting | 20:37 |
rcarrillocruz | \o/ | 20:42 |
pabelanger | should read before posting | 20:44 |
pabelanger | fail | 20:44 |
Clint | job's done | 20:51 |
clarkb | yolanda: http://paste.openstack.org/show/488102/ | 20:57 |
yolanda | thx | 20:57 |
yolanda | clarkb http://paste.openstack.org/show/488103/ | 20:59 |
jhesketh | Morning | 21:05 |
jhesketh | how's it going? | 21:07 |
clarkb | jhesketh: we ran our first job on infracloud via nodepool https://jenkins03.openstack.org/job/swift-coverage-bindep/2/console | 21:08 |
jhesketh | oh exciting :-) | 21:08 |
clarkb | it was slower than expected so now we are digging into why that may be | 21:08 |
jhesketh | okay | 21:09 |
jhesketh | still, that's pretty cool | 21:09 |
jhesketh | kudos to all! | 21:09 |
rcarrillocruz | https://pythonhosted.org/python-hpilo/health.html | 21:15 |
rcarrillocruz | search for 'raid' ^ | 21:16 |
rcarrillocruz | crinkle: ^ | 21:16 |
clarkb | compute035:/sys/devices/pci0000:00/0000:00:07.0/0000:06:00.0/host4/target4:0:0/4:0:0:0 | 21:19 |
clarkb | fungi: ^ | 21:19 |
fungi | thanks | 21:20 |
clarkb | jhesketh: currently looking into what the disk situation is as disk IO (installing packages) was really slow | 21:22 |
clarkb | we seem to have confirmed that / sits on a logical volume provided by an HP raid controller with disks in a raid 0 | 21:23 |
clarkb | RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01) | 21:24 |
Clint | cciss-vol-status | 21:24 |
clarkb | cat compute035:/sys/devices/pci0000:00/0000:00:07.0/0000:06:00.0/host4/target4:0:0/4:0:0:0/raid_level | 21:26 |
nibalizer | SpamapS: ohai | 21:26 |
clarkb | says RAID 0 | 21:26 |
SpamapS | o/ | 21:26 |
nibalizer | doo you remember the disk situation of the computers in west? | 21:26 |
SpamapS | in the past we had some non-uniformity in the hp west rack w.r.t. disk controllers | 21:26 |
nibalizer | raid etc? | 21:26 |
SpamapS | MANY had no BBWC | 21:27 |
SpamapS | some did | 21:27 |
jeblair | SpamapS: do you want to voice with us? | 21:27 |
SpamapS | At some point we had some upgraded | 21:27 |
jeblair | let me rephrase... | 21:27 |
SpamapS | lol I get it | 21:27 |
jeblair | SpamapS: if you want to voice conference, we can arrange it; let me know. | 21:28 |
SpamapS | yeah I can jump in there | 21:28 |
jeblair | https://wiki.openstack.org/wiki/Infrastructure/Conferencing | 21:29 |
jeblair | room 6013 | 21:29 |
SpamapS | in 2 voice conversations already at the moment, standby | 21:29 |
SpamapS | down to one now | 21:30 |
SpamapS | if I could just get the voices in my head to shut up... ;) | 21:30 |
SpamapS | jeblair: ok, jitsi-ing into that conf room | 21:30 |
jeblair | SpamapS: cool; put the voices in your head on mute ;) | 21:31 |
Clint | P212 controller, single physical 2tb drive in "raid 0" | 21:36 |
crinkle | SpamapS: ubuntu@15.184.52.3 | 21:40 |
*** krtaylor has quit IRC | 21:41 | |
clarkb | CONF.libvirt.disk_cachemodes | 21:42 |
clarkb | is what nova reads to set the cache mode | 21:43 |
clarkb | http://docs.openstack.org/kilo/config-reference/content/list-of-compute-config-options.html | 21:44 |
clarkb | disk,if=none,id=drive-virtio-disk0,format=qcow2,cache=none is what ps reports we are doing | 21:45 |
nibalizer | [ 4.172895] sd 4:0:0:0: [sda] Write cache: disabled, read cache: disabled, doesn't support DPO or FUA | 21:45 |
nibalizer | from dmesg | 21:45 |
clarkb | https://www.suse.com/documentation/sles11/book_kvm/data/sect1_1_chapter_book_kvm.html explains our options | 21:47 |
Clint | and SpamapS recommended unsafe | 21:48 |
clarkb | yup | 21:48 |
Clint | puppet-nova: # [*libvirt_disk_cachemodes*] | 21:51 |
crinkle | ^ | 21:51 |
*** krtaylor has joined #openstack-sprint | 21:55 | |
rcarrillocruz | ctrl all show status | 21:58 |
crinkle | I think https://review.openstack.org/284435 is the setting we want? | 21:58 |
crinkle | should probably try running it on one of the computes before running it on all of them | 21:59 |
rcarrillocruz | http://www.lazysystemadmin.com/2012/01/hpacucli-check-raid-information-from.html | 22:00 |
rcarrillocruz | jeblair: | 22:00 |
clarkb | crinkle: file=/var/lib/nova/instances/758549a0-0dcc-4f53-bb98-647cb62b782b/disk,if=none,id=drive-virtio-disk0,format=qcow2,cache=none | 22:02 |
clarkb | file=unsafe | 22:03 |
jeblair | rcarrillocruz: https://etherpad.openstack.org/p/zmeWythuDL | 22:12 |
jeblair | mordred: how do i get output from an ansible shell command from a playbook? | 22:18 |
jeblair | mordred: see the second part of https://etherpad.openstack.org/p/zmeWythuDL | 22:18 |
mordred | jeblair: looking | 22:18 |
mordred | jeblair: that's untested - gimme a sec and I'll tell you for real | 22:19 |
*** delattec has joined #openstack-sprint | 22:34 | |
*** cdelatte has quit IRC | 22:37 | |
clarkb | 85b1dc74-266c-41c8-aa26-1cee94ea7048 is new instance | 22:41 |
clarkb | https://jenkins01.openstack.org/job/gate-tempest-dsvm-full-ceph/6837/console | 22:41 |
*** rfolco has quit IRC | 22:46 | |
nibalizer | 2016-02-24 23:09:01.863 5846 WARNING nova.compute.api [req-a825bb40-3dc4-470b-a992-ee6cc1c4d0f8 7dbe0f121e424a74be2eed25399e2c75 894a11e0a16a4c29bb8b884c1c70bf2c - - -] [instance: 85b1dc74-26 | 23:09 |
nibalizer | 6c-41c8-aa26-1cee94ea7048] instance's host compute035.hpuswest.ic.openstack.org is down, deleting from database | 23:09 |
jeblair | nibalizer: remote: https://review.openstack.org/284448 Cacti: add hosts to alternate graph trees | 23:11 |
jeblair | mordred: https://review.openstack.org/284325 | 23:17 |
crinkle | root@controller00:~# nova host-update | 23:18 |
crinkle | usage: nova host-update [--status <enable|disable>] | 23:18 |
crinkle | [--maintenance <enable|disable>] | 23:18 |
crinkle | <hostname> | 23:18 |
crinkle | ^ i think not fiery enough | 23:18 |
rcarrillocruz | are we good to land: | 23:23 |
rcarrillocruz | https://review.openstack.org/#/c/283737/ | 23:23 |
rcarrillocruz | https://review.openstack.org/#/c/283816/ | 23:24 |
rcarrillocruz | https://review.openstack.org/#/c/283870/ | 23:24 |
rcarrillocruz | https://review.openstack.org/#/c/284246/ | 23:24 |
rcarrillocruz | we already maeked as completed on whiteboard, so we should merge those | 23:24 |
fungi | jeblair: mordred: clarkb: Clint: https://review.openstack.org/284463 addresses my comments on 284325 | 23:28 |
mordred | fungi: wow. /etc/mailname | 23:28 |
Clint | fungi: why do you want fqdn in /etc/hostname? | 23:29 |
*** mestery has quit IRC | 23:30 | |
*** mestery has joined #openstack-sprint | 23:35 | |
fungi | Clint: don't we? | 23:36 |
Clint | i doubt it | 23:36 |
fungi | oh, we were just making sure the fqdn was in /etc/hosts on the physical hosts | 23:38 |
clarkb | ya we just needed hostname -f to work | 23:38 |
fungi | hrm, but we _do_ want /etc/mailname to have the fqdn then | 23:38 |
Clint | yes | 23:38 |
fungi | grr... yep, you're right. this is what i get for confusing /etc/hostname with the bsds' /etc/myname | 23:40 |
nibalizer | https://review.openstack.org/#/c/284465/1 is pretty striaghtforward | 23:41 |
clarkb | mordred: 85b1dc74-266c-41c8-aa26-1cee94ea7048 | 23:42 |
clarkb | this instance was killed in the middle of its life for what we think is nova compute changing its name | 23:42 |
clarkb | but it wasn't removed from the db, libvirtd was asked to stop it though | 23:43 |
mordred | http://paste.openstack.org/show/488129/ | 23:44 |
mordred | http://paste.openstack.org/show/488130/ | 23:45 |
crinkle | mordred: okay the other thing is that nova hypervisor-list | grep compute035 has two things | 23:48 |
crinkle | so one of them needs to go away | 23:48 |
jeblair | glean: error: unrecognized arguments: --version | 23:50 |
clarkb | that instance is no longer in nodepool | 23:55 |
mordred | crinkle: http://paste.openstack.org/show/488131/ | 23:55 |
mordred | crinkle: which one is good? | 23:55 |
crinkle | mordred: this is the information nova has http://paste.openstack.org/show/488132/ | 23:56 |
crinkle | mordred: so the one that has state 'down' is bad | 23:56 |
mordred | crinkle: so 21 is the bad one | 23:56 |
mordred | crinkle: deleted | 23:57 |
crinkle | mordred: lgtm ty | 23:57 |
mordred | crinkle: should I take out compute004 as well? | 23:57 |
crinkle | mordred: yes please | 23:57 |
rcarrillocruz | - name: Disable glean | 23:59 |
rcarrillocruz | shell: echo 'manual' > /etc/init/glean.override creates=/etc/init/glean.override | 23:59 |
rcarrillocruz | jeblair: ^ | 23:59 |
mordred | crinkle: done | 23:59 |
crinkle | mordred: ty | 23:59 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!