Wednesday, 2017-12-20

*** bfournie has quit IRC00:00
*** bfournie has joined #tripleo00:01
*** moshele has quit IRC00:02
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: ADD MTU settings and Neutron settings adjustment  https://review.openstack.org/52724900:12
*** threestrands has joined #tripleo00:26
*** threestrands has quit IRC00:26
*** threestrands has joined #tripleo00:26
*** slacko_16322 has joined #tripleo00:58
*** itlinux has joined #tripleo00:59
openstackgerritzenghui.shi proposed openstack/tripleo-heat-templates master: Add PTP composable service  https://review.openstack.org/49131701:01
*** dhill_ has quit IRC01:11
mwhahahadmsimard: more dns problems, http://logs.openstack.org/39/526439/5/gate/tripleo-ci-centos-7-containers-multinode/f55c04f/logs/undercloud/home/zuul/vxlan_networking.sh.log.txt.gz#_2017-12-19_23_57_3701:11
dmsimardmwhahaha: is that occurring inside a container ?01:12
mwhahahadmsimard: no01:13
mwhahahaIs undercloud setup bits01:13
mwhahahaOur vxlan config or whatever01:13
dmsimardOk, keep sending those, I have a list01:14
*** chem has quit IRC01:15
*** psachin has joined #tripleo01:24
*** cshastri has joined #tripleo01:26
*** psachin has quit IRC01:28
*** psachin has joined #tripleo01:33
*** slacko_16322 has quit IRC01:37
*** yamahata has quit IRC01:38
*** yamahata has joined #tripleo01:39
*** gfidente|afk has quit IRC01:39
*** jd_ has quit IRC01:45
*** jd_ has joined #tripleo01:47
*** dmacpher has joined #tripleo01:47
*** agopi has joined #tripleo01:52
*** jongwooh has joined #tripleo02:01
*** dprince has quit IRC02:06
*** karthiks has joined #tripleo02:16
itlinuxhello all.. I wonder about this issues.. Introspection of node 13a27ec6-5167-456e-a081-dfd076f48639 timed out.02:23
itlinuxI had ocata running and now trying to run pike I upgraded the UC fine..02:23
itlinuxany tips on this.. since the other look c04c86f6-3024-4590-b294-512cecfcf53d | None | None          | power off   | available          | Tru02:24
itlinuxthanks02:24
*** jlabarre has quit IRC02:26
*** fzdarsky_ has joined #tripleo02:29
jongwoohhow is the result of "openstack baremetal instrospection status <uuid>"?02:29
*** fzdarsky has quit IRC02:30
*** Goneri has quit IRC02:30
*** catintheroof has joined #tripleo02:38
*** atarlov has joined #tripleo02:48
itlinuxlet me check02:49
itlinuxhttp://paste.openstack.org/show/629417/02:50
itlinuxrunning the retry now..02:51
itlinuxIntrospection of node 13a27ec6-5167-456e-a081-dfd076f48639 timed out.02:51
*** catintheroof has quit IRC02:53
*** rlandy|rover has quit IRC03:01
*** threestrands has quit IRC03:03
*** threestrands has joined #tripleo03:04
itlinuxit comes back like this 13a27ec6-5167-456e-a081-dfd076f48639 | None | None          | None        | enroll             | False03:04
*** threestrands has quit IRC03:05
*** threestrands has joined #tripleo03:06
*** threestrands has joined #tripleo03:06
*** threestrands has quit IRC03:07
*** threestrands has joined #tripleo03:07
itlinuxlooks like | last_error             | Failed to change power state to 'power on' by 'rebooting'. Error: IPMI   |03:08
itlinux|                        | call failed: power status.03:08
itlinuxjongwooh: any tips on that..03:09
*** yamahata has quit IRC03:11
*** karthiks has quit IRC03:23
*** psahoo has joined #tripleo03:24
itlinuxI think I found the issue I cannot ipmi to the box!03:25
*** artom has quit IRC03:31
*** artom has joined #tripleo03:31
jongwoohok you found it03:40
*** ramishra has joined #tripleo03:50
*** owalsh_ has joined #tripleo03:55
*** udesale has joined #tripleo03:57
*** threestrands_ has joined #tripleo03:57
*** threestrands has quit IRC03:57
*** threestrands_ has quit IRC03:58
*** threestrands_ has joined #tripleo03:59
*** owalsh has quit IRC03:59
openstackgerritSteve Baker proposed openstack/tripleo-common master: Use skopeo for tag discover  https://review.openstack.org/52894504:06
openstackgerritSteve Baker proposed openstack/tripleo-common master: Move more prepare logic into kolla_builder  https://review.openstack.org/52657904:06
openstackgerritSteve Baker proposed openstack/tripleo-common master: Use push_destination as the registry host in env file  https://review.openstack.org/52861604:06
openstackgerritSteve Baker proposed openstack/tripleo-common master: Prepare action: extra arguments  https://review.openstack.org/52658004:06
openstackgerritSteve Baker proposed openstack/tripleo-common master: WIP Discover every tag on prepare  https://review.openstack.org/52921504:06
*** liverpooler has quit IRC04:14
*** shreshtha has joined #tripleo04:15
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: Add the option to run the container-check script  https://review.openstack.org/50102804:21
*** ykarel has joined #tripleo04:27
*** pgadiya has joined #tripleo04:29
*** pgadiya has quit IRC04:39
*** pdeore has joined #tripleo04:42
*** gvrangan_ has joined #tripleo04:46
*** dpawar has joined #tripleo04:47
*** gvrangan_ has quit IRC04:55
*** pgadiya has joined #tripleo04:56
*** ianw is now known as ianw_pto04:57
*** links has joined #tripleo04:57
openstackgerritMerged openstack/tripleo-common master: Inital import of tripleo ansible inventory code  https://review.openstack.org/52834205:03
*** moshele has joined #tripleo05:15
*** skramaja has joined #tripleo05:27
*** pgadiya has quit IRC05:30
*** pgadiya has joined #tripleo05:31
*** gkadam has joined #tripleo05:43
openstackgerritMerged openstack/instack-undercloud master: Add missing include of ironic::drivers::ansible  https://review.openstack.org/52643905:51
Tenguhello there :)06:05
TenguEmilienM: if you're still up: time to sleep ;)06:05
*** rbrady has quit IRC06:05
Tengumwhahaha: if you're still here, can you point me some location for documenting the new basic auth feature in haproxy?06:05
*** rbrady has joined #tripleo06:06
*** rbrady has joined #tripleo06:06
*** psahoo has quit IRC06:11
*** marios has joined #tripleo06:16
*** psahoo has joined #tripleo06:16
*** d0ugal has quit IRC06:17
*** jaganathan has joined #tripleo06:19
*** d0ugal has joined #tripleo06:22
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Fix up the rabbitmq-ready check  https://review.openstack.org/52740306:24
*** jfrancoa has joined #tripleo06:38
openstackgerritMichele Baldessari proposed openstack/tripleo-quickstart master: Do not use puppet-ceph on newton  https://review.openstack.org/52923406:41
*** agopi has quit IRC06:42
*** agopi has joined #tripleo06:42
*** masco has joined #tripleo06:43
*** karthiks has joined #tripleo06:49
*** janki has joined #tripleo06:49
*** karthiks has quit IRC06:54
*** threestrands_ has quit IRC06:57
*** pdeore has quit IRC06:58
*** agurenko has joined #tripleo06:59
*** pdeore has joined #tripleo07:05
*** karthiks has joined #tripleo07:06
*** rcernin has quit IRC07:08
*** gkadam has quit IRC07:09
*** dsneddon has quit IRC07:12
*** yprokule has joined #tripleo07:12
*** cylopez has joined #tripleo07:22
*** holser__ has joined #tripleo07:28
*** shardy has joined #tripleo07:29
*** dmacpher has quit IRC07:34
*** agopi has quit IRC07:38
*** agopi has joined #tripleo07:38
*** abregman has joined #tripleo07:42
*** ebarrera has joined #tripleo08:00
*** stendulker has joined #tripleo08:06
*** nyechiel has joined #tripleo08:09
*** psahoo has quit IRC08:14
moshelejanki: hi08:23
jankimoshele, hey08:24
moshelejanki: can we talk in bluejeans I have some questions issues with opendaylight deployment08:25
jankimoshele, I have few things lined up. HOw about in an hour?08:26
*** cshastri has quit IRC08:26
moshelejanki: sure ping me when you can08:27
*** jtomasek has joined #tripleo08:27
*** ccamacho has joined #tripleo08:27
jankimoshele, sure and about yesterday's query, there is a dependent ODL patch that is needed - https://git.opendaylight.org/gerrit/#/c/64602/08:29
*** psahoo has joined #tripleo08:30
*** jtomasek has quit IRC08:31
*** jtomasek has joined #tripleo08:32
*** gkadam has joined #tripleo08:32
*** amoralej|off is now known as amoralej08:35
sri_mwhahaha, got it thanks08:37
Tenguhello!08:38
Tengusmall question: is this doc still up-to-date? https://docs.openstack.org/tripleo-docs/latest/install/post_deployment/quiesce_compute.html#quiesce-compute08:38
*** cshastri has joined #tripleo08:39
Tenguapparently, nova account has a ~/.ssh/config that points to a command wrapper and a distinct SSH port, enforcing "nova_migration" user.08:39
Tengumeaning: the ssh key won't be used.08:39
*** agurenko has quit IRC08:40
*** jpena|off is now known as jpena08:44
*** paramite has joined #tripleo08:47
*** pgadiya has quit IRC08:50
*** mdnadeem has joined #tripleo08:51
*** ukalifon has joined #tripleo08:52
*** psahoo has quit IRC08:56
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: DNM: test container updates  https://review.openstack.org/51537208:57
*** anilvenkata has joined #tripleo08:58
*** hjensas has quit IRC09:01
*** chem has joined #tripleo09:03
*** pgadiya has joined #tripleo09:04
skramajashardy: hi, are you working on modifying the deprecated params workflow according the heat based env merging?09:04
skramajashardy: i am planning to add a validation for role-specific parameters in the same workflow, if you in progress, i will wait for it to complete.09:04
shardyskramaja: Hi, no I haven't got to that yet - to be honest the work to move environment merging to heat stalled when I started modifying tripleo-common, because I ran into some request limit problems with heat09:05
shardyskramaja: I'd like to get back to it, but it probably requires changes to heat to load files directly from swift09:05
shardyskramaja: so please feel free to go ahead and make your workflow changes :)09:06
skramajasure shardy09:07
shardyTengu: probably owalsh_ is your best contact for the nova migration questions09:08
*** psahoo has joined #tripleo09:09
Tengushardy: hmm ok. well, I could "fake" it using openstack server migrate --wait --block-migration --live <dest> <id>. As we "only" have 2 computes, evacuating one isn't hard.09:10
owalsh_Tengu: nope, docs are not up to date - https://review.openstack.org/49954309:13
Tenguowalsh_: ah, thanks :)09:14
shardyThanks owalsh_:)09:14
Tenguowalsh_: would be great to release that change :)09:15
openstackgerritSaravanan KR proposed openstack/tripleo-heat-templates master: Configure qemu group setting as hugetlbfs for ovs-dpdk  https://review.openstack.org/52927209:15
openstackgerritSaravanan KR proposed openstack/tripleo-heat-templates master: Removed ovs-dpdk workaround to fix the vhost socket permission  https://review.openstack.org/52927309:15
owalsh_Tengu: indeed :-) I'll take a look at it today09:17
*** owalsh_ is now known as owalsh09:17
Tenguowalsh: thank you :). In the meantime, I'm migrating nodes one by one with the `server migrate' command.09:18
Tenguwokring well so far.09:18
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: logging: use service_config_settings for fluentd  https://review.openstack.org/50145809:23
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates master: Update templates alias to queens  https://review.openstack.org/52927609:27
*** agopi has quit IRC09:27
*** oidgar has joined #tripleo09:28
*** anilvenkata has quit IRC09:28
*** agurenko has joined #tripleo09:28
oidgarhi everybody, gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master and gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master fails for me for a long time, but I'm not sure if this related to RDO cloud issues from the last days09:29
*** anilvenkata has joined #tripleo09:29
oidgarIt fails because it cannot find brctl command09:29
*** dsariel has joined #tripleo09:29
oidgaranyone here has experience with those gates?09:29
*** fragatina has joined #tripleo09:30
*** fragatina has quit IRC09:31
*** fragatina has joined #tripleo09:31
*** lucas-afk is now known as lucasagomes09:34
*** aditya_r has joined #tripleo09:34
honzamandre: i tried to install an undercloud the old way in hopes of getting a proper hiera instance, but then i run into IP address change issues when using run.sh09:44
*** derekh has joined #tripleo09:44
mandrehonza: you need to tweak your undercloud.conf i believe09:45
*** aditya_ra has joined #tripleo09:45
honzamandre: did you see my messages from last night about the controller_admin_host/hiera issues?09:45
mandrehonza: I didn't09:46
honzamandre: I'm getting a bunch of hiera-related issues, and I remember you telling me about a hack I needed.  I searched my irc logs and emails but couldn't find it.09:46
honzamandre: http://paste.openstack.org/show/629388/09:47
mandrehonza: oh... that! you need a newer puppet-tripleo module09:48
honza!!!09:48
openstackhonza: Error: "!!" is not a valid command.09:48
honzaopenstack: lol09:48
*** aditya_r has quit IRC09:48
honzamandre: how can i get a newer one?09:49
honzait's commented out!09:49
honzaoh my09:49
mandrehonza: https://review.openstack.org/#/c/525761/09:49
honzamandre: thank you so much09:50
openstackgerritOliver Walsh proposed openstack/tripleo-docs master: Remove obsolete section on compute ssh-key setup  https://review.openstack.org/49954309:51
mandrehonza: just uncomment https://github.com/dprince/undercloud_containers/blob/master/doit.sh#L145-L153 and that should get you puppet-tripleo from a checkout09:52
moshelejanki: I hope you haven't forgot me ; )09:57
*** cylopez has left #tripleo09:58
*** florianf has joined #tripleo09:58
jankimoshele, ofcourse not. give me 10 more minutes plz10:00
moshelejanki: sure10:00
lyarwoodalee: pingo, re https://review.openstack.org/#/c/526514/ did you also have a THT change enabling this?10:00
*** psachin has quit IRC10:01
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add docker-registry service  https://review.openstack.org/52613210:03
*** tosky has joined #tripleo10:05
*** nyechiel has quit IRC10:08
openstackgerritLee Yarwood proposed openstack/tripleo-heat-templates master: nova: Add VerifyGlanceSignatures compute param  https://review.openstack.org/52928610:11
*** psachin has joined #tripleo10:11
lyarwoodalee: ^ https://review.openstack.org/529286 - nova-compute THT change to introduce a param for this, let me know if you already have something and I'll kill this change.10:11
openstackgerritOliver Walsh proposed openstack/tripleo-docs master: Remove obsolete section on compute ssh-key setup  https://review.openstack.org/49954310:11
owalshlyarwood: looks like alee has a related review https://review.openstack.org/52713610:14
*** jeblair has quit IRC10:15
*** bcafarel has quit IRC10:15
owalshlyarwood: NB the potential migration issues10:15
*** jeblair has joined #tripleo10:16
Tenguowalsh: small question: can we migrate a stopped instance?10:16
* lyarwood waits for gerrit's webui to load10:17
owalshTengu: cold migration should work AFAIK10:19
lyarwoodowalsh: ah, using ExtraConfig, would be nice to have a param in THT tbh10:19
Tenguowalsh: hmm ok. had some issues with that one, and apparently "live" is working fine. will stop services and launch the live migration.10:20
*** psachin has quit IRC10:20
owalshTengu: stop services? Need everything running for live migration to work I would think10:23
Tenguowalsh: ah, services in the instance, not node10:23
*** shardy has quit IRC10:24
owalshTengu: ah, ack. Probably will work without stopping services on the instances but certain services don't like live-migration (e.g rabbitmq)10:24
Tenguowalsh: or docker/rancher :)10:24
Tengujust stopped all docker containers, migration's running, and that's it. we don't have prod per se on the openstack, I can do some small downtime :).10:25
*** salmankhan has joined #tripleo10:26
owalshTengu: yea, cold migration might be more appropriate if there is something like rancher running on top. Do you recall what the issue was with cold migration? I landed a few fixes a while back10:27
Tenguowalsh: an issue with the root-wrap thingy, unfortunately I can't get the command output because its log is squashed - that lead me to create this review: https://review.openstack.org/#/c/518695/  but apparently, it doesn't suit people from Oslo :(10:28
TenguI think I also opened an issue one LP, wait.10:29
*** dciabrin__ has joined #tripleo10:30
*** dciabrin_ has quit IRC10:30
Tenguah, related issue, owalsh : https://bugs.launchpad.net/oslo.concurrency/+bug/173118510:31
openstackLaunchpad bug 1731185 in oslo.concurrency "Not enough debug info for "execute"" [Undecided,In progress] - Assigned to Cédric Jeanneret (cjeanneret-c2c)10:31
Tengubut I was more searching for debug logs.10:31
owalshTengu: can't seem to login to rdo gerrit but the patches were https://github.com/rdo-packages/nova-distgit/commit/c34374a867cf022e8c8773ab46ac1a032aa9d29e & https://github.com/rdo-packages/nova-distgit/commit/2955f70ce42e4f62ed7661817ed0c2a1dce600e810:34
*** hewbrocca_afk is now known as hewbrocca10:34
Tenguowalsh: I'll check that - I can't update our current openstack deploy due to the lack of proper lab for update testing, but that's planned.10:35
Tenguowalsh: but your patches seem to meet the thing I stumbled upon, nice catch!10:35
Tenguowalsh: was it backported in Pike?10:36
owalshTengu: yes, think it was backported to Pike and Newton. Looking at the LP though I don't think that's the issue as it is failing in the touch command10:36
Tenguowalsh: hmm. you're right. it was "some time" ago, I don't recall all the details unfortunately.10:37
Tenguand as we're in the (urgent) need to move instance around, I can't afford to re-create this issue right now.10:37
TenguI have to free one of our two computes in order to reinstall it properly.10:37
owalshTengu: np, just interested in (or to blame for) any issues with this10:38
Tenguowalsh: *taking notes* :)10:38
Tenguowalsh: next year I'll be able to test that in better conditions, as we'll integrate 2 new nodes (meaning 4 computes), hence more way to play with instances around.10:39
*** dciabrin__ has quit IRC10:39
Tenguowalsh: so I might ping you back then10:39
owalshTengu: sure10:39
owalshTengu: just FYI while it's fresh in my head... check /var/lib/nova/.ssh/config looks like https://github.com/rdo-packages/nova-distgit/blob/rpm-master/nova-ssh-config10:40
Tenguowalsh: 2s, I think it's the same content, just need to ensure that10:40
owalshTengu: and check the target IP address is allowed in the Match block in /etc/ssh/sshd_config10:41
*** dciabrin has joined #tripleo10:41
owalshTengu: I expect it's one of those files if the touch command is failing10:41
Tenguah, nope. a bit more lines in it. maybe I added them: http://paste.openstack.org/show/629445/  also, port… ?!10:41
Tenguowalsh: the match blocs: http://paste.openstack.org/show/629446/10:42
*** bcafarel has joined #tripleo10:42
owalshTengu: port 2022 is for containers only IIRC10:42
Tenguo_O errr… we didn't deploy with containers…10:43
Tenguah, but sshd is listening on 22 and 202210:43
Tenguso not a problem10:43
*** hjensas has joined #tripleo10:43
*** hjensas has quit IRC10:43
*** hjensas has joined #tripleo10:43
Tenguhmmm10:43
Tenguah, ok, so if a "nova_migration" user hits ssh, it checks if the request IP is 101.6, else drop. didn't understand it correctly. the blocks are OK I think.10:45
owalshTengu: yes10:45
owalshTengu: do the keys exist? Maybe ssh isn't setup at all but live migration isn't using it (i.e it's live_migration_uri isn't set to qemu+ssh in nova.conf)10:46
* owalsh really needs to write up all of the details somewhere10:50
Tenguowalsh: I've checked the key existence back then, and yep, they do exist, and are allowed as well10:50
Tengubut without a proper command log output, I just can't check anything.10:51
Tenguowalsh: I suspect the "/sbin/nologin" to be maybe an issue though10:51
owalshTengu: should be /bin/bash for the nova_migration user, /sbin/nologin for nova10:52
Tenguah, yes, true. different user.10:52
Tengu-.- tricky.10:52
owalshTengu: I think I'll flesh out https://review.openstack.org/499543 with more details on the setup and how to test it10:52
Tenguowalsh: good idea :).10:53
Tenguowalsh: I took the time to check how things were supposed to work for the migration, but I'm not sure I could get all the things.10:54
Tenguhave to go, end-of-year dinner with the colleagues. of course, as I'm in Switzerland, Fondue time :).10:54
owalshTengu; enjoy10:57
*** cshastri has quit IRC10:57
*** agurenko has quit IRC10:58
*** dtantsur|afk is now known as dtantsur11:02
*** yolanda__ has joined #tripleo11:05
*** hewbrocca is now known as hewbrocca_afk11:07
*** yolanda has quit IRC11:08
*** cshastri has joined #tripleo11:10
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: [WIP] Add a report of the Workflow execution on failure  https://review.openstack.org/52665311:15
openstackgerritMerged openstack/tripleo-heat-templates master: Create flavors for undercloud  https://review.openstack.org/52681011:21
*** moshele has quit IRC11:22
*** stendulker has quit IRC11:25
*** pdeore has quit IRC11:26
*** cshastri has quit IRC11:29
*** nyechiel has joined #tripleo11:30
*** caboucha has joined #tripleo11:31
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Ignore errors in graphite task  https://review.openstack.org/52930211:31
sshnaidmcores, please some urgent fix ^^11:31
sshnaidmtrown|outtypewww, panda ^^11:31
*** pdeore has joined #tripleo11:36
*** oidgar has quit IRC11:40
*** psachin has joined #tripleo11:40
*** agurenko has joined #tripleo11:45
*** akrivoka has joined #tripleo11:54
*** aditya_ra has quit IRC11:54
*** aditya_ra has joined #tripleo11:54
*** oidgar has joined #tripleo11:56
oidgarhi, does anyone else here encounter non stop gate failures in tripleo-common patches?11:57
*** dciabrin has quit IRC12:00
*** shreshtha has quit IRC12:04
*** salmankhan has quit IRC12:04
*** salmankhan has joined #tripleo12:06
*** aditya_ra has quit IRC12:14
*** dciabrin has joined #tripleo12:18
d0ugaloidgar: do you have an example?12:23
*** bfournie has quit IRC12:23
*** bfournie has joined #tripleo12:23
*** moshele has joined #tripleo12:23
*** dpawar has quit IRC12:26
*** lucasagomes is now known as lucas-hungry12:26
*** bfournie has quit IRC12:28
oidgard0ugal: 1s, upstream gerrit returns "service unavailable..."12:28
*** jlabarre has joined #tripleo12:29
oidgard0ugal: https://review.rdoproject.org/jenkins/job/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master/900/12:29
*** raildo has joined #tripleo12:29
*** pdeore has quit IRC12:29
oidgarhjensas: do we have a meeting now?12:32
d0ugaloidgar: so it is the 3rd party CI?12:32
oidgard0ugal: yes, it fails consistently in the last week12:33
d0ugaloidgar: I think you can ignore the third party CI. I believe it is unstable and hopefully still being worked on12:33
d0ugalI don't really think I have ever seen it pass reliably yet12:33
d0ugalbut I'm not sure who is responsible for it.12:33
oidgard0ugal: so we can merge patches which fails on those gates?12:34
d0ugaloidgar: I have been :)12:34
oidgard0ugal: great, thanks!12:34
hjensasoidgar: we do, but I am on a train today. Network/cell coverage is not sufficent for me to join.12:35
oidgarhjensas: I don't think anyone else is joining so probably it is canceled. thanks12:35
d0ugalNo meetings should be allowed this week :)12:37
cabouchaI too have been noticing failures in gate12:41
cabouchafirst time committing to tripleo12:41
cabouchahttps://review.openstack.org/52750812:42
*** psahoo has quit IRC12:42
cabouchaI'll keep looking to see if it's me12:42
*** dpawar has joined #tripleo12:51
Tenguowalsh: I'm back, just saw your review request. Will check that shortly :).12:52
Tenguhmmm.... gerrit is slow as hell.12:52
*** pgadiya has quit IRC12:53
*** yolanda__ is now known as yolanda12:53
*** jpena is now known as jpena|lunch12:58
*** dmellado has quit IRC13:05
*** dprince has joined #tripleo13:08
Tengugerrit is dead, apparently.13:08
Tengugetting 502 errors.13:08
Tenguoidgar: d0ugal can confirm: 3rd party isn't stable nor reliable for CI - already seen constent failure for a working code (due to timeouts or such non-code related)13:10
oidgarTengu: thanks13:10
d0ugalThanks Tengu13:10
Tenguand I was told by others "nah, don't care". or things like that ;)13:11
Tengudigging in the CI logs is a painful exercise.13:11
*** abregman has quit IRC13:11
*** openstackgerrit has quit IRC13:13
*** fpan has joined #tripleo13:14
Tenguah. according to the ML, gerrit is under maintenance due to some issue.13:15
*** dmellado has joined #tripleo13:15
-openstackstatus- NOTICE: gerrit is being restarted due to extreme slowness13:15
Tenguvoilà :)13:15
*** amoralej is now known as amoralej|lunch13:16
*** openstackgerrit has joined #tripleo13:17
openstackgerritYurii Prokulevych proposed openstack/tripleo-heat-templates stable/pike: Check for yum lock befor all yum* operations.  https://review.openstack.org/52930913:17
*** stevebaker has quit IRC13:18
jaganathand0ugal, please look into https://review.openstack.org/#/c/522265/13:19
*** dmellado has quit IRC13:19
*** hewbrocca_afk is now known as hewbrocca13:20
*** dmellado has joined #tripleo13:21
*** pdeore has joined #tripleo13:22
*** jaganathan has quit IRC13:23
*** rlandy has joined #tripleo13:23
*** rlandy is now known as rlandy|ruck13:24
openstackgerritMartin André proposed openstack/tripleo-quickstart-extras master: DNM: Update quickstart extras with undercloud install for containers  https://review.openstack.org/51744413:26
*** abregman has joined #tripleo13:27
*** dmacpher has joined #tripleo13:27
*** stevebaker has joined #tripleo13:28
*** lucas-hungry is now known as lucasagomes13:28
*** BryanS68 has joined #tripleo13:32
*** rmascena has joined #tripleo13:35
*** raildo has quit IRC13:37
*** pchavva has joined #tripleo13:39
*** rhallisey has quit IRC13:40
*** skramaja has quit IRC13:40
*** jcoufal has joined #tripleo13:43
*** rhallisey has joined #tripleo13:43
openstackgerritMartin André proposed openstack/tripleo-heat-templates master: Add docker-registry service  https://review.openstack.org/52613213:45
weshaymwhahaha, morning.. ping me when you have a sec re: container updates13:46
weshayI gave it a go w/ puppet-nova but I need to verify the results13:47
*** dpawar has quit IRC13:48
*** trown|outtypewww is now known as trown13:48
openstackgerritSven Anderson proposed openstack/tripleo-quickstart master: TEST DON'T MERGE - Enabling EC2-API Tempest tests.  https://review.openstack.org/51513913:49
*** psachin has quit IRC13:49
*** amoralej|lunch is now known as amoralej13:50
openstackgerritMartin André proposed openstack/tripleo-quickstart-extras master: DNM: Update quickstart extras with undercloud install for containers  https://review.openstack.org/51744413:51
*** rlandy|ruck is now known as rlandy|rover13:52
*** catintheroof has joined #tripleo13:52
*** trown is now known as trown|ruck13:52
openstackgerritDmitry Tantsur proposed openstack/tripleo-docs master: Document using Ironic Ansible deploy interface  https://review.openstack.org/52666313:53
*** jpena|lunch is now known as jpena13:54
openstackgerritMarios Andreou proposed openstack/tripleo-quickstart-extras master: Add ansible update into UpgradeInitCommand of repo template  https://review.openstack.org/52826113:58
*** dpawar has joined #tripleo13:58
*** bregman has joined #tripleo13:58
*** rbowen has joined #tripleo14:00
*** bfournie has joined #tripleo14:00
*** agopi has joined #tripleo14:01
*** jwb has joined #tripleo14:01
*** abregman has quit IRC14:02
*** catintheroof has quit IRC14:02
dtantsuransiwen: o/ up for questions re real time virt?14:03
*** jmelvin has joined #tripleo14:04
*** catintheroof has joined #tripleo14:04
*** yprokule has quit IRC14:04
*** yprokule has joined #tripleo14:05
*** liverpooler has joined #tripleo14:07
Tenguowalsh: are you still here? any knowledge about node deletion?14:08
*** pdeore has quit IRC14:10
*** agurenko has quit IRC14:11
Tenguerf… doc for node removal is also deprecated X(14:12
*** agopi has quit IRC14:14
aleemwhahaha, EmilienM , rlandy|rover  https://review.openstack.org/#/c/527136/  failed to get off the ground -- merge conflict somewhere14:18
owalshTengu: don't know much about it, I've run it once or twice maybe14:19
Tenguowalsh: ok. running one right now, got a timeout with some websocket, but apparently the removal is running14:20
Tengustack is updated.14:20
Tengubut the --help for `openstack overcloud node delete' was a bit confusing, when we read the doc in //14:21
Tengumakes me say: doc isn't up-to-date and might create some issues shortly.14:21
*** catinthe_ has joined #tripleo14:22
openstackgerritJohn Fulton proposed openstack/tripleo-common master: Parameterize ceph-ansible forks in Mistral Workflow  https://review.openstack.org/52812414:23
openstackgerritMerged openstack/instack-undercloud master: Load undercloud DB password to a mistral environment  https://review.openstack.org/51829214:24
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: Convert tags to when statements for Q major upgrade workflow  https://review.openstack.org/51090214:25
openstackgerritJohn Fulton proposed openstack/tripleo-common master: Parameterize ceph-ansible forks in Mistral Workflow  https://review.openstack.org/52812414:25
*** catintheroof has quit IRC14:26
rlandy|roveralee: lookinh14:27
rlandy|roverlooking14:27
openstackgerritDougal Matthews proposed openstack/tripleo-quickstart-extras master: Make suer we use quickstart extras from the $WORKSPACE  https://review.openstack.org/52933414:27
openstackgerritJohn Fulton proposed openstack/tripleo-common master: Parameterize ceph-ansible forks in Mistral Workflow  https://review.openstack.org/52812414:28
*** dciabrin has quit IRC14:29
rlandy|roveralee: where did you see a merge conflict?14:30
*** dciabrin has joined #tripleo14:30
aleerlandy|rover, last comment for zuul in https://review.openstack.org/#/c/527136/14:30
aleerlandy|rover, I'm not sure where it happens ..14:31
EmilienMalee: because your patches in Depends-On were updated14:31
EmilienMso Zuul asks you to recheck14:31
aleeEmilienM, ah ok - rechecking14:32
EmilienMTengu: I was sleeping :-)  - have you found on https://docs.openstack.org/tripleo-docs/latest/ ?14:33
*** openstackgerrit has quit IRC14:33
TenguEmilienM: wow, you had a long night then ;)14:33
TenguEmilienM: https://docs.openstack.org/tripleo-docs/latest/install/post_deployment/delete_nodes.html yup, and apparently, according the the --help, the "-e" isn't needed anymore and is deprecated.14:33
*** openstackgerrit has joined #tripleo14:35
openstackgerritDmitry Tantsur proposed openstack/instack-undercloud master: Generate a temporary URL key for Swift "service" project  https://review.openstack.org/52737614:35
*** oidgar has quit IRC14:35
*** lblanchard has joined #tripleo14:37
*** trown|ruck is now known as trown|brb14:39
*** shardy has joined #tripleo14:41
*** ykarel has quit IRC14:43
*** trown|brb is now known as trown14:47
*** oidgar has joined #tripleo14:48
Tenguhmmm. node deletion seems to be stuck in a sub-stack, named overcloud-ComputeSshKnownHostsDeployment-3ffgrpdxnrmw - its name seems to indicate it's just the know host in /etc/ssh and, probably, the /etc/hosts file edition…14:50
*** sshnaidm has quit IRC14:50
mwhahahaweshay: whats up14:50
*** sshnaidm has joined #tripleo14:50
mwhahahaalee: ah blame weshay for messing with the dependencies14:51
*** trown is now known as trown|ruck14:52
*** shreshtha has joined #tripleo14:53
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: Enable support for ironic "direct" deploy interface  https://review.openstack.org/52934214:53
* weshay looking at https://review.openstack.org/#/c/515372/ that has a dep on puppet-nova https://review.openstack.org/#/c/529183/ when I look at the rpms getting updated I don't see it. Wondering if I'm correct in understanding that I should see it on a container.. but maybe not14:53
weshayhttp://logs.openstack.org/72/515372/16/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0d7055/logs/undercloud/home/zuul/overcloud_prep_containers.log.txt.gz14:53
*** lblanchard has quit IRC14:54
weshayhttp://logs.openstack.org/72/515372/16/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/e0d7055/logs/subnode-2/var/log/yum.log.txt.gz14:54
mwhahahapuppet-nova-12.1.1-0.20171220092840.241e81c.el7.centos.noarch14:54
mwhahahathat's updated14:54
weshayya.. on the host, not the container14:56
*** bregman has quit IRC14:56
weshaydo you have a suggestion of a repo I can dep on?14:56
weshayto see the update in the containers14:56
mwhahahawell alee's patch would be that14:57
mwhahahasince he's deping on nova14:57
weshayhis patch has a long train of deps14:57
weshayso just openstack/nova14:57
mwhahahato check the containers you'd need a normal openstack project14:57
weshaymakes sense14:57
weshayya14:57
weshayk k14:57
weshayrlandy|rover, fyi ^14:57
mandreweshay: let me know when you have a minute to look at the containerized undercloud patch with me14:57
weshaysshnaidm, ^14:58
weshaymandre, ok.. give me 5min14:58
mandreweshay: cool14:58
mwhahahamandre: i heard rumblings that the containerized undercloud was going to be a requirement for FFU, do you know if that's the case and why?14:58
shardyAnyone know how to monitor the status of promotion for the https://trunk.rdoproject.org/centos7/current/ pin?  It seems that's no longer actually trunk, so there's a period after a patch lands where we're using old packages and CI jobs fail where a Depends-On exists14:59
rlandy|roverweshay: thanks for following  this up14:59
mandremwhahaha: from what I understood, it's not a strict requirement for FFU but it would make it more maintainable14:59
mwhahahamandre: ok. i'm not sure i get that maintainable claim15:00
*** catinthe_ has quit IRC15:02
*** ykarel has joined #tripleo15:02
dtantsurI've heard this too, and I'm not sure why baremetal->baremetal is harder than baremetal->containers either15:02
*** catintheroof has joined #tripleo15:03
openstackgerritThomas Herve proposed openstack/tripleo-quickstart-extras master: Use openstack commands in overcloud-deploy.sh  https://review.openstack.org/52934715:03
tbarronbfournie: do you have a view on https://review.openstack.org/#/c/523638/ ?  is it close to merger or not?15:03
tbarronbfournie: I'm lining up all the unresolved dependencies for our manila ceph-nfs work in light of potential Feature Freeze Exceptions, etc. and this is a big one.15:04
bfournietbarron: some of my comments from 4 still aren't addressed, I think Dan's still working on change for the management network, I think its really close but Dan will no better when he's online15:05
bfournies/no/know15:05
tbarronbfournie: k, I'll ask both of you again when the west coast wakes up.  Thanks.15:06
tbarronhjensas: ^^ I see you on that review too, and Welcome!15:06
*** catintheroof has quit IRC15:08
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: DNM: test container updates  https://review.openstack.org/51537215:08
*** nyechiel has quit IRC15:10
weshaymandre, hey15:11
*** shardy has quit IRC15:11
mandrehey weshay, so I was looking at https://review.openstack.org/#/c/517444/ and was wondering what was the reason for only keeping localhost in /etc/hosts15:12
weshaymandre, so initially dprince found duplicate entries in hosts and that was causing issues w/ rabbit according to eck`.  So I first removed the duplicates, and the undercloud was still failing to deploy.15:13
weshayI next checked the overcloud node hosts file and noticed it ONLY had localhost, so figuring the containerized undercloud is more like the previous overcloud deployment.  Also locally it was working for me and noticed the hosts file only had localhost15:14
weshayonce we purged the hosts file from what infra put in .. it started working and completing the undercloud deployment15:15
mandreok, do you mind me removing this? I'm thinking this is messing up with CI15:15
weshaymandre, give it a go, but I don't think it will work15:15
weshayit least it hasn't in the past, maybe something changed15:15
mandreweshay: or do you have an idea why almost all the jobs are red at https://review.openstack.org/#/c/517444/?15:16
*** sshnaidm is now known as sshnaidm|afk15:16
mandrehttp://logs.openstack.org/44/517444/40/check/tripleo-ci-centos-7-containers-multinode/56e2191/logs/undercloud/home/zuul/undercloud_install.log.txt.gz15:17
weshayhttp://logs.openstack.org/44/517444/40/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/baeef16/logs/undercloud/home/zuul/undercloud_install.log.txt.gz15:18
weshayheh15:18
weshayok.. let's take it out and see what happens15:18
*** oidgar has quit IRC15:20
openstackgerritMartin André proposed openstack/tripleo-quickstart-extras master: DNM: Update quickstart extras with undercloud install for containers  https://review.openstack.org/51744415:20
*** myoung is now known as myoung|bbl15:20
mandreweshay: we'll know in 30 min ^^15:20
*** shardy has joined #tripleo15:24
*** masco has quit IRC15:29
*** karthiks has quit IRC15:32
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart-extras master: Fix reproducer script path references for all environments  https://review.openstack.org/52935615:32
*** dpawar has quit IRC15:36
*** fragatina has quit IRC15:37
*** fragatina has joined #tripleo15:37
*** moshele has quit IRC15:38
*** trozet has quit IRC15:40
*** trozet has joined #tripleo15:43
rookshardy: so, * 2 might not be a bad idea15:46
rookshardy had 12, and it was soo slow.15:47
*** hjensas has quit IRC15:47
*** jongwooh has quit IRC15:47
openstackgerritJohn Fulton proposed openstack/tripleo-common master: Parameterize Ansible environment vars in Mistral Workflow  https://review.openstack.org/52812415:51
*** pcaruana has joined #tripleo15:51
*** janki has quit IRC15:53
ansiwendtantsur: sorry, misse you. still around? now I'm up to it :-)15:53
shardyrook: Ack Ok, would you mind commenting on https://review.openstack.org/#/c/529066/ and/or the bug so we can track your results and agree a reasonable default?15:57
shardyrook: thanks for the update!15:57
dtantsuransiwen: hey! I got some good progress with the ansible deploy. I managed to set custom kernel params, see doc https://review.openstack.org/52666315:58
dtantsuransiwen: now I'm going through your google doc, trying to figure out how it all maps to this work15:58
openstackgerritMark Hamzy proposed openstack/tripleo-common master: [WIP] Support multiple architectures  https://review.openstack.org/52800015:58
dtantsuransiwen: so, question #1: why not pre-install packages on the overcloud-full image?15:58
*** moshele has joined #tripleo16:01
*** jongwooh has joined #tripleo16:02
*** jcoufal has quit IRC16:04
*** BryanS68 has quit IRC16:07
openstackgerritJohn Fulton proposed openstack/tripleo-heat-templates master: Parameterize ceph-ansible environment variables  https://review.openstack.org/52812516:07
*** ykarel has quit IRC16:09
*** ykarel has joined #tripleo16:09
*** ykarel has quit IRC16:11
openstackgerritDmitry Tantsur proposed openstack/instack-undercloud master: Generate a temporary URL key for Swift "service" project  https://review.openstack.org/52737616:15
*** jcoufal has joined #tripleo16:24
openstackgerritMerged openstack/tripleo-quickstart master: Correct links for images  https://review.openstack.org/51635316:25
openstackgerritMerged openstack/tripleo-heat-templates master: Check for yum lock befor all yum* operations.  https://review.openstack.org/52898416:25
*** moshele has quit IRC16:29
ansiwendtantsur: sorry, just got a phone call. so: the kernel you can't install on the overcloud image, because it replaces the standard kernel.16:30
dtantsuransiwen: ugh. so, setting up repositories and installing packages is not impossible, but assumes access to the internet during deployment16:31
*** marrusl has quit IRC16:33
openstackgerritMerged openstack/tripleo-common master: SRIOV derive parameters workflows  https://review.openstack.org/52226516:34
*** rbrady is now known as rbrady-afk16:36
*** Goneri has joined #tripleo16:37
owalshansiwen, dtantsur: can have both kernel installed IIRC16:39
dtantsurworth figuring out IMO16:40
EmilienMshardy, rook : any thoughts on https://review.openstack.org/#/c/529130/ ?16:40
ansiwendtantsur: but this requirement is also given for the cirrent script in the doc I sent you, right? so I think that wouldn't be a "regression"16:40
dtantsuransiwen: well, I'm trying to figure out what it takes to move your script to an ironic ansible playbook16:41
ansiwenowalsh: ok, interesting? how to install it without "enable" it? and how to enable it afterwards? over grub default?16:41
dtantsure.g. networking during deploy does not have to be set up the same way as on the final instance (incl. during first boot)16:41
dtantsuri.e. IPA does not have to be able to access internet16:41
dtantsurwhich will complicate fetching packages16:41
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Fix nodes config path in reproducer script  https://review.openstack.org/52936716:43
sshnaidm|afktrown|ruck, rlandy|rover ^^16:43
trown|rucksshnaidm|afk: rlandy|rover has a similar review16:44
trown|ruckhttps://review.openstack.org/#/c/529356/116:44
*** links has quit IRC16:44
rlandy|roversshnaidm|afk: trown|ruck: already addressed with other changes in https://review.openstack.org/#/c/529356/16:44
rlandy|roversshnaidm|afk: adding you to that review16:44
rlandy|roverfeel free to modify but let;s keep one to avoid merge issues16:45
sshnaidm|afkoops16:45
rlandy|rovernp - my fault for not adding you16:46
*** lucasagomes is now known as lucas-afk16:48
*** moshele has joined #tripleo16:48
openstackgerritMarios Andreou proposed openstack/tripleo-common master: Remove step_tags_to_when function from config download  https://review.openstack.org/52936916:48
openstackgerritJohn Fulton proposed openstack/tripleo-heat-templates master: Parameterize ceph-ansible environment variables  https://review.openstack.org/52812516:50
openstackgerritSayali Lunkad proposed openstack/diskimage-builder master: Adding mapping for SUSE package  https://review.openstack.org/52937016:51
*** moshele has quit IRC16:51
*** cshastri has joined #tripleo16:53
*** jtomasek has quit IRC16:54
ansiwendtantsur: I see... the script is run in different context than the ansible playbook. so I will check all the packages. if we can all add them to the overcloud image in parallel without risks for the current roles, sure, let's do that. otherwise it will be hard to add a risk so late in the cycle.16:56
*** marios has quit IRC16:57
dtantsuransiwen: yep, let's check it first16:57
dtantsuransiwen: are you coming to the office tomorrow? I'll have to run soon today, but we can chat face-to-face16:57
ansiwendtantsur: you come to the brerakfast? ok, cool! let's do that, that'd be great!16:58
*** anilvenkata has quit IRC16:58
dtantsur:)16:58
dtantsurif I figure out how to enter the office this time...16:58
Tenguanyone can help me in order to clean a failed resource in tripleo "overcloud" heat stack? the failed resource is: overcloud-ComputeSshKnownHostsDeployment-3ffgrpdxnrmw - I don't know why this #@|@#¼ tasks is failing, it's supposed to clean removed compute nodes, but is always failing due to some timeout.16:59
Tenguit's starting to really annoy me, as it prevents any stack update, like adding new node -.-16:59
*** ykarel has joined #tripleo17:01
ansiwendtantsur: go to 3rd floor first and take the stairs to 2nd and knock loudly on the door. (it's probably on 2nd floor this time)17:02
dtantsuransiwen: what if I telegram you when I approach the building, so that you meet me on the 3rd floor?17:15
*** mdnadeem has quit IRC17:17
ansiwendtantsur:  you can try, but I'm often too late. :-) but the receptionist can escort you to the breakfast in any case. :-)17:17
dtantsurassuming they know where it is..17:18
ansiwendtantsur: they set it up, so they _must_ know it :-)17:18
*** udesale has quit IRC17:18
dtantsurcool :)17:19
* dtantsur gets late coffee now17:19
*** florianf has quit IRC17:19
*** cshastri has quit IRC17:20
*** ykarel has quit IRC17:21
*** dprince has quit IRC17:23
*** d0ugal has quit IRC17:23
*** trown|ruck is now known as trown|lunch17:25
*** gkadam has quit IRC17:27
*** salmankhan has quit IRC17:29
*** jfrancoa has quit IRC17:29
*** hewbrocca is now known as hewbrocca_afk17:30
*** dtantsur is now known as dtantsur|afk17:34
openstackgerritOliver Walsh proposed openstack/tripleo-puppet-elements master: WIP: add RT kernel to overcloud compute image  https://review.openstack.org/52938117:34
owalshdtantsur|afk, ansiwen: ^^^ that should work I think, installs the RT kernel but restores the default back to the non-RT kernel17:34
dtantsur|afkthanks17:35
* dtantsur|afk goes now17:35
*** d0ugal has joined #tripleo17:35
openstackgerritOliver Walsh proposed openstack/tripleo-puppet-elements master: WIP: add RT kernel to overcloud compute image  https://review.openstack.org/52938117:36
EmilienMshardy: would you mind to review https://review.openstack.org/#/c/526151/ please?17:37
*** salmankhan has joined #tripleo17:37
Tenguwhat would happen if I comment out this bloc and deploy/update the overcloud stack? https://github.com/openstack/tripleo-heat-templates/blob/stable/pike/overcloud.j2.yaml#L450-L45517:41
TenguI think it should drop 3 stacks - and when I uncomment it, it should create it back. Is that right and safe?17:42
*** marrusl has joined #tripleo17:42
Tenguor, shall I flag the stack as "failed" (openstack stack resource mark unhealthy <resource>) and re-deploy?17:43
*** etingof has quit IRC17:43
*** fultonj has quit IRC17:47
Tenguhmmm. apparently, marking the nested stack as unhealthy should do the trick.17:47
*** derekh has quit IRC17:52
rookEmilienM: I mentioned to Shardy that 12 might not be enough workers (very simplistic findings from trying 12 workers with a 90 node deployment)17:53
owalshTengu: ah, so that's why you're using StrictHostKeyChecking no17:53
owalshTengu: why do you need to disabled it?17:53
EmilienMrook: what's the right formula for you then?17:53
rookEmilienM: shardy mentioned the calculation you had *2.17:54
rookso, if you default to the max of 12, it would be 2417:54
EmilienMok17:54
rookEmilienM: which I have tested and that does work much better.17:54
rookHowever, we are around 1GB per worker :/17:55
rookso, back to the memory consumption issue17:55
ansiwenowalsh: oh, cool, thanks!17:55
*** pchavva has quit IRC17:55
EmilienMrook: is https://review.openstack.org/529130 better onw?17:56
EmilienMnow*17:56
rookhttps://snapshot.raintank.io/dashboard/snapshot/zQODzQetB56fGgDahLLAAu2zifpB11Bx?orgId=2 <-- showing the usage17:56
tdasilvamwhahaha, EmilienM looking for some help regarding swift+barbican integration. alee has made the change to install barbican in step3 but now I need to add a little script to create a secret and stick the key_id in the swift conf file. Where should that script be executed?17:58
*** yprokule has quit IRC18:00
Tenguowalsh: ah, well, no, nothing to do with that. unrelated :)18:00
Tenguowalsh: fact is, tripleo deploy process is failing on that precise task, probably due to some error I made earlier. And I can't manage to recover :(18:01
Tenguowalsh: so I'm reading and thinking of a way to sort that situation. I see two possibilities: either mark the specific resource as "unhealthy" in heat, or drop that particular thing from the overcloud.j2 and re-deploy so that it should drop it.18:02
Tenguowalsh: fact is: I think marking it as unhealthy should be the right thing.18:02
mwhahahatdasilva: so i assume it needs to go into the appropriate place in docker_config in step4+18:02
owalshTengu: any custom roles?18:03
*** dprince has joined #tripleo18:03
Tenguowalsh: nope18:03
EmilienMrook: please give feedback on https://review.openstack.org/#/c/529130/18:03
Tenguowalsh: basic ones, but there's a name mapping, and I failed it, and it kind of messed up the compute-related stacks.18:03
*** dsneddon has joined #tripleo18:04
*** fultonj has joined #tripleo18:05
owalshTengu: ok, I've not seen any issues with the ssh known hosts setup, but there's always a first time18:06
Tenguowalsh: :)18:07
Tenguowalsh: so, marking a task as unhealthy should replace it "in-place" right?18:07
owalshTengu: no idea, shardy?18:08
Tenguowalsh: in my case, I have the stack name, it's precisely overcloud-ComputeSshKnownHostsDeployment-3ffgrpdxnrmw, and it has only one resource. Reading the overcloud.j2 makes me think it's only managing ssh known host for computes, so it should NOT impact anything else.18:09
tdasilvamwhahaha: but won't that be executed in every controller node? my thought is that it would be executed once and then stick the result in here: https://review.openstack.org/#/c/525324/2/puppet/services/swift-proxy.yaml@15418:11
tdasilvausing get_param18:11
mwhahahatdasilva: the problem is that under containerization the puppet stuff may not be run at the same time18:11
owalshTengu: all hosts, it's in a {% for role in roles %} loop18:12
mwhahahatdasilva: I think there's a way to run the docker_tasks only on a bootstrap node18:12
mwhahahatdasilva: this is the problem with needing dynamic config items that are not generated prior to deployment18:12
Tenguowalsh: hmm... so it will deploy the compute nodes keys on the other roles? anyway, it's only ssh keys...18:12
rookEmilienM: ack18:12
tdasilvamwhahaha: i heard you :/18:12
TenguI guess I can mark it unhealty… :/18:12
tdasilvai hear you18:12
tdasilvamwhahaha: recently learned that cinder is creating these keys per tenant and wished we had done the same18:13
owalshTengu: yea, combines all of the ssh host keys and generates /etc/ssh/ssh_known_hosts on all hosts (so we don't need StrictHostKeyChecking no)18:13
* owalsh biab18:13
Tenguowalsh: ok. so I don't really have risk marking it. hopefully.18:14
rookfultonj: the last patch you applied set delegate_facts to false, right?18:14
rookSeb is asking me.18:14
Tengushardy: are you here? :)18:14
mwhahahatdasilva: so the setting of config item itself needs to on all nodes, it's jsut teh single generation of the key that you'd need to do once right?18:14
fultonjrook: yes18:14
fultonji harded coded that18:14
rookok18:14
rookI didn't see it hard coded.18:14
rooki looked int he ceph-ansible dir.18:14
*** etingof has joined #tripleo18:14
rookin the*18:14
tdasilvamwhahaha: correct18:15
mwhahahaowalsh: how do we prevent the bootstrap bits for nova from being run on multiple nodes18:15
mwhahahaowalsh: https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/nova-api.yaml#L193 what i was looking at18:15
fultonjrook: http://ix.io/Dfc18:15
rookI see it here fultonj  site-docker.yml.sample18:15
fultonj^ yep18:15
rookok18:15
rookcool, then we are on the same page.18:16
rookbecuase i see it set to true here : infrastructure-playbooks/rolling_update.yml18:16
fultonjthat playbook isn't used on deploy though18:16
rooknope, that shouldn't be in question18:16
rookhowever, that would override a configuration18:17
*** yamahata has joined #tripleo18:17
mwhahahatdasilva: so i'm not exactly sure the best place to drop the items as it relates to containers so might be a good idea to ask the containers folks. I think you'll probably need a docker_config item but not sure the best way to implement your script18:20
tdasilvamwhahaha: no worries, i'll ping the containers folks, thanks!18:22
*** etingof has quit IRC18:22
*** eck` is now known as eck`gone18:25
fultonjrook: i replied to seb18:25
rookok fultonj18:26
* rook closes window18:26
rookfultonj: where is the best place to chat with seb live?18:26
fultonjrook: i will pm you18:27
*** eck`gone is now known as eck`18:30
*** rhallisey has quit IRC18:31
*** etingof has joined #tripleo18:35
fultonjrook: did you get to run your change with https://gist.github.com/jtaleric/e8c3f6f6137751ab89e20efd8093643b ?18:37
rookit is running now.18:38
fultonjack18:38
openstackgerritSven Anderson proposed openstack/tripleo-quickstart master: TEST DON'T MERGE - Enabling EC2-API Tempest tests.  https://review.openstack.org/51513918:39
*** pchavva has joined #tripleo18:39
*** jtomasek has joined #tripleo18:40
*** salmankhan has quit IRC18:41
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart-extras master: Remove MTU-based tests from the master and pike skip lists  https://review.openstack.org/52829218:42
weshaymwhahaha, k.. this is working.. thanks for the help https://review.openstack.org/#/c/501028/18:46
weshayit's off by default atm, we'll send a patch to turn it on upstream18:46
weshayrlandy|rover, fyi ^18:47
mwhahahak18:47
weshaySlower++18:47
*** rbrady-afk is now known as rbrady18:48
rlandy|roverfinally :) - started September 1518:48
* mwhahaha points out container-check should probably be packaged18:48
owalshmwhahaha: bootstrap_host_exec https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/nova-api.yaml#L20618:48
mwhahahaowalsh: ah so that ensure it only runs on the bootstrap node18:49
mwhahahatotally not obvious :D18:49
*** oidgar has joined #tripleo18:50
owalshmwhahaha: :-) yea, noop if the hostnames don't match hiera IIRC18:51
*** ebarrera has quit IRC18:51
mwhahahatdasilva: so if you want to run a shell script on a single node you can do so witht he bootstrap_host_exec -^ but the actual config writing out would be hard to do on all systems without publishing that key somewhere. So maybe you just need a script that runs everywhere and includes the bootstrap check18:51
*** ebarrera has joined #tripleo18:53
*** oidgar has quit IRC18:55
*** ebarrera_ has joined #tripleo19:01
*** myoung|bbl is now known as myoung19:01
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Update containers when the overcloud is containerized  https://review.openstack.org/52939919:04
rookhttps://gist.github.com/jtaleric/4dee30154651ecdedc79ea820a0d3c10 fultonj have you seen this before?19:05
rookOr anyone...19:05
rlandy|roverweshay: ^^ update_containers settings in the testenv files19:05
rookThe gist of the gist... Overcloud deployment is in flight. Node gets beyond build... Node never reboots (from ironic)... So, pinging the IP fails, Deployment gets hung up. The only way to progress the deployment is to do what I dd in the gist.19:06
rookShut down the trouble node, and start it back up.19:06
rookthis used to happen a lot more frequently, which led to people having to baby-sit deployments.19:07
fultonjrook: i have seen that occasionally19:07
rookIt has happened 4x in this scale deployment.19:07
rookI will admit, I haven't seen it for a while.19:07
*** trown|lunch is now known as trown|brb19:08
*** trown|brb is now known as trown19:08
*** trown is now known as trown|ruck19:08
*** ebarrera_ has quit IRC19:10
*** oidgar has joined #tripleo19:11
*** oidgar has quit IRC19:13
*** holser__ has quit IRC19:14
rookfultonj: this seems like a expensive operation : 2017-12-20 19:11:58,144 p=356309 u=mistral |  TASK [ceph-defaults : set_fact fsid ceph_current_fsid.stdout] ******************19:14
weshayneed a third party vote on https://review.openstack.org/#/c/509660/19:19
weshayrlandy|rover, ^19:19
weshayEmilienM, do you have a sec?19:19
rlandy|roverconflict of interest19:19
rlandy|rovermy code19:19
EmilienMweshay: of course19:19
weshaythank you sir19:19
rookrlandy|rover you should of skipped that ethics training.19:19
EmilienMchem: https://review.rdoproject.org/r/#/c/10827/ needs rebase fyi19:20
rlandy|roverrook: lol - the ethics training didn't cover w+1'ing your own code - it should19:20
EmilienMweshay: too late trown|ruck approved it :) but lgtm as well19:21
weshayaight19:23
weshaythanks anyway19:23
EmilienMexciting times, tripleo periodic jobs run again19:23
EmilienMhave they already run today?19:23
*** shardy has quit IRC19:23
EmilienMweshay: re: https://review.openstack.org/#/c/526138/ - thanks again for this work, I would send an email to the ML + a patch in tripleo-docs to announce this cool feature. Thanks19:23
trown|ruckEmilienM: yes... though not with passing results19:24
tdasilvamwhahaha: one way i was thinking about doing is having one exec to create the key, and then a second exec to get the key could be run in all nodes, the problem is that I need to pass an uuid between the create and the get19:25
EmilienMtrown|ruck: what were the failures?19:27
weshayEmilienM, ok.. I'll look at tripleo-docs with a more holistic view with regards to ci again in a bit19:28
trown|ruckEmilienM: i havent got to all of the pike ones yet... master all the code passed, but there is some issue with the qcow image upload, working on getting a bug for that, then looking at pike19:28
EmilienMtrown|ruck: I can help you19:28
*** pcaruana has quit IRC19:29
EmilienMtrown|ruck: if you have a link for the pike ones, I can take a look now19:29
*** oidgar has joined #tripleo19:29
trown|ruckhttps://review.rdoproject.org/jenkins/job/periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset002-master-upload/417/console19:29
trown|ruckhttps://review.rdoproject.org/jenkins/job/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-pike/19:29
trown|ruckhttps://review.rdoproject.org/jenkins/job/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-pike/19:29
trown|ruckhttps://review.rdoproject.org/jenkins/job/periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset002-pike-upload/19:29
trown|ruckEmilienM: ^ those are all the ones that have failed on pike19:30
trown|ruckEmilienM: fs17 passed then failed.. so might pass next run19:30
trown|ruckEmilienM: and upload job is probably same as what I am making a bug for on master, but I can check it19:30
*** oidgar has quit IRC19:30
tdasilvamwhahaha: the problem is that I need to get information out of a container and I don't know how that could be done19:30
EmilienMtrown|ruck: ok, thanks. I'll let you know if I find something else19:32
EmilienM"qemu-kvm: cannot set up guest memory 'pc.ram': Cannot allocate memory",19:33
EmilienMfor the ASK [convert-image : convert image] failure19:33
EmilienMtrown|ruck: sounds like something with RDO Cloud maybe19:33
EmilienMon pike it looks like a serious error:19:35
EmilienMhttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-pike/87b9ed2/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2017-12-20_12_56_2019:35
*** dsariel has quit IRC19:36
*** jpena is now known as jpena|off19:37
EmilienMit sounds like a valid puppet error: https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-pike/87b9ed2/subnode-2/var/log/journal.txt.gz#_Dec_20_12_56_1319:38
*** rlandy|rover is now known as rlandy|rover|brb19:39
mwhahahatdasilva: can you leverage something similar to what we do for the swift rings?19:39
EmilienMthis: http://paste.openstack.org/show/629493/19:40
EmilienMtrown|ruck: ^ the puppet error on pike19:40
EmilienMmwhahaha: I'm wondering if we miss a backport here /me digging19:40
mwhahahalet me see19:40
EmilienMit's the rabbitmq bundle thing19:40
rookfultonj: the builtin docker module still consumes tons of memory19:41
fultonj:(19:41
mwhahaha"/usr/bin/docker-current: Error response from daemon: invalid header field value \"oci runtime error: container_linux.go:247: starting container process caused \\\"process_linux.go:258: applying cgroup configuration for process caused \\\\\\\"write /sys/fs/cgroup/pids/system.slice/docker-985b1b2ad4bdb6643087afee7885f72d82fa0a50db03976008df692bec4b2d0d.scope/cgroup.procs: no such device\\\\\\\"\\\"\\n\".",19:41
mwhahahaEmilienM: no there's a bug in docker19:41
mwhahahai've seen this before, it's not consistent19:42
EmilienMmhh19:42
EmilienMok maybe but have you seen the rabbitmq thing also?19:42
EmilienMmwhahaha: can you review https://review.openstack.org/#/c/527404/ please?19:43
mwhahahathat's what i'm looking at19:43
mwhahahain postci19:43
tdasilvamwhahaha: was thinking i could stick a little json file in a swift object somewhat similar to swiftrings19:43
mwhahahaEmilienM: https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset017-pike/87b9ed2/postci.txt.gz19:43
tdasilvamwhahaha: this seems like the perfect job for etcd??19:43
mwhahahatdasilva: or stop doing silly stuff as part of the deployment :D19:44
trown|ruckEmilienM: https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset008-pike/a991439/undercloud/home/jenkins/failed_deployment_list.log.txt.gz also failed on pike... also looks puppet related19:44
tdasilvamwhahaha: lol, true true19:44
mwhahahaCould not find resource 'Exec[exec-setfacl-manila-manila]' for relationship from 'Ceph::Key[client.manila]' on node upstream-centos-7-2-node-rdo-cloud-tripleo-400-113.localdomain",19:45
*** ebarrera has quit IRC19:45
tdasilvamwhahaha: but this can't be the only case where we are producing dynamic data and setting it to config files, is it?19:45
tdasilvamwhahaha: passwords are generated by mistral, is that correct?19:45
mwhahahatdasilva: generated prior to deployment19:45
mwhahahatdasilva: so they are just inputs19:46
mwhahahatdasilva: and they don't rely on overcloud services19:46
tdasilvayeah, i see, in my case i actually need part of the deployment ready19:46
tdasilvaright19:46
mwhahahatdasilva: right so octavia is the only other instance of something like this really19:46
mwhahahawhich is what we're working through and it needs ansible stuff wedged in post deploy19:46
mwhahahawhich is really ugly19:46
* tdasilva goes to look at octavia19:46
mwhahahawhich is why i said this need to not be a pattern in openstack servers19:46
mwhahahatdasilva: plz don't, we don't want to repeat that pattern19:47
tdasilvaheh19:47
*** oidgar has joined #tripleo19:47
mwhahahathis is where the openstack services need to be able to handle this themselves and not require deployment/config update steps19:47
*** oidgar has quit IRC19:47
mwhahahaswift is awkward here because these is no shared db19:48
*** pcaruana has joined #tripleo19:48
tdasilvamwhahaha: if i wanted to write a script to be executed using bootstrap_host_exec, where would that script live? tripleo-heat-templates?19:50
*** jtomasek has quit IRC19:50
mwhahahatdasilva: I don't think so because i'm not sure if that's installed by default on the overcloud nodes19:51
*** jobewan has joined #tripleo19:51
tdasilvamwhahaha: ok, let me look at ringbuilder a bit see if I can do something similar19:51
mwhahahatdasilva: owalsh had to do something simialr and i think (unfortunately) we ended up putting it in tripleo-common or something19:53
*** pcaruana has quit IRC19:54
*** moshele has joined #tripleo19:55
Tenguowalsh: apparently, commenting out the bloc in the overcloud.j2.yaml and deploying did what I needed in order to get back to a stable, working stack. I'll uncomment it right after the deploy is over.19:57
* Tengu is happy, because he could correct a really messed up stack19:57
Tengubtw.... a new check might be interesting in the pre-flight checks.19:58
*** dprince has quit IRC20:01
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo stable/pike: Correct typo in manila/share.pp resource chaining  https://review.openstack.org/52940620:03
EmilienMmwhahaha: ^ the puppet error that you found - was a missing backport20:03
mwhahahak20:03
* mwhahaha blames gfidente20:04
EmilienMmwhahaha: we'll need https://review.openstack.org/#/c/527403/ as well (backported)20:04
EmilienMmwhahaha: but https://review.openstack.org/#/c/527404/ first20:04
mwhahahayea20:04
*** oidgar has joined #tripleo20:04
*** oidgar has quit IRC20:05
*** pcaruana has joined #tripleo20:06
*** rmascena__ has joined #tripleo20:08
rookfultonj: when you get back lemme know.20:08
fultonjwhat's up rook ?20:08
rookfultonj so, the fsid -- one area I think we can help... the fsid should be the same across nodes. Any reason why we run, and store the fact across all hosts?20:09
rookvs just run on a single node?20:09
rookasking around, I get the sense that fsid is unique per cluster, not per node.20:09
openstackgerritDavid Peacock proposed openstack/tripleo-quickstart-extras master: Fix failure of UI validation in some shells  https://review.openstack.org/52940720:10
*** rmascena has quit IRC20:10
fultonjyes it's one per cluster20:10
fultonjhow expensive is that operation?20:10
fultonjis it the right thing to optimize?20:11
rookThat is the first spike of 39GB (from what I can tell tracing things).20:11
*** rmascena__ is now known as raildo20:11
fultonjwow20:11
rooksorry 37GB*20:11
fultonjbig enough20:11
rookmoving to docker modules didn't help20:11
fultonjbut the docker module was for downloading and starting the image, right?20:12
fultonjpulling it into the local registry20:12
rookright, i was just mentioning that didn't hlep the utilization either.20:12
*** pcaruana has quit IRC20:13
openstackgerritMatt Young proposed openstack/tripleo-quickstart master: Featureset 22: run tempest (smoke+basic)  https://review.openstack.org/52940820:13
*** rlandy|rover|brb is now known as rlandy|rover20:14
fultonjhttps://github.com/ceph/ceph-ansible/blob/master/roles/ceph-defaults/tasks/facts.yml#L3720:15
rookhttps://github.com/ceph/ceph-ansible/blob/6a9b5c9632a39d290ebf707a21e98f17b064f198/roles/ceph-defaults/tasks/facts.yml#L1720:16
openstackgerritMatt Young proposed openstack/tripleo-quickstart master: Featureset 22: run tempest (smoke+basic)  https://review.openstack.org/52940820:16
rookI really wonder if this is the task that causes the problems fultonj ^^20:16
fultonjseems to record the result,20:16
fultonjor line 5220:16
*** amoralej is now known as amoralej|off20:17
owalshmwhahaha, tdasilva: didn't end up in tripleo-common -  https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/nova-api.yaml#L12620:17
fultonjbut what would be more resource intensive would be https://github.com/ceph/ceph-ansible/blob/master/roles/ceph-defaults/tasks/facts.yml#L1820:18
mwhahahaowalsh: oh right bash in the THT20:18
fultonjand then the question would be, can we do without that task20:18
owalshmwhahaha: probably could add the script to t-h-t and use get_file instead of inline bash script in yaml20:19
mwhahahaowalsh: not sure which is uglier :D20:20
rookfultonj: i think we sent the same thing20:20
fultonjit's name indicates that it checks if ceph is running (gets the fsid as a side effect)20:20
rookfultonj: I do see this in the stdout / mistral log :   [WARNING]: scp transfer mechanism failed on [192.168.24.71].20:21
rookwhich might be the delegation failing20:21
openstackgerritMatt Young proposed openstack/tripleo-quickstart master: Featureset 22: run tempest (smoke+basic)  https://review.openstack.org/52940820:21
rookwhich must not be all that important?20:21
owalshmwhahaha: yea, reminds me that I meant to move that to tripleo-common when I had more time20:21
fultonjrook: for tripleo... we pass it the fsid20:23
fultonjfrom heat20:23
fultonjnormally ceph-ansible needs to make it and use it but if it's defined... can we skip the task?20:23
fultonjadd an extra when20:24
fultonjor line in the when i shoul say20:24
rookfultonj: oh, it is passed??/20:25
fultonjonly for tripleo, but yes20:25
rookso we could add to the when clause.20:25
fultonji'm not convinced the play's only job is to get the fsid20:25
fultonjbut let's try it as a theory20:25
openstackgerritMark Hamzy proposed openstack/tripleo-common master: [WIP] Support multiple architectures  https://review.openstack.org/52800020:26
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Don't run check-tripleo OVB jobs frm RH1 anymore  https://review.openstack.org/52648120:26
rookfultonj well if it was to really check if ceph is running, failed_when should be true20:26
rookoh, nm... it doesn't have ignore...20:27
rookMaybe have another pre-task to check the state of the container.20:27
openstackgerritMerged openstack/tripleo-upgrade master: Fix missing attribute in upgrade Infrared plugin  https://review.openstack.org/52743720:34
openstackgerritMerged openstack/tripleo-upgrade master: Use parameter to control the docker registry env file creation  https://review.openstack.org/52743820:35
*** oidgar has joined #tripleo20:35
rookfultonj: how is the fsid passed?20:37
fultonjrook: ansible-playbook ... --extra-vars {..., "fsid": "2d87a5e8-8e72-11e7-a223-003da9b9b610", ...}20:38
fultonjrook: you can see it in the executor log20:38
fultonjwe also pass "generate_fsid": false,20:39
fultonjso that could be the easiest when to add20:39
fultonjassuming that's all that that task does20:39
*** oidgar has quit IRC20:40
*** links has joined #tripleo20:41
rookso the current code is really overwriting the fsid with the same value.20:45
*** links has quit IRC20:47
aleerlandy|rover, EmilienM , mwhahaha, weshay - well this is reassuring -- looks like the rebuild container patches worked (mostly)20:49
mwhahahaworked-ish20:49
aleerlandy|rover, mwhahaha , EmilienM , weshay still confirming that it all got pulled in  but ... https://review.openstack.org/#/c/529181/  looks good20:50
aleeat least for the zuul check20:50
aleethat patch pulls in changes from barbican and nova20:51
aleeand all the tests in scenario 2 pass20:51
weshaynice20:51
aleethere are some failures in some of the dependent packages -- looking to see what happened20:51
aleemwhahaha, weshay , rlandy|rover where are the logs to show the container rebuilds?20:53
*** jcoufal has quit IRC20:53
*** catintheroof has joined #tripleo20:54
weshayalee, in /homne/zuul overcloud-prep-containers.log20:54
weshayalee, we don't rebuild.. just install the rpm update on the container20:54
weshayrebuilding would take too long20:54
*** vpickard is now known as vpickard_20:57
*** bfournie has quit IRC20:58
*** dprince has joined #tripleo21:00
*** dsariel has joined #tripleo21:00
*** raildo has quit IRC21:01
openstackgerritDan Prince proposed openstack-infra/tripleo-ci master: Update reviewday project list  https://review.openstack.org/52671221:02
openstackgerritDan Prince proposed openstack/tripleo-heat-templates master: swift_rsync: don't bind mount /run  https://review.openstack.org/51302021:03
openstackgerritwes hayutin proposed openstack/instack-undercloud master: DNM, undercloud containers TESTING ONLY  https://review.openstack.org/51811821:03
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: DNM: Update undercloud install options for containers  https://review.openstack.org/51744521:06
aleeweshay, rlandy|rover , mwhahaha , EmilienM  so in that last review https://review.openstack.org/#/c/529181/ looking at the overcloud-prep-containers.log, it appears the packages I updated in fact got updated.21:09
aleethat is nova-compute and barbican-*21:09
weshayalee, I love it when a plan comes together21:09
aleeand all the tests passed21:10
rlandy|rovergood news21:10
weshaythat's the way to go out in 201721:10
aleebut, I do not see the logs for those services21:10
weshaymwhahaha, merge it .. merge it21:10
weshaypeer opensource pressure21:10
aleein fact the only logs I don't see are the ones that were changed21:10
mwhahahai think it's already in the gate21:11
weshayoh21:11
weshaymerry xmas to everyone then :)21:11
mwhahahawe still need to get packaged tho21:11
* mwhahaha doesn't like the pip install in quickstart21:11
weshaySlower, let's build a rpm together.. at lowes21:11
aleeif I am looking in the right place, the logs should be in http://logs.openstack.org/81/529181/1/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/0d670f4/logs/subnode-2/var/log/containers/  ? right?21:12
aleeno barbican logs and no nova-compute.log21:12
mwhahahawonder if the rebuilds break the log mounts21:14
openstackgerritMike Fedosin proposed openstack/tripleo-common master: Remove "overcloud-swift-rings" container during overcloud deletion  https://review.openstack.org/52941421:14
aleelooks like https://review.openstack.org/#/c/524064/  timed out -- rechecking ..21:16
mwhahahaalee: ah the might explain the missing logs if it timed out before they got collected21:17
aleemwhahaha, no -- thats a different review21:17
aleemwhahaha, I rechecked that one .. that one reported failure21:18
*** catinthe_ has joined #tripleo21:23
*** catintheroof has quit IRC21:25
openstackgerritMike Fedosin proposed openstack/tripleo-common master: Remove "overcloud-swift-rings" container during overcloud deletion  https://review.openstack.org/52941421:25
aleemwhahaha, weshay , rlandy|rover there are a number of images that got updated.  (not just the ones I modified).  It looks like all of those images now lack log files21:25
* weshay checks another build21:26
aleeyou can see the differences in the same review -- http://logs.openstack.org/36/527136/2/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/cc76cd0/logs/subnode-2/var/log/containers/  this was from a previous run21:26
weshayalee, honestly this sounds more like a bug w/ containers21:27
weshayalee, we're just updating the rpm21:27
weshayhrm...21:28
aleeon the same review.  I looked for Running docker command: /usr/bin/docker push" in the overcloud-image-prep.log and searched for the corresponding containers21:28
aleethat is - corresponding logs21:28
weshaysame thing here21:28
weshayhttp://logs.openstack.org/72/515372/17/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/98c339f/logs/subnode-2/var/log/containers/nova/21:28
weshayno compute log21:28
weshaythis is problem21:28
weshaywill containers ever get yum updated in the field?21:28
weshayalee, we need a lp on this21:29
mwhahahaweshay: this is the thing that needed to get solved for this cycle as how to deploy hotfixes, etc21:29
weshayya21:29
mwhahahawelcome to containers!21:30
aleeweshay, yeah .. you want to file it or shall I?21:30
weshaywooo hooo21:30
mwhahahawere all the old problems are new again21:30
weshaymwhahaha, no worries.. I'm sure another container will fix this21:30
mwhahahabecause solving them the first time wasn't a big enough pain in the bit21:30
weshaySlower, get over here21:30
weshayhttps://www.projectatomic.io/blog/2016/02/dont-run-yum-update-within-a-running-container/21:31
weshayfirst hit21:31
weshayalthough it only says not to because of the time it takes21:32
mwhahahayou wouldn't want to under normal circumstances21:33
mwhahahabecause you'd have to do it ever container launch21:34
mwhahahabut it's ok for testing new packages i guess21:34
weshaydmsimard, ping.. when we build the containers in rdo, are we pulling the latest udpates from centos?21:34
mwhahahayou'd want to rebuild21:34
weshayhrm.. is there a thread on the topic that I've missed?21:34
dmsimardweshay: you're pulling from the base centos7 image21:34
dmsimardwhatever it is21:34
*** catinthe_ has quit IRC21:35
dmsimardweshay: https://hub.docker.com/r/library/centos/tags/ "7" and "latest" dates from 20 days ago21:35
weshaydmsimard, just qq.. wondering if you have seen this...21:35
weshaysay we update openstack/nova on a container.. we loose the nova compute log21:35
*** catintheroof has joined #tripleo21:35
weshayever see anything like that?21:36
*** bnemec has quit IRC21:36
dmsimardme? I have absolutely no clue, mostly because I haven't worked on the underlying implementation21:36
*** threestrands_ has joined #tripleo21:36
weshayk21:36
dmsimardit might be a question for opstools ? I think they worked on logging in general and on fluentd implementation21:36
*** lblanchard has joined #tripleo21:37
dmsimardmwhahaha, weshay: in a containerized workflow, you usually don't run yum update or apt-get update -- you build a new container and redeploy21:37
aleemwhahaha, got a  failure on https://review.openstack.org/#/c/527136/  -- though scenario 2 did succeed ..21:37
weshaydmsimard, yes yes.. this is back to how to do it quickly in ci21:38
weshaydmsimard, you were in on that conversation :)21:38
dmsimardweshay: sure, but you're not supposed to actually run that in a running container21:38
Slowerhmm21:38
dmsimardweshay: you need to add a layer which would be like FROM <the image you want to update> RUN yum -y update21:38
dmsimardand then deploy the new layer resulting image21:39
aleemwhahaha, a recheck will prob fix it -- but just confirming .. patches that were based on top of this one succeeded21:39
Slowerfor the actual yum update that is probably a better idea21:39
SlowerI dunno why I did it the way I did now :)21:39
dmsimardOCI is like read only by default21:39
dmsimardso yeah21:40
*** catintheroof has quit IRC21:40
SlowerI don't see why it would cause us to lose logs though, that's strange21:40
dmsimarddo we have logs of the update ?21:40
dmsimardI have no idea but I'm curious21:40
Slowerthe only thing I can think is that the metadata on the container changed21:41
weshayhttp://logs.openstack.org/72/515372/17/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/98c339f/logs/undercloud/home/zuul/overcloud_prep_containers.log.txt.gz21:41
weshayhttp://logs.openstack.org/72/515372/17/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/98c339f/logs/undercloud/home/zuul/overcloud_prep_containers.log.txt.gz#_2017-12-20_16_40_5921:42
dmsimard"INFO: Removing container" ?21:42
Slowerlots of base OS updates..21:43
Slowerdmsimard: start at container-check21:43
*** lblanchard has quit IRC21:43
Slowerhrrm21:44
dmsimardSlower: /usr/bin/docker run --user root --rm 192.168.24.1:8787/tripleomaster/centos-binary-aodh-api:c8cceebf8e648ce46219026f926047491135a66e_fcf8d179 rpm -qa21:44
Slowerweshay: we have a few problems here21:44
dmsimardSlower: to me, that reads: "start the aodh-api container if it's not already running, run rpm -qa on it and remove the container once you're done"21:45
Slowerdmsimard: so it gets a list of rpms in the container and compares that to the yum database21:45
Slowerthen updates only containers that need it21:45
dmsimardSlower: was the container already running ? maybe removing it messes up the logging ? I dunno, just brainstorming trying to give ideas21:45
Slowerno it wouldn't be running then..21:46
Slowerdmsimard: running it messes up CMD though21:46
Slower/usr/bin/docker run --user root --net host --volume /etc/yum.repos.d:/etc/yum.repos.d --volume /opt:/opt --name yum-update-7 192.168.24.1:8787/tripleomaster/centos-binary-neutron-openvswitch-agent:c8cceebf8e648ce46219026f926047491135a66e_fcf8d179 yum -y update21:46
Slowerand then we commit it after with CMD changed to how it should be21:47
*** trown|ruck is now known as trown|outtypewww21:50
openstackgerritIan Main proposed openstack/instack-undercloud master: DNM: Testing containerized undercloud.  https://review.openstack.org/52941921:53
*** ramishra has quit IRC21:59
*** itlinux_ has joined #tripleo22:00
*** akrivoka has quit IRC22:02
weshaySlower, dmsimard so any changes needed to container-check?22:03
*** Goneri has quit IRC22:07
weshayalee, mwhahaha the issue is not w/ the update http://logs.openstack.org/25/528125/4/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/ba349fc/logs/subnode-2/var/log/containers/nova/22:08
weshayI see several reviews w/o nova compute logs22:08
weshaybut it did work here...22:09
weshayhttp://logs.openstack.org/07/472607/159/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/84a4673/logs/subnode-2/var/log/containers/nova/22:09
dmsimardI WAS TOLD CONTAINERS WOULD SOLVE ALL OF MY PROBLEMS22:10
dmsimard(╯°□°)╯︵ ┻━┻22:10
*** apetrich has quit IRC22:11
*** apetrich has joined #tripleo22:11
weshayalee, mwhahaha appears to be random http://logs.openstack.org/56/529356/1/check/tripleo-ci-centos-7-scenario001-multinode-oooq-container/5f0dc70/logs/subnode-2/var/log/containers/nova/22:12
aleeweshay, perhaps - but what I saw in this case is that it seems pretty much all the images that were updated did not have logs (at least the ones I checked)22:12
weshayalee, I just pointed out 3 runs w/o updates that did not have logs22:13
weshayyes, it's a problem, no it's not being caused by update22:13
aleeweshay, understood -- was there a change that went in over the last couple of days that broke logs?  I think we handle the mounts for these in the same way ..22:14
weshaynot sure.. going to get a bug going22:14
*** fultonj has quit IRC22:16
weshayhttps://bugs.launchpad.net/tripleo/+bug/173949222:17
openstackLaunchpad bug 1739492 in tripleo "nova compute log missing in some containerized deployments" [High,Triaged]22:17
*** itlinux_ has quit IRC22:18
*** Goneri has joined #tripleo22:23
*** itlinux_ has joined #tripleo22:30
*** alee is now known as alee_afk22:31
*** rcernin has joined #tripleo22:32
openstackgerritwaleed mousa proposed openstack/tripleo-heat-templates master: Adding support for role parameters in "environment_generator.py"  https://review.openstack.org/52942222:32
*** trozet has quit IRC22:32
*** jappleii__ has joined #tripleo22:35
tbarronweshay: will containers ever get yum updated in the field?22:36
tbarronweshay: I don't know the plan on this, maybe mburns does?  Storage delivers several hotfixes a week sometimes, mostly in cinder.22:36
*** threestrands_ has quit IRC22:36
tbarronweshay: could do docker build and docker push to a registry (where?); then pull from overcloud nodes?22:37
tbarronweshay: the (where?) on the registry is due to the hotfix being customer specific, not a generally published fix.22:38
tbarronweshay: and there are 'test-only' patches, delivered to customers who are willing to try the band-aid and see if it helps22:38
*** paramite has quit IRC22:41
dmsimardtbarron: that's what I was trying to convey earlier22:44
tbarrondmsimard: oh, you probably did then, I was just catching up, reading backlog, and that issue has been on my mind.22:45
dmsimardtbarron: containers are *usually* treated as read only, if you need to do an update, you re-build and re-deploy -- in the worst case scenario, you start from the image you currently have, add a layer (ex: yum update) and then deploy the new image you got from adding that layer22:45
tbarrondmsimard: exactly22:45
dmsimardI don't know what's the intent, but it's usually what people do with containers22:45
*** pchavva has quit IRC22:46
dmsimardbuilding on top of existing layers is proabably the safest route but it can probably lead to bloated images down the road22:46
tbarrondmsimard: I think that's the intent; all mutable content (config, logs, etc. ) are bind mounts form the host22:46
tbarrons/form/from/22:46
tbarrondmsimard: well, image consolidation is a worthwhile goal but I think it's not an immediate goal22:47
tbarrondmsimard: getting rid of misleading config and packages on the host woule IMO be higher prio22:47
tbarrondmsimard: I'm somewhat concerned that we'll have unanticipated support and maintenance issues22:48
dmsimardtbarron: but anyway, regarding the yum update thing22:48
*** jmelvin has quit IRC22:48
tbarrondmsimard: not an objection to the projectk, but anyways somethihng we'll deal with22:48
dmsimardtbarron: in the context of CI, we might end up in a scenario where we need to rebuild the openstack-nova package because we're doing a depends-on a nova patch for example -- now we'd need to rebuild all the containers with that package in it... but it's actually trickier than that22:49
dmsimardbecause with some packages (i.e, oslo), you'll find those either in all container images or very early on in the hierarchy tree22:50
dmsimardso you end up having to rebuild all container images which is very expensive from inside a job that is already long22:50
tbarrondmsimard: ack.  consider the bugs that are fixed for nova / cinder via the common brick library.22:50
tbarrondmsimard: it's nice that nova can run with its brick and cinder with its brick, but :_22:51
dmsimardthe objective with the yum update workflow in the context of containers in CI is to build the package once, create a local repository and then add a layer which does a yum update on every container image22:51
tbarron:)22:51
dmsimardwhich is significantly faster and less expensive than a full rebuild22:51
*** ManoX has joined #tripleo22:51
dmsimardnow, I don't know the specifics since I'm not intimately involved in that but I was part of the early discussions :)22:51
tbarrondmsimard: got it.  Optimizing for time and consistency across containers and then later looking at space/layer consolidation as another pass may make sense.  But what do I know?22:53
dmsimardtbarron: I'm not sure if the expectation is to use this kind of workflow in the field22:53
dmsimardtbarron: there's no orchestration around it, it's a dumb yum update.. so there's no notion of sql migrations or whatever22:54
*** itlinux_ has quit IRC22:54
*** dsariel has quit IRC22:54
tbarrondmsimard: well, that's why thinking through the hot-fix and test-fix scenarios needs to be done if it hasn't already been done.22:55
tbarrondmsimard: with the old rpm / build system there was a process for these that was understood, ugly but understood.22:55
*** etingof has quit IRC22:56
*** itlinux_ has joined #tripleo23:09
*** dprince has quit IRC23:10
*** etingof has joined #tripleo23:11
*** dhill_ has joined #tripleo23:30
openstackgerritMark Hamzy proposed openstack/tripleo-common master: [WIP] Support multiple architectures  https://review.openstack.org/52800023:34
openstackgerritJohn Fulton proposed openstack/tripleo-heat-templates master: Add new roles for Ceph containerization  https://review.openstack.org/52198923:38
*** itlinux_ has quit IRC23:42
*** itlinux__ has joined #tripleo23:46
*** rlandy|rover is now known as rlandy|bbl23:47
*** moshele has quit IRC23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!