Tuesday, 2018-07-31

*** Goneri has joined #oooq00:12
*** agopi|brb has joined #oooq00:20
*** agopi|brb is now known as agopi00:23
*** Goneri has quit IRC00:28
rlandybbl00:31
*** rlandy has quit IRC00:31
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master, legacy-tripleo-ci-centos-7-multinode-1ctlr-featureset037-updates-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-3nodes- (1 more message)00:39
*** rfolco|off is now known as rfolco|ruck01:01
*** vinaykns has joined #oooq01:16
*** rfolco|ruck is now known as rfolco|off01:55
*** vinaykns has quit IRC01:57
*** vinaykns has joined #oooq01:57
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master, legacy-tripleo-ci-centos-7-multinode-1ctlr-featureset037-updates-master @ https://review.openstack.org/56044502:39
*** links has joined #oooq03:15
*** skramaja has joined #oooq03:50
*** skramaja_ has joined #oooq04:00
*** skramaja has quit IRC04:00
*** jaganathan has joined #oooq04:00
*** jaganathan has quit IRC04:02
*** jaganathan has joined #oooq04:02
*** jaganathan has quit IRC04:03
*** jaganathan has joined #oooq04:03
*** gkadam has joined #oooq04:05
*** yolanda has quit IRC04:09
*** vinaykns has quit IRC04:14
*** ykarel|away has joined #oooq04:23
*** yolanda has joined #oooq04:24
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @  (1 more message)04:39
*** udesale has joined #oooq04:50
*** bogdando has joined #oooq05:10
*** skramaja_ is now known as skramaja05:21
*** ykarel|away has quit IRC05:26
*** jfrancoa has joined #oooq05:29
*** ratailor has joined #oooq05:35
*** ykarel|away has joined #oooq05:55
*** gkadam has quit IRC06:04
*** gkadam has joined #oooq06:05
*** quiquell has joined #oooq06:18
*** ccamacho has joined #oooq06:21
quiquellGoing to cry we have merge the fix :-)06:22
quiquellmarios: On friday I made a little dashboard-ci demo, in case you are interested I can do it again for you06:23
mariosquiquell: o/06:24
mariosquiquell: which fix06:24
mariosquiquell: tags?06:24
mariosquiquell: dashboard-ci memo?06:24
mariosoh demo06:24
mariosquiquell: cool sure maybe save it for the community call this afternoon?06:24
quiquellmarios: Already did for the rest of the team06:26
quiquellTag fix yes :-)06:26
*** ykarel|away is now known as ykarel06:35
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @  (1 more message)06:39
*** bogdando has quit IRC06:44
*** quiquell is now known as quiquell|bbl07:04
*** jaganathan has quit IRC07:09
*** jaganathan has joined #oooq07:09
*** links has quit IRC07:13
*** bogdando has joined #oooq07:24
*** amoralej|off is now known as amoralej07:35
*** ykarel is now known as ykarel|lunch07:36
*** tesseract has joined #oooq07:42
*** florianf has joined #oooq07:43
*** udesale has quit IRC07:49
*** udesale has joined #oooq07:49
*** tosky has joined #oooq07:55
quiquell|bblmarios: Looking at your bootstrap patch08:07
*** quiquell|bbl is now known as quiquell08:07
quiquellWe don't want to have a pre.yaml with all the tasks ?08:07
*** udesale has quit IRC08:13
*** links has joined #oooq08:13
*** udesale has joined #oooq08:16
*** ykarel|lunch is now known as ykarel08:18
mariosquiquell: you mean https://review.openstack.org/583195 ?08:21
quiquellmarios: Yep08:22
mariosquiquell: it was renamed to 'ceph.yaml' after discussion https://review.openstack.org/#/c/583195/10/playbooks/tripleo-ci/pre.yaml08:23
mariosquiquell: but it is still wired up the same into pre.yaml @ https://review.openstack.org/#/c/583195/12/zuul.d/base.yaml08:24
quiquellmarios: Don't know if we will need it for the reproducer rafactoring08:24
*** gkadam has quit IRC08:31
*** gkadam has joined #oooq08:37
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @ https://review.openstack.org/560445, stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode @  (1 more message)08:39
*** gkadam has quit IRC08:41
*** brault has joined #oooq08:55
*** sshnaidm|afk has quit IRC09:02
*** sshnaidm has joined #oooq09:04
quiquellsshnaidm: Are you there ? have some question with some toci bash variables used at influxdb09:12
*** ratailor has quit IRC09:20
*** ratailor has joined #oooq09:24
sshnaidmquiquell, yep09:25
*** zoli is now known as zoli|lunch09:32
quiquellsshnaidm: They are REMAINING_TIME and STATS_TESTENV09:35
quiquellsshnaidm: I see that REMAINING_TIME is used for the run_with_timeout09:36
quiquellsshnaidm: But we are not running this with reproducer and we will use ansible stuff for that purpose09:36
quiquellsshnaidm: Ans zuul has it's own mechanism for timeouts... so maybe we can remove it ?09:36
quiquellsshnaidm: Even the use of run_with_timeout09:37
quiquellsshnaidm: Zuul is already timing out09:37
sshnaidmquiquell, run_with_timeout was for stopping job before zuul kill it to collect logs, but when we started to collect logs in post.yaml..09:38
sshnaidmquiquell, maybe we can remove it09:38
sshnaidmquiquell, but it will be a problem for OVB jobs09:39
sshnaidmquiquell, because right now OVB hosts are erased right after zuul kills the job (when it's timeouted) and we can't collect logs from them09:39
sshnaidmquiquell, remaining time we get from devstack I think.. need to check if we get it in any way in new zuul way09:41
quiquellsshnaidm: Sure we have, I have a review tha prints all the vars09:41
quiquellsshnaidm: You can find the variable here https://review.openstack.org/#/c/581313/09:41
quiquellsshnaidm: Where do we do the cleanup of OVB nodes ?09:42
sshnaidmquiquell, we don't really control cleanup of OVB jobs, it's something between zuul and te-broker09:43
sshnaidmquiquell, I've already checked once if it's possible to hold them, seems like not really09:43
quiquellsshnaidm: We don't send the delete order from toci ?09:43
quiquellsshnaidm: So I don't get the relation with REMAINING_TIME09:43
sshnaidmquiquell, no, all managements of OVB stack is done in te-broker09:43
sshnaidmquiquell, blue?09:44
quiquellsshnaidm: Sure09:44
sshnaidmhttps://bluejeans.com/u/sshnaidm/09:44
*** panda|rover|off is now known as panda|rover09:48
*** ratailor has quit IRC09:51
*** ratailor has joined #oooq09:52
*** jaosorior has quit IRC09:53
*** jaosorior has joined #oooq09:54
*** ratailor has quit IRC09:56
*** ratailor has joined #oooq09:57
ssbarneacan we please merge the pause bugfix? https://review.openstack.org/#/c/583965/10:08
ssbarneait breaks all the time if console is redirected.10:09
panda|roverssbarnea: are you sure connection: local is an effective replacement for local_action ?10:11
panda|roverssbarnea: any reason why you're not using local_action directly ?10:11
ssbarneapanda|rover: because code is much harder to read when using local_action10:13
ssbarneathey have same functionality, but 2nd one is easier to read (and lint)10:13
panda|roverssbarnea: then use delegate_to: 127.0.0.110:14
panda|roverssbarnea: I'm reading docs and connection: has some implications10:14
panda|roverssbarnea: https://docs.ansible.com/ansible/2.6/user_guide/playbooks_delegation.html#local-playbooks10:14
panda|roverssbarnea: the note10:14
panda|roverssbarnea: also if you look at the last example here https://docs.ansible.com/ansible/2.6/user_guide/playbooks_delegation.html#local-playbooks, you can use the args per line syntax10:16
quiquell panda|rover: Good morning, do you have some time for sprint sync ?10:16
panda|roverjust add the argument module:10:16
ssbarneareading it, i need to test to be sure, i know that these nuances can make a big difference.10:16
ssbarneasure, will do.10:17
panda|roverssbarnea: thanks10:17
panda|roverquiquell: sure10:18
ssbarneapanda|rover: there is another way to fix the bug, bumping ansible to 2.5.7 https://review.openstack.org/#/c/587371/10:19
ssbarneasadly jobs seem to be broken for some weird missing --requirements error.10:20
quiquellpanda|rover: Already at your blue10:20
quiquellsshnaidm: Can I do the same with STATS_OOOQ ?10:21
panda|roverssbarnea: yeah, too much to test before we can bump, yours seems really a good solution, instead of pausing blindly10:21
panda|roverssbarnea: I don't want to block too much, if you don't have time to test the suggestion, I can take it back.10:22
ssbarneapanda|rover: i will rework it, also fixing the 35s instead of 30 (marios comment)10:25
ssbarneaclearly quickstart.sh looks borken to me: documentation states --requirements argument but implementation accepts only --requirements-file10:32
ssbarneasshnaidm: I think you should be able to help me with that https://github.com/openstack/tripleo-quickstart/commit/0a30e04efe41c8e1579731c848e4d78f4a3768da10:36
ssbarneayou added "-file" suffix, but documentation line was not updated, also these jobs were not updated. Am I wrong to believe that -file should be removed?10:37
ssbarneawhat i do not understand is how this was not catched by any gate, mainly is an API change that would break any job passing the --requirements parameter.10:38
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario008-multinode-oooq-container, tripleo-ci-centos-7-3nodes-multinode @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master @  (1 more message)10:39
ssbarneapanda|rover: ^ tell me if I am correct/wrong, so I would know if I shoud make a CR to remove -file or something else.10:39
panda|roverssbarnea: sorry, context ?10:42
*** zoli|lunch is now known as zoli10:47
ssbarneapanda|rover: look at https://github.com/openstack/tripleo-quickstart/commit/0a30e04efe41c8e1579731c848e4d78f4a3768da#diff-8846dd18c9ee9c09dadeee541156c2b8L194 -- and line 27010:52
ssbarneathis caused all rdo builds to fail, example: https://ci.centos.org/job/tripleo-quickstart-gate-master-delorean-quick-basic/6550/console10:54
ssbarneai just want to know if I should remove the -file suffix, or just add back the one that does not have the suffix.10:55
sshnaidmquiquell, yep10:57
sshnaidmssbarnea, yeah, I think you can remove it10:58
sshnaidmssbarnea, I mean -file suffix10:58
ssbarneadoing it right now, thanks.10:58
sshnaidmand also remove these "--requirements" from ci.centos jobs10:59
sshnaidmssbarnea, ^^10:59
sshnaidmit's completely redundant there10:59
sshnaidmoh, we use there another requirements file, ok.. could be replaced by "-r" then11:00
ssbarneasshnaidm: panda|rover https://review.openstack.org/#/c/587384/ -- fix for --req*11:01
panda|roverssbarnea: is there a bug for the rdo failures ?11:01
ssbarneanope, btw, where should I raise the bugs?11:03
panda|roverssbarnea: in launchpad11:03
ssbarneaok, doing it, refreshing review after.11:04
panda|roverssbarnea: yep, please add the close bug tag in the review11:05
panda|roverssbarnea: paste the bug here, so I can review it and eventually assign tags11:05
ssbarneahttps://bugs.launchpad.net/tripleo-quickstart/+bug/178460811:07
openstackLaunchpad bug 1784608 in tripleo-quickstart "quickstart.sh: ERROR: unknown option: --requirements" [Undecided,New]11:07
ssbarneai am kinda happy to use LP instead of storyboard :)11:08
panda|roverssbarnea: don't get too happy, the plan is to move11:11
ssbarneayeah, i know... hopefully not all plans need to materialize.11:12
ssbarneasshnaidm: panda|rover : now you can vote on https://review.openstack.org/#/c/587384/ again, i mentioned the bug.11:13
panda|roverssbarnea: I said all the rdo jobs were failing for this bug ?11:14
panda|roverssbarnea: do you have example logs ?11:14
ssbarneapanda|rover: yep, check this https://review.openstack.org/#/c/587371/ -- and look at failures, all of them for the same reason which is unrelated to the change.11:15
panda|roversshnaidm: now I'm confused :)11:20
panda|roverssbarnea: do you have time to chat ?11:26
ssbarneasure11:27
ssbarneapanda|rover:  https://bluejeans.com/u/ssbarnea11:29
panda|roverssbarnea: give me 2 minutes11:31
panda|roverssbarnea: I'm there11:33
quiquelldamn: sshnaidm and ssbarnea have even the same number of letters11:33
panda|roverquiquell: duh11:33
ssbarneatry https://bluejeans.com/265541792811:33
panda|roverquiquell: they are both standard unix usernames11:34
panda|rover8 letters11:34
panda|roverlike the old times11:34
quiquellpanda|rover: Unix etiquette11:34
panda|roverssbarnea: I'm there11:34
quiquellsshnaidm: Question, do you have a env of dashboard-ci running ? looks like the bot is not legit11:35
panda|roverssbarnea: ok, let's talk here11:45
ssbarneahttps://bluejeans.com/265541792811:47
*** amoralej is now known as amoralej|lunch11:55
*** rfolco|off is now known as rfolco|ruck12:12
*** panda|rover is now known as panda|rover|lunc12:18
*** ratailor has quit IRC12:32
weshaymorning12:34
weshaypanda|rover|lunc, rfolco|ruck hey.. new kolla bug on horizon.. but good news is that jobs are kicking :)12:34
*** rlandy has joined #oooq12:35
panda|rover|luncnew ? so it's still in horizon, but a different one from sunday ?12:35
weshaypanda|rover|lunc, ya12:35
weshaypanda|rover|lunc, https://trello.com/c/qGAKp5Yw12:36
weshaypanda|rover|lunc, rfolco|ruck also I noticed a bug in the script that creates escalations12:36
weshayshould be fixed now12:36
quiquellweshay: gm, fyi rr dashboard is now at dashboard-ci.tripleo.org the migration to tripleo-infra is done12:37
weshayquiquell, nice12:37
weshayquiquell, what is up w/ the RDO CI stats12:37
weshaywe're saying 90% of those jobs are failing?12:38
weshaythat is coming from the zuul json?12:38
weshayquiquell, ?12:39
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario009-multinode-oooq, legacy-tripleo-ci-centos-7-container-to-container- (1 more message)12:39
weshayquiquell, wouldn't we see more fs001/35 failures ^12:39
*** ykarel is now known as ykarel|away12:40
*** agopi has quit IRC12:40
rlandyrfolco|ruck: panda|rover|lunc: weshay: when you get a chance, pls see reviews from https://trello.com/c/TVsZ3Ut6/877-clean-up-rdo-sf-legacy-code-after-zuulv3-migration - didn't kick the browbeat job. I think we may have to merge the some of these?12:40
weshayquiquell, where would you like bugs written for the ruck/rover cockpit?12:42
quiquellweshay: I am back12:42
weshaypanda|rover|lunc, rfolco|ruck btw.. http://cistatus.tripleo.org/promotion/ is back12:43
quiquellweshay: For bugs FIXME section at https://docs.google.com/document/d/1MHflTy9krTFGrZ4nL_PG4AJDlcynWVkau9Mjvnkq094/edit is enough for now12:43
weshayquiquell, k.. thanks12:43
weshayquiquell, so what's up w/ the rdo jobs?12:43
weshay90% fail?12:43
quiquellweshay: Let me check12:43
weshayquiquell, ya.. what is the data source on that12:44
quiquellweshay: The zuulv3 builds API12:45
quiquellHave to document all that at the panels12:45
quiquellweshay: Exploring them here http://dashboard-ci.tripleo.org/d/-UEjGKFmz/exploration?orgId=1&var-influxdb_filter=type%7C%3D%7Crdo12:47
quiquellweshay: They look pretty broken yes12:48
quiquellweshay: Going to verify, doesn't make sense12:49
weshayquiquell, sorry.. what is the url for the json?12:51
quiquellweshay: https://softwarefactory-project.io/zuul/api/tenant/rdoproject.org/builds12:52
panda|rover|luncI'm seeing some of these https://logs.rdoproject.org/94/587394/1/openstack-check/legacy-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens-branch/caf9600/job-output.txt.gz#_2018-07-31_12_42_39_921617 in the latest jobs12:53
quiquellweshay: for tqe https://softwarefactory-project.io/zuul/api/tenant/rdoproject.org/builds?project=openstack/tripleo-quickstart-extras12:53
weshayquiquell, thanks12:53
weshayrfolco|ruck, you see https://softwarefactory-project.io/zuul/api/tenant/rdoproject.org/builds  and search for FAILURE12:54
panda|rover|luncrfolco|ruck: ^12:54
weshaysshnaidm, how we doing on getting collaborators on sova?12:55
panda|rover|luncrfolco|ruck: asking in #sf about these, but seems we have similar failures in post pipeline too12:55
sshnaidmweshay, cores were added12:55
sshnaidmweshay, and panda|rover|lunc was collaborator there before12:55
sshnaidmweshay, you should have mail about it12:55
rlandysshnaidm++12:57
hubbotrlandy: sshnaidm's karma is now 412:57
rlandyI missed that promotions/rdocloud job page12:57
sshnaidmrlandy, yeah, should work now12:58
rlandysshnaidm: now we are going to change the jobs names again :)12:58
weshaysshnaidm, thank you!!12:58
sshnaidmweshay, also moved ovb jobs form promotion page to "check" jobs12:59
rlandyyay12:59
weshaysshnaidm, ya.. that was smart12:59
weshaythank you12:59
rlandymarios: hi - how are we going on the pike failure?13:00
*** agopi has joined #oooq13:01
weshayquiquell, sshnaidm so looking at the results from sova and the ruck/rover cockpit.. judging  http://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?panelId=63&fullscreen&orgId=113:01
weshaydoesn't seem right to me13:01
mariosrlandy: waiting on the latest run, it is going to hang/timeout again (form console) almost done watching zuul run13:02
*** ykarel|away has quit IRC13:02
mariosrlandy: it should include logs this time13:03
mariosrlandy: https://trello.com/c/GvjcnJB2/850-translate-tripleosh-bootstrap-subnodes-into-a-series-of-tasks#comment-5b60384c3e7a3cc8bd6f081913:03
quiquellweshay: What's the major difference you see ?13:04
sshnaidmweshay, I don't count nodes failures in sova13:04
rlandymarios: my reproducer did just fine :(13:04
sshnaidmweshay, they don't have logs, so nothing to analyze there13:04
mariosrlandy: yeah thanks very much i got the update13:04
rlandywhich means I didn't reproduce anything13:05
rlandyweird13:05
quiquellsshnaidm, weshay: It's still good to have node_failures at dashboard-ci ?13:05
quiquellweshay: We 'skip' SKIPPED builds at dashboard-ci13:05
weshayquiquell, depends..13:05
sshnaidmweshay, and sova skip "skipped" too13:06
quiquellsshnaidm, weshay: The only differente is the NODE_FAILURE we can ignore them if you see it appropiate13:06
sshnaidmquiquell, I think worth to have them, otherwise we'll miss it13:06
weshaysshnaidm, quiquell yes.. it's worth it.. if that is the final job state13:07
weshayquiquell, noting that paul put in a change to have zuul retry13:08
quiquellweshay: are NODE_FAILURE going to disappear in the future ?13:08
*** links has quit IRC13:09
sshnaidmquiquell, maybe it's worth to split "last jobs" to upstream and rdo-ci too13:10
sshnaidmso that we'll not be spammed with these node failures13:10
quiquellsshnaidm: Yep agree, we missed that part in the last split exercise13:11
quiquellweshay: are you ok with that ?13:11
weshayya.. last jobs should be split13:12
weshayagree13:12
quiquellweshay, sshnaidm: review coming13:12
chandankumarweshay: trown|outtypewww sshnaidm please have a look at this etherpad when you are free https://etherpad.openstack.org/p/devconfin2018 I need to prepare demo tomorrow13:15
weshaysorry my connection dropped13:17
weshaypanda|rover|lunc, rfolco|ruck you guys want/need to sync?13:17
rfolco|ruckyes... I am looking at master container build failure... want to know if this kolla error is a new one...13:18
rfolco|rucknERROR:kolla.common.utils.ec2-api:Unknown error when pushing\nTraceback (most recent call last):\n  File \"/usr/lib/python2.7/site-packages/kolla/image/build.py\", line 322, in run\n    self.push_image(image)\n  File \"/usr/lib/python2.7/site-packages/kolla/image/build.py\", line 348, in push_image13:18
rfolco|ruckhttp://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-centos-7-master-containers-build/e824a88/job-output.txt.gz13:18
sshnaidmquiquell, hmm.. if we split "check jobs" and "ci stats" too, maybe it's worth to have a different row for "rdo jobs"13:18
*** Goneri has joined #oooq13:19
quiquellsshnaidm: Doing a review for last jobs13:19
weshayrfolco|ruck, /me looking13:19
weshayrfolco|ruck, don't see errors in https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-centos-7-master-containers-build/e824a88/logs/kolla/13:20
weshayso things built ok13:21
weshayand I can close the horizon bug13:21
sshnaidmquiquell, btw, do you have graphs for "RDO cloud performance"? for me all is empty except "nodes status" graph13:21
quiquellsshnaidm: Nope, they don't appear I have add a todo/fixme list https://docs.google.com/document/d/1MHflTy9krTFGrZ4nL_PG4AJDlcynWVkau9Mjvnkq094/edit13:22
weshayrfolco|ruck, here's the error https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-centos-7-master-containers-build/e824a88/logs/kolla/logs/ec2-api.log13:22
weshaypanda|rover|lunc, rfolco|ruck I'm not sure if the periodic jobs are kicking often enough13:23
weshayquiquell, are you filtering for tripleo jobs btw?13:24
weshayor all jobs13:24
weshaylooks like tripleo13:24
weshaysshnaidm, quiquell join my blue for a minute13:25
chandankumarrfolco|ruck: is this one a known issue 2018-07-31 10:52:42 | AmbiguousAuthSystem: Must provide Keystone credentials or user-defined endpoint, error was: cannot import name universaldetector13:25
quiquellweshay: Filtering by projects, all the projects that belong to tripleo13:25
chandankumarhttp://logs.openstack.org/68/584368/8/check/tripleo-ci-centos-7-undercloud-oooq/1670e29/logs/undercloud/home/zuul/undercloud_reinstall.log.txt.gz#_2018-07-31_10_52_4213:25
quiquellweshay: ack13:25
rfolco|ruckchandankumar, apparently yes13:25
quiquellweshay, sshnaidm: split last jobs https://review.rdoproject.org/r/1508913:26
*** links has joined #oooq13:26
rfolco|ruckweshay, master container build for example... 2018-07-30T22:57:50 than... 2018-07-31T04:57:12 --> 6 hours.... looks right?13:27
panda|rover|luncrfolco|ruck: there should be another at 1113:30
panda|rover|luncmore or less13:30
panda|rover|luncnot sure what's the time zone fo the log server13:31
*** myoung has joined #oooq13:37
*** skramaja has quit IRC13:38
*** chuck_ has joined #oooq13:41
*** chuck_ is now known as zul13:42
weshayrfolco|ruck, ya.. so I think we need to have the job run again13:42
*** amoralej|lunch is now known as amoralej13:43
weshayrfolco|ruck, that error seems transient13:43
weshayI think13:43
weshaypanda|rover|lunc, rfolco|ruck anything else I can help w/?13:43
rfolco|ruckweshay, I did not open a new bug for container build master coz ... yes, not consistent13:43
weshayrfolco|ruck, ack.. ya that was the right thing imho .. agree w/ you13:44
myoungpanda|rover|lunc, rfolco|ruck: (optional) please update https://etherpad.openstack.org/p/tripleo-ci-squad-meeting @ L62 with any additional ruck/rover status items of note.  Promotion status and alerts have already been added.13:44
weshayrfolco|ruck, are you using http://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?orgId=1 ?13:45
rfolco|ruckweshay, I tried to. Realized it was a bit unstable yet13:47
sshnaidmto everybody who work with rdo cloud: https://pasteboard.co/Hx0I6sH.jpg13:48
rfolco|rucksshnaidm, node_failure is gone?13:49
weshaylolz13:49
weshayrfolco|ruck, it's worth using and fixing :)13:49
weshayrfolco|ruck, ok.. the only other things I see are13:49
weshayrfolco|ruck, rdo phase 1 queens and pike13:50
weshayplease get promotion blockers up on those two builds13:50
weshayand tripleo-ci, pike is at 3days, please check that out13:50
weshaypanda|rover|lunc, anything I can help you out w/?13:50
weshayssbarnea, ping.. want to sync up?13:51
*** vinaykns has joined #oooq13:51
quiquellsshnaidm, weshay: just skipping node_failures https://review.rdoproject.org/r/1509113:53
rfolco|ruckmyoung, do you care about promotion status for the squad mtg ?13:55
weshayquiquell, is the rdo cloud performance data updating for you?13:55
weshayI only see node status13:55
quiquellweshay: There is already a FIXME item for it13:56
quiquellweshay: Didn't have time to work this out13:56
weshayheh. k13:56
weshaynp13:56
quiquellrfolco|ruck: What are the issues you found ?13:56
myoungrfolco|ruck: for the #tripleo meeting every week we document general tripleo CI status in our etherpad (like the other squads do)13:57
myoungrfolco|ruck: i've already populated it with promotion status, and tripleo bugs with 'alert' tag.  Just meant if there's anything above/beyond that you think we should communicate from ruck standpoint, feel free to add it there.13:57
myoung#tripleo meeting starts in 120 sec (ish)13:58
rfolco|ruckmyoung, k13:58
mariosrlandy: come on timeout already http://zuul.openstack.org/stream.html?uuid=734965f901ce459f89ec4024d8a040fe&logfile=console.log13:59
marios!13:59
weshayquiquell, so for rdo ci stats total13:59
mariosrlandy: (pike job on https://review.openstack.org/#/c/583195/ should be done soon)13:59
weshayshould be success / success + failures I suppose?13:59
mariosrlandy: 'done' i mean timout but hoping to see logs lets see14:00
rlandymarios: looking14:00
quiquellweshay: for this one ? http://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?orgId=1&from=1532872852639&to=1533045652639&var-launchpad_tags=alert&var-promotion_names=current-tripleo&var-promotion_names=current-tripleo-rdo&var-promotion_names=current-tripleo-rdo-testing&var-releases=master&var-releases=queens&var-releases=pike&var-releases=ocata&panelId=203&fullscreen14:00
quiquellOk have to leave for a few14:01
rlandymarios: and there we have a queens branch failure?14:01
*** panda|rover|lunc is now known as panda|rover14:01
*** quiquell is now known as quiquell|bbl14:01
weshayquiquell|bbl, aye14:01
mariosrlandy: no the queens one is green at http://zuul.openstack.org/ tripleo-ci-centos-7-containers-multinode-queenssuccess14:02
rlandyok - that failure was an image issue14:03
*** bogdando has quit IRC14:11
mariospanda|rover: looks like your logs patch works \o/14:13
mariospanda|rover: added comment https://review.openstack.org/#/c/587103/414:13
panda|rover\o/14:16
weshay<mwhahaha> the healthy bits are captured in the docker info log so we can probably check logstash14:18
weshayre: health checks..14:18
weshaypanda|rover, can I get you to look into that please?14:18
*** quiquell|bbl is now known as quiquell14:18
quiquellI am back14:18
panda|roverweshay: what do you want me to look ? if we have the docker health check logs in logstash ?14:21
weshaypanda|rover, I don't think we do, I'd like them added.. also if possible to send that data to https://review.rdoproject.org/grafana/dashboard/db/tripleo-ci?orgId=114:23
ssbarneasshnaidm: please recast vote on https://review.openstack.org/#/c/587384/ i lost it when I updated the message.14:24
*** holser_ has joined #oooq14:24
weshayrlandy, oy.. we have to duplicate all playbooks and templates?14:24
weshayrlandy, /me added paul to the review14:24
rlandyweshay: it looks that way to me - I was hoping panda|rover or rfolco|ruck would say I could do something else14:24
weshay:(14:25
rlandyweshay: I thought it would pick up a base test from tripleo14:25
weshayrlandy, it's worth double checking w/ pb14:25
rlandydup'ed that as well :(14:25
panda|roveras far as I know, roles can be imported, playbooks must be copied14:25
rlandyweshay: absolutely - will add him14:25
panda|roverand I remember Paul first saying it.14:25
weshaypanda|rover, k.. panda|rover so maybe we get the templates into a role?14:25
quiquellweshay, sshnaidm: the NODE_FAILURE graph https://review.rdoproject.org/r/1509214:26
quiquelldrop now, read you tomorrow14:26
weshaypanda|rover, or that doesn't matter so much as it's temporary?14:26
*** quiquell is now known as quiquell|off14:26
weshayquiquell, thanks14:26
rfolco|ruckyes, playbooks have to belong to the same repo where the job is defined14:26
sshnaidmssbarnea, you need to use "Closes-Bug: #111111" in commit message14:26
openstackbug 111111 in tepache (Ubuntu) "Tepache doesn't create a working code" [Undecided,Confirmed] https://launchpad.net/bugs/11111114:26
sshnaidmssbarnea, not "fixes lp"14:26
panda|roverlol14:26
panda|roverssbarnea: and please fix that nasty Tepache bug.14:27
sshnaidmbug 000000114:27
openstackbug 1 in Ubuntu Malaysia LoCo Team "Microsoft has a majority market share" [Critical,In progress] https://launchpad.net/bugs/1 - Assigned to MFauzilkamil Zainuddin (apogee)14:27
sshnaidmheh14:27
sshnaidmmaybe we can close this one ^14:27
panda|roverWONTFIX14:28
weshaylolz14:28
weshayI love that bug14:28
weshayssbarnea, ya.. please read through that openstack doc on commit messages14:28
ssbarneasshnaidm: I copied this text from our documentation... but I just realised that I copied the "bad example".... bad idea to document the how not to do it before the correc tone.14:28
weshaythat I sent yesterday.. it will save you time in the future14:28
ssbarneaI read, ..... but not the entire document.14:29
weshayya. it's big :)14:29
ssbarneais not that, i find the idea of putting BAD example before correct ones really,.... unfortunate.14:29
sshnaidmssbarnea, https://docs.openstack.org/infra/manual/developers.html14:29
panda|roverweshay: can we get any design on the dashboard world ? The number, what they do ? these seems always injected out of sprint from individual contributions. We have currently tre different dashboard in three different places.14:30
ssbarneaI did google, find the doc, search in it, found text and copied and replaced bug number,... can you blame me for not reading line by line,... maybe a little bit ;)14:30
weshayssbarnea, there are only a few important things to know14:30
weshayssbarnea, I'll cover them w/ you in a minute14:31
ssbarneafunny part is that gerrit did recognize the bug and linked it.14:31
sshnaidmpanda|rover, three?14:31
sshnaidmssbarnea, but not sure it will close LP automatically with that message, should be "Closes" in it14:31
ssbarneai will not touch it now, but I updated my note to use correct syntax.14:32
sshnaidmssbarnea, if it doesn't solve bug completely, you can use "Related-Bug: #222222"14:32
openstackbug 222222 in linux (Ubuntu) "Sony VAIO VGN-SZ430N and other models; Stamina mode doesn't let Ubuntu boot up" [Undecided,Invalid] https://launchpad.net/bugs/22222214:32
panda|roversshnaidm: https://review.rdoproject.org/grafana/dashboard/db/tripleo-ci?orgId=1 , http://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit?orgId=1, http://cistatus.tripleo.org/14:34
sshnaidmpanda|rover, also http://grafana.openstack.org/14:34
weshayrasca, what's the bug # again?14:36
rascaweshay, looking for it sec14:37
rascaweshay, myoung, there https://bugs.launchpad.net/tripleo/+bug/177280714:38
openstackLaunchpad bug 1772807 in tripleo "default containerized undercloud install with local CA fails with "Error org.freedesktop.DBus.Error.TimedOut"" [High,Triaged]14:38
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-ci-centos-7-scenario009-multinode-oooq, legacy-tripleo-ci-centos-7-container-to-container- (1 more message)14:39
*** links has quit IRC14:41
rfolco|rucksshnaidm, --requirements rings a bell? https://ci.centos.org/job/tripleo-quickstart-promote-pike-rdo_trunk-minimal/168/console14:42
sshnaidmrfolco|ruck, yeah. ssbarnea has a fixing patch14:42
rfolco|rucksshnaidm, I remember seeing something, thought was you14:43
rfolco|ruckthx14:43
sshnaidmrfolco|ruck, https://review.openstack.org/#/c/587384/14:43
sshnaidmrfolco|ruck, yeah, I'm this who broke it :D14:43
ssbarneasshnaidm:  it still needs Verified label to pass: https://review.openstack.org/#/c/587384/14:44
rfolco|rucksshnaidm, I don't know but I feel good hearing that after breaking gate...14:44
rfolco|ruck:)14:44
sshnaidmrfolco|ruck, everybody does it, no worries :)14:44
chandankumarsshnaidm: weshay this https://review.openstack.org/#/c/580384/ and this https://review.openstack.org/#/c/577780/514:45
*** ccamacho has quit IRC14:48
arxcruzsshnaidm: thanks for the review, can you please also review https://review.openstack.org/#/c/577780/ ?14:49
arxcruz:D14:49
arxcruzit's working now14:49
sshnaidmchandankumar, why undercloud, and not overcloud? https://review.openstack.org/#/c/57778014:50
sshnaidmarxcruz, ^^14:50
*** tcw has quit IRC14:50
arxcruzsshnaidm: because tempest runs in undercloud14:50
sshnaidmcan't we run tempest container when overcloud is containerized?14:50
*** tcw has joined #oooq14:51
arxcruzsshnaidm: the run itself doesn't matter if overcloud is containerized or not14:51
arxcruzonly undercloud14:51
arxcruzsshnaidm: for tempest is transparent, everything is tested via api14:51
sshnaidmarxcruz, ok14:52
sshnaidmarxcruz, does containerized_undercloud have a default value?14:53
arxcruzsshnaidm: to false iirc14:53
arxcruzsshnaidm: sorry, i don't know14:53
sshnaidmarxcruz, worth to check14:54
sshnaidmarxcruz, also I'd prefer this tempest config to be always in the end of file, so that people won't add undercloud_containerized settings *after* tempest config14:54
arxcruzsshnaidm: i added closer to other tempest stuff14:54
sshnaidmarxcruz, worth to have it in the end imho14:54
arxcruzokay, i'll update the files14:54
sshnaidmarxcruz, if somebody adds unvercloud containers settings after - this settings won't work14:54
*** jfrancoa has quit IRC14:55
arxcruzsshnaidm: fair enough14:55
rfolco|ruckpanda|rover, can we create an etherpad with thoughts/suggestions/ideas/issues for cockpit ?14:57
panda|roverrfolco|ruck: https://trello.com/c/dyvGWJps/890-dashboards-maintenance14:57
rfolco|ruckok will update there14:58
panda|roverthis way this sill officially take off sprint time, but if quiquell|off needs to address the fixes it will have less time for the sprint, and this must be taken into consideration14:58
*** jfrancoa has joined #oooq14:58
*** links has joined #oooq15:00
*** holser_ has quit IRC15:03
mariosrlandy: i think this is the change weshay was just referring to about the changes being applied from soren https://review.openstack.org/#/c/582963/15:03
weshayssbarnea, sorry you avail?15:06
weshaynow15:06
rascarlandy, https://bluejeans.com/957911389015:06
weshayhttps://bluejeans.com/411356779815:06
ssbarneayep15:06
*** holser_ has joined #oooq15:09
*** links has quit IRC15:12
rlandyweshay: rfolco|ruck: panda|rover: meeting with paul this afternoon re: zuul v3 in rdocloud15:18
panda|roverrlandy: when ?15:19
rfolco|ruckrlandy, can I join ?15:19
rlandypanda|rover: not sure - will ping him later - and collect you all15:19
rlandyrfolco|ruck: of course15:19
panda|roverrlandy: collect us before time out, or post-run will fail.15:20
rlandypanda|rover" ^^???15:20
panda|roverrlandy: a lousy joke, nevermind.15:21
rlandypanda|rover: ok - then I get it15:21
rlandywhen is timeout?15:21
* rfolco|ruck understands this flavor of jokes15:21
panda|roverrlandy: for me is in ~2 hours15:22
rlandypanda|rover;will be after your timeout15:22
rlandyunderstand if you can't make it15:22
chandankumarweshay: sshnaidm I have moved the content for devconf demo to here https://docs.google.com/document/d/1PAhNsxf30m5ZRR--CX3GqB-FsDUj8M5oyedB5rLFaVc/edit15:23
*** udesale has quit IRC15:23
panda|roverrlandy: ok ping me anyway even if I'm off, I'll join if I can15:24
rfolco|ruckpanda|rover, added promotion-blocker tag to https://bugs.launchpad.net/tripleo/+bug/178460815:29
openstackLaunchpad bug 1784608 in tripleo "quickstart.sh: ERROR: unknown option: --requirements" [Critical,Fix released] - Assigned to Sorin Sbarnea (ssbarnea)15:29
rfolco|ruckcoz you know... blocks promotion15:29
panda|roverrfolco|ruck: which jobs does it block ? I don't remember which jobs still use quickstart.sh15:29
rfolco|ruckhttps://ci.centos.org/job/tripleo-quickstart-promote-queens-rdo_trunk-minimal/119/console15:30
rfolco|ruck"rdo_trunk-promote-queens-current-tripleo"15:30
panda|roverrfolco|ruck: phase1 promotions15:30
rfolco|ruckyes15:30
panda|rovermmmhh15:30
rfolco|ruckstill applies right ?15:30
panda|roverI'm not sure if tags applies for anything else than the trripleo-ci promotion15:31
panda|roverI've never used it ofr this15:31
panda|roverrfolco|ruck: keep it for now15:31
rfolco|ruckok, if you don't add, it won't show up in cockpit for example15:32
rfolco|ruckin the alerts15:32
rfolco|ruckand promotion-blockers15:32
*** holser_ has quit IRC15:33
panda|rover...15:33
*** sshnaidm is now known as sshnaidm|off15:36
*** links has joined #oooq15:49
*** links has quit IRC15:51
*** links has joined #oooq15:52
*** links has quit IRC15:55
*** links has joined #oooq15:55
*** links has quit IRC15:58
*** zoli is now known as zoli|gone15:58
*** links has joined #oooq15:58
*** zoli|gone is now known as zoli15:58
*** links has quit IRC16:01
*** links has joined #oooq16:01
*** jfrancoa has quit IRC16:07
*** marios is now known as marios|gonehome16:08
*** marios|gonehome is now known as marios16:09
*** links has quit IRC16:10
weshayrfolco|ruck, ping16:18
weshayrfolco|ruck, holla at me when you have time16:19
*** links has joined #oooq16:21
weshayrfolco|ruck, /me kicking jobs on https://ci.centos.org/view/rdo/view/promotion-pipeline/job/rdo_trunk-promote-queens-current-tripleo/16:22
weshayqueens phase 1 is at 2 days16:22
rfolco|ruckweshay, lunch... will ping you asap16:23
weshayhttps://ci.centos.org/view/rdo/view/promotion-pipeline/job/rdo_trunk-promote-pike-current-tripleo/200/16:23
weshayoh16:23
weshay08:51:01 ERROR: unknown option: --requirements16:23
weshayssbarnea, fixed this16:23
rfolco|ruckweshay, yep, was waiting this to merge16:24
*** links has quit IRC16:24
weshayrfolco|ruck, ok.. that should have a bug.. for future notice16:24
weshayand marked promotion blocker16:24
*** myoung is now known as myoung|lunch16:24
*** links has joined #oooq16:25
rfolco|ruckweshay, it has16:25
rfolco|ruckhttps://review.rdoproject.org/etherpad/p/ruckrover-sprint1716:25
rfolco|rucktags:added: promotion-blocker16:26
weshayrfolco|ruck, what line?16:26
rfolco|ruck3416:26
*** links has quit IRC16:27
weshayrfolco|ruck, thanks16:27
weshay:)16:27
rfolco|ruckyw16:27
*** links has joined #oooq16:28
weshayrfolco|ruck, didn't see it on http://dashboard-ci.tripleo.org/d/cEEjGFFmz/cockpit16:28
weshayunder Promotion Blockers16:28
rfolco|ruckweshay, coz it did not have the tag one hour ago16:28
weshaynow I do16:28
weshayyes16:28
rfolco|ruckweshay, gave some suggestions for the main tab https://trello.com/c/dyvGWJps/890-dashboards-maintenance16:29
*** links has quit IRC16:30
rfolco|ruckbig charts just to show zuul queue is overkill16:30
rfolco|ruckIMHO16:30
*** links has joined #oooq16:31
rfolco|ruckalerts should show at the top, it gives overall status of ci16:31
panda|roverrfolco|ruck: anything critical to pass to me before I go ?16:31
rfolco|ruckpanda|rover, no... I did not see any progress on the tempest bug for 3-node job16:32
rfolco|ruckhttps://bugs.launchpad.net/tripleo/+bug/178401716:32
openstackLaunchpad bug 1784017 in tripleo "TestNetworkBasicOps.test_network_basic_ops failures" [Critical,Triaged]16:32
weshayrfolco|ruck, it's a pretty good indicatior of health imho16:33
rfolco|rucknon-voting, but we need to fix16:33
weshaythe big pie graphs?16:33
*** links has quit IRC16:33
rfolco|ruckdoesn't need to be big like that16:33
*** links has joined #oooq16:33
rfolco|ruck2.5 hours queue and a small graph is enough16:33
rfolco|ruckanyways, this is my perspective16:34
weshayk k16:34
weshayrfolco|ruck, which jobs does this affect? https://bugs.launchpad.net/tripleo/+bug/178401716:34
openstackLaunchpad bug 1784017 in tripleo "TestNetworkBasicOps.test_network_basic_ops failures" [Critical,Triaged]16:34
weshayrfolco|ruck, can you run that through an elastic recheck query16:34
rfolco|rucktripleo-ci-centos-7-3nodes-multinode16:35
rfolco|ruckall times failing in check16:35
*** links has quit IRC16:35
rfolco|ruckits non-voting but annoying16:36
*** links has joined #oooq16:36
chandankumarrfolco|ruck: regarding above bug reason is here http://logs.openstack.org/28/585528/8/check/tripleo-ci-centos-7-3nodes-multinode/cd65ccf/logs/subnode-3/var/log/extra/errors.txt.gz#_2018-07-27_09_21_09_02216:37
weshayrlandy, ping me when you chat w/ paul please16:38
weshaymy afternoon is clear16:38
*** links has quit IRC16:38
rlandyweshay:ack16:39
*** links has joined #oooq16:39
hubbotFAILING CHECK JOBS on stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-quickstart-extras-gate-newton-delorean-full-minimal, tripleo-ci-centos-7-scenario009-multinode-oooq, legacy-tripleo-ci-centos-7-container-to-container-upgrades-master, legacy-tripleo-ci-centos-7-ovb-3ctlr_1comp- (1 more message)16:39
chandankumarweshay: which script generates extra/errors.txt file?16:39
rfolco|ruckweshay, we changed from qemu to kvm recently16:39
rfolco|ruckchandankumar, ^16:39
weshayrfolco|ruck, correct..16:41
weshaychandankumar, it's in collect logs16:41
*** links has quit IRC16:41
weshaychandankumar, oh interesting16:42
*** links has joined #oooq16:42
weshaywth.. does that show up on other jobs?16:42
chandankumarweshay: if that is the case, we can move collect-logs to use it all zuulv3 devstack jobs16:42
chandankumarthat would be too much interesting16:43
*** sshnaidm|off has quit IRC16:43
weshaychandankumar, not sure what you are saying in that latest comment16:43
weshaychandankumar, I am planning on breaking out the collect-logs role16:43
weshaychandankumar, however it's just one simple command that creates the errors.txt file16:44
weshayif you are looking for that16:44
weshayI can show.. if you want to add it to devstack16:44
weshaychandankumar, need to thank Sagi for the idea though :)16:44
*** links has quit IRC16:44
chandankumarweshay: I mean to say in tripleo we use collect logs to gather all errors at one place at each node if we can add to all devstack jobs it will be much easier for other people16:44
weshaychandankumar, sure16:45
weshayyou want the code snip?16:45
*** links has joined #oooq16:45
chandankumarweshay:yup16:45
weshaychandankumar, https://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/collect-logs/tasks/collect.yml#L25016:46
weshaychandankumar, then it's added to logstash16:46
weshaytoo16:46
weshaypretty nice16:46
chandankumarweshay: i will take a look16:47
*** links has quit IRC16:48
*** amoralej is now known as amoralej|off16:48
*** links has joined #oooq16:48
*** panda|rover is now known as panda|rover|off16:49
*** links has quit IRC16:51
weshaychandankumar, rfolco|ruck http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22could%20not%20find%20capabilities%20for%20domaintype%5C%5C%3Dkvm%5C%22%20AND%20build_status%3AFAILURE16:51
*** links has joined #oooq16:51
weshaychandankumar, rfolco|ruck really odd.. just the 3node and scenario008 job16:52
weshayrfolco|ruck, maybe we're not setting up the right packages on all the nodes in 3node16:53
weshaymarios, ^  is there some diff w/ regards to the package prep in --boostrap w/ 3node?16:53
rfolco|ruckweshay, I was trying to find nova virt_type for the sub-nodes.... perhaps it uses qemu default for one subnode only ??16:53
rfolco|rucknot finding it in etc16:53
*** links has quit IRC16:54
*** links has joined #oooq16:54
*** links has quit IRC16:56
*** links has joined #oooq16:57
weshayrfolco|ruck, this appears to run on all three http://logs.openstack.org/57/586057/10/check/tripleo-ci-centos-7-3nodes-multinode/a25e19b/logs/undercloud/var/log/bootstrap-subnodes.log.txt.gz16:57
rfolco|ruckweshay, why we don't save nova config under etc/docker/nova ?16:59
weshayask in #tripleo17:00
*** links has quit IRC17:00
weshaythanks chandankumar17:00
weshaychandankumar++17:00
hubbotweshay: chandankumar's karma is now 517:00
chandankumarweshay: sorry what have i done17:00
weshaychandankumar, found that kvm issue17:01
chandankumarweshay: you know Aziza and Sachin from Pune?17:01
weshaychandankumar, ya.. I love them :)17:01
*** links has joined #oooq17:01
weshaysay hi for me17:01
chandankumarweshay: Aziza used to report to you and you hired sachine17:01
chandankumarweshay: will say it tomorrow17:01
weshaychandankumar, ya. neither really reported to me, I was just the team lead17:01
weshaythose were fun days17:02
weshayeasier than openstack17:02
chandankumarweshay: and I came to know your one's daughter name is Devvi :-)17:02
chandankumar*Devi17:02
weshaychandankumar, ha.. yes!17:02
weshayindian name :)17:02
chandankumarweshay: yes, one of the godess name :-)17:02
weshayyup17:03
chandankumarweshay: it is quite a small world :-)17:03
weshayheh.. small company17:03
weshayor it was17:03
*** tesseract has quit IRC17:03
*** links has quit IRC17:03
weshaychandankumar, ya.. tell Aziza and Sachin I miss them both :)17:03
chandankumarweshay: sure17:03
*** links has joined #oooq17:04
weshaychandankumar, you and I need to chat about the next few months of tempest work17:04
weshaywhen you have time17:04
weshayrfolco|ruck, thanks for logging that bug17:04
chandankumarweshay: Can we talk tomorrow ?17:05
weshaychandankumar, sure17:05
chandankumarweshay: let me know when you are free, I will schedule a meeting17:05
*** links has quit IRC17:06
*** links has joined #oooq17:07
chandankumarweshay: rfolco|ruck https://review.openstack.org/#/c/570892/ anf https://review.openstack.org/570884 can we merge this?17:07
weshayrfolco|ruck, SUCCESS https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-centos-7-master-containers-build/db7e819/17:09
weshaychandankumar, /me looks17:09
rfolco|ruckchandankumar, will review it asap17:09
*** links has quit IRC17:09
rfolco|ruckweshay, \o/17:09
*** links has joined #oooq17:10
weshaychandankumar, why use the diff parent? https://review.openstack.org/#/c/570892/917:11
*** links has quit IRC17:12
chandankumarweshay: i will fix that17:12
*** links has joined #oooq17:12
chandankumardoes someone looking at this job https://ci.centos.org/job/tripleo-quickstart-extras-gate-newton-delorean-full-minimal/6760/17:13
chandankumarit always fails17:13
*** links has quit IRC17:15
*** links has joined #oooq17:15
*** links has quit IRC17:18
*** links has joined #oooq17:18
*** links has quit IRC17:21
*** links has joined #oooq17:21
*** links has quit IRC17:24
*** links has joined #oooq17:24
*** links has quit IRC17:27
*** links has joined #oooq17:27
rfolco|ruckchandankumar, this has been fixed by https://bugs.launchpad.net/tripleo/+bug/178460817:28
openstackLaunchpad bug 1784608 in tripleo "quickstart.sh: ERROR: unknown option: --requirements" [Critical,Fix released] - Assigned to Sorin Sbarnea (ssbarnea)17:28
*** links has quit IRC17:30
*** links has joined #oooq17:30
*** links has quit IRC17:33
*** links has joined #oooq17:33
rfolco|ruckweshay, do you think this should be kvm instead ? https://github.com/openstack/tripleo-quickstart-extras/blob/ee03ae932a012f1eeede89c54248322f8538eab8/roles/overcloud-deploy/files/hardware_environments/virt/hw_settings.yml#L317:34
*** myoung|lunch is now known as myoung17:34
weshayrfolco|ruck, ya17:34
rfolco|ruckweshay, will make a quick test ok ?17:35
rfolco|ruckrun 3-node job only with kvm fix17:35
*** links has quit IRC17:36
*** links has joined #oooq17:37
weshayrfolco|ruck, I think the issue the rpms required for kvm are not installed on all the nodes in 3node17:37
weshaybut maybe I'm wrong17:37
rfolco|ruckweshay, qemu-kvm should be enough afaik17:38
*** vinaykns has quit IRC17:38
*** links has quit IRC17:39
*** links has joined #oooq17:40
*** links has quit IRC17:42
*** links has joined #oooq17:42
*** links has quit IRC17:45
*** links has joined #oooq17:46
*** florianf has quit IRC17:46
*** links has quit IRC17:52
*** florianf has joined #oooq18:01
rlandyrasca: ping18:05
rlandyrasca: 3ctlr_1comp or 1?18:05
weshayrlandy, 318:05
rlandyI'm going with 318:05
weshay:)18:05
rlandyobject on the patch pls18:05
rlandyif I did the wrong thing18:05
rlandyhack, hack, hack ....18:06
rlandyagopi: rook: weshay: any issue with merging this review? https://review.openstack.org/#/c/583717/18:07
rlandyI can remove the depends on18:07
rlandyand we can make a test patch for triggering18:08
rlandynot a blocker - just asking18:08
agopirook patched it up rlandy, waiting for it finish running in our CI18:09
rlandyagopi: ack - no rush18:10
agopishould be good to go by tomorrow18:10
rlandynice18:10
rlandythanks18:10
agopirlandy++18:10
hubbotagopi: rlandy's karma is now 1818:10
rlandyweshay: ^^18:10
weshayk18:11
*** holser_ has joined #oooq18:12
rookrlandy: yeah we hit a couple of snags... but should be good after some patches from today.18:18
*** vinaykns has joined #oooq18:27
*** holser_ has quit IRC18:33
*** jaganathan has quit IRC18:36
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-quickstart-extras-gate-newton-delorean-full-minimal, tripleo-ci-centos-7-scenario009-multinode- (1 more message)18:39
rfolco|ruckrlandy, keep me in the loop, I can cover you on this work on your PTO18:51
rfolco|ruckif you want, of course18:51
rlandyrfolco|ruck++18:52
hubbotrlandy: rfolco|ruck's karma is now 118:52
rlandyrfolco|ruck: on the upside, it might get you out of ruck/rover shift :)18:54
rfolco|ruckrlandy, shhhh walls have ears18:55
rfolco|ruck:)18:55
rlandyrfolco|ruck; I plan for everything to go wrong while I am hiking some glacier with no internet access18:56
rfolco|ruckrlandy, haha18:56
myoungrlandy: are you headed to glaciers?18:59
rlandymyoung: yep - next week18:59
myoungcoooooooollll18:59
myoungwhere?18:59
myoung(north obviously)18:59
rlandyAlaska18:59
myoungwowzers!18:59
myoungi got to fly over one a few weeks ago.  amazing.19:00
myoungI suspect hiking one will be more of a workout :)19:00
rlandywell, the glacier hike is our last day - so we may be less brave by then19:01
rlandyhttps://review.openstack.org/587603 WIP: DNM: Enable upstream testing of tripleo-ha-utils19:03
rlandyrasca: ^^19:03
rfolco|ruckweshay, I think my thoery has good chances of being correct: overcloud deploy uses --libvirt_type kvm when deploying nodes but for tempest what counts is what nova.conf has, should be virt_type = kvm19:12
rfolco|ruckif notthing there, uses qemu as default19:12
rfolco|ruckrlandy, wow I thought you were kidding... enjoy your hiking / skiing at alaska19:19
weshayrfolco|ruck++19:19
hubbotweshay: rfolco|ruck's karma is now 219:19
weshayya..19:19
weshayrfolco|ruck, so is that read in from hw_env?19:19
weshayI was looking for that19:19
rfolco|ruckweshay, still looking... I think I'll try to add etc/nova logs for debugging19:20
rfolco|ruckweshay, if i am reading your mind, you thinking "don't go too deep"... ok will add my comments to the bug19:22
weshayrfolco|ruck, well.. I like where you going w/ this.. tbh however keep in mind we have to get master going19:24
weshayrfolco|ruck, https://review.rdoproject.org/zuul/status.htm19:24
weshaylooks like fs001 and 35 failed19:24
weshayrfolco|ruck, I'm rekicking pike/queens rdo p1 jobs quickstart.sh should be fixed19:25
rfolco|ruckweshay, thx for doing this19:25
rfolco|ruckweshay, will look fs001 and 3519:25
rlandyrfolco|ruck: wrt your comments on https://review.openstack.org/#/c/587228/2/playbooks/tripleo-ci/templates/toci_gate_test.sh.j2@22919:44
rlandy^^ I am nit sure19:44
rlandynot19:44
rlandyupgrades still has this included19:44
rfolco|ruckI think upgrades job has been added before we moved to zuulv3 with required-projects and etc19:45
rfolco|ruckif zuul already does that, why you have to manually gate it ?19:46
rlandyrfolco|ruck: afaict from before, the changes were not tested without it19:46
rlandyirrespective of the fact that the local repo is there19:46
rlandythe roles need to be copied correctly19:46
rfolco|ruckrlandy, probably browbeat is expecting to be run from there instead of the zuul src place19:47
rlandyrfolco|ruck: how would we fix that?19:47
rfolco|ruckI might be wrong, would need to understand how browbeat runs and change its call to the new workspace where we copy browbeat that zuul clones19:48
rfolco|ruckrlandy, please paste me links again for the patches or merged code that runs browbeat and I'll make more accurate comments there19:49
rlandyall roles and playbooks etc, are copied via the setup.cfg19:49
rlandyrfolco|ruck: should be the same as extras19:49
* rlandy will paste in a bit19:49
rfolco|ruckthx19:49
rfolco|ruckbrb19:49
rlandyjust setting up a reproducer for rasca's work19:49
weshayrfolco|ruck, this probably infra https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/0ec0f44/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-31_19_03_3019:52
weshayrlandy, have you seen this?20:03
weshayhttps://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-master-upload/80c01cd/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-31_19_01_0720:03
weshayhttps://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master/fa46166/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-31_19_02_5620:04
* rlandy looks20:05
rlandyno - did something change?20:05
rlandyovercloud deploy20:06
rlandyoh20:06
rlandyweshay: this only in periodic?20:06
rlandyso ...  resources.StorageSubnet.properties.allocation_pools[0].start: "172.18.0.10" does not validate ip_addr (constraint not found) comes from network_environment20:07
rlandyhttps://github.com/openstack/tripleo-heat-templates/blob/master/ci/environments/network/multiple-nics/network-environment.yaml#L1720:08
rlandyweshay: ^^20:08
rlandyline 15 actually20:09
weshayrlandy, ya.. it's across multiple jobs20:09
weshayand ipv620:09
rlandyso that's the complaint20:09
weshayhttps://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035-master/fa46166/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-31_19_02_5620:09
weshayhttps://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/0ec0f44/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz#_2018-07-31_19_03_3020:10
rlandy  state chHeat Stack create failed.20:13
rlandychHeat?20:13
rlandyok - so it's not any particular subnet20:14
rlandyit's the verification thereof20:14
rfolco|ruckweshay, tempest tests runs nested virt, so kvm-intel module must be present here http://logs.openstack.org/28/585528/8/check/tripleo-ci-centos-7-3nodes-multinode/cd65ccf/logs/undercloud/var/log/extra/lsmod.txt.gz20:30
weshayrfolco|ruck, k20:30
weshayrfolco|ruck, can you check if it's present in a job that is passing that test20:31
rfolco|ruckI got the undercloud lsmod, tempest runs from there... I guess20:32
rfolco|ruckor should check subnode (ctrller)20:32
rfolco|ruckweshay, yes! http://logs.openstack.org/55/587155/1/check/tripleo-ci-centos-7-3nodes-multinode/dff7051/logs/subnode-2/var/log/extra/lsmod.txt.gz20:35
weshayrfolco|ruck, and that's from a working job?20:36
rfolco|ruckexactly, working job has kvm-intel module20:36
rfolco|ruckhttps://docs.openstack.org/devstack/latest/guides/devstack-with-nested-kvm.html20:36
rfolco|ruckneed to check why 1st level vms are not loading it in 80% of the jobs... perhaps this carries from host20:37
*** zul has quit IRC20:37
*** zul has joined #oooq20:38
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-quickstart-extras-gate-newton-delorean-full-minimal, tripleo-ci-centos-7-scenario009-multinode- (1 more message)20:39
rfolco|ruckweshay, working job ran on a  Intel Core Processor (Haswell, no TSX)20:39
rfolco|ruckfailed one on Intel Xeon E312xx (Sandy Bridge)20:40
weshayah20:40
rfolco|ruckweshay, https://bugs.launchpad.net/tripleo/+bug/1784017/comments/720:47
openstackLaunchpad bug 1784017 in tripleo "Build of instance was re-scheduled: invalid argument: could not find capabilities for domaintype=kvm" [Critical,In progress] - Assigned to Rafael Folco (rafaelfolco)20:47
weshayhrm20:49
rfolco|ruckbelieve me nested virt runs 30x slower where I come from, this won't make our jobs much faster I believe20:49
weshayrfolco|ruck, it would be better if we could be smarter about when we add that setting20:49
rfolco|ruckthats why devstack kept qemu as default20:49
rfolco|ruckweshay, enable kvm when module kvm-intel is present... this ?20:50
weshayya.. something along those lines20:50
*** vinaykns has quit IRC20:52
weshayrfolco|ruck, what ya think?20:56
weshaydo-able?20:56
weshayshould be right?20:56
rfolco|ruckweshay, yes, think so, trying to implement20:56
weshayrfolco|ruck, thanks man!20:56
rlandydoc/source/feature-configuration.rst will kill me in the end20:56
rlandymerge nightmare20:56
weshayrlandy, leave that for me20:56
weshayrlandy, I'll do it, don't waste ur time20:57
rlandyweshay: I have to fix it - I need the review for the reproducer20:57
* rlandy is just complaining - ignore me20:57
*** florianf has quit IRC20:59
*** jtomasek has joined #oooq21:01
*** jtomasek_ has quit IRC21:03
*** Goneri has quit IRC21:08
*** rfolco|ruck is now known as rfolco|off21:09
*** yolanda has quit IRC21:25
*** myoung has quit IRC21:28
*** agopi has quit IRC21:30
weshayrfolco|off, can you cover the program call tomorrow morning?21:31
weshayrfolco|off, status is RED for master, that's all we care about21:31
weshayrfolco|off, blockers include https://bugs.launchpad.net/tripleo/+bug/178471221:31
openstackLaunchpad bug 1784712 in tripleo "ExternalNetwork, InternalApiNetwork, StorageNetwork fail to validate ip_addr" [Critical,Triaged]21:31
weshayhttps://trello.com/c/hkvfxAdX/667-cixtripleoci-rdo-software-factory-3rd-party-jobs-failing-due-to-instance-nodefailure21:32
rfolco|offweshay, ack21:39
weshayrfolco|off, anything you see w/ this filter https://trello.com/b/j4IcIomh/production-chain-escalation?menu=filter&filter=label:TripleO-master21:39
*** vinaykns has joined #oooq21:48
rlandyweshay: ok, so I'll put in a job to parallel https://softwarefactory-project.io/r/#/c/12967/  ... not sure about the test definition though21:55
weshayaye21:56
*** vinaykns has quit IRC22:05
*** jtomasek has quit IRC22:21
rlandyweshay: https://softwarefactory-project.io/r/#/c/13256/ - seem correct?22:32
rlandyadding depends to job22:32
rlandyhttps://review.rdoproject.org/r/#/c/15074/ - didn't work :(22:36
*** vinaykns has joined #oooq22:38
hubbotFAILING CHECK JOBS on stable/queens: tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades @ https://review.openstack.org/567224, stable/ocata: legacy-tripleo-ci-centos-7-ovb-1ctlr_1comp_1ceph-featureset024-ocata @ https://review.openstack.org/564291, master: tripleo-ci-centos-7-3nodes-multinode, tripleo-quickstart-extras-gate-newton-delorean-full-minimal, tripleo-ci-centos-7-scenario009-multinode- (1 more message)22:39
*** vinaykns has quit IRC22:44
rlandyweshay:  https://review.rdoproject.org/r/15097 - sanity check pls22:44
rlandyfixing required projects ...22:51
*** agopi has joined #oooq23:16
*** tosky has quit IRC23:35
*** vinaykns has joined #oooq23:43

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!