*** hdd has joined #openstack-sahara | 00:25 | |
*** hdd has quit IRC | 00:30 | |
openstackgerrit | Ken Chen proposed openstack/sahara: Enable YARN ResourceManager HA in CDH plugin https://review.openstack.org/213000 | 01:22 |
---|---|---|
*** rodrigod` has joined #openstack-sahara | 01:54 | |
*** rodrigod` is now known as rodrigods | 01:57 | |
*** macjack has joined #openstack-sahara | 02:07 | |
*** logan2 has quit IRC | 02:18 | |
*** logan2 has joined #openstack-sahara | 02:57 | |
openstackgerrit | weiting-chen proposed openstack/sahara-specs: Add new spec for NFS-as-a-data-source blueprint https://review.openstack.org/210839 | 03:05 |
*** saneax has joined #openstack-sahara | 03:58 | |
*** saneax has quit IRC | 04:03 | |
*** openstack has joined #openstack-sahara | 04:18 | |
*** saneax has joined #openstack-sahara | 04:54 | |
*** Poornima has joined #openstack-sahara | 05:17 | |
*** nkrinner has joined #openstack-sahara | 05:27 | |
*** coolsvap|away is now known as coolsvap | 05:32 | |
*** macjack has quit IRC | 05:38 | |
*** hdd has joined #openstack-sahara | 05:38 | |
*** macjack has joined #openstack-sahara | 05:42 | |
*** coolsvap is now known as coolsvap|away | 05:55 | |
*** hdd has quit IRC | 06:24 | |
*** saneax has quit IRC | 06:29 | |
*** macjack has quit IRC | 06:30 | |
*** witlessb has joined #openstack-sahara | 06:32 | |
openstackgerrit | willy lin proposed openstack/sahara: Rename "get_job_status" to "get_job_info" https://review.openstack.org/213611 | 06:34 |
*** macjack has joined #openstack-sahara | 06:36 | |
*** macjack has quit IRC | 06:53 | |
*** macjack has joined #openstack-sahara | 06:55 | |
*** esikachev has joined #openstack-sahara | 06:57 | |
*** macjack has quit IRC | 07:06 | |
-openstackstatus- NOTICE: Gerrit is currently under very high load and may be unresponsive. infra are looking into the issue. | 07:07 | |
*** macjack has joined #openstack-sahara | 07:07 | |
*** macjack has quit IRC | 07:19 | |
*** macjack has joined #openstack-sahara | 07:22 | |
*** Nikolay_St has joined #openstack-sahara | 07:35 | |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara: Update plugin version for transient tests to vanilla 2.7.1 https://review.openstack.org/213622 | 07:39 |
*** vgridnev has joined #openstack-sahara | 07:59 | |
*** sgotliv has joined #openstack-sahara | 08:09 | |
*** sgotliv has quit IRC | 08:38 | |
*** sgotliv has joined #openstack-sahara | 08:52 | |
*** degorenko has joined #openstack-sahara | 08:55 | |
*** sgotliv has quit IRC | 09:02 | |
*** sgotliv has joined #openstack-sahara | 09:14 | |
*** tosky has joined #openstack-sahara | 09:21 | |
*** macjack has quit IRC | 09:31 | |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Configure rpc options separately from ceilometer notifications https://review.openstack.org/198744 | 09:52 |
*** macjack has joined #openstack-sahara | 10:14 | |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Remove Sqlite validation for database_connection https://review.openstack.org/213650 | 10:14 |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Configure rpc options separately from ceilometer notifications https://review.openstack.org/198744 | 10:16 |
openstackgerrit | Denis Egorenko proposed stackforge/sahara-ci-config: Add force options for apt-get upgrade https://review.openstack.org/213651 | 10:21 |
-openstackstatus- NOTICE: review.openstack.org (aka gerrit) is going down for an emergency restart | 10:22 | |
*** ChanServ changes topic to "review.openstack.org (aka gerrit) is going down for an emergency restart" | 10:22 | |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Configure rpc options separately from ceilometer notifications https://review.openstack.org/198744 | 10:27 |
*** macjack has quit IRC | 10:29 | |
openstackgerrit | Luigi Toscano proposed openstack/sahara: doc, sahara-templates: fix typo https://review.openstack.org/213653 | 10:33 |
openstackgerrit | Denis Egorenko proposed stackforge/sahara-ci-config: Add force options for apt-get upgrade https://review.openstack.org/213651 | 10:44 |
*** ashishb has joined #openstack-sahara | 10:44 | |
openstackgerrit | Evgeny Sikachev proposed stackforge/sahara-ci-config: Update version of vanilla plugin for transient check https://review.openstack.org/213657 | 10:44 |
*** ChanServ changes topic to "OpenStack Sahara // IRC Meetings - http://eavesdrop.openstack.org/#OpenStack_Data_Processing_(Sahara)_Team_Meeting" | 10:49 | |
-openstackstatus- NOTICE: Gerrit restart has resolved the issue and systems are back up and functioning | 10:49 | |
*** sgotliv has quit IRC | 11:01 | |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Make infra engine configurable in devstack plugin https://review.openstack.org/213545 | 11:03 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 11:03 |
openstackgerrit | Andrey Pavlov proposed openstack/python-saharaclient: Adding updates for clusters, jobs, job binary internals https://review.openstack.org/213665 | 11:06 |
openstackgerrit | Merged stackforge/sahara-ci-config: Add force options for apt-get upgrade https://review.openstack.org/213651 | 11:06 |
*** esikachev has quit IRC | 11:11 | |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Add sample usage info for pre_test_hook https://review.openstack.org/213667 | 11:11 |
*** esikachev has joined #openstack-sahara | 11:19 | |
openstackgerrit | Evgeny Sikachev proposed stackforge/sahara-ci-config: Update version of vanilla plugin for transient check https://review.openstack.org/213657 | 11:21 |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Update parameters for Sahara https://review.openstack.org/205988 | 11:21 |
*** sgotliv has joined #openstack-sahara | 11:23 | |
openstackgerrit | Evgeny Sikachev proposed stackforge/sahara-ci-config: Update version of vanilla plugin for transient check https://review.openstack.org/213657 | 11:28 |
openstackgerrit | Denis Egorenko proposed openstack/puppet-sahara: Configure rpc options separately from ceilometer notifications https://review.openstack.org/198744 | 11:30 |
openstackgerrit | Andrey Pavlov proposed openstack/sahara: Formatting and mounting methods changed for ironic https://review.openstack.org/200483 | 11:42 |
*** sgotliv has quit IRC | 11:42 | |
*** sgotliv has joined #openstack-sahara | 11:57 | |
*** Poornima has quit IRC | 11:59 | |
*** chlong has quit IRC | 11:59 | |
*** sgotliv has quit IRC | 11:59 | |
*** sgotliv has joined #openstack-sahara | 12:00 | |
*** witlessb has quit IRC | 12:07 | |
*** witlessb has joined #openstack-sahara | 12:08 | |
openstackgerrit | Andrey Pavlov proposed openstack/sahara: Adding shared and protected resources support https://review.openstack.org/195568 | 12:15 |
openstackgerrit | Andrey Pavlov proposed openstack/sahara: Adding is_public and is_protected fields support https://review.openstack.org/195065 | 12:15 |
*** DWfuturetec has joined #openstack-sahara | 12:15 | |
openstackgerrit | Luigi Toscano proposed openstack/sahara: Scenario tests: store ssh key if resources are retained https://review.openstack.org/213690 | 12:20 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Add sample usage info for pre_test_hook https://review.openstack.org/213667 | 12:21 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 12:21 |
*** tellesnobrega_af has quit IRC | 12:28 | |
SergeyLukjanov | hey folks, does anybody wants to chair the bug triage day? | 12:29 |
*** tellesnobrega has joined #openstack-sahara | 12:30 | |
openstackgerrit | Vitaly Gridnev proposed stackforge/sahara-ci-config: Start using edp.yaml.mako template https://review.openstack.org/213693 | 12:31 |
*** ekarlso has quit IRC | 12:44 | |
*** ekarlso has joined #openstack-sahara | 12:44 | |
tmckay | hmm, is CI broken? | 12:51 |
vgridnev | tmckay, right now seems not https://sahara.mirantis.com/zuul/ | 12:54 |
tmckay | vgridnev, okay, thanks. I had a rebase several days ago which did not refire for some reason | 12:55 |
openstackgerrit | Merged stackforge/sahara-ci-config: Update version of vanilla plugin for transient check https://review.openstack.org/213657 | 12:55 |
tmckay | just added a recheck | 12:56 |
openstackgerrit | Merged stackforge/sahara-ci-config: Start using edp.yaml.mako template https://review.openstack.org/213693 | 12:56 |
*** shikel has joined #openstack-sahara | 12:58 | |
*** ashishb has quit IRC | 13:02 | |
*** esikachev has quit IRC | 13:03 | |
*** elmiko has joined #openstack-sahara | 13:07 | |
*** DWfuturetec has quit IRC | 13:31 | |
*** egafford has joined #openstack-sahara | 13:35 | |
elmiko | did anyone sign up to run bug triage day? | 13:49 |
elmiko | SergeyLukjanov: ^ | 13:49 |
SergeyLukjanov | elmiko, not yet :) | 13:49 |
SergeyLukjanov | elmiko, do you want to do it? | 13:50 |
tmckay | well, we can all just start triaging bugs :) | 13:50 |
elmiko | SergeyLukjanov: i can help, we mainly need to check all the new untriaged stuff | 13:50 |
elmiko | i'll setup an etherpad | 13:51 |
elmiko | https://etherpad.openstack.org/p/liberty-3-sahara-bug-triage | 13:51 |
openstackgerrit | Merged openstack/sahara: Add recommendation support to Cloudera plugin https://review.openstack.org/193098 | 13:54 |
openstackgerrit | Merged openstack/sahara: Support placeholders in args of job for i/o https://review.openstack.org/206094 | 13:54 |
SergeyLukjanov | elmiko, oh, I've missed your message | 13:57 |
openstackgerrit | Merged openstack/sahara: Add scenario gate testing placeholders https://review.openstack.org/213544 | 13:57 |
SergeyLukjanov | elmiko, I've already mailed the etherpad to openstack-dev | 13:57 |
SergeyLukjanov | http://etherpad.openstack.org/p/sahara-liberty-bug-triage-day | 13:57 |
SergeyLukjanov | prev. week | 13:57 |
SergeyLukjanov | let's use this one to avoid people flustrating on different etherpads in mail list and here ;) | 13:58 |
SergeyLukjanov | I've added link to the wiki page with useful queries and order of the bugs checking by prio | 13:58 |
openstackgerrit | Vitaly Gridnev proposed stackforge/sahara-ci-config: Revert "Update version of vanilla plugin for transient check" https://review.openstack.org/213716 | 14:00 |
*** chlong has joined #openstack-sahara | 14:03 | |
elmiko | SergeyLukjanov: np, i'll copy what i have into the new pad | 14:11 |
vgridnev | I marked https://bugs.launchpad.net/sahara/+bug/1479666 as duplicate of https://bugs.launchpad.net/sahara/+bug/1436425 | 14:14 |
openstack | Launchpad bug 1436425 in Sahara "duplicate for #1479666 [CDH 5.3.0] Too many connection" [Undecided,New] | 14:14 |
uvirtbot | Launchpad bug 1479666 in sahara "Too many connections (dup-of: 1436425)" [Undecided,New] | 14:14 |
openstack | Launchpad bug 1436425 in Sahara "[CDH 5.3.0] Too many connection" [Undecided,New] | 14:14 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH 5.3.0] Too many connection" [Undecided,New] | 14:14 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH 5.3.0] Too many connection" [Undecided,New] https://launchpad.net/bugs/1436425 | 14:14 |
elmiko | SergeyLukjanov: ok, copied all "undecided" bugs into the new pad | 14:15 |
elmiko | https://etherpad.openstack.org/p/sahara-liberty-bug-triage-day | 14:16 |
*** egafford has quit IRC | 14:16 | |
*** _crobertsrh is now known as crobertsrh | 14:17 | |
elmiko | vgridnev: thanks | 14:17 |
vgridnev | elmiko, what I should do with that bug in ether pad? | 14:17 |
elmiko | mark it with strike-through (ctrl+s) once it's been triaged | 14:18 |
*** egafford has joined #openstack-sahara | 14:18 | |
elmiko | oops, thats (ctrl+5) | 14:20 |
elmiko | vgridnev: yea, thanks! | 14:20 |
*** DWfuturetec has joined #openstack-sahara | 14:22 | |
vgridnev | We have several bugs that filed for Hadoop 1 in Vanilla and HDP, so I suppose that it should be marked as Invalid or Won't Fix ? | 14:22 |
elmiko | vgridnev: i think won't fix makes sense, with a comment about deprecating hadoop1 | 14:23 |
SergeyLukjanov | IMO Hadoop 1 issues are won't fix | 14:23 |
SergeyLukjanov | elmiko ++ | 14:23 |
tosky | SergeyLukjanov, elmiko, vgridnev: can we can first kill Hadoop 1 and HDP standalone and then kill the bugs | 14:24 |
tosky | order is important sometime :) | 14:24 |
vgridnev | I think that invalid bug: https://bugs.launchpad.net/sahara/+bug/1416968 | 14:24 |
openstack | Launchpad bug 1416968 in Sahara "scaling test failed in cdh integration test" [Undecided,New] - Assigned to lu huichun (lhcxx0508) | 14:24 |
uvirtbot | Launchpad bug 1416968 in sahara "scaling test failed in cdh integration test" [Undecided,New] | 14:25 |
uvirtbot | Launchpad bug 1416968 in sahara "scaling test failed in cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416968 | 14:25 |
elmiko | tosky: i'm ok with that, but we should at least mark the bugs with a comment if we are going to leave them as "undecided" | 14:25 |
tosky | elmiko: sure, even group them; not sure how to do it in launchpad | 14:25 |
tosky | a tag maybe so that they can be found easily? | 14:25 |
elmiko | hmm, vgridnev what is an example of a hadoop1 bug? | 14:26 |
vgridnev | 1min | 14:26 |
vgridnev | https://bugs.launchpad.net/sahara/+bug/1438153 | 14:27 |
openstack | Launchpad bug 1438153 in Sahara "[HDP 1.3.2] With enabled auto security grup EDP jobs do not pass" [Undecided,New] | 14:27 |
uvirtbot | Launchpad bug 1438153 in sahara "[HDP 1.3.2] With enabled auto security grup EDP jobs do not pass" [Undecided,New] | 14:27 |
uvirtbot | Launchpad bug 1438153 in sahara "[HDP 1.3.2] With enabled auto security grup EDP jobs do not pass" [Undecided,New] https://launchpad.net/bugs/1438153 | 14:27 |
elmiko | thanks | 14:27 |
vgridnev | https://bugs.launchpad.net/sahara/+bug/1436372 | 14:28 |
openstack | Launchpad bug 1436372 in Sahara "Java job has "KILLED" state" [Undecided,New] | 14:28 |
uvirtbot | Launchpad bug 1436372 in sahara "Java job has "KILLED" state" [Undecided,New] | 14:28 |
uvirtbot | Launchpad bug 1436372 in sahara "Java job has "KILLED" state" [Undecided,New] https://launchpad.net/bugs/1436372 | 14:28 |
elmiko | tosky, SergeyLukjanov, vgridnev, so for hadoop1 maybe could mark them as "triaged" with a comment about deprecation, and when we have deprecated we can remark the bugs as "won't fix"? | 14:29 |
vgridnev | elmiko, I propose to move them to Won't fix and add target to liberty-3 | 14:29 |
elmiko | vgridnev: yea, but tosky is asking that we not mark them as "won't fix" until we actually deprecate the hadoop1 stuff. since it is still technically in the codebase. | 14:30 |
vgridnev | elmiko, spec for drop already merged | 14:31 |
tosky | vgridnev, elmiko: exactly; first remove the code, then close the bugs, otherwise it's a bit confusing | 14:31 |
tosky | oh | 14:31 |
tosky | I missed the merge | 14:31 |
elmiko | tosky: so, are you ok with us marking them as "won't fix" then? | 14:32 |
tosky | well, if the spec is approved, they are technically zombies | 14:32 |
tosky | so... | 14:32 |
elmiko | k, thanks | 14:32 |
*** Nikolay_St has quit IRC | 14:33 | |
elmiko | ok then, i'm gonna mark some of these and leave comments | 14:33 |
tosky | if you remember, please add a link to the spec when closing them | 14:33 |
elmiko | good idea | 14:33 |
elmiko | tosky: take a look at https://bugs.launchpad.net/sahara/+bug/1436372 | 14:35 |
openstack | Launchpad bug 1436372 in Sahara "Java job has "KILLED" state" [Undecided,Won't fix] | 14:35 |
uvirtbot | Launchpad bug 1436372 in sahara "Java job has "KILLED" state" [Undecided,New] | 14:35 |
uvirtbot | Launchpad bug 1436372 in sahara "Java job has "KILLED" state" [Undecided,New] https://launchpad.net/bugs/1436372 | 14:35 |
elmiko | and see my comment | 14:35 |
tosky | elmiko: right, make sense | 14:35 |
elmiko | k, thanks | 14:36 |
tosky | SergeyLukjanov, vgridnev, elmiko: the spec is only for Vanilla 1 and HDP 1; I think also one of the MapR plugins is Hadoop 1, isn'it ? | 14:36 |
elmiko | good question, /me looks | 14:37 |
vgridnev | tosky, Mapr plugin is some kind of magic in our sahara code | 14:37 |
elmiko | lol | 14:38 |
tosky | O.o | 14:38 |
vgridnev | no one knows what happens there | 14:38 |
elmiko | i actually really like how the mapr plugin is structured | 14:38 |
tosky | can this be raised with MapR developers/contributors? 3.1.1 is the Hadoop 1 iirc | 14:39 |
tosky | or maybe not, let me recheck | 14:40 |
elmiko | tosky: i think it's written to be somewhat flexible with regards to hadoop version | 14:40 |
tosky | uhm, it's not really clear for me from MapR release notes | 14:42 |
elmiko | hence vgridnev's comment ;) | 14:43 |
elmiko | i think you are correct though, 3.1.1 is hadoop 1 | 14:44 |
elmiko | but 4.0.0 looks like it supports both | 14:44 |
crobertsrh | Has anyone else managed to reproduce https://bugs.launchpad.net/sahara/+bug/1477530 | 14:46 |
openstack | Launchpad bug 1477530 in Sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] | 14:46 |
uvirtbot | Launchpad bug 1477530 in sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] | 14:46 |
uvirtbot | Launchpad bug 1477530 in sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] https://launchpad.net/bugs/1477530 | 14:46 |
crobertsrh | wow, attack of the bots | 14:47 |
vgridnev | elmiko, tosky I'm ping Mapr guy, he will verify bugs for Mapr plugin | 14:47 |
elmiko | vgridnev: awesome, thank you | 14:47 |
elmiko | crobertsrh: i have seen that issue before, but i think we might have fixed the indefinite "waiting" status with the timeout stuff. | 14:47 |
crobertsrh | Ok. I see it's with nova networks. I haven't used nova in quite awhile and was hoping to avoid doing so. | 14:48 |
egafford | elmiko: Yeah, the bug specifies that it goes into Error, though, so I think the bug is a follow-up on "configuring forever". | 14:48 |
elmiko | should be easy to test, just turn off the sec groups and auto-config sec. groups, then try to spin a cluster | 14:48 |
elmiko | seems like not a bug to me... | 14:49 |
vgridnev | Do we have one job with nova-network + heat engine on our CI? | 14:51 |
elmiko | not sure about that | 14:52 |
vgridnev | seems that HDP 2.0.6 is testing on that environment | 14:52 |
*** nkrinner has quit IRC | 14:53 | |
egafford | vgridnev: And HDP 2.0.6 has been failing for a while... hm... | 14:54 |
elmiko | ouch | 14:55 |
vgridnev | egafford, not so much | 14:56 |
tmckay | hi folks. https://bugs.launchpad.net/sahara/+bug/1470525, changing the output in the CLI for the job list | 14:56 |
openstack | Launchpad bug 1470525 in Sahara "Sahara CLI does not show start_time in job-list" [Wishlist,New] - Assigned to Henrique Truta (henriquetruta) | 14:56 |
uvirtbot | Launchpad bug 1470525 in sahara "Sahara CLI does not show start_time in job-list" [Wishlist,New] | 14:56 |
uvirtbot | Launchpad bug 1470525 in sahara "Sahara CLI does not show start_time in job-list" [Wishlist,New] https://launchpad.net/bugs/1470525 | 14:56 |
vgridnev | It's passed there: https://review.openstack.org/#/c/207039/ | 14:56 |
tmckay | changed it to wishlist, what about a target? I guess we still have 2 weeks before the freeze. Would we require a spec for this? Should it be posted as a bluepring instead of a bug? | 14:57 |
AndreyPavlov | What do you suppose to do with CLI-related bugs (wich sounds more like wishes)? Spec for new CLI, integrated with openstackclient, is on review. Should we fix the old one? | 14:57 |
htruta | tmckay: I think someone has suggested to put it as a bug | 14:57 |
egafford | tmckay: Honestly, order by UUID is kind of so unuseable that filing it as a bug makes some sense to me. | 14:57 |
tmckay | I'm going to suggest that it be closed wontfix and redone as a blueprint | 14:57 |
egafford | tmckay: There are UX nice-to-haves, but this does strike me as a UX oversight / bug. | 14:58 |
tmckay | htrutra, hey there :) yeah, gray area | 14:58 |
htruta | tmckay: hey :) let me see here | 14:58 |
tmckay | alright, I'm okay leaving it as a bug. Only thought was, since there is a suggestion about how to sort the list, it sounds more like a spec/blueprint to me | 14:59 |
elmiko | AndreyPavlov: i think we should mark those as wishlist and add a comment to the bug about migrating to openstackclient | 14:59 |
egafford | tmckay: Yeah, totally valid that it could be treated as a feature. | 14:59 |
tmckay | either way. If it's a bug, we should target to liberty 3, and if it doesn't get in we can bump to M | 14:59 |
egafford | tmckay: +1. | 14:59 |
tmckay | htrutra, okay, carry on. Wishlist/confirmed/l3 | 15:00 |
tmckay | thanks | 15:00 |
elmiko | AndreyPavlov brings up a good point about sahara cli related stuff. if we are moving to the openstackclient we should reassess how we handle these new features | 15:00 |
htruta | tmckay: found it: https://review.openstack.org/#/c/191875/3//COMMIT_MSG | 15:00 |
htruta | guess I was too slow | 15:00 |
vgridnev | tmckay, it should affects only saharaclient | 15:01 |
vgridnev | not sahara | 15:01 |
tmckay | htruta, heh, aignatov shot me down :) | 15:01 |
egafford | Folks good with targetting https://bugs.launchpad.net/sahara/+bug/1485624 to L3? Oversight related to Main Class and other required config keys on interface map. | 15:01 |
openstack | Launchpad bug 1485624 in Sahara "Main Class required in configs even when mapped in interface" [Undecided,New] | 15:01 |
aignatov | tmckay: how? :) | 15:01 |
uvirtbot | Launchpad bug 1485624 in sahara "Main Class required in configs even when mapped in interface" [Undecided,New] | 15:01 |
uvirtbot | Launchpad bug 1485624 in sahara "Main Class required in configs even when mapped in interface" [Undecided,New] https://launchpad.net/bugs/1485624 | 15:01 |
tmckay | okay, I stand corrected | 15:01 |
tmckay | aignatov, I told htruta his bug should be a bp, you told him in the commit that it should be a bug :) | 15:02 |
aignatov | :) | 15:02 |
tmckay | and it already merged, so it should be fix committed | 15:02 |
egafford | Also, hruta, tmckay: It's worth real thought as to whether that ordering should be put into the service layer (otherwise we get into weird places on pagination.) | 15:02 |
aignatov | sorry for that htruta tmckay | 15:02 |
egafford | Ah, never mind. | 15:02 |
htruta | tmckay: it was a patch before the one you showed... but it should follow the same line | 15:02 |
tmckay | hmm, htrutra, why only "partial bug" on that commit? Is there more? | 15:02 |
tmckay | aignatov, lol, completely np | 15:02 |
elmiko | egafford: +1 on 1485624 | 15:03 |
aignatov | actually this change looks really simple :) | 15:03 |
egafford | elmiko: Cool. | 15:03 |
htruta | tmckay: the first: https://review.openstack.org/#/c/191875 | 15:03 |
aignatov | so I’ve thought that it could look like bug | 15:03 |
htruta | the second: https://review.openstack.org/#/c/197606/ | 15:03 |
htruta | I thought about doing this in Horizon, as well | 15:03 |
htruta | would make my life a lot easier | 15:03 |
elmiko | egafford: are you gonna mark that one as confirmed? | 15:04 |
tmckay | htrutra, hmm, still "closes bug" though, wonder why launchpad didn't pick it up | 15:04 |
egafford | elmiko: Well, in theory, Confirmed is "someone other than the reporter." | 15:04 |
egafford | It's pretty definitely happening, though. | 15:04 |
tmckay | htruta, anything else to do on that, or is it really closed? I think we want "fix committed" on the bug | 15:05 |
tmckay | maybe because the ":" was missing in the commit message? | 15:05 |
htruta | tmckay: was intending to take it to horizon, but didn't have the time for it | 15:06 |
htruta | I'm ok with closing it for now | 15:07 |
tmckay | htrutra, that's okay. that can be a separate issue. Let's close this one, maybe it didn't close because it was against sahara and not the client? | 15:07 |
egafford | htruta: If the order by is in place in the client, won't that propagate to Horizon without a specific change? | 15:07 |
tmckay | egafford, client or CLI? /me looks at the change | 15:08 |
htruta | I don't think so... I've only changed the CLI part | 15:08 |
tmckay | this is just the shell ... not the client | 15:08 |
egafford | tmckay: Ah, yeah, okay, if it's just in the shell, we still need work there. | 15:08 |
egafford | htruta: Cool; thanks for the clarification. | 15:08 |
tmckay | I picked the simplest, and also the most discussed, bug to triage :) | 15:09 |
egafford | tmckay: Nice place to start. :) | 15:09 |
htruta | egafford: np | 15:09 |
elmiko | all i ask is that people update the etherpad once they have triaged a bug | 15:12 |
tmckay | question now is milestone for the client ... | 15:12 |
tmckay | SergeyLukjanov, unsure how to set milestone on https://bugs.launchpad.net/sahara/+bug/1470525, it's already merged, I moved it against the client instead of Sahara | 15:16 |
openstack | Launchpad bug 1470525 in Python client library for Sahara "Sahara CLI does not show start_time in job-list" [Wishlist,Fix committed] - Assigned to Henrique Truta (henriquetruta) | 15:16 |
uvirtbot | Launchpad bug 1470525 in sahara "Sahara CLI does not show start_time in job-list" [Wishlist,New] | 15:16 |
uvirtbot | Launchpad bug 1470525 in sahara "Sahara CLI does not show start_time in job-list" [Wishlist,New] https://launchpad.net/bugs/1470525 | 15:16 |
egafford | tmckay: I remember you had all manner of trouble with VM sizing on the Cloudera default templates. In your expert opinion on that point, is there anything we can actually do about https://bugs.launchpad.net/sahara/+bug/1416969? | 15:16 |
openstack | Launchpad bug 1416969 in Sahara "big flavor cost too resource when running cdh integration test" [Undecided,New] - Assigned to lu huichun (lhcxx0508) | 15:16 |
uvirtbot | Launchpad bug 1416969 in sahara "big flavor cost too resource when running cdh integration test" [Undecided,New] | 15:16 |
uvirtbot | Launchpad bug 1416969 in sahara "big flavor cost too resource when running cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416969 | 15:16 |
tmckay | egafford, nope | 15:17 |
egafford | tmckay: That was my thought. | 15:17 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: [WIP] Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 15:17 |
tmckay | without the flavors outlined in the default templtes, cdh will hang | 15:17 |
egafford | CDH: Putting the big in big data. | 15:17 |
vgridnev | egafford, I think it's another invalid bug | 15:18 |
egafford | vgridnev: Yeah, agreed; just wanted to confirm. Any objection to invalid status on that one? | 15:18 |
*** chlong has quit IRC | 15:18 | |
egafford | (Or perhaps just a comment to Lu Huichun, for politeness' sake?) | 15:18 |
elmiko | so, are we marking this one as not a bug? https://bugs.launchpad.net/sahara/+bug/1477530 | 15:19 |
openstack | Launchpad bug 1477530 in Sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] | 15:19 |
uvirtbot | Launchpad bug 1477530 in sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] | 15:19 |
uvirtbot | Launchpad bug 1477530 in sahara "Cluster stays in waiting state and then goes into error" [Undecided,New] https://launchpad.net/bugs/1477530 | 15:19 |
elmiko | imo it seems more like a config/operator issue than a bug | 15:19 |
egafford | Well, is anyone successfully using nova-net in their devstack setups atm? | 15:20 |
elmiko | either that or we should change sahara to ensure that there are no blocks to using ssh before starting a cluster | 15:20 |
elmiko | egafford: i just don't see the bug here. the operator doesn't have a security group to allow ssh traffic, of course sahara will fail | 15:21 |
elmiko | the one thing we could do, is check sec.groups to ensure that ssh is available, but that might be a bit much. | 15:21 |
egafford | elmiko: I think the bug is that the auto sec groups aren't being assigned through Nova. | 15:21 |
elmiko | egafford: maybe we should mark as incomplete and ask whether auto-groups were used? | 15:22 |
vgridnev | btw, do we care, that we use some kind of icehouse quichstart? | 15:22 |
egafford | elmiko: Or at least, that's my read on what the issue is, assuming a generous stance toward the reporter. | 15:22 |
egafford | elmiko: Yeah, that's fair. | 15:22 |
elmiko | vgridnev: that's a question for doc days ;) | 15:22 |
elmiko | vgridnev: but i think we should update the quickstart, so yes, i think we should care | 15:23 |
crobertsrh | quickstart should probably be updated | 15:23 |
vgridnev | elmiko, do we want to file bug for that? Just to ensure that it will be fixed in docs-days | 15:23 |
elmiko | hmm, good question vgridnev. i suppose it wouldn't hurt | 15:24 |
vgridnev | elmiko, https://bugs.launchpad.net/sahara/+bug/1485648 | 15:26 |
openstack | Launchpad bug 1485648 in Sahara "Quickstart user guide is too old" [Undecided,New] | 15:26 |
uvirtbot | Launchpad bug 1485648 in sahara "Quickstart user guide is too old" [Undecided,New] | 15:26 |
uvirtbot | Launchpad bug 1485648 in sahara "Quickstart user guide is too old" [Undecided,New] https://launchpad.net/bugs/1485648 | 15:26 |
tmckay | hey, alright, I am the only one that I see that did a strikethrough as elmiko requested. | 15:26 |
egafford | elmiko, tmckay: Marked https://bugs.launchpad.net/sahara/+bug/1416969 incomplete pending notation of a specific flavor that does work for the CDH namenode. Seemed the most polite thing. | 15:26 |
openstack | Launchpad bug 1416969 in Sahara "big flavor cost too resource when running cdh integration test" [Undecided,Incomplete] - Assigned to lu huichun (lhcxx0508) | 15:26 |
uvirtbot | Launchpad bug 1416969 in sahara "big flavor cost too resource when running cdh integration test" [Undecided,Incomplete] | 15:26 |
uvirtbot | Launchpad bug 1416969 in sahara "big flavor cost too resource when running cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416969 | 15:26 |
tmckay | Has anyone actually triaged anything else? Where are the strikethroughs? | 15:26 |
tmckay | am I on the wrong sheet? | 15:26 |
elmiko | tmckay: https://etherpad.openstack.org/p/sahara-liberty-bug-triage-day | 15:26 |
tmckay | https://etherpad.openstack.org/p/liberty-3-sahara-bug-triage | 15:26 |
elmiko | there are a bunch on that page | 15:26 |
tmckay | doh | 15:27 |
elmiko | yea, SergeyLukjanov had sent out the link to his page on the ML | 15:27 |
elmiko | egafford: ack | 15:27 |
tmckay | elmiko, saw that conversation and still picked the wrong one | 15:29 |
elmiko | tmckay: no worries =) | 15:30 |
elmiko | there needs to be a way to delete an etherpad, ooh wait i know | 15:30 |
elmiko | ok, fixed | 15:31 |
*** chlong has joined #openstack-sahara | 15:31 | |
vgridnev | that should be also won't fix: https://bugs.launchpad.net/sahara/+bug/1416992 - we use new scenario tests | 15:32 |
openstack | Launchpad bug 1416992 in Sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] - Assigned to lu huichun (lhcxx0508) | 15:32 |
uvirtbot | Launchpad bug 1416992 in sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] | 15:32 |
uvirtbot | Launchpad bug 1416992 in sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416992 | 15:32 |
elmiko | vgridnev: why won't fix? | 15:32 |
vgridnev | we don't use old integrations tests | 15:33 |
elmiko | ah, ok | 15:33 |
vgridnev | Or invalid, not so sure | 15:33 |
elmiko | so, this should be fixed for the tempest tests then? | 15:33 |
vgridnev | why? | 15:34 |
egafford | Hm: https://bugs.launchpad.net/sahara/+bug/1416968 is likely about the old integration tests, but I'm noting that the current CI pipelines for CDH don't run scaling tests. I can see adapating this bug to be a test coverage bug for scaling for CDH in the scenario tests. | 15:34 |
openstack | Launchpad bug 1416968 in Sahara "scaling test failed in cdh integration test" [Undecided,New] - Assigned to lu huichun (lhcxx0508) | 15:34 |
uvirtbot | Launchpad bug 1416968 in sahara "scaling test failed in cdh integration test" [Undecided,New] | 15:34 |
uvirtbot | Launchpad bug 1416968 in sahara "scaling test failed in cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416968 | 15:34 |
elmiko | maybe i misunderstood what you are saying. are we not checking the cdh plugin during tempest testing? | 15:34 |
elmiko | vgridnev, egafford, sounds like we should mark 1416968 as incomplete and ask a few questions? | 15:36 |
egafford | elmiko: Yup, sounds about right. | 15:36 |
*** chlong has quit IRC | 15:38 | |
vgridnev | elmiko, agreed | 15:39 |
egafford | vgridnev, elmiko: Added a few comments there. | 15:39 |
elmiko | egafford: thanks | 15:39 |
egafford | elmiko: How does one make text background red in etherpad? I may be being dense, but I see no interface for that fanciness. | 15:40 |
*** chlong has joined #openstack-sahara | 15:40 | |
* egafford needs his triaged bugs to be fancier! | 15:40 | |
elmiko | egafford: its based on your user color | 15:40 |
egafford | elmiko: Ah. This makes sense. | 15:40 |
egafford | Thought it signified something entirely different. | 15:41 |
elmiko | nah, AndreyPavlov just happened to be red color for this pad | 15:41 |
egafford | Instead, it signifies apavlov. | 15:41 |
egafford | (A great thing to signify, really.) :) | 15:41 |
elmiko | so, the comments you made to 1416968, do those apply to https://bugs.launchpad.net/sahara/+bug/1416992 as well? | 15:42 |
openstack | Launchpad bug 1416992 in Sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] - Assigned to lu huichun (lhcxx0508) | 15:42 |
uvirtbot | Launchpad bug 1416992 in sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] | 15:42 |
uvirtbot | Launchpad bug 1416992 in sahara "need two parameters added in the config.py in cdh integration test" [Undecided,New] https://launchpad.net/bugs/1416992 | 15:42 |
elmiko | well, the first question at least | 15:42 |
*** hdd has joined #openstack-sahara | 15:43 | |
egafford | elmiko: Yeah, these are definitely old. I'll check the scenario tests quick to see if there's feature parity with what this is asking for, though. | 15:43 |
elmiko | egafford: thanks! | 15:43 |
vgridnev | want is the purpose of this bug? | 15:44 |
vgridnev | I don't clearly understand that | 15:45 |
egafford | vgridnev: It's not clear. :) | 15:45 |
elmiko | sounds like a good case to mark it incomplete and ask questions =) | 15:45 |
vgridnev | elmiko, +1 | 15:45 |
tmckay | vgridnev, anything to add on https://bugs.launchpad.net/sahara/+bug/1466876? Looks "incomplete" to me, no info on how to rerpoduce. Have you seen it since? | 15:45 |
openstack | Launchpad bug 1466876 in Sahara "Arguments dropped when creating context" [Undecided,New] | 15:45 |
uvirtbot | Launchpad bug 1466876 in sahara "Arguments dropped when creating context" [Undecided,New] | 15:45 |
uvirtbot | Launchpad bug 1466876 in sahara "Arguments dropped when creating context" [Undecided,New] https://launchpad.net/bugs/1466876 | 15:45 |
vgridnev | tmckay, I have some https://sahara.mirantis.com/logs/39/207039/4/check/gate-sahara-neutron-heat-vanilla_2.6.0-u14/cf5f283/ | 15:47 |
vgridnev | tmckay, it's reproducable on all envs with api and 2 engines | 15:48 |
tmckay | ah, okay, multiple engines. Would you kindly add the above information to the bug, and then someone can confirm it? | 15:49 |
tmckay | vgridnev, also, a note on whether it breaks anything (if not, it can be "low") | 15:49 |
vgridnev | tmckay, I suppose it should invalid, btw | 15:50 |
egafford | Hm; this one is interesting (and has bit me a bit in the past): https://bugs.launchpad.net/sahara/+bug/1419643 | 15:50 |
openstack | Launchpad bug 1419643 in Sahara "saharaclient should check the input param when cluster-create" [Undecided,Triaged] - Assigned to warewang (wangguangcai) | 15:50 |
uvirtbot | Launchpad bug 1419643 in sahara "saharaclient should check the input param when cluster-create" [Undecided,Triaged] | 15:50 |
uvirtbot | Launchpad bug 1419643 in sahara "saharaclient should check the input param when cluster-create" [Undecided,Triaged] https://launchpad.net/bugs/1419643 | 15:50 |
elmiko | egafford: i just triaged that one | 15:50 |
tmckay | vgridnev, why invalid? | 15:50 |
egafford | Is the stdin JSON CLI feature worth potential hanging? | 15:50 |
vgridnev | I was added to setup logging on devstack | 15:50 |
egafford | elmiko: Ah, okay. I don't see Milestone or Importance, though... | 15:50 |
vgridnev | or to handle it's correctly | 15:50 |
elmiko | egafford: i left those open as we need to discuss what should happen. i added it to the meeting agenda for this week, based on alazarev's comments in the bug. | 15:51 |
egafford | elmiko: Sensible. | 15:52 |
elmiko | egafford: i'm not sure the proper response, i suppose if we determine that stdin is sending nothing then we should fail early. | 15:52 |
egafford | elmiko: I think it's supposed to wait for the user to type JSON. | 15:53 |
elmiko | it's also partially a pbkac issue as i've seen many other commands just hang when you pipe stdin to them | 15:53 |
elmiko | egafford: right... how can we know what they are doing at the shell | 15:53 |
egafford | elmiko: But I'm totally unconvinced that's a realistically useful feature, or worth the hanging failure. | 15:53 |
*** vgridnev has quit IRC | 15:54 | |
elmiko | egafford: agreed, if stdin.read() == 0 then we should probably fail | 15:54 |
egafford | elmiko: +1. | 15:54 |
egafford | (Mind if I note the addition to the agenda and this discussion on the bug, for memory's sake?) | 15:55 |
egafford | Ah, you've done so since I last checked. Bully for you! | 15:55 |
egafford | This one (https://bugs.launchpad.net/sahara/+bug/1431460) is a real oddity (CDH uses _ rather than . to separate tokens in config keys.) Does anyone know of a reason in the bowels of CDH why this is appropriate? | 15:56 |
openstack | Launchpad bug 1431460 in Sahara "Different naming pattern for Cluster template parameters in CDH plugin" [Undecided,New] | 15:56 |
uvirtbot | Launchpad bug 1431460 in sahara "Different naming pattern for Cluster template parameters in CDH plugin" [Undecided,New] | 15:56 |
uvirtbot | Launchpad bug 1431460 in sahara "Different naming pattern for Cluster template parameters in CDH plugin" [Undecided,New] https://launchpad.net/bugs/1431460 | 15:57 |
*** egafford is now known as egafford|afk | 15:58 | |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: [WIP] Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 16:04 |
openstackgerrit | Sergey Reshetnyak proposed openstack/sahara-image-elements: Install xfsprogs for ability to formatting volumes in XFS FS https://review.openstack.org/213768 | 16:04 |
openstackgerrit | Merged openstack/sahara-image-elements: Added ability to specify exact package versions for MapR https://review.openstack.org/202090 | 16:08 |
tmckay | egafford|afk, taking a peek at 143160 | 16:14 |
*** egafford|afk is now known as egafford | 16:16 | |
egafford | elmiko: We planning on doing an Importance field review later, once we've assigned status and milestone? Ideally, importance will guide next week's bugfix day. | 16:25 |
elmiko | egafford: i'm ok with making a second pass later in the week for importance. or even reviewing that we start grabbing bugs | 16:26 |
egafford | elmiko: (Being as we're in semi-headless mode, I suppose anyone could just start doing being the change they want to see in the world, but.) Yeah, sounds good. | 16:26 |
egafford | elmiko: I'll definitely want it to be filled and team-approved in the overwhelming majority of cases by Monday, one way or another. | 16:27 |
elmiko | egafford: fair, i'll make sure to followup at the end of each day on the etherpad. i will attempt to make guesses at importance for the bugs. | 16:27 |
elmiko | we can always re-assess if/when necessary | 16:28 |
egafford | elmiko: Yeah. Some lightweight process might be nice there. Maybe recording tentative importance on the etherpad, lazy consensus until Thursday or so? | 16:28 |
elmiko | egafford: +1, i'll make some notes as we go | 16:29 |
egafford | (Where anyone can make a first stab at importance, for discussion?) | 16:29 |
egafford | elmiko: Solid. | 16:29 |
elmiko | yea, i'll just put importance beneath the link, and if you have an idea put it there. i'll also note this in the pad | 16:29 |
egafford | elmiko: +1. | 16:30 |
elmiko | ok, updated the pad | 16:30 |
egafford | elmiko: \o/ | 16:30 |
egafford | elmiko: What's the Importance enum, for those of us who are +2-challenged? | 16:33 |
elmiko | sec | 16:33 |
egafford | elmiko: (Thanks!) | 16:33 |
elmiko | critical, high, medium, low, wishlist | 16:33 |
elmiko | added this to the pad as well | 16:35 |
*** ashishb has joined #openstack-sahara | 16:39 | |
*** vgridnev has joined #openstack-sahara | 16:44 | |
elmiko | egafford: thinking about this more, it makes sense to do two pass. | 16:46 |
egafford | Triage, then prio? | 16:46 |
elmiko | first we determine which are valid bugs, ie not incomplete or invalid | 16:46 |
elmiko | yea, then prio | 16:46 |
egafford | Agreed. | 16:46 |
elmiko | fair, i'm gonna target for starting the prio pass by wednesday evening | 16:47 |
egafford | I think a first pass as we go (marking a starting point on the pad) is still a good call, though, to give us a starting point. | 16:47 |
elmiko | unless we finish triage before then | 16:47 |
elmiko | egafford: +1 | 16:47 |
elmiko | if you have an idea about prio, definitely add a comment on the pad | 16:47 |
*** DWfuturetec has quit IRC | 16:47 | |
egafford | elmiko: +1 right back at you, buddy. :) | 16:47 |
elmiko | hehe | 16:47 |
*** sgotliv has quit IRC | 16:51 | |
*** DWfuturetec has joined #openstack-sahara | 17:04 | |
*** hdd has quit IRC | 17:07 | |
*** hdd has joined #openstack-sahara | 17:09 | |
egafford | vgridnev: Have you seen https://bugs.launchpad.net/sahara/+bug/1436425 happen? Noted that you changed the tag from [CDH 5.3.0] to [CDH]. | 17:13 |
openstack | Launchpad bug 1436425 in Sahara "[CDH] Too many connection" [Undecided,New] | 17:13 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,New] | 17:13 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,New] https://launchpad.net/bugs/1436425 | 17:13 |
vgridnev | same for cdh https://bugs.launchpad.net/sahara/+bug/1479666 egafford | 17:15 |
openstack | Launchpad bug 1436425 in Sahara "duplicate for #1479666 [CDH] Too many connection" [Undecided,New] | 17:15 |
uvirtbot | Launchpad bug 1479666 in sahara "Too many connections (dup-of: 1436425)" [Undecided,New] | 17:15 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,New] | 17:15 |
openstack | Launchpad bug 1436425 in Sahara "[CDH] Too many connection" [Undecided,New] https://launchpad.net/bugs/1436425 | 17:15 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,New] https://launchpad.net/bugs/1436425 | 17:15 |
egafford | Okay, so effectively the two bugs confirm one another. :) | 17:16 |
*** degorenko has quit IRC | 17:16 | |
*** IBerezovskiy has quit IRC | 17:19 | |
*** hdd has quit IRC | 17:23 | |
*** hdd has joined #openstack-sahara | 17:25 | |
egafford | vgridnev, SergeyLukjanov, et al.: Is this a Sahara issue or an MOS 7.0 issue? I'm seeing check_cinder pass in our CI, in, say, https://sahara.mirantis.com/logs/22/213622/1/check/gate-sahara-neutron-heat-vanilla_2.7.1-u14/8646c44/console.html. https://bugs.launchpad.net/sahara/+bug/1484535 | 17:29 |
openstack | Launchpad bug 1484535 in Sahara "In scenario tests cinder check fail with trace" [Undecided,New] - Assigned to Evgeny Sikachev (esikachev) | 17:29 |
uvirtbot | Launchpad bug 1484535 in sahara "In scenario tests cinder check fail with trace" [Undecided,New] | 17:29 |
uvirtbot | Launchpad bug 1484535 in sahara "In scenario tests cinder check fail with trace" [Undecided,New] https://launchpad.net/bugs/1484535 | 17:29 |
egafford | (Very odd issue; 'module' object has no attribute 'poll' in select, deep in tempest_lib.) | 17:30 |
vgridnev | egafford, i reproduce same on my macbook, btw it's not reproduced on ci | 17:32 |
egafford | vgridnev: Weird... are you running the upstream codebase (sahara from our tox) or a packaged MOS 7.0 product version? (Just wondering about the "on MOS 7.0" comment in the bug report, and how that would work in CI but not manually with the same code, especially given that the error seems to be pretty definitely saying that the select module, itself, is missing one of its key members.) | 17:35 |
tosky | vgridnev: if you can reproduce, can you please add the details about tempest-lib version etc? I asked something on the bug | 17:36 |
egafford | So odd... they'd have to overwrite select somehow... | 17:37 |
vgridnev | egafford, on upstream codebase, of cource | 17:37 |
egafford | vgridnev: Okay, good to know, just trying to imagine how this could even happen. Thanks very much for clarifying. | 17:37 |
*** DWfuturetec has quit IRC | 17:38 | |
egafford | vgridnev: If you could mark the bug as Confirmed and mark a possible Importance on the etherpad, that'd be really great. | 17:38 |
egafford | vgridnev: http://stackoverflow.com/questions/19740471/cannot-use-python-select-poll-in-mac-os | 17:41 |
egafford | Neat. | 17:41 |
egafford | Or at least, a very real possibility. | 17:41 |
vgridnev | ok, egafford could you please that as invalid with that link? | 17:42 |
egafford | Sure; at the very least, it's a bug in tempestlib that seems likely to only effect macs (and thus, not something to fix in Sahara itself.) | 17:43 |
vgridnev | agreed | 17:43 |
egafford | vgridnev: One imagines the CI lab is running on Ubuntu? | 17:43 |
tosky | vgridnev: do you know if esikachev use a Mac too? | 17:43 |
vgridnev | yep | 17:44 |
egafford | tosky, always with the reasonable QE questions... | 17:44 |
vgridnev | egafford, ci lab installed with ubuntu | 17:44 |
egafford | vgridnev: Cool; this diagnosis makes a lot of sense then. Thanks very much. | 17:45 |
tmckay | interesting devstack tidbit, glance failing to upload more images, although there is space. Restarted swift-proxy service, upload works. | 17:47 |
tmckay | I've run into this before but had not found a solution without ./unstack.sh ^^ | 17:48 |
tosky | egafford: oh, needs to head out in 2 minutes, but please keep an eye on https://review.openstack.org/#/c/212865/ - it escaped my testing | 17:52 |
egafford | tosky: Ack; do we want to make a RH bug about it now so we remember? I can provide that service. :) | 17:53 |
tosky | egafford: let's talk tomorrow! | 17:54 |
egafford | tosky: +1! | 17:54 |
*** tosky has quit IRC | 17:54 | |
egafford | elmiko: You're all Sparky. I'm not! What are your immediate thoughts on https://bugs.launchpad.net/sahara/+bug/1452127? | 17:55 |
openstack | Launchpad bug 1452127 in Sahara "Spark plugin does not pass JAVA_OPTS and configurations" [Undecided,New] | 17:55 |
uvirtbot | Launchpad bug 1452127 in sahara "Spark plugin does not pass JAVA_OPTS and configurations" [Undecided,New] | 17:55 |
uvirtbot | Launchpad bug 1452127 in sahara "Spark plugin does not pass JAVA_OPTS and configurations" [Undecided,New] https://launchpad.net/bugs/1452127 | 17:55 |
*** crobertsrh has left #openstack-sahara | 17:56 | |
tmckay | egafford, never implemented | 17:58 |
tmckay | just a change to the Spark EDP engine | 17:58 |
egafford | tmckay: So is this more of a feature request than a stark bug? | 17:58 |
tmckay | I would say so. It's not the case that it was meant to be there and is broken. | 17:59 |
tmckay | probably not too hard to add, could potentially land in L3 | 17:59 |
egafford | tmckay: Yeah, we have the technology to add two key-mapped configuration types, certainly. | 18:00 |
tmckay | it would be great if the reporter gave us an example -- I am unclear on what java_opts or configs to actually set for spark | 18:00 |
tmckay | or, how to verify that they actually work | 18:00 |
egafford | tmckay: You're core; could you set the milestone and status on that if you'd like to approve it for L3? | 18:01 |
egafford | tmckay: Also, what would be the fun in knowing what you're coding before you code it? ;) | 18:02 |
egafford | (I kid; I kid.) | 18:02 |
*** crobertsrh has joined #openstack-sahara | 18:02 | |
tmckay | ack, I'll set it and ask for an example | 18:02 |
egafford | tmckay: Perfect; thanks. | 18:02 |
elmiko | egafford: did you get everything sorted out? | 18:15 |
openstackgerrit | Sergey Reshetnyak proposed openstack/sahara-image-elements: Install xfsprogs for ability to formatting volumes in XFS FS https://review.openstack.org/213768 | 18:16 |
egafford | elmiko: Think so, yeah. Thanks. Need to take a triage-break for a little while and wrestle with TripleO. Are we aiming for total triage / prio by Wednesday? | 18:19 |
elmiko | egafford: goal is total triage/prio by friday | 18:20 |
elmiko | it would be cool if we could get total triage by wednesday eve., then prio on thurs/fri | 18:21 |
egafford | elmiko: Cool. Looks like we're 20/32 triaged now. :) | 18:21 |
*** ashishb has quit IRC | 18:21 | |
elmiko | egafford: awesome, thanks for keeping an eye on that | 18:21 |
egafford | elmiko: If we can get total triage and prio by Wednesday, though, I can get the bugfix etherpad sorted sensibly for our team meeting Thursday, and we can give everyone a little time to sign up for fixes with prio already assigned. Good stretch goal if we find that we're that awesome. :) | 18:22 |
elmiko | egafford: ack, that's a good goal to work towards | 18:23 |
egafford | elmiko: Seems like we're on track so far, though we may well lose some steam. | 18:23 |
elmiko | i figure we will, but i'll make a pass at trying to prio some of the stuff we have done | 18:24 |
egafford | elmiko: Happy to ride shotgun when you do. | 18:24 |
elmiko | ack | 18:24 |
elmiko | if we can make good use of the etherpad to take first stabs at the prio, we should be in good shape | 18:25 |
egafford | elmiko: Absolutely. We can firm up Thurs/Fri. | 18:25 |
elmiko | nice, apparently Alan Moore is teaming up for a new HP Lovecraft inspired comic book series | 18:26 |
egafford | elmiko: <3 | 18:26 |
tmckay | hey guys, need an opinion on "Spark cannot connect to separate HDFS cluster" (skipping bug number because I want the bots to be quiet) | 18:36 |
egafford | tmckay: :) | 18:36 |
elmiko | shoot | 18:36 |
elmiko | (i looked at that one a little too) | 18:36 |
tmckay | bug is against spark to hadoop 2.4.1 cluster | 18:37 |
tmckay | I just did spark to spark, and spark to Fedora vanilla hadoop 2.6, works fine | 18:37 |
elmiko | yea, i was curious about that as well | 18:37 |
tmckay | venza noted that cdh vs non-cdh might be an issue | 18:37 |
tmckay | also, this was spark 1.3.1 | 18:37 |
elmiko | hmm | 18:37 |
tmckay | so, given that hadoop 2.4.1 is deprecated, and spark version is new, and I didn't reproduce, I'm tempted to say "invalid" | 18:38 |
elmiko | maybe mark as incomplete with a request for more recent versions? | 18:38 |
elmiko | or yea, invalid with a note about versions | 18:38 |
egafford | In this case, if tmckay actively failed to repro with the newer, supported version, I think invalid works. The bug was actually really good about noting versions, so I'm not sure what other info it could provide if incomplete. | 18:39 |
tmckay | I suppose it could still be an issue for juno or kilo -- spark 1.3.1 is newish | 18:39 |
elmiko | yea | 18:39 |
tmckay | hmm, only question might be kilo | 18:39 |
tmckay | this could be a user that does not want to run off the tip | 18:40 |
egafford | tmckay: Ack. This feature was intended to function for Spark in Kilo, yes? | 18:40 |
elmiko | makes good sense | 18:40 |
tmckay | yeah, should theoretically always function | 18:40 |
tmckay | hdfs is hdfs | 18:40 |
egafford | tmckay: Invalid in master and moving it to kilo/stable makes sense to me then. | 18:41 |
tmckay | there was a bug with updating the /etc/hosts file, but if you reference by ip it should work | 18:41 |
tmckay | ok, I'll try a kilo devstack | 18:41 |
tmckay | user also notes though that the foreign hdfs didn't seem to be listening on 8020 or 9000, which sounds like the foreign hdfs was messed up to me ... | 18:42 |
tmckay | also, he tried to use floating ips | 18:43 |
tmckay | user could have messed up networking, in which case it's not going to work either | 18:43 |
tmckay | from January, I'm going to kill it | 18:44 |
*** DWfuturetec has joined #openstack-sahara | 18:44 | |
DWfuturetec | does anyone have proper nodegroup templates (master + worker) for Vanilla Apache Hadoop 2.6.0? I used the old templates from the quickstart guide (actually for 1.2.1), but I think that the master node doesn’t have „jobtracker“ as a process anymore | 18:46 |
elmiko | DWfuturetec: have you tried the default templates? | 18:47 |
elmiko | DWfuturetec: this may be of some help, http://docs.openstack.org/developer/sahara/userdoc/installation.guide.html#optional-installation-of-default-templates | 18:48 |
DWfuturetec | elmiko, thanks for the help | 18:48 |
DWfuturetec | i will try this | 18:48 |
elmiko | DWfuturetec: also, i have a utility in my github that i use to create 2.6.0 clusters. here is the relevant code for make them (it's python btw) | 18:49 |
elmiko | https://github.com/elmiko/psychic-dromedary/blob/master/psydr/cmds/cluster.py#L38 | 18:49 |
openstackgerrit | Merged openstack/sahara: Support manila shares as binary store https://review.openstack.org/204690 | 18:49 |
DWfuturetec | elmiko, Thanks - looks like a handy snippet | 18:50 |
elmiko | DWfuturetec: i use the python-saharaclient for creating the templates, but you could extract that dictionary into a json object if necessary | 18:51 |
tmckay | DWfuturetec, default templates should help, I just used them today :) Let us know if you have any trouble. | 18:55 |
DWfuturetec | elmiko, tmckay - thanks for now .. I’m on it - i will let you know if it worked | 18:56 |
* tmckay hopes so | 18:56 | |
tmckay | I wrote it :) default templates | 18:57 |
DWfuturetec | tmckay .. then there is no doubt, that it won’t work ;-) | 18:57 |
tmckay | with help on the actual templates, that is | 18:57 |
tmckay | heh, thanks for the vote of confidence | 18:57 |
elmiko | lol | 18:59 |
DWfuturetec | the floating_ip placeholder is replaced with e.g. the ext-net network ID ? (floating ip pool) | 19:00 |
elmiko | yes | 19:00 |
tmckay | yes, it is actually the uuid of the network | 19:00 |
elmiko | even though tmckay wishes it was the name ;) | 19:00 |
tmckay | we would like to find a nicer way to specify that | 19:00 |
elmiko | hehe | 19:00 |
tmckay | jinx | 19:00 |
elmiko | i owe you a beverage | 19:00 |
DWfuturetec | another thing: does to floating_ip pool need to be external IPs (let’s say „genuine ipv4“ addresses) or can I use an internal network for my cluster (simple 192.168.1.0/24 network) | 19:04 |
elmiko | you can use an internal network, it just needs to be something that sahara can request floating ip addresses on | 19:05 |
elmiko | it should be a network configured through neutron(or nova-net) as a floating ip pool network | 19:05 |
DWfuturetec | yes it is a neutron-based network with floating ips | 19:05 |
elmiko | ok, then you can just supply the uuid of that network | 19:06 |
*** hdd has quit IRC | 19:12 | |
*** hdd has joined #openstack-sahara | 19:15 | |
vgridnev | egafford, is this bug https://bugs.launchpad.net/sahara/+bug/1436425 just some problems with configuration of OpenStack? | 19:22 |
openstack | Launchpad bug 1436425 in Sahara "[CDH] Too many connection" [Undecided,Confirmed] | 19:22 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,Confirmed] | 19:22 |
uvirtbot | Launchpad bug 1436425 in sahara "[CDH] Too many connection" [Undecided,Confirmed] https://launchpad.net/bugs/1436425 | 19:22 |
DWfuturetec | elmiko, I’m a little bit confused .. I’m trying to write my own .json templates without the python script or „sahara-templates“ cli tool .. in your cluster.py, you add „'net_id': mgmt_net,“ to the cluster template, the default template contains „"neutron_management_network": "{neutron_management_network}“,“ instead … which one is the right one and which network UUID do I use for a | 19:23 |
DWfuturetec | a mgmt network? | 19:23 |
elmiko | DWfuturetec: 1sec | 19:24 |
DWfuturetec | no hurry | 19:24 |
egafford | vgridnev: It could certainly be a MySQL misconfiguration. | 19:25 |
egafford | vgridnev: (It looks to me like it is a MySQL misconfiguration, in fact.) | 19:25 |
elmiko | DWfuturetec: so yea, the json template should be "neutron_management_network", "net_id" is something that the python client accepts | 19:26 |
egafford | vgridnev: But, it's been confirmed to happen by multiple sources, so getting to the bottom of root cause seems to still be a good idea, and that might be a non-trivial investigation, so it makes sense to me to keep it as a bug for next week. | 19:26 |
vgridnev | egafford, I didn't find anything except just increasing size of pool in mysql | 19:26 |
DWfuturetec | elmiko, ok thx .. and the mgmt network is just another floating-ip network? | 19:27 |
vgridnev | egafford, that's make sense, agreed | 19:27 |
egafford | vgridnev: I agree that that's probably going to be the fix. :) | 19:27 |
elmiko | DWfuturetec: the management network should be a network that will allow the controller to talk with the cluster | 19:27 |
elmiko | DWfuturetec: usually, in devstack, the management network is fixed IPs (or the "private" net), and the floating_ip network is the "public" network | 19:28 |
egafford | vgridnev: Might be an easy one; might spiral out into crazy puppet module madness. We'll see. | 19:28 |
DWfuturetec | elmiko, well I used the example architecture for my test envorinment, which basically means there is this 10.0.0.0/24 management network between all nodes - but this (of course) won’t show up in neutron, because it’s manually configured within the distribution | 19:29 |
elmiko | DWfuturetec: hmm, not sure how to deal with that. usually the mgmt net would be something defined with the networking service in openstack | 19:30 |
DWfuturetec | elmiko, i will have a look in the documentation - maybe I missed something during the network setup | 19:31 |
elmiko | DWfuturetec: have you played around with devstack? | 19:31 |
DWfuturetec | elmiko, just started it up once .. i soon went to build my own test environment (example architecture on 5 machines, controller/network/compute/block/object1+2) | 19:33 |
*** hdd has quit IRC | 19:33 | |
elmiko | DWfuturetec: ok, i'll share a screenshot of what my networks look like | 19:33 |
DWfuturetec | elmiko, thanks that would be great! | 19:33 |
*** hdd has joined #openstack-sahara | 19:33 | |
elmiko | DWfuturetec: https://mimccune.fedorapeople.org/openstack_networks.png | 19:38 |
elmiko | DWfuturetec: that's what my networks look like, the "public" network is what i use for managment and the "private" network is what i use for floating_ip_pool | 19:38 |
elmiko | the private network is defined only in my demo project | 19:38 |
DWfuturetec | elmiko: okay I will try to configure neutron like this | 19:39 |
elmiko | same with the router too | 19:39 |
elmiko | DWfuturetec: good luck! | 19:39 |
elmiko | does this, https://bugs.launchpad.net/sahara/+bug/1426398 , look like a wishlist to anyone else? | 19:49 |
openstack | Launchpad bug 1426398 in Sahara "Current anti-affiity only allows instances equals to number of hypervisors" [Undecided,New] | 19:49 |
uvirtbot | Launchpad bug 1426398 in sahara "Current anti-affiity only allows instances equals to number of hypervisors" [Undecided,New] | 19:49 |
uvirtbot | Launchpad bug 1426398 in sahara "Current anti-affiity only allows instances equals to number of hypervisors" [Undecided,New] https://launchpad.net/bugs/1426398 | 19:49 |
egafford | elmiko: Hard to call it a stark bug, but it does sound like we can do better (esp. in small private clouds where hypervisor count might not be huge.) | 19:51 |
elmiko | egafford: i'm just trying to figure out what hints we observer now for the schedulers. seems like we have something, also this sounds like a request for an improvement as opposed to an outright bug | 19:52 |
egafford | Also, elmiko, added tag-style statuses to bugs a few minutes ago (tried to reorg bugs into headings, but that trashed all pad history.) Feel free to remove or ask me to do so if it feels cluttering. Yeah, seems like a feature request to me, regardless of impl, unless what we claim we're doing in our docs or specs is Just Wrong. | 19:53 |
elmiko | egafford: ack, i'll look at the pad | 19:54 |
elmiko | egafford: looks good, thanks | 19:55 |
elmiko | man bug triage pwned my day =( | 19:59 |
egafford | elmiko: Oh, totally. | 19:59 |
egafford | I have done almost nothing else. | 19:59 |
egafford | Complete pwnage. | 20:00 |
egafford | At least we've been pwnt for The Community. :) | 20:00 |
crobertsrh | bah, horizon github is down | 20:16 |
egafford | crobertsrh: Bah! | 20:17 |
crobertsrh | I did spend 10 min thinking it was something I did, but I doubt I took out the whole repo. | 20:17 |
egafford | crobertsrh: If you did, it's an impressive feat. | 20:18 |
crobertsrh | true | 20:18 |
DWfuturetec | i just tried to start my cluster, but it failed with „floating ip pool not found“ .. just to be clear -> as floating_ip_pool I use the ID of the network or the subnet ID? how are floating and non-floating networks defined anyway? DHCP + allocation pool disabled/enabled? | 20:35 |
elmiko | DWfuturetec: the uuid as listed by neutron | 20:35 |
egafford | DWfuturetec: The floating_ip_pool should be a public network uuid (not subnet, or private network.) | 20:36 |
DWfuturetec | elmiko: e.g. neutron net-list outputs both .. ID for the network itself + ID for the subnet | 20:36 |
elmiko | i think you just want the ID for the network | 20:37 |
egafford | DWfuturetec: +1 to elmiko on that. | 20:37 |
DWfuturetec | egafford: well I could use my ext-net .. problem is my provider won’t give me a bigger subnet .. I just have 3 floating IPs available in the pool | 20:37 |
DWfuturetec | maybe I could define another „external“ network with floating-ips which doesn’t use public accessible IPv4 addresses | 20:38 |
egafford | DWfuturetec: ...Ah, I see. You can designate one node as a public gateway to your cluster, I know. I haven't worked with that setup extensively, but it's probably a good way to get around your limitation. | 20:40 |
elmiko | DWfuturetec: +1, that would be one way to go. make a truly private network that you control | 20:40 |
DWfuturetec | egafford: elmiko: thanks .. i think i just need to create the already existing networks (10.0.0.0/24 for management, etc.) within neutron again, so I can access them from within - the nodes itself already are communicating over this network and are using another vm as gateway to „outer space“ | 20:42 |
*** crobertsrh is now known as _crobertsrh | 21:33 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/sahara: Updated from global requirements https://review.openstack.org/212281 | 21:44 |
elmiko | egafford: is this something we would likely port to juno? https://bugs.launchpad.net/sahara/+bug/1433401 | 21:46 |
openstack | Launchpad bug 1433401 in Sahara "In stable/juno branch, cluster launch failed" [Undecided,New] | 21:46 |
uvirtbot | Launchpad bug 1433401 in sahara "In stable/juno branch, cluster launch failed" [Undecided,New] | 21:46 |
uvirtbot | Launchpad bug 1433401 in sahara "In stable/juno branch, cluster launch failed" [Undecided,New] https://launchpad.net/bugs/1433401 | 21:46 |
*** uvirtbot has quit IRC | 21:50 | |
egafford | elmiko: If it is actionable, it may well be something that can only be fixed by a non-cherry-pick commit to juno. | 21:56 |
egafford | elmiko: Depends on the root cause and whether it's still active in master. | 21:57 |
*** DWfuturetec has quit IRC | 21:57 | |
egafford | elmiko: I don't know that it's necessarily a candidate for the L bugfix cycle, but we may want to shove it in there anyway, just so it gets some attention. | 21:59 |
elmiko | egafford: ok, thanks. i just wanted to make sure we are still in the window for fixing juno stuff | 22:04 |
openstackgerrit | OpenStack Proposal Bot proposed openstack/sahara: Updated from global requirements https://review.openstack.org/212281 | 22:06 |
egafford | elmiko: Yeah, I don't believe Juno's EOL; it's only at 2014.2.3. By precedent, it's got at least 6 months and 1 release left in it. | 22:07 |
elmiko | egafford: ack, i'll try to replicate this | 22:08 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: [WIP] Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 22:10 |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: Remove never executable code from devstack plugin https://review.openstack.org/213895 | 22:18 |
*** vgridnev has quit IRC | 22:23 | |
*** vgridnev has joined #openstack-sahara | 22:23 | |
*** vgridnev has quit IRC | 22:24 | |
*** chlong has quit IRC | 22:34 | |
openstackgerrit | OpenStack Proposal Bot proposed openstack/sahara: Updated from global requirements https://review.openstack.org/212281 | 22:41 |
openstackgerrit | Merged openstack/puppet-sahara: Remove Sqlite validation for database_connection https://review.openstack.org/213650 | 22:59 |
*** AndreyPavlov has quit IRC | 23:19 | |
*** AndreyPavlov has joined #openstack-sahara | 23:19 | |
*** tiny-hands has joined #openstack-sahara | 23:20 | |
openstackgerrit | Sergey Lukjanov proposed openstack/sahara: [WIP] Run scenario tests for the fake plugin in gate https://review.openstack.org/213546 | 23:27 |
*** egafford has quit IRC | 23:40 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!