18:00:07 #startmeeting sahara 18:00:11 Meeting started Thu Jul 20 18:00:07 2017 UTC and is due to finish in 60 minutes. The chair is tellesnobrega. Information about MeetBot at http://wiki.debian.org/MeetBot. 18:00:12 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 18:00:15 The meeting name has been set to 'sahara' 18:01:55 o/ 18:03:11 waiting a couple more minutes 18:05:19 #topic News/Updates 18:05:50 right 18:05:55 I'm working on ambari images now, and fixing some environment issues, other than that I'm dealing with some downstream stuff 18:06:40 cool, i think you've reviewed all the patches i have up right now, those two SIE things, plus the dashboard thing. Other than that I will have another SIE patch very soon for adding the S3 hadoop jar 18:06:54 awesome 18:06:58 how is that integration going? 18:07:32 uh, i'd feel more comfortable with landing everything else (jb, data store, client work, dashboard work) in queens 18:08:05 but adding it to the images, and providing some reminder in doc about how to run job against s3 manually would be nice 18:08:16 we can start merging small stuff now 18:08:29 yep 18:08:45 aiming to have a bigger window in queens 18:09:01 please keep me up to date on that 18:09:04 btw, will add the s3 jar to the new-style image gen as well, once you merge that 18:09:04 yep 18:09:13 will keep you up to date on everything 18:09:13 cool 18:09:41 thanks 18:09:50 this is very important 18:10:42 that it's it for the "news" topic I think 18:10:49 cool 18:10:51 lets move on 18:11:05 #topic Open Discussion 18:11:13 will skip PTG this time, there isn't updates on that 18:11:20 yep 18:11:53 first thing is, do we want to try to get cdh plugin upgrade done? 18:12:33 shuyingya was working on 5.10 back in april: https://review.openstack.org/#/c/456546/ but now we are on to 5.12 18:12:58 hmm, it was part of the plan yes, shuyingya was supposed to make that happen 18:13:40 5.12? 18:13:51 5.12 came out in june I think 18:14:08 maybe we should try to merge 5.10 for pike and we upgrade to 5.12 in queens 18:14:16 what do you think? 18:14:25 it is very late to start a new patch now 18:14:27 at this point, it's the same amount of work to get 5.10 ready vs 5.12 ready 18:15:03 even considering shuyingya's patch? 18:15:21 ok, it's slightly less work to 5.10 because the patch is there 18:15:30 but it is mostly just changing some numbers around for 5.12 18:15:47 not to mention we still have to do sahara-side change for either version 18:16:05 ok 18:16:17 it makes sense to do to 12 18:16:30 hi esikachev 18:16:33 hi 18:16:35 sorry 18:16:43 hey esikachev, no worries 18:17:04 jeremyfreudberg, are you saying that you are willing to to do the work? /me hopes so 18:17:27 I mean, do you have the time? 18:17:31 tellesnobrega, I will try. and ping you if I end up too busy 18:17:44 jeremyfreudberg, thanks, let me know, I can help out for sure 18:18:22 luckily it easy enough -- shuyingya did some refactoring to cut down on the work 18:18:29 well, not too easy, but easier 18:18:56 cool 18:18:58 that's all I have to say about that topic, we can move on to something else 18:19:10 yeah, we have to formally thank him for the refactoring 18:19:15 cool 18:19:29 i have to step out for 1 minute, I will be back asap 18:19:33 my boss is being my boss 18:19:36 esikachev, do you have any topics in mind to discuss? 18:19:39 tellesnobrega: do you have any updates about hosts for sahara-ci? 18:19:41 yes) 18:19:45 jeremyfreudberg, no worries 18:20:24 esikachev, so about that, we don't have any resources yet 18:20:37 it's sad :( 18:20:47 it is 18:21:14 tosky is looking into adding some tests into the infra tests 18:21:32 at least for vanilla 18:21:32 ok 18:21:47 tellesnobrega, I am back 18:21:48 in the mean time we are looking for resources so we can have a full CI 18:21:48 then I have not any topics 18:22:39 cool 18:22:56 also, I don't have an answer from tsp 18:23:29 that is also sad, let me know when you get some help 18:23:34 some info 18:23:38 ok 18:24:21 tellesnobrega, I can see if my team https://massopen.cloud might have some resources, but really no promises 18:24:23 for ci i mean 18:24:39 jeremyfreudberg, that would be great 18:24:53 I don't take as a promise :) 18:25:52 we need around 64gb 18:25:58 it's min 18:26:29 esikachev, how many machines would be ideal for the CI? 18:26:40 64gb ram per machine? 18:26:45 which machines? 18:26:45 just so I can understand 18:26:48 or just a very big 64gb machine? 18:26:57 it's for devstack 18:27:13 because we have a huge cdh clusters 18:27:25 it is huge :( 18:27:30 "huge" 18:27:32 :) 18:27:34 yep 18:28:03 i will think about restructurisation of mechanism of ci 18:28:20 maybe, we can use less resources 18:28:28 that would be cool, if you could present us the desired architecture 18:28:42 i will try 18:28:48 one more thing, on the topic of testing 18:29:01 we still need to manually check ironic integration still works 18:29:04 before pike relase 18:29:19 jeremyfreudberg, true, i had forgot about that 18:29:28 do you have plans on that? 18:29:56 i don't really have plans 18:30:38 certainly after FF 18:31:00 tellesnobrega: btw, if you will have a tasks for development in sahara, i can help 18:31:21 jeremyfreudberg, yes. I will take a look into how to deploy fake ironic instances and see if I can test it 18:31:36 tellesnobrega, yep, fake ironic was my thought too 18:31:45 esikachev, any task? 18:31:51 we can sync and try to get this going fast 18:32:03 jeremyfreudberg, did you think cdh upgrade as well? 18:32:24 you mean for esikachev to help with that? 18:32:30 if he wants to 18:32:41 yeah 18:32:52 were you thinking into some other task? 18:32:55 sometimes i have a free time 18:33:01 i can 18:33:40 resolving the concerns in https://review.openstack.org/#/c/333273/ ? 18:34:14 I also filed these two bugs https://bugs.launchpad.net/sahara/+bug/1705335 and https://bugs.launchpad.net/sahara/+bug/1705037 lately 18:34:15 Launchpad bug 1705335 in Sahara "[SIE] Cannot build image with oracle jdk" [Undecided,New] 18:34:16 Launchpad bug 1705037 in Sahara "[UI] Caching can be improved" [Undecided,New] 18:34:22 but yes, CDH is the priority 18:34:37 (or rather, plugins upgrading in general) 18:35:06 but i have not any resources for devstack. only my macbook :) 18:35:16 jeremyfreudberg, yes 18:35:19 but 16gb :) 18:35:48 esikachev, if I could suggest I would recommend upgrade cdh to 5.10 and 5.12 18:36:01 yep 18:36:05 most of that work can be done without devstack 18:36:16 ok 18:36:19 thanks 18:36:55 esikachev, when I ask about new CI resources, I can also ask about donating some resources for your general use too 18:36:57 but no promises 18:37:25 thanks, it will be good 18:38:26 I have other topic to discuss, I hit an interesting issue yesterday, I was trying to run a job that its binary was "large", around 115MB 18:38:31 we can use 64gb labs, but we can get large queue of patches 18:39:53 two problems that I saw, first is that it timed out to copy the jar file. This can be solved by changing the timeout 18:40:24 tellesnobrega, is this internal db? or swift? 18:40:43 but the bigger problem here is, sftp.write() from paramike takes a long time to write files when the files get bigger (not sure if linear) 18:40:46 swift 18:41:06 yes, I suggested adding nightly jobs for testing this case 18:41:20 tellesnobrega, i see 18:41:45 looking around, the solution can be increasing the sftp window size 18:41:46 something like this https://review.openstack.org/#/c/367959/ 18:42:32 tellesnobrega: swift will only return success after the data has been fsync'd, so larger objects will take longer to write than smaller objects (regardless of differences in network transfer) 18:43:05 tellesnobrega, I think the real solution is actually something else 18:43:14 notmyname, I don't think the problem was from swift 18:43:24 so right now we have binary retriever inside of sahara itself 18:43:24 https://github.com/openstack/sahara/blob/master/sahara/service/edp/binary_retrievers/internal_swift.py#L36 18:43:26 the problem is when sahara writes the files to the remote isntance 18:43:36 actually we should not do the retrieving inside of sahara 18:43:47 we should instead retrieve it from inside the cluster itself 18:43:55 but that is a bigger discussion 18:44:25 jeremyfreudberg, maybe a bigger discussion (will add to the queens ptg etherpad) 18:45:08 tellesnobrega, yep 18:45:17 one thing that I tried was using sftp.put instead of write and the transfer goes a lot faster 18:45:30 have to check with sahara, just tried a dummy example 18:46:07 cool, let me know what happens 18:46:23 I will create a bug on this 18:47:24 great 18:48:42 anything else to discuss? 18:48:55 not from my side 18:48:59 you? esikachev? 18:49:20 from me too 18:49:26 that's it for mee 18:49:29 *me 18:50:17 ok, thanks guys for showing up, turned out the meeting was better than expected 18:50:27 yep, very good meeting 18:50:31 jeremyfreudberg, I will take a look into those bugs 18:50:37 thanks 18:50:43 keep an eye for the one I'm filing 18:50:59 yep, i already added a small reference to it on the etherpa 18:51:00 d 18:51:02 so i don't forget 18:51:05 thanks esikachev for helping out 18:51:10 cool 18:51:11 thanks 18:51:14 thanks esikachev! 18:51:15 np) 18:51:40 lets keep up the good work :) 18:51:47 #endmeeting