21:00:50 #startmeeting Scientific-SIG 21:00:51 Meeting started Tue May 25 21:00:50 2021 UTC and is due to finish in 60 minutes. The chair is b1airo. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:00:52 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:00:54 The meeting name has been set to 'scientific_sig' 21:00:57 time for another exiting Scientific ... what Blair said :) 21:00:59 morning 21:01:26 i'm in a work call with Stig at the moment, so we'll be a bit tardy.. 21:02:14 roger 21:02:25 anything new for our HPC friends? 21:02:36 I saw Belmiro during the last Open Infra Live 21:02:57 https://www.youtube.com/watch?v=yf5iFiCg_Tw 21:03:27 the topic was very relevant to our crowd: " Upgrades in Large Scale OpenStack Infrastructure" 21:07:09 uh oh, time for a Zoom update 21:07:17 #chair martial 21:07:18 Current chairs: b1airo martial 21:08:12 not sure if that is a good sign :) 21:08:32 I haven't seen Belmiro's talk yet, but based on Tim's Twitter feed I have an idea of the content :-) 21:08:43 ;) 21:09:20 it was good, and the questions relevant 21:09:41 anybody else had a chance to view it and/or had comments? 21:10:07 g'day 21:10:54 b1airo: hey 21:11:02 o/ 21:11:17 deja vu :-) 21:15:29 What is the status of our plan for ISC ? 21:15:44 Hi martial, didn't spot you there. 21:15:50 It's true, there's something going ahead. 21:16:16 #chair oneswig 21:16:17 Current chairs: b1airo martial oneswig 21:16:55 I haven't checked what needs to be done but a ticket needs to be organised for one. 21:17:15 I got the email a while back but no action since so unclear 21:17:19 We had a really good panel of speakers involved too IIRC 21:17:41 It's the fatigue with online conferences. 21:18:00 same 21:19:52 welcome Tim 21:20:23 Been having fun with OVN recently 21:20:39 in a "actually not fun" sense 21:20:42 * trandles reads "OVN" and starts twitching 21:22:47 Hardware offload of security groups. It's neat, but requires the latest Mellanox firmware. Too recent if you bought your NICs from an OEM 21:24:20 There's a "seamless" trap to software if you can't offload the flow onto the NIC. RDMA even works - at a small number of gb/s 21:25:46 OVS is like nft - one of those hard-to-learn things you only ever have to learn when something's broken. It doesn't set you up to want to explore it's many treasures. 21:25:58 Or SELinux perhaps ;-) 21:27:09 "treasures" indeed : 21:30:36 Had another interesting issue today with Slurm automation - setup would mysteriously fail in the playbooks, but work if the steps were repeated manually. Turns out the systemd service for slurmctld in OpenHPC does not wait for the service to complete startup. If your config file takes 4s to read and process, other daemons would out-race it and barf on its absence. Gah. 21:31:03 Easily fixed if you wait_for it to bind. 21:31:40 To be fair, my OVS+OVN+HA (no DVR) deploy of RHOSP went very smoothly and is working well. But it is a lot to grok if you've never dug into this stuff before. 21:32:49 I had never dug into it before. I was very happy it "just worked" because then I could start playing with things and watching the results to learn WTF is actually going on in there. 21:33:03 sounds like systemd doing its job then oneswig :-o 21:33:30 OVN does work and is simpler than the OVS Hybrid driver. It's just the contortions that optimisation puts us through. 21:34:22 A colleague once gave a summer intern the project to "serialize systemd services." /o\ 21:34:25 trandles: if you're using it with Ironic you've probably had fun integrating VLANs, metadata and FIPs into OVN together. 21:34:53 oneswig: not doing that fortunately 21:36:15 trandles: is the networking flat for your Ironic deploy? 21:36:20 yes 21:36:45 certainly keeps things simple 21:37:47 Ironically simple... 21:39:09 martial: regarding ISC - yes, I received a note on that last week that I haven't processed yet. has been near the top of my todo list for last couple of weeks to contact the panel and try to get some conversation going and/or setup a pre-meeting ... 21:41:07 I think a pre-meeting is good, although likely hard to schedule. Even a group email thread would help 21:41:32 Ah, kkillsfirst is here. Welcome to the LANL summer intern who will be working on kexec in Ironic. 21:41:46 Wow! Hi kkillsfirst, welcome 21:41:52 Great project trandles 21:41:59 thx oneswig 21:42:28 Will there be a public document at the end of the internship? 21:42:52 How can it be summer already? It hasn't stopped raining for weeks. 21:43:35 Hello, Glad to be here helping. 21:43:50 I'm really hoping by the end of the summer the functionality is merged and documented. That's the goal at least. 21:43:59 Good plan 21:44:28 Julia and her team are chomping at the bit to get Kam started. Probably next week after the obligatory week of LANL hazing. 21:44:39 I mean, training. 21:45:55 I'm sure. 21:46:50 How much deploy time do you think can be saved? 21:47:21 depends a lot on how long it takes to go back through the BIOS 21:47:43 I have some 2TB nodes that take nearly 20 minutes to start booting 21:47:45 of course. A long and arduous journey 21:48:07 ouch 21:50:43 It should be a lot of gain, even doing all the shutdown-style things you have to do before the kexec. Unmounting network filesystems, etc...you do that anyway. For the stateless HPC compute node case you don't really care to shut down many services gracefully. It'll be unmount, downlaod kernel+ramdisk, kexec-pivot_root and go. 21:51:46 Do you think it could work with trusted boot? Perhaps a long shot. Not even sure why I'm asking 21:52:49 Haven't thought about that much... 21:53:15 It would definitely work the way we do things today. We've tested it manually just to see what happens. 21:53:48 Way back in the day, I worked on a LinuxBIOS equivalent for AlphaServers, replacing the BIOS with an embedded Linux kernel. I was always deeply skeptical about chainloading kernels. I want to state for the record I was probably overly skeptical and wrong :-) Only took me 20 years. 21:55:14 :D Interesting. Ahead of your time. 21:55:15 I'm going to have to run...another meeting in 5 minutes. Anyway, kkillsfirst is also on the openstack-scientific-sig slack. 21:55:31 Cool, see you around Kam! 21:55:36 Hopefully more updates to come during the summer :) 21:56:33 In fact we are nearly out of time. 21:57:06 b1airo: can you mail folks for the ISC BoF? That would be a great help 21:57:35 yep absolutely - can do that todaay 21:58:12 Thanks Blair 21:59:20 I'm on vacation next week (woohoo), much though I enjoy yappin' on IRC it aint happening 21:59:37 Enjoy vacation oneswig 21:59:52 i'm not jealous... 21:59:54 Thanks trandles :-) 22:00:04 See you later everyone. I hope to get a tiny cluster working at home this week. :) 22:00:14 b1airo: summer's on the way 22:00:24 Nice kkillsfirst. See y'all next time. 22:00:25 kkillsfirst: nice to meet you. Good luck! 22:00:39 martial: time to close 22:00:47 yep 22:00:57 #endmeeting