21:00:44 #startmeeting scientific-sig 21:00:45 Meeting started Tue May 12 21:00:44 2020 UTC and is due to finish in 60 minutes. The chair is oneswig. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:00:46 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:00:48 The meeting name has been set to 'scientific_sig' 21:01:16 #chair b1airo martial 21:01:17 Current chairs: b1airo martial oneswig 21:01:21 hey guys 21:01:26 o/ 21:01:35 How's things? 21:02:13 Doing ok myself, you? 21:02:38 having a good one today. Performance analysis, when it yields something, is fun. 21:03:18 looking into gitlab for swagger/open api usage ... maybe to create a new project for the P2302 effort 21:03:36 As in the NIST cloud federation work? 21:04:17 I believe I had a bit of a breakthrough today 21:05:31 Was struggling to come up with a way to effectively use all the bandwidth to cephfs, I think using a combination of DVR, BGP dynamic routing, and IPv6 gets me there 21:06:02 well the CFRA is out, NIST SP500-332 21:06:07 Does IPV6 work with the other two jmlowe? 21:06:13 right now we are looking at the IEE side 21:06:20 Yes, I believe it does 21:06:30 Just checking :-) 21:06:33 and we want to define a minimalistic API definition 21:06:38 so OpenAPI compatible 21:06:48 and open to all to contribute 21:07:09 added bonus is that floating ip address get the full compute node bandwidth and can bypass the network node 21:07:09 so was thinking gitlab since they have an OpenAPI viewer 21:07:17 for ipv4 21:07:54 (brb) 21:08:29 jmlowe: weren't you using linuxbridge on Jetstream or is this a new project? 21:09:36 yes we are using linuxbridge, and dvr now works with linuxbridge, I'm thinking about ways to reinvent Jetstream 21:10:48 Sorry I'm late. Prior call ran long. 21:10:57 Sounds good jmlowe. I'm wary of DVR but perhaps that's undue 21:11:05 ditto, hello all 21:11:14 hi trandles rbudden, glad you could make it :-) 21:11:22 rbudden do you guys use dvr with linuxbridge? 21:11:26 yes 21:11:34 dvr_noext 21:12:10 since we’re mostly provider networks in our one cloud 21:12:48 With linuxbridge it's relatively simple, proxy arp and dnat in the network node network ns, compute node gets snat rules and dumps things out onto the provider network 21:14:32 a bit of asymetric routing, throw in bgp dynamic routing and you can move the dnat to the compute node 21:14:48 In a parallel universe, I've been learning much about OVS hardware offloads and how to understand their dark magic. 21:15:26 Anything useful to report from your trips to that netherworld? 21:15:42 Yes. Don't ever go there, unless performance makes you do it :-) 21:17:28 I've been going the opposite direction, looking at BGP as an elegant weapon for a more civilized age 21:17:37 Using SRIOV I'm getting line rate from the 100G NIC, but currently only one of a bonded pair attached to my OVS bridge. OVS flow rules imply I should be using both bonded links, active-active. The silicon's not quite on the same page, but I am so close... 21:18:12 jmlowe: BGP to the hypervisor - cool stuff. What do you need to do to make it work? 21:20:15 afaik just the bgp speaker agent from neutron 21:21:02 What does the network fabric look like? 21:22:56 Well, that seems to be left as an exercise for the user 21:23:45 ... you get your users to cable up their own networks? :-) 21:24:48 I think you can use frrouting and vxlan to publish routes to the computes and keep the tenants non the wiser 21:26:47 I'd be very interested to hear how you get on with that. The layer-2 equivalent will look like a poor country cousin once you're done 21:28:19 I'm having an interesting MLAG issue currently, trying to get ingress traffic to use both ports instead of 95% of it coming to one port. 21:32:51 SYN 21:33:03 which lacp mode are you using? 21:33:51 lacp layer 3+4 - should be doing the right thing. I wonder if the hardware hashing isn't including source IP and MAC, that might be the only uniqueness between flows. 21:37:57 I should check, does anyone have any points to raise - agenda items, workshops, etc? 21:43:18 Tim? 21:43:37 Not I, still holding out hope the Berlin Summit might happen 21:43:51 same 21:44:01 jmlowe: +1 21:44:52 For the ex-Vancouver PTG, we should think of those upstream activities we always wanted to do, but never had the time. 21:45:01 I'd say something about the DOE Workflow summer seminar series but the schedule hasn't been posted yet. 21:45:17 so Stig, you are looking to do RTX 2080 passthrough? 21:45:29 Eventually at https://wowoha.org 21:45:57 martial: asking for a friend, really. 21:46:31 Coincidentally I think we'd just did the setup to get them working today. 21:46:40 oh that's cool 21:46:53 Big doubts that Berlin happens. Why SC20 hasn't gone virtual yet I don't know. There's about 0.1% chance I'm going anywhere near Georgia before I can get a vaccine. Plus, with COVID-19, it's worse now. 21:48:18 this is a little sad in truth 21:48:33 I don't see how they can actually pull of a virtual SC20 21:49:15 I don't think they even put the keynotes online from SC19 did they? I remember looking for them some weeks later and didn't find it. 21:50:04 Nope and I even emailed the organizers asking about that. They said to keep checking back but the videos would be posted. I never found them. 21:50:30 lol trandles ! 21:50:51 perhaps someone forgot to hit record on the day :-) 21:51:03 technology is hard 21:51:25 Anyway, SC has some work to do to catch up with the online world. 21:53:02 It'll be interesting to see how ISC20 does online 21:53:24 I heard mixed reviews for Red Hat's summit 21:53:28 an unrelated question - anyone claim to be a slurm aficionado? i'm investigating consistently high "reserved time" and wondering if it is something to do with all the array jobs on this machine 21:53:36 Sounds like the system was overwhelmed somewhat 21:54:28 b1airo: not an aficionado to that degree, sorry 21:54:39 b1airo, I wouldn't saw aficionado but I can put you in touch with our slurm gurus 21:56:09 thanks trandles - i might reach out if i don't hear back from my usual suspects (one local and one at NERSC). on the other hand I could probably just submit a support ticket and then blog about it 21:59:56 Nearly at the hour - anything more to add? 22:00:41 OK y'all, have a good day 22:00:45 #endmeeting