21:00:13 <timburke_> #startmeeting swift 21:00:13 <openstack> Meeting started Wed Jan 20 21:00:13 2021 UTC and is due to finish in 60 minutes. The chair is timburke_. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:00:14 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:00:16 <openstack> The meeting name has been set to 'swift' 21:00:26 <timburke_> who's here for the swift meeting? 21:00:31 <seongsoocho> o/ 21:00:31 <kota_> hi 21:00:33 <seongsoocho> hi 21:00:39 <rledisez> o/ 21:00:45 <mattoliverau> o/ 21:01:51 <timburke_> as usual, the agenda's at https://wiki.openstack.org/wiki/Meetings/Swift 21:01:55 <timburke_> first up 21:02:03 <timburke_> #topic reconciler pipelines 21:02:15 <timburke_> #link https://bugs.launchpad.net/swift/+bug/1910804 21:02:17 <openstack> Launchpad bug 1910804 in OpenStack Object Storage (swift) "Encryption doesn't play well with processes that copy cleartext data while preserving timestamps" [Undecided,New] 21:03:00 <acoles> o/ 21:03:18 <timburke_> i just wanted to give an update on how we (nvidia) are addressing it, and get input on what we (as a community) want to do 21:04:16 <timburke_> nvidia is addressing it as an operations issue -- we wrote bad pipelines before, and now we'll have hardcoded reconciler pipelines that more-closely match what's in upstream docs 21:04:56 <timburke_> of course, there's still the upstream patch i wrote to forcibly remove some middlewares from the reconciler pipeline 21:05:02 <timburke_> #link https://review.opendev.org/c/openstack/swift/+/770522 21:05:25 <timburke_> ...but i'm not sure whether that's necessarily the direction we should be going 21:07:24 <acoles> we should probably add some doc/comment in reconciler sample conf to emphasise what not to do in the pipeline (and why) 21:07:58 <timburke_> does anyone have opinions on whether there should be new code to try to prevent the situation, or if it's purely a documentation issue? 21:08:42 <zaitcev> I'm bothered by the fact that middlewares are no longer true middlewares if the core code knows about their intrinsic characteristics and modifies the pileline on the basis of the middleware names. 21:08:59 <zaitcev> Why are they middlewares, then? Might as well re-format them as modules. 21:09:28 <acoles> I guess there are (at least) 3 possible actions: 1. add doc 2. check pipeline and log warnings 3. modify bad piplines 21:10:10 <zaitcev> I'd just dump it on docs, or better yet etc/object-server.conf-sample should have a working pipeline 21:10:33 <zaitcev> And write a comment "do not insert symlink, dlo, slo" here. 21:11:12 <rledisez> I would also add a note in the changelog to emphase that operator must fix their configuration (not sure anyone read documentation diff before an upgrade) 21:11:22 <acoles> zaitcev: it's container-reconciler.conf-sample 21:12:17 <acoles> rledisez: +1 21:13:01 <timburke_> all right, that souds like a plan then. i'll work on getting a docs/sample conf patch together, and plan on calling it out in the next release's changelog. thanks! 21:13:19 <timburke_> #topic ssync and non-durable frags 21:13:26 <timburke_> acoles, how's it going? 21:14:08 <acoles> we're deploying a fix this week: https://review.opendev.org/c/openstack/swift/+/770047 21:14:45 <acoles> and hoping to observe some 'stuck' handoff partitions get sync'd and cleaned up 21:15:21 <timburke_> anything else you need o that for right now? reviews, i suppose? 21:15:27 <acoles> (the problem being non-durable EC fragments preventing handoff partitions being deleted) 21:15:37 <acoles> reviews always welcome 21:15:51 <zaitcev> maybe you could comment on the review with the outcome 21:16:12 <acoles> yes, and/or here next week 21:17:42 <timburke_> sounds good 21:17:53 <timburke_> #topic shard cleanup 21:18:32 <timburke_> mattoliverau, clayg, acoles i think you were all pushing on this a bit over the last week 21:18:37 <mattoliverau> We have 2 patches going in 2 different directions. 21:18:38 <timburke_> how's it going? 21:19:01 <mattoliverau> https://review.opendev.org/c/openstack/swift/+/770529 and https://review.opendev.org/c/openstack/swift/+/771086 21:19:26 <mattoliverau> the problem is this only really happens because autosharding + shrinking doesn't exist 21:20:09 <mattoliverau> so the first potential fix is patch one, that inserts a kind of poison pill, or rather the delete ts of the root when the root 404s 21:20:15 <mattoliverau> so they can get cleaned up 21:20:47 <mattoliverau> but the second path, simply makes reclaiming a deleted root container not possible until it has no shards 21:21:19 <mattoliverau> the second means we could have shards hanging around, but gets us closer to the indended solution 21:21:25 <acoles> the second is a pre-requisite for most other solutions IMHO 21:21:35 <mattoliverau> +1 21:21:54 <acoles> i.e. avoid orphaned shards, then figure out how to delete them so the root can be deleted 21:21:55 <mattoliverau> So I've been playing with acoles' compact shrinking patches 21:22:08 <acoles> but dealing with orphans will be assisted by still having the root around 21:22:27 <acoles> mattoliverau: yeah. that's like the third piece of the story :) 21:22:46 <mattoliverau> So an op can do something about collapsing the empty shards back into the root so they can be reclaimed.. this shrinking code also is in the sharder for when we get to autosharding so it'll happen automagically at some point 21:23:18 <timburke_> i think i like that part the most ;-) 21:23:19 <acoles> https://review.opendev.org/c/openstack/swift/+/765623 -> gives us a tool to get rid of empty shards 21:23:42 <acoles> but again, relies on the root still existing! 21:23:51 <timburke_> will that work against a db that's been marked deleted? have we tested that yet? 21:23:54 <mattoliverau> So I have a feeling maybe option 2 is best because it gets us further down to correct path. 21:24:06 <mattoliverau> yup 21:24:10 <timburke_> cool! 21:24:13 <acoles> I'm almost there on https://review.opendev.org/c/openstack/swift/+/765623 but have one piece left to write 21:24:20 <mattoliverau> well, I think.. that's what I've been testing in my SAIO 21:24:47 <acoles> timburke_: good question, need to add that test 21:25:10 <timburke_> sounds like good progress. anything else you guys need input on there? 21:25:13 <zaitcev> Apropos sharding, I'm going to enable it on Train. So, I'm going to propose a few backports. 21:25:16 <acoles> oh, cool, so mattoliverau did you try than manually? 21:25:37 <acoles> that* 21:25:40 <zaitcev> I don't have +2 on stable branches, so I'll have to ask to approve. 21:26:06 <acoles> timburke_: reviews always welcome :) 21:26:08 <zaitcev> Sorry, I thought you moved on from the orphan shards discussion. 21:26:11 <mattoliverau> acoles: the compact worked to get it down to 1, it's the final collapse I need to test ;) 21:26:12 <timburke_> zaitcev, cool! let us know how it goes. how big are the containers you're looking at, out of curiosity? 21:26:23 <mattoliverau> so your last peice as you will :) 21:26:31 <zaitcev> timburke_: it's the 300GB container that I mentioned. 21:26:46 <acoles> mattoliverau: the final collapse won't work yet, give me another 24 hours, but did you try it on a deleted root? 21:26:47 <timburke_> ah, right! that does sound familiar 21:27:06 <mattoliverau> yeah 21:27:11 <acoles> super 21:27:20 <zaitcev> timburke_: A national phone provider has tens of millions of phones archive their text messages once a day. And all of the objects have expiration tags, so the expirers delete tens of millions of objects each day. 21:27:31 <zaitcev> And all of that in 1 container 21:27:49 <zaitcev> I cannot understand who thought it was a good idea, and how it works at all. They started in Queens. 21:28:10 <acoles> zaitcev: do you know if the object names are monotonically increasing? 21:28:46 <acoles> as in new objects append to end of sorted listing 21:28:49 <zaitcev> acoles: I think the are "username.timestamp.gz" so they should be increasing. 21:29:20 <zaitcev> acoles: wait, nm. User 'a' has all of its archives ahead of user 'z'. 21:29:30 <timburke_> well, at least there will be *some* load-spreading across the shards 21:29:34 <zaitcev> I need to look for sure. 21:29:35 <mattoliverau> I wonder if adding a vaccum facility to containers would be useful for cases like that too. 21:29:54 <zaitcev> Oh it would. Bu 21:29:56 <mattoliverau> rebalacne replicaiton would be a pain 21:30:28 <zaitcev> t by the time I got there, sqlite vacuum would 100% fail with a segfault on that container. Not even running out of RAM, just segfault. 21:30:50 <mattoliverau> lol, well thats useful :P 21:31:14 <acoles> I had this idea once that we could use the fresh db with new epoch trick to, well, get a fresh root db (without any change to shards), but that was just a wild thought 21:31:34 <acoles> just copy what we care about to a fresh db and ditch the old 21:32:29 <zaitcev> In this particular case I calculated the number of objects and it was something like only 17 bytes per object in the .db, maybe 170. I don't think vacuuming was going to help my case. 21:33:02 <timburke_> all right, one last topic for updates 21:33:06 <timburke_> #topic relinker 21:33:38 <timburke_> rledisez, thanks for reviewing https://review.opendev.org/c/openstack/swift/+/769855 ! i pushed up a new patchset to roll in the test changes you recommended 21:34:47 <rledisez> timburke_: yeah, i reviewed it just before the meeting. code looks good, just a concern that should be easy to fix I think. seems almost good to go 21:35:11 <timburke_> thanks! 21:35:33 <timburke_> i haven't gotten back to https://review.opendev.org/c/openstack/swift/+/769632 (better-handle unmounted disks) yet, but hope to this week. i'm not so sure about the patch as written, since it'll log the warning but not change the exit code (which seems like it's still a bit dangerous) 21:35:44 <timburke_> that's all i've got 21:35:50 <timburke_> #topic open discussion 21:35:58 <timburke_> what else should we bring up this week? 21:37:41 <zaitcev> I said my piece about sharding, albeit out of turn. 21:38:09 <rledisez> concerning p 769632 I think the solution is in the way progress is provided to the operator (either recon or a swift-object-relinker status) that would tell "on this device that is in the ring, there is no state file, it's suspect" 21:38:24 <zaitcev> Also, I'm still cracking my head about why Romain's patch to delete Queue fails. It now fails with connect() returning EBADF which is just absurt. 21:39:30 <timburke_> seems to still point to some wires getting crossed down in eventlet :-/ 21:40:15 <zaitcev> Oh, yeah. I started looking at S3 v4 signatures not working. 21:40:18 <timburke_> rledisez, good thought -- i'll see what i can do with the pre-device hook 21:40:30 <zaitcev> No progress but I'm amazed that nobody noticed. 21:40:41 <zaitcev> All users just silently back down to v2, I guess. 21:41:08 <timburke_> zaitcev, is it something involving unsigned payloads, aybe? i know there was a regression i accidetally introduced a bit ago... lemme find the patches... 21:41:51 <zaitcev> No, or, I don't know. In my case I only ever did GET and they work for some names but not the others. 21:42:06 <zaitcev> Almost like even-length name is okay 21:42:47 <zaitcev> I need to dig deeper though. It certainly reproduces for me, using s3cmd. I need to re-try with Boto too. 21:42:53 <timburke_> huh. any non-ascii in the names, maybe? 21:43:32 <zaitcev> Nope. Just names like s3cmd ls s3://test-1235163301 21:43:49 <timburke_> (regression i was thinking of was caused by https://review.opendev.org/c/openstack/swift/+/767644, fixed by https://review.opendev.org/c/openstack/swift/+/770004) 21:44:20 <zaitcev> I'll test, thanks a lot. 21:44:56 <acoles> speaking of s3, I noticed some pipeline checks were not actually being executed in s3api middleware: https://review.opendev.org/c/openstack/swift/+/771467 21:45:30 <zaitcev> it was on my list to review 21:45:36 <acoles> thanks 21:47:47 <timburke_> all right, looks like we can end a bit early 21:48:06 <timburke_> thak you all for coming, and thank you for working on swift! 21:48:11 <timburke_> #endmeeting