19:05:40 #startmeeting marconi 19:05:41 Meeting started Thu Mar 7 19:05:40 2013 UTC. The chair is kgriffs. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:05:42 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 19:05:44 The meeting name has been set to 'marconi' 19:06:11 #topic Process update 19:07:41 So, we've been hard at work setting up tasks and adding blueprints to organize everything that needs to be done before the Portland summit 19:08:15 #info Smaller work items are tracked in Trello: https://trello.com/b/7NLODgbr 19:08:38 o/ 19:09:05 for larger work items/complicated stuff we will continue adding blueprints in the next couple of weeks. 19:09:14 flaper87: welcome! 19:09:40 * flaper87 thought the meeting was at 21:00 his time..... 19:09:57 so, what's the current topic? 19:10:04 process update 19:11:27 also, I just want to let everyone know we now have a HACKING file and there is enough code in the repo for folks to start getting the lay of the land. 19:11:49 * flaper87 dances 19:11:50 anyway, does anyone have any questions about the current state of Marconi? 19:12:21 o/ 19:12:33 about the message post api 19:12:50 * flaper87 wonders if that's off-topic 19:13:09 no worries, let's move on 19:13:40 #topic API - Post Message 19:13:57 cool 19:14:00 so 19:14:03 #info API working draft - https://etherpad.openstack.org/queuing-api 19:14:57 the current spec says that when a message is posted, marconi returns an empty response with the Location header! But, when multiple messages are posted (bulk) it just doesn't return anything 19:15:08 hopefully I didn't miss-read any of that 19:16:05 so, That doesn't feel right and I think we should give a consistent response either case 19:16:06 well, as written, it will return just the location of the messages resource 19:16:15 agreed 19:16:21 here's my thought 19:17:08 What about *always* returning a list of dicts containing the location of the messages and other information (some metadata related to the message)? OR 19:17:15 just returning a list of ids / locations 19:19:06 thoughts? 19:19:15 so, the list of id's would be returned in the same order as messages posted? 19:19:16 I think it would be good to get the IDs of all the messages posted, one way or another 19:19:32 edsrzf: +1 19:19:36 or does that matter? 19:19:51 kgriffs: mmh, good point, I think that matters 19:19:52 I mean 19:20:06 it doesn't but having that small "coolness" is certainly good 19:20:34 but, clients should already know in what order messages where inserted 19:20:39 since they did the request 19:21:06 I think returning the list of ids is good enough 19:21:14 an alternative to returning the ids in the body would be something like 19:21:17 I can't think, right now, about a case where we'd need more info 19:21:33 If the list is unordered, how can the client associate an ID with a message? Or are you saying you don't think that's necessary? 19:21:39 the idea of just returning the ids is to save bandwith 19:21:40 Location: .../messages/id1,id2,id3 19:22:40 edsrzf: the list wouldn't be un-ordered but I don't think the association is an issue in that case 19:23:10 kgriffs: Then that would mean the GET endpoint would need to support that syntax, too, correct? 19:23:20 kgriffs: and will it be possible to call that url? I like that Location format and I've used it in other projects with good results 19:23:20 bryansd: correct 19:23:44 bryansd is faster than me 19:23:58 I think it would be interesting to use that syntax for bulk queries across multiple queues as well, but that's a discussion for another day 19:24:13 yep 19:25:37 so, ordered list of ids? 19:25:44 #startvote return id's in header or body? Header, Body 19:25:44 Begin voting on: return id's in header or body? Valid vote options are Header, Body. 19:25:45 Vote using '#vote OPTION'. Only your last vote counts. 19:26:09 #vote Body 19:26:36 #vote Header 19:26:49 #vote Header 19:27:10 (in the name of reducing payload) 19:27:33 * flaper87 assumes we all agreed on returning a list of ids 19:27:47 flaper87: agreed 19:28:20 so, we will return a list of IDs, the question is do we do it in the body as a document or in the Location header 19:28:24 bryansd: mmh might be, but I think that headers are not meant to be used like that 19:28:50 they should contain parameters need for the communication but not the actual response 19:28:57 well, if we were trying to return more than the id, I would agree 19:28:57 anyway, I'm ok with it as well 19:29:11 so, endvote? 19:29:29 actually, it is Best Practice™ to return the location of the newly created resource. 19:29:37 #endvote 19:29:38 Voted on "return id's in header or body?" Results are 19:29:39 Body (1): flaper87 19:29:40 Header (2): bryansd, kgriffs 19:29:42 kgriffs: agreed on that 19:30:13 so, that would be a list of , separated ids ? 19:30:20 I think we can list the IDs in the URI and if we ever decide we need to return more data than that, we can also add a body 19:30:29 that will need to be splited in the client side ? 19:30:39 kgriffs: +1 19:30:43 for teh uri stuff 19:30:55 kgriffs: place an #agree on that 19:31:02 :) 19:31:07 i guess so. We could define a URI template that would allow the client to do know what to expect and parse it. 19:31:47 * flaper87 doesn't know how to do the f#@$@$ TM symbol 19:32:35 And with regards to ordering, we ARE guaranteeing messages are posted in the order received, correct? With the stipulation it's possible other clients may have inserted their own messages in the middle. 19:33:00 #agree Response to Post Message will be the full path to the message or messages created 19:33:27 bryansd: we guarantee the order but not that the'll be sequential 19:33:27 #agree Extend messages URI template to allow for multiple message_id's 19:34:10 flaper87: Nice verbiage :-) 19:34:28 * flaper87 deserves a pop-tart 19:34:50 So the list of IDs returned should be in the correct order as well 19:34:55 bryansd: I imagine we can guarantee that the node will send the messages to storage in the order received, but I'm not sure that means a lot since messages can be posted in parallel. 19:35:37 kgriffs: bryansd yes, what really matters there is that we respect the order of the current requests 19:35:51 although, that node should be able to return the list of ids to the client in the same order they were in the body of the POST 19:35:57 the rest is up to the god of the Race Conditions! 19:36:18 OK, so I guess we need to specify that in the transport blueprint 19:36:27 kgriffs: +1 19:36:43 * flaper87 will add that to the base class as well 19:37:08 flaper87: do you want to take care of the blueprint or shall I? 19:37:20 kgriffs: I'll take care of that as well 19:38:02 #action flaper87 to update Transport blueprint and base class with info about returning ids in same order that messages are written to storage 19:38:07 * flaper87 ™ <- cool 19:38:33 * bryansd gives flaper87 a Pop-Tart™ 19:38:55 ok, moving on? 19:38:57 yes 19:39:08 so, same topic, but different tangent 19:39:26 Any objections to allowing per-message TTLs? 19:39:47 And if the TTL is not specified, it defaults to the setting in the queue metadata? 19:40:07 that sounds good to me 19:40:11 so no objections so far 19:40:12 I was thinking about that. Just wondering if that's adding an extra layer of complexity to a v1.0 release. Any compelling reason we need it? 19:40:36 mostly to be consistent with the way claims work 19:40:56 bryansd: thinking about how the current implementation is moving on, that'd be really easy to implement 19:40:57 although, we could just say for both claims and queues, you set the global value and can't change it per-claim or per-message. 19:41:46 so, a few "implementative" words about that (just to be sure it makes sense to have it in v1) 19:42:49 In order to post a message we need to have the queue where it will be posted (kind of model like) which should (i guess it depends on the storage backend as well) contain the per-queue ttl already 19:43:05 that would allow us to manage per-message ttl when it is not specified 19:43:50 so, I guess what I'm trying to say is that at the end, we'll just manage per-message ttl and putting it in the queue meta or directly in the message just changes the level where the parameter is stored 19:43:54 does that make sense? 19:44:46 the element that expires is the message, not the queue 19:45:24 again, that could definitely vary when it comes to the final implementation (which involves the storage backend) 19:45:35 thoughts? 19:46:03 it seems to me that assuming the same TTL for all messages in a given queue might lead choices in the storage driver implementation that make it hard to do different TTLs for each message later on. 19:47:02 re storage model, I think "queues" are really just a metadata collection, indexed by queue name. 19:47:37 kgriffs: wow, you expressed that way simpler than what I did 19:47:45 kgriffs: +1, that's what I meant 19:47:48 :P 19:47:50 sweet 19:48:43 we need to be careful that we don't tightly couple the abstract notion of a "queue" to a physical resource that is expensive to create/manage, since we want to support gazillions of queues. 19:48:56 anyway, that's a discussion for another day. 19:49:08 kgriffs: agreed 19:49:27 Okay, I was only concerned with getting to a minimum viable product. Sounds like it won't be too much of a concern. 19:49:44 #topic API - Create Queue - Metadata Format 19:50:17 so, couple of questions I had on this 19:50:37 shot 19:50:42 first, do we allow arbitrary metadata with some space limit (say, 64K) 19:51:18 second, do we say that only simple key-value's are allowed, or are nested docs OK? 19:52:59 so, first, I honestly don't think that arbitrary metadata is needed in the queue 19:53:43 mmhh, wait 19:53:56 no no, I guess it's not needed 19:54:30 Is there a case where a client could need to store arbitrary metadata to a queue for later use? 19:54:42 I guess I'm the Debbie Downer of features but I vote no. Agree with flaper87 19:55:09 Well, metadata can be stored out-of-band by the app that's using the queue, I suppose. There may be a useful reason for it, but we can add it later if the need arises. 19:55:25 +1 for adding it later based on demand 19:55:43 Do Swift containers allow arbitrary metadata? 19:55:55 * kgriffs can't remember 19:56:03 kgriffs: if there are use cases for that, then cool!!! +1 for adding it later 19:56:13 mmh, don't remember either 19:57:06 mhh, wait again 19:57:18 I think it could be useful 19:57:21 for example 19:57:39 http://docs.rackspace.com/files/api/v1/cf-devguide/content/Update_Container_Metadata-d1e1900.html 19:58:10 thinking about SNS, if a message get's published to a specific queue and it has to be notified to X plugin based on queue metadata (filtering or whatever) 19:59:08 I now think it is useful. It isn't for the *actual* work the queue should do but it could be for other stuff 19:59:33 I guess my point is, TTL is probably just the beginning of other stuff we want to store, so if we are building a mechanism for storing multiple config items, essentially, it is trivial to allow arbitrary metadata 19:59:51 indeed 20:00:29 I agree on allowing arbitrary metadata 20:00:36 I guess, as long as we build a JSON schema that can allow some kind of nested docs or we just use X-headers like Swift 20:00:47 dunno if it is worth it to add a limit on the size of it 20:01:01 and I agree on allowing nested objects as well 20:01:39 well, it could be really high, we just need to sanity-check it so someone isn't upload a few gigs worth of metadta 20:02:02 lol 20:02:23 re nested objects, that may change the way the controller base classes are defined - maybe should take a single dict or something 20:02:34 * flaper87 thinks about someone saving a movie in a last_seen metadata field 20:03:04 kgriffs: why should it change it? 20:03:13 it all goes inside the same metadata field 20:03:15 #agreed The tentative plan is to allow custom metadata, as long as it isn't much more work than just doing TTL. 20:03:24 well, does metadata need to be a kwargs? 20:03:50 kind of, I think of metadata as a dictionary of key-object 20:04:03 I guess you would decompose the top-level keys in the document. 20:04:37 so, messages={"ttl":30} 20:04:41 or something like that 20:04:54 well, looks like we are about out of time. 20:05:19 lets discuss about this in the next meeting 20:06:44 final thing 20:07:41 looking at the etherpad, which document type do you guys prefer as the body of the PUT? 20:07:56 (not type, schema) 20:09:09 * flaper87 looking 20:09:13 I like the nested messages -> ttl 20:09:37 That way if we have more messages-related metadata we throw it in that collection 20:09:49 And we could make another similar section for claims if need be 20:10:25 yeah, I guess that's kind of how metadata will be stored as well 20:10:48 something like "queues have a message field" 20:10:52 messages* 20:11:02 where all messages configs are stored 20:11:06 OK, well, sleep on it and let me know in openstack-marconi 20:11:12 * kgriffs needs to run 20:11:15 kk 20:11:22 thanks everyone 20:11:27 thanks! 20:11:34 * flaper87 thanks everyone 20:11:49 #endmeeting