Re: Customizable Serialization check-in

Piotr Grabowski Mon, 07 May 2012 15:21:28 -0700

W dniu 07.05.2012 20:13, Tom Christie pisze:

Hey Piotr,
Here's a few comments...
You have 'fields' and 'exclude' option, but it feels like it's missingan 'include' option - How would you represent serializing all thefields on a model instance (without replicating them), andadditionally including one other field? I see that you could do thatby explicitly adding a Field declaration, but 'include' would seemlike an obvious addition to the other two options.

Default all model fields will be serialized and additional all fieldsadding by Fields declaration. If 'fields' is set then only fieldspresent in 'fields' and additional fields added by Fields declarationwill be serialized. To many fields :). If exclude is set then all modelfields except fields set in exclude will be serialized and additionalfields added by explicit declaration. I think it's like in ModelFormdeclaration. Did I'm missing some case?

I'd second Russell's comment about aliases. Defining a label on thefield would seem more tidy.
Likewise the comment about 'preserve_field_order' I've still got thisin for 'django-serializers' at the moment, but I think it's somethingthat should become private. (At an implementation level it's stillneeded, in order to make sure you can exactly preserve the fieldordering for json and yaml dumpdata, which is unsorted (determined bypythons dict key ordered).

I answer Russell about that

Being able to nest serializers inside other serializers makes sense,but I don't understand why you need to be able to nest fields insidefields. Shouldn't serializers be used to represent complex outputsand fields be used to represent flat outputs?

At first I think Serializer should be tied with object (one Serializer =one object). But then I figured out that Serializer can work with objectpassed in upper level Serialized (so 'source' field isn't needed). Maybenested serializers and flat field is better approach. I must consider this.

The "class_name" option for deserialization is making too manyassumptions. The class that's being deserialized may not be presentin the data - for example if you're building an API, the class that'sbeing deserialized might depend on the URL that the data is being senttoo. eg "http://example.com/api/my-model/12";

I wrote about class_name in answer to Russell. If model class is in urlthen we can do something like that:serializers.deserialize("json", data_from_response,deserializer=UserSerializer(class_name=model_from_url(url)))

In your dump data serializer, how do you distinguish that the 'fields'field is the entire object being serialized rather than the 'fields'attribute of the object being serialized?

fields = ModelFieldsSerializer(...) will be feed with object toserialize and name 'fields'. I'm only interested at output from it. Itmust be python native datatype and I do something likeserialized_dict['fields'] = output_of_mode_fields_serializerModelFieldsSerializer knows what do with object.

Also, the existing dumpdata serialization only serializes local fieldson the model - if you're using multi-table inheritance only thechild's fields will be serialized, so you'll need some way of handlingthat.
Your PKFlatField implementation will need to be a bit more complex inorder to handle eg many to many relationships. Also, you'll want tomake sure you're accessing the pk's from the model without causinganother database lookup.

Thanks for point that. Have to think about it.

Is there a particular reason you've chosen to drop 'depth' from theAPI? Wouldn't it sometimes be useful to specify the depth you want toserialize to?

Sometimes maybe. But in most cases no. And there are some other ways todo that. In my opinion going (globally) more than one level depth almostnever be needed. If there is need to go deeper in only one (or few butnot all) fields 'depth' is unusable.

There's two approaches you can take to declaring the 'xml' format fordumpdata, given that it doesn't map nicely to the json and yamlformats. One is to define a custom serializer (as you've done), theother is to keep the serializer the same and define a custom renderer(or encoder, or whatever you want to call the second stage). Of thetwo, I think that the second is probably a simpler cleaner approach.When you come to writing a dumpdata serializer, you'll find thatthere's quite a few corner cases that you'll need to deal with inorder to maintain full byte-for-byte backwards compatibility,including how natural keys are serialized, how many to manyrelationships are encoded, how None is handled for different types,down to making sure you preserve the correct field ordering acrosseach of json/yaml/xml. I *think* that getting the details of all ofthose will end up being awkward to express using your current approach.The second approach would be to a dict-like format, that can easily beencoded into json or yaml, but that can also include metadata specificto particular encodings such as xml (or perhaps, say, html). You'dhave a generic xml renderer, that handles encoding into fields andattributes in a fairly obvious way, and a dumpdata-specific renderer,that handles the odd edge cases that the dumpdata xml format requires.The dumpdata-specific renderer would use the same intermediate datathat's used for json and yaml.

I can't agree with that. There are too big differences between existingxml and json serializer output format. There is field 'fields' in jsonand 'field' in xml. Xml has attributes and json not. It's onlypresentation and these two cases could be handled in second phase (inrenderer). But there is one big difference - xml has additional fields'to', 'rel', 'type' and these are not presentation. These are informations.

The next (and maybe most important) thing to consider is what usershould know about formats to be able to serialize his data. In your'sapproach user should be familiar with for example SimpleXMLGeneratorbecause if he want


xml
<object>
<item>...</item>
<item>...</item>
</object>

and json
{
    items : [ ..., ...],
}

then he must wrote at least one renderer to transform 'items' to 'item'like you did in DumpDataXMLRenderer in django-serializers. I can'taccept that. Don't get me wrong, I adopt a lot of your's ideas fromdjango-serializers and I think is very good project. You shouldn't forceusers to know anything about generating xml or any other format. Maybeyou should create some metalanguage for user to speak about what he wantlike:

"I want that field 'items' will be transform to 'item' in xml (but Idon't know how to do it)" ->


class DumpDataSerializer(ModelSerializer):
    """
    A serializer that is intended to produce dumpdata formatted structures.
    """
    renderer_optons = {
        'xml': { 'transform' : {'fields' : 'field'}} ,
    }

It's ugly but I hope you understand my idea.

I hope all of that makes sense, let me know if I've not explainedmyself very well anywhere.
Regards,

  Tom


--
Piotr Grabowski

--
You received this message because you are subscribed to the Google Groups "Django 
developers" group.
To post to this group, send email to django-developers@googlegroups.com.
To unsubscribe from this group, send email to 
django-developers+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/django-developers?hl=en.

Re: Customizable Serialization check-in

Reply via email to