Re: Persistent Objects Using SQL

Darren Duncan Sat, 29 May 2010 23:34:26 -0700

Stevan Little wrote:

On May 29, 2010, at 11:20 PM, Darren Duncan wrote:
2. Besides the ability to introspect or perform powerful searches onyour objects using SQL/etc, I see another big advantage of usingdatabase storage without serialization as portability. You can haveapplications written in different programming languages sharing thesame database and the same objects, because they don't containPerl-specific data formats.
KiokuDB mostly uses JSON and JSPON as the storage format, which is notPerl specific. The serialization format we store in is dependent on theMoose class definition, so in that way it is not terribly portable.

An advantage of not using serialization like JSON, but rather storing eachobject attribute as a database member attribute, is that the DBMS itself canthen most easily be defined to enforce the consistency of your objects, sosomeone accessing the database by some way other than KiokuDB, or using a buggyversion of KiokuDB, is less likely to be able to corrupt the data. As for howto get the database to do that, one general answer is CHECK constraints, thoughthat is a fallback to where terser/simpler kinds of constraints don't do the job.

A relational database can map to an object structure of any languagefairly easily. Add attributes/columns for mutually heterogeneousdata, like when you would add object attributes, and add tuples/rowsfor mutually homogeneous data, like when you would use arrays or sets.
And then you get the impedance mismatch. You are ignoring inheritance,which is not really possible in a relational model.

I wasn't ignoring inheritance, but rather was just being terse by givingexamples rather than every relevant detail.


As for inheritance, a relational model can handle that just fine.

You also have several options for how to lay it out, depending on what you'regoing for.

One option in the general case is to have a distinct database relvar/table pereach instantiatable class, which has one attribute/column per class attribute,plus an extra attribute/column to hold an ID value for the object. When a classcomposes a role or inherits from a class, the attributes defined in the othersplus those defined directly in the first class would each have a correspondingattribute in the relvar/table attribute/column, so that each attribute of theobject of that class has a place to be stored. And so, when multiple classescompose the same attributes, their corresponding relvars/tables all havecommon-named/typed attributes/columns corresponding to said.

Another option in the general case is to also have a database relvar/table foreach role or non-instantiatable class as well, which is then the only one havingthe attributes/columns that the corresponding declares, and then therelvars/tables mentioned in the previous paragraph then wouldn't have these butinstead would have matching ID values to relate records in them to ones in theothers.

Generally speaking, with the exception perhaps of Moose classes where everysingle object can have different names or kinds etc of attributes, rather thanthose being class-defined, I would think the best design is for the database tohave exactly the same granularity of component data as the Moose objects do.Just where each object can have different attributes, then the database couldprobably be designed like a key-value store, but that's less ideal.

One should think about the database schema like they think about their code. Itis just as reasonable to change the schema as it is to change what classes youhave or what attributes they have. The schema *is* code, and the data it holdsis like objects of classes. No more, and no less.

Remember, objects are graphs not sets of tuples.

And graphs can be represented as sets of tuples, such as where tuples have 2attributes that name connected graph nodes. For that matter, objects only*represent* graphs themselves.

Now, all that I've had to say here isn't meant to diminish that the JSONserialization approach is useful and probably a best fit for many usage scenarios.

But at the same time, relational databases are very powerful and theirstrengths, of ensuring that data is consistent and making it easier to search,should be utilized, where it makes sense to do so. Using a relational database,without exploiting the features that make them uniquely powerful, is likewasting the tools you have.

Perhaps a reasonable analogy is people who use Perl 5 but write their Perl codeas if they were using Perl 4, and were faking references rather than using realreferences, structures, and objects. Sometimes I think of that when I hear ofpeople just dumping objects as a serialized string in a relational database.


-- Darren Duncan

Re: Persistent Objects Using SQL

Reply via email to