Bart, On 26 May 2014, at 13:03, Bart Vandewoestyne <bart.vandewoest...@telenet.be> wrote:
> On 2014-05-18 16:11, Hugh Williams wrote: >> Hi Sumit >> >> Ah, didn't realise you started with a v6 row store database, thus how >> did you upgrade to v7 ? >> >> As if you simply check pointed and restarted the v6 database with a v7 >> binary then it would be automatically upgraded but all the tables >> including the RDF_QUAD tables and indexes would remain in the original >> row store mode. >> >> There are two options for converting the RDF_QUAD table to column store >> which are: >> >> 1. To dump the current RDF triples to datasets and reload them in an >> empty database which will automatically then create the RDF_QUAD tables >> and indexes in column wise mode. Scripts for dumping and reloading the >> RDF data can be found at: >> >> http://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/VirtRDFDatasetDump > > Hello Hugh and list, > > I just tested the dump/reload scripts and these are my results: > > 1) As I already reported, dumping my Virtuoso database (virtuoso.db file > of 142 GB) took me about one night (11 or 12 hours or so i guess). > > 2) Reloading the turtle data took me almost 3 days (244000672 msec. to > be precise ;-) > > 3) When using the query > > SQL> sparql SELECT COUNT(*) { ?s ?p ?o }; > > to count the number of tuples in the old and new RDF store, I get a > tuple count of 2164637247 (old) vs 2123396737 (new), a difference of > 41240510 (2%). In my virtuoso.log I do see messages like > > 16:30:51 INFO: PL LOG: Error in graph000053_000063.ttl > 16:33:00 INFO: PL LOG: Error in graph000053_000043.ttl > > I would expect the dump/reload scripts not to cause any tuple loss... > What could be the reason for this? I find it strange that the dumped > .ttl files seem to contain errors... [Hugh] The count should be at least the same as was in v6, in fact a bit more as v7 has more triples by default in an empty database. The TTL files once dumped can also be loaded with the Virtuoso RDF Bulk Loader to load the ttl files and it will log the reason a specific datasets failed to load in the load_list table ll_error column. Regards Hugh > > Kind regards, > Bart > > ------------------------------------------------------------------------------ > The best possible search technologies are now affordable for all companies. > Download your FREE open source Enterprise Search Engine today! > Our experts will assist you in its installation for $59/mo, no commitment. > Test it for FREE on our Cloud platform anytime! > http://pubads.g.doubleclick.net/gampad/clk?id=145328191&iu=/4140/ostg.clktrk > _______________________________________________ > Virtuoso-users mailing list > Virtuoso-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/virtuoso-users ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/NeoTech _______________________________________________ Virtuoso-users mailing list Virtuoso-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/virtuoso-users