Re: [Virtuoso-users] Problem with very large RDF literals

Patrick van Kleef Mon, 12 Oct 2009 20:02:25 +0000

HI Sebastian,

Looking at the Virtuoso configuration file params you provided Inot you
are running in LiteMode, with only the following set:
[Parameters]
LiteMode=1
ServerPort=1111
DisableTcpSocket=1
PrefixResultNames=0
ServerThreads=5
CheckpointInterval=10
MaxDirtyBuffers=50
SchedulerInterval=5
FreeTextBatchSize=1000
Which means the Virtuoso NumberOfBuffers which is not set andcontrols
the amount of RAM used by the server will be the default of 2000 * 8K
buffers = 16K . So you entire Virtuoso Server is running within16K of
memory and you are attempting to insert a 1MB plus triple, I think it
will have problems as the server will then having to be swappingbetweenmemory and disk like crazy. I would suggest you set theNumberOfBuffersto something like 200000 to give the Virtuoso server a reasonableamount
of RAM to perform such inserts ...
I only have 3gb of main memory so 200000 does not work here:

GPF: disk.c:1294 Cannot allocate memory for Database buffers, try to
decrease NumberOfBuffers INI setting


But I tried 100000 and killed virtuoso after two hours.
Also you introduced the lite mode for me to be able to run Virtuoso on
desktop machines where people are already going crazy if you use 100MB
of memory. So raising the number of buffers cannot really be the end
solution.

Yes, but you got to realize that it requires some tuning in order forthe system to be able to still perform. If you try to stuff datalarger than your buffer size into the system, then it will requirethe database to make a lot of swaps to get the relevant data pagesinto memory, constantly swapping data between memory and db. Thenupdating the freetext index etc.

Since i do not know all the details of your machine, what otherprocesses are running etc, i cannot tell you exactly what settingsyou can try at this point.

I fully appreciate the fact you want to keep the memory footprintdown, but this does not mean that you can expect all lite mode willjust be able to handle arbitrary commands without some performancecosts.

Now i am prepared to help analyze your current statements and see ifwe can come up with a method that will return in a more reasonabletime, without dramatically increase the footprint of virtuoso.

I don't really know how to continue now.
The only solution I can see ATM is to split the big literals into
several statements. After all they are only used for the full textindexanyway (it is the plain text representation of files in the user'shome
directory.)
Any other (hopefully better) ideas?

Can you dump the sample data in some format like .n3 or similar so ican have a look at this. Or just dump the sparql commands you triedto use to a text file and compress it.


Contact me privately so we can arrange pickup.


Patrick

Re: [Virtuoso-users] Problem with very large RDF literals

Reply via email to