Hi Bastian,

Some additional information. Below are the relevant INI file params we used
on our hosted Dbpedia server for importing and running this service:

[Parameters]
.
.
.
ServerThreads        = 1000
CheckpointInterval   = 60;
AutoCheckpointLogSize = 10000000
NumberOfBuffers      = 300000
MaxDirtyBuffers      = 4000
MaxCheckpointRemap   = 40000
.
.
.

Note the server we are using has less capacity in terms of Processors speeds
and Memory than your, so this should suffice for your needs and could also
be increased if need be ...

Regards,
Hugh


On 4/7/07 19:10, "Hugh Williams" <hwilli...@openlinksw.com> wrote:

> Hi Bastian
> 
> You most probably need to update or add the NumberOfBuffers ini file parameter
> to the [Parameters] section of the Virtuoso configuration file as detailed at:
> 
> http://docs.openlinksw.com/virtuoso/dbadm.html#ini_Parameters
> 
> Note that Dbpedia is actually hosted on a Virtuoso server see
> http://dbpedia.org for more details and can be interacted
> with directly via http://dbpedia.openlinksw.com:8890/sparql or
> http://dbpedia.openlinksw.com:8890/isparql .
> 
> Below are the details of how we imported the Dbpedia dataset into Virtuoso:
> 
>     * Download the DBpedia datasets from http://dbpedia.org/docs/#down
> 
>     * Unzip and place all the the datasets file (*.nt) into a directory called
> "dbpedia_data"
> 
>     * Ensure the "dbpedia_data" directory is added to your "DirsAllowed?"
> paramter in the Virtuoso Server configuration file and restart the server.
> 
>     * The NumberOfBuffers? paramter of the Virtuoso Server configuration file
> is set to a suitable value to perform the import,noting that each buffer is 8K
> bytes of memory. (In our case NumberOfBuffers=300000)
> 
>     * Run the script below from the parent directory of the "dbpedia_data"
> directory to loop through and import each dataset into your Virtuoso Server
> 
> #!/bin/sh
> rm -f load_dbpedia.log
> for f in `find dbpedia_data -name '*.nt'`
> do
> echo "Loading $f `date \"+%H:%M:%S\"`" | tee -a load_dbpedia.log
> isql 1112 dba dba verbose=on banner=off prompt=off echo=ON errors=stdout
> exec="ttlp_mt (file_to_string_output ('$f'), '', 'http://dbpedia.org');
> checkpoint;" >> load_dbpedia.log
> done
>  
> Best Regards,
> Hugh Williams
> OpenLink Software
> 
> On 4/7/07 18:41, "Bastian Quilitz" <quil...@informatik.hu-berlin.de> wrote:
> 
>> Hello,
>> 
>> I am experimenting with virtuoso open source as rdf store. I tried to import
>> a dbpedia dataset (1.6M triples) into the demo database (demo.ini) on a
>> 8xOpteron(dualCore) with a lot of ram. This took around 3h and I am
>> wondering whether this is a reasonable time? Although there were some other
>> users using the machine (total cpu avg. was around 50%, RAM available >5GB)
>> this seems to be quiet slow (~130triples/sec). I used the ttpl_mp() function
>> to import the triples from a local file.
>> 
>> Am I doing something wrong or are there tuning options I should use?
>> 
>> 
>> Kind regards,
>> Bastian
>> 
>> 
>> -------------------------------------------------------------------------
>> This SF.net email is sponsored by DB2 Express
>> Download DB2 Express C - the FREE version of DB2 express and take
>> control of your XML. No limits. Just data. Click to get it now.
>> http://sourceforge.net/powerbar/db2/
>> _______________________________________________
>> Virtuoso-users mailing list
>> Virtuoso-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
> 
> 


Best Regards,
Hugh Williams
VP Product Development
OpenLink Software
Tel: +44 (0) 8681 7701
Web: http://www.openlinksw.com
Universal Data Access & Data Integration Technology Providers



Reply via email to