Hi Bastian

You most probably need to update or add the NumberOfBuffers ini file
parameter  to the [Parameters] section of the Virtuoso configuration file as
detailed at:

http://docs.openlinksw.com/virtuoso/dbadm.html#ini_Parameters

Note that Dbpedia is actually hosted on a Virtuoso server see
http://dbpedia.org for more details and can be interacted
with directly via http://dbpedia.openlinksw.com:8890/sparql or
http://dbpedia.openlinksw.com:8890/isparql .

Below are the details of how we imported the Dbpedia dataset into Virtuoso:

    * Download the DBpedia datasets from http://dbpedia.org/docs/#down

    * Unzip and place all the the datasets file (*.nt) into a directory
called "dbpedia_data"

    * Ensure the "dbpedia_data" directory is added to your "DirsAllowed?"
paramter in the Virtuoso Server configuration file and restart the server.

    * The NumberOfBuffers? paramter of the Virtuoso Server configuration
file is set to a suitable value to perform the import,noting that each
buffer is 8K bytes of memory. (In our case NumberOfBuffers=300000)

    * Run the script below from the parent directory of the "dbpedia_data"
directory to loop through and import each dataset into your Virtuoso Server

#!/bin/sh
rm -f load_dbpedia.log
for f in `find dbpedia_data -name '*.nt'`
do
echo "Loading $f `date \"+%H:%M:%S\"`" | tee -a load_dbpedia.log
isql 1112 dba dba verbose=on banner=off prompt=off echo=ON errors=stdout
exec="ttlp_mt (file_to_string_output ('$f'), '', 'http://dbpedia.org');
checkpoint;" >> load_dbpedia.log
done
 
Best Regards,
Hugh Williams
OpenLink Software

On 4/7/07 18:41, "Bastian Quilitz" <quil...@informatik.hu-berlin.de> wrote:

> Hello,
> 
> I am experimenting with virtuoso open source as rdf store. I tried to import
> a dbpedia dataset (1.6M triples) into the demo database (demo.ini) on a
> 8xOpteron(dualCore) with a lot of ram. This took around 3h and I am
> wondering whether this is a reasonable time? Although there were some other
> users using the machine (total cpu avg. was around 50%, RAM available >5GB)
> this seems to be quiet slow (~130triples/sec). I used the ttpl_mp() function
> to import the triples from a local file.
> 
> Am I doing something wrong or are there tuning options I should use?
> 
> 
> Kind regards,
> Bastian
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users





Reply via email to