Corey Lovitt wrote:
Corey Lovitt wrote:
Hi,
I've been working with virtuoso for a couple weeks now
and have been able to load a test RDF data file into the
quad store (subset of the UNIPROT dataset containing about
2.6 million triples) and query it  with good
success.  I had saved the loading commands as well as
SPARQL queries for the database so they were reusable.
In testing the system, I shutdown the virtuoso-t
server, deleted the db-files  as well as the trx and
the pxa files and then brought the server back up.  I
then did a full reload of the dataset.
I'm now struggling with query performance on what seem
to be very basic queries and am looking for suggestions to
help resolve them.  I've loaded the same dataset and
reused the coded SPARQL queries it.  My original
install was I believe into a vanilla quadstore (no special
indexes in place).  I believe the query is being done
on exclusively local data - if this assumption is wrong it
may explain the performance issue - really don't understand
how the caching aspect of this system work and may be
tripping on this.
Can folks suggest any way to check this (my packet
sniffer isn't showing anything obvious).
I can provide the explain output for an example query
if this would be useful.
Any suggestions would be _really_ appreciated.
Corey

_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users
Corey,

What indexes do you have in place ?

What RDF data loading guide did you use and follow?


--

Regards,

Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen President & CEO OpenLink Software Web: http://www.openlinksw.com

I'm using the default indexes: 'RDF_QUAD' and 'RDF_QUAD_OGPS'.
The query I'm running uses a FROM to name the graph so I was assuming that 
default indexes were appropriate.

PREFIX uni: <http://purl.uniprot.org/core/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
SELECT ?Protein ?ProteinName ?FunctionDescription FROM <http://localgraphstore/data/UNIPROT>
WHERE {
?Protein uni:organism <http://purl.uniprot.org/taxonomy/9606>. #get Human records ?Protein uni:recommendedName ?recnameid . ?recnameid uni:fullName ?ProteinName . ?Protein uni:annotation ?annotationid . ?annotationid a uni:Function_Annotation ; rdfs:comment ?FunctionDescription . }


In terms of loading guide, I used the information available through the 
5.0.{9,10} documentation releases.  The data was loaded from the isql prompt 
using the commands:

checkpoint;
checkpoint_interval(6000);
DB.DBA.RDF_LOAD_RDFXML_MT 
(file_to_string_output('/opt/bigV/data-to-load/uniprot-test-records.rdf'),'', 
'http://localgraphstore/data/UNIPROT', 0, 1);
checkpoint;
checkpoint_interval(60);

I've just upgraded to 5.0.11 and it looks like some of the docs have been 
updated.  Performance/tuning section 15.7 - anything new in it?

Cheers,
Corey



Corey,

Please read: http://docs.openlinksw.com/virtuoso/rdfperformancetuning.html

--


Regards,

Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO OpenLink Software Web: http://www.openlinksw.com





Reply via email to