Hi Hugh,
On 08/29/2015 12:29 AM, Hugh Williams wrote:
Hi Núria,
So you increased from 9000 to 9000000 , which is excessive considering
the query takes 144secs to complete ?
well, this is the computer time, my crono said 2'25''
Can you provide the query being executed as well as the query profile
by running “set profile on;” from isql before running the query.
The query is:
SELECT count(DISTINCT ?gene) as ?genes ?disease ?diseaseName ?orphanet
count(DISTINCT ?phenotype) as ?phenotypes FROM <http://rdf.disgenet.org>
WHERE {
?gda sio:SIO_000628 ?disease,?gene .
?disease skos:exactMatch ?orphanet .
?orphanet sio:SIO_000341 ?phenotype .
FILTER regex(?gene, 'ncbigene')
FILTER regex(?disease, 'umls/id')
?disease dcterms:title ?diseaseName .
FILTER regex(?orphanet, 'identifiers.org/orphanet/86893')
?orphanet dcterms:title ?orphanetName .
?phenotype dcterms:title ?phenotypeName
}
GROUP BY ?disease ?diseaseName ?orphanet
After running "set profile on;" :
SQL> set profile on;
SQL> sparql SELECT count(DISTINCT ?gene) as ?genes ?disease ?diseaseName
?orphanet count(DISTINCT ?phenotype) as ?phenotypes FROM
<http://rdf.disgenet.org> WHERE {
?gda sio:SIO_000628 ?disease,?gene .
?disease skos:exactMatch ?orphanet .
?orphanet sio:SIO_000341 ?phenotype .
FILTER regex(?gene, 'ncbigene')
FILTER regex(?disease, 'umls/id')
?disease dcterms:title ?diseaseName .
FILTER regex(?orphanet, 'identifiers.org/orphanet/86893')
?orphanet dcterms:title ?orphanetName .
?phenotype dcterms:title ?phenotypeName
}
GROUP BY ?disease ?diseaseName ?orphanet
Type the rest of statement, end with a semicolon (;)> Type the rest of
statement, end with a semicolon (;)> Type the rest of statement, end
with a semicolon (;)> Type the rest of statement, end with a semicolon
(;)> Type the rest of statement, end with a semicolon (;)> Type the rest
of statement, end with a semicolon (;)> Type the rest of statement, end
with a semicolon (;)> Type the rest of statement, end with a semicolon
(;)> Type the rest of statement, end with a semicolon (;)> Type the rest
of statement, end with a semicolon (;)> Type the rest of statement, end
with a semicolon (;)> Type the rest of statement, end with a semicolon
(;)> Type the rest of statement, end with a semicolon (;)> ;
result
LONG VARCHAR
_______________________________________________________________________________
12 http://linkedlifedata.com/resource/umls/id/C1334968 Nodular
Lymphocyte Predominant Hodgkin Lymphoma
http://identifiers.org/orphanet/86893 11
1 http://linkedlifedata.com/resource/umls/id/C2239290 Hodgkin
lymphoma, nodular lymphocyte predominance
http://identifiers.org/orphanet/86893 11
{
time 4e-06% fanout 1 input 1 rows
Subquery 27
{
time 2.9e-06% fanout 1 input 1 rows
{ fork
time 0.14% fanout 2.03747e+06 input 1 rows
RDF_QUAD 2.8e+05 rows(s_14_7_t0.O, s_14_7_t0.S)
inlined P = #/SIO_000628 G = #/rdf.disgenet.org
time 6.8% fanout 0.5 input 2.03747e+06 rows
END Node
After test:
0: __ro2sq := Call __ro2sq (s_14_7_t0.O)
5: _cvt := Call _cvt (<constant>, __ro2sq)
10: if (_cvt LIKE <c %umls/id%>) then 14 else 15 unkn 15
14: BReturn 1
15: BReturn 0
time 2.3% fanout 2 input 1.01874e+06 rows
Precode:
0: __ro2sq := Call __ro2sq (s_14_7_t0.O)
5: BReturn 0
RDF_QUAD 3.5 rows(s_14_7_t1.O)
inlined P = #/SIO_000628 , S = k_s_14_7_t0.S G = #/rdf.disgenet.org
time 6.7% fanout 0.5 input 2.03747e+06 rows
END Node
After test:
0: __ro2sq := Call __ro2sq (s_14_7_t1.O)
5: _cvt := Call _cvt (<constant>, __ro2sq)
10: if (_cvt LIKE <c %ncbigene%>) then 14 else 15 unkn 15
14: BReturn 1
15: BReturn 0
time 1% fanout 1 input 1.01874e+06 rows
RDF_QUAD 5 rows(s_14_7_t4.S, s_14_7_t4.O)
inlined P = #/title , S = k_s_14_7_t0.O G = #/rdf.disgenet.org
time 4.5% fanout 14.4808 input 1.01874e+06 rows
Precode:
0: __ro2sq := Call __ro2sq (s_14_7_t4.O)
5: BReturn 0
RDF_QUAD 4.6 rows(s_14_7_t2.O)
inlined P = ##exactMatch , S = k_s_14_7_t0.O G = #/rdf.disgenet.org
time 79% fanout 8.81231e-07 input 1.47521e+07 rows
END Node
After test:
0: __ro2sq := Call __ro2sq (s_14_7_t2.O)
5: rdf_regex_impl := Call rdf_regex_impl (__ro2sq, <c
identifiers.org/orphanet/86893>)
10: if ( 0 = rdf_regex_impl) then 15 else 14 unkn 15
14: BReturn 1
15: BReturn 0
time 0.0006% fanout 11 input 13 rows
Precode:
0: __ro2sq := Call __ro2sq (s_14_7_t2.O)
5: BReturn 0
RDF_QUAD 86 rows(s_14_7_t3.O, s_14_7_t3.S)
inlined P = #/SIO_000341 , S = k_s_14_7_t2.O G = #/rdf.disgenet.org
time 0.00054% fanout 1 input 143 rows
RDF_QUAD 5 rows()
inlined P = #/title , S = cast G = #/rdf.disgenet.org
time 0.00021% fanout 2 input 143 rows
RDF_QUAD 5 rows()
inlined P = #/title , S = k_s_14_7_t2.O G = #/rdf.disgenet.org
time 0.0021% fanout 0 input 286 rows
Sort (s_14_7_t0.O, s_14_7_t4.O, s_14_7_t2.O) -> (inc, inc, __ro2sq,
__ro2sq, __ro2sq)
}
time 7.5e-05% fanout 2 input 1 rows
group by read node
(s_14_7_t0.O, s_14_7_t4.O, s_14_7_t2.O, aggregate, aggregate, __ro2sq,
__ro2sq, __ro2sq)
After code:
0: genes := := artm aggregate
4: disease := := artm __ro2sq
8: diseaseName := := artm __ro2sq
12: orphanet := := artm __ro2sq
16: phenotypes := := artm aggregate
20: BReturn 0
time 1.4e-06% fanout 0 input 2 rows
Subquery Select(genes, disease, diseaseName, orphanet, phenotypes)
}
After code:
0: genes := Call __ro2sq (genes)
5: disease := Call __ro2sq (disease)
10: diseaseName := Call __ro2sq (diseaseName)
15: orphanet := Call __ro2sq (orphanet)
20: phenotypes := Call __ro2sq (phenotypes)
25: BReturn 0
time 8e-07% fanout 0 input 2 rows
Select (genes, disease, diseaseName, orphanet, phenotypes)
}
103934 msec 99% cpu, 4.05894e+06 rnd 1.7155e+07 seq 70.6155% same
seg 13.7565% same pg
Compilation: 35 msec 0 reads 0% read 0 messages 0% clw
4 Rows. -- 103972 msec.
What is the current triple count on the server, which can be obtained
with:
sparql select count(*) where {?s ?p ?o};
24914558 triples
Also what is the status of the server after running the query multiple
times, which can be obtained with the command:
status();
SQL> status();
result
LONG VARCHAR
_______________________________________________________________________________
OpenLink Virtuoso Server
Version 07.10.3211-pthreads for Linux as of Nov 4 2014
Started on: 2015-08-19 10:02 GMT+2
Database Status:
File size 0, 1073664 pages, 474141 free.
5450000 buffers, 535311 used, 2 dirty 0 wired down, repl age 0 0 w.
io 0 w/crsr.
Disk Usage: 526645 reads avg 0 msec, 0% r 0% w last 1604 s, 495790
writes flush 26.55 MB,
2677 read ahead, batch = 191. Autocompact 133309 in 100660 out,
24% saved col ac: 25228 in 22% saved.
Gate: 9239 2nd in reads, 0 gate write waits, 0 in while read 0 busy scrap.
Log =
/usr/local/virtuoso-development-04112014/var/lib/virtuoso/db/virtuoso.trx,
5872 bytes
599446 pages have been changed since last backup (in checkpoint state)
Current backup timestamp: 0x0000-0x00-0x00
Last backup date: unknown
Clients: 16 connects, max 4 concurrent
RPC: 688 calls, -12 pending, 2 max until now, 0 queued, 0 burst reads
(0%), 0 second 2M large, 417M max
Checkpoint Remap 0 pages, 0 mapped back. 464 s atomic time.
DB master 1073664 total 474141 free 0 remap 0 mapped back
temp 13312 total 13307 free
{
Precode:
0: <none> := Call status ()
5: BReturn 0
END Node
}
264 msec 99% cpu, 0 rnd 0 seq 0% same
seg 0% same pg
Compilation: 0 msec 0 reads 0% read 0 messages 0% clw
4 Rows. -- 266 msec.
I hope this is the info you requested. Thanks again!
Best,
Núria
Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc. // http://www.openlinksw.com/
Weblog -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter -- http://twitter.com/OpenLink
Google+ -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers
On 28 Aug 2015, at 18:08, Nuria Queralt Rosinach <nquer...@imim.es
<mailto:nquer...@imim.es>> wrote:
Hi Hugh,
On 08/28/2015 06:46 PM, Hugh Williams wrote:
Hi Núria,
What was the 'MaxQueryCostEstimationTime’ parameter set to
previously and what did you increase it to , as the
“ERR_CONTENT_DECODING_FAILED” error seem odd for a timeout parameter ?
I increased from 9x10**3 to 9x10**6
I got directly the virtuoso error message executing the query from
our endpoint but specifying the virtuoso port in the browser, in
order to avoid the apache server. then the message error from
virtuoso was that the estimation execution time was too short. the
error message "ERR_CONTENT_DECODING_FAILED" come from apache i guess,
as it does not know how to serve the error message from virtuoso?
Also, when you ran the query from isql how long does it take to
execute and return results ?
144552 msec
best,
n
Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc. // http://www.openlinksw.com/
Weblog -- http://www.openlinksw.com/blogs/LinkedIn --
http://www.linkedin.com/company/openlink-software/Twitter --
http://twitter.com/OpenLinkGoogle+ --
http://plus.google.com/100570109519069333827/Facebook --
http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers
On 28 Aug 2015, at 15:51, Nuria Queralt Rosinach <nquer...@imim.es
<mailto:nquer...@imim.es>> wrote:
Hi Hugh,
Problem solved! just increasing a bit the
'MaxQueryCostEstimationTime' parameter in the configuration of the
SPARQL service and the query works perfectly fine :)
Thank you very much for your always fast replies.
Best,
Núria
On 08/28/2015 04:30 PM, Nuria Queralt Rosinach wrote:
Hi Hugh,
On 08/28/2015 02:56 PM, Hugh Williams wrote:
HI Núria,
You reported the same error running a SPARQL-FED query last year,
was this resolved then as it is unclear from the thread ?
Good memory! i managed to solve the problem in that case. This
time it is not a FED.
What is the query currently being run is it a SPARQL-FED query
also or a query against the local instance ?
it is about a query against local data. Issues raised via the
sparql endpoint when using DISTINCT in a SELECT query, or GROUP BY
modifier: the same query gives output without using DISTINCT or
GROUP BY in the SELECT query.
If the query runs from isql which executes queries as the “dba”
(or whatever user you login as) and fails via the HTTP , which I
assume is the /sparql endpoint, which runs as user “SPARQL”,
yes
then most probably the roles assigned to both differ, thus you
should check was is assigned to both in the “System Admin -> User
Accounts” section of the Conductor and ensure the same SPARQL
roles are assigned ie SPARQL_SELECT, SPARQL_UPDATE,
SPARQL_SPONGE, SPARQL_LOAD_SERVICE_DATA , see:
http://docs.openlinksw.com/virtuoso/rdfsparql.html#rdfsupportedprotocolendpointuri
http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtTipsAndTricksDiscoverSPARQFedCapabilities
I checked, they differ: SPARQL user has the 'SPARQL_SELECT' role
assigned, whereas the dba user does not have any role assigned...
Thanks!
Núria
Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc. // http://www.openlinksw.com/
Weblog -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter -- http://twitter.com/OpenLink
Google+ -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology
Providers
On 28 Aug 2015, at 12:33, Nuria Queralt Rosinach
<nquer...@imim.es <mailto:nquer...@imim.es>> wrote:
Dear all,
I have a problem with a query that perfectly works via ISQL, but
it does not via Web. The error i got is:Error code:
ERR_CONTENT_DECODING_FAILED
Virtuoso version implemented:
OpenLink Virtuoso Server
Version 07.10.3211-pthreads for Linux as of Nov 4 2014
Could you shed light on that problem? Many thanks in advance.
Best regards,
Núria
--
********************************************************************
Núria Queralt Rosinach
Research Programme on Biomedical Informatics (GRIB)
Department of Experimental and Health Sciences
Universitat Pompeu Fabra
IMIM (Hospital del Mar Medical Research Institute)
C/Dr. Aiguader 88, 08003
Barcelona, Spain
Tel.: +34 93 316 0536 (1536)
E-mail:nquer...@imim.es
Skype IM: nuriaqr76
http://ibi.imim.es/
********************************************************************
------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
<mailto:Virtuoso-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/virtuoso-users
--
********************************************************************
Núria Queralt Rosinach
Research Programme on Biomedical Informatics (GRIB)
Department of Experimental and Health Sciences
Universitat Pompeu Fabra
IMIM (Hospital del Mar Medical Research Institute)
C/Dr. Aiguader 88, 08003
Barcelona, Spain
Tel.: +34 93 316 0536 (1536)
E-mail:nquer...@imim.es
Skype IM: nuriaqr76
http://ibi.imim.es/
********************************************************************
--
********************************************************************
Núria Queralt Rosinach
Research Programme on Biomedical Informatics (GRIB)
Department of Experimental and Health Sciences
Universitat Pompeu Fabra
IMIM (Hospital del Mar Medical Research Institute)
C/Dr. Aiguader 88, 08003
Barcelona, Spain
Tel.: +34 93 316 0536 (1536)
E-mail:nquer...@imim.es
Skype IM: nuriaqr76
http://ibi.imim.es/
********************************************************************
------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
<mailto:Virtuoso-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/virtuoso-users
--
********************************************************************
Núria Queralt Rosinach
Research Programme on Biomedical Informatics (GRIB)
Department of Experimental and Health Sciences
Universitat Pompeu Fabra
IMIM (Hospital del Mar Medical Research Institute)
C/Dr. Aiguader 88, 08003
Barcelona, Spain
Tel.: +34 93 316 0536 (1536)
E-mail:nquer...@imim.es
Skype IM: nuriaqr76
http://ibi.imim.es/
********************************************************************
------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
<mailto:Virtuoso-users@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/virtuoso-users
--
********************************************************************
Núria Queralt Rosinach
Research Programme on Biomedical Informatics (GRIB)
Department of Experimental and Health Sciences
Universitat Pompeu Fabra
IMIM (Hospital del Mar Medical Research Institute)
C/Dr. Aiguader 88, 08003
Barcelona, Spain
Tel.: +34 93 316 0536 (1536)
E-mail: nquer...@imim.es
Skype IM: nuriaqr76
http://ibi.imim.es/
********************************************************************
------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users