Kingsley Idehen wrote:
Jem Rayfield wrote:
Hi Kingsley,
I want to push content into Virtuoso via HTTP rather than a crawl
mechanism. I don't think the content crawler is what I am after.
Yes.
When you PUT data into Virtuoso, sponging can occur, it depends on where
you PUT the data :-)
Our content gets created and pushed onto publish/rendering queues. I
need to aysnch the triple extraction as and when content is published
rather than waiting for a publishing mechanism, exposure onto public
facing web servers and a subsequent crawl. Ideally I would like the
extraction process to work on new content documents (rdf) and augment
the rdf to an existing graph. (Will a HTTP post approach always create a
new graph? Maybe I have configured something incorrectly?)
New graph if you create resources in the RDF_Sink folder. Otherwise not.
Of course you can tailor all of this to your specific needs e.g. put the
triples in one graph.
(The crawl mechanism certainly looks like a very interesting
feature but I don't think it fits this use case. Although I could have
missed something?)
After pushing the XHTML2/RDFa content into Virtuoso via HTTP I am able
to use SPARQL on the quad store. The origin document is also available
via the DAV interface.
Yes.
However I am assuming that your DAV store is built using database tables
and thus the sponging process also consumes the XHTML into a Table
(XMLType (text index)).
This Table/XMLType could then be queried using SQL/SPARQL?. I could then
maybe even be expose these a stored proc using virtuoso web service->PL?
Yes, but you should use WebDAV as if first via functions. Also note we
do have a separate WebDAV Cartridge that simply makes an RDF graph of
DAV resources. This particular cartridge is about the DAV information
resources. Thus, you have SPARQL access to these, you just use the
http://<cname>/webdav Graph IRI. Even better, if you install
ODS-Briefcase, all you DAV resources are exposed via a more granular
graph using the SIOC Ontology.
Only if the ODS-Briefcase or basic WebDAV graphs don't meet your needs
should you consider writing Virtuoso PL against the WebDAV tables.
Something to bear in mind:
DAV_PROP_SET ('/DAV/home/me/rdf_sink/', 'virt:rdf_graph',
someIRI, 'me', 'mypassword');
Subsequent uploads into rdf_sink will use the someIRI specified in the
virt:rdf_graph DAV Property as their graph.
I'm sure this can be set using the WebDAV Browser via Conductor and
probably the ODS-Briefcase UI if required.
~Tim
--
Tim Haynes
Product Development Consultant
OpenLink Software
<http://www.openlinksw.com/>