Jem,
<<
I have managed to sponge XHTML/RDFa via DAV (using the steps below)
however I want to be able to query the origin XHTML2 using XPATH or
transform using XSLT. I can only see the origin content within DAV at
the moment and cannot see the XML stored as an XMLType within a Virtuoso
Table. Ideally I would like to be able to query the triples using SPARQL
and (maybe in combination with) query the origin XHTML using XPATH. Have
you came across anything like this (examples)? This would be most useful
as I could then transform required content from the result of a SPARQL
query and maybe even expose the content via a Virtuoso stored procedure
(Virtuosos web service->stored-proc mapping) this would allow some
pretty funky logic and will enable me to constrain and control access to
the content.
>>
Please confirm that this is what you seek:
1. Grab XHTML content from an HTTP accessible source into Virtuoso
(WebDAV Content Management realm)
2. Have an RDF graph(s) generated from the imported resource(s)
3. Have SPARQL access to the RDF in the Quad Store
4. XQuery/XPath access to the XHTML via WebDAV or any other means.
If the above is true, the key to this is via the Virtuoso Content
Crawler which can do the following on a scheduled basis:
1. Grab/Sponge Web content into a location of your choice within WebDAV
2. Indicate to the Crawler that is should use one or more Sponger
Cartridges during the crawl
Result:
1. WebDAV accessible XHTML to which you can apply XSLT, XQuery, XPath
Queries
2. Triples in the Quad Store (with Graph IRIs matching the content
source URLs)
Also see: http://demo.openlinksw.com/tutorial re. examples of XML data
manipulation etc.. You can install a local version of this via the
"tutorial vad package".
--
Regards,
Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO
OpenLink Software Web: http://www.openlinksw.com