On 10/1/15 6:52 PM, Jason Haag wrote: > Does anyone happen to have a sample RDFa/HTML file that works in > crawler for vituoso? I have tried all of the options (including both > sitemap and semantic sitemaps). I'm curious if my RDFa/HTML import > (URI is here: http://purl.org/xapi/adl/verbs) might not be working > with virtuoso for some reason. It shows it being fetched in the import > que and retrieved sites in the conductor interface, but there is no > data in the database.
Here's an example using our URIBurner instance (a public instance of Virtuoso with the Sponger and various Cartridges enabled): [1] http://linkeddata.uriburner.com/about/html/http/xapi.vocab.pub/datasets/adl/verbs/index.html -- Using /about service [2] http://linkeddata.uriburner.com/c/9DB7EM2T - Using /describe service . > > I am able to import my data when I use a RDF/XML file (.rdf) as the > graph URI, but I would really like to demonstrate that it can process > RDFa/HTML as an option. Thanks in advance. You should be able to use the /about service, as I have to achieve similar results. Once that works you simply enable the crawler to use the sponger. If you have any issues, share a screenshot of your crawler job configuration. Ensure you hatch the following checkboxes: [1] Run Sponger [2] Accept RDF -- this adds transparent content negotiation algorithm into the HTTP request [3] Store Metadata -- instruction to persist the sponger transformation to the designated named graph. Kingsley > > On Thu, Oct 1, 2015 at 3:29 PM, Jason Haag <jhaa...@gmail.com> wrote: >> Hi Kingsley, >> >> Thank you for the reply. I entered a Graph URI for the import into the >> Target URL field. Is this the correct place to enter it? >> >> I also selected the "Do not store crawled documents" radio button >> option as I don't care if the data is stored locally in WEBDAV. >> >> However, when I chose this option I receive the warning message "Host, >> Target collection and Start path must be specified." >> >> So it seems that providing a webdav path is required no matter what? >> >> I also have installed the RDF sponger cartridges and have those >> checked as well as "Accept RDF." >> >> Alternatively, I tried entering my URI into the field where it states: >> "If Graph IRI is unassigned use this Data Source URL:" >> >> When I try using this alternative data source URL field I receive the >> warning message, "Incorrect target URL." >> >> Appreciate any tips or advice you have for using the crawler interface >> with RDFa. Will also take a look at the linked data folder setup you >> suggested. >> >> Kind Regards, >> >> J Haag >> >> >> Date: Wed, 30 Sep 2015 20:25:50 -0400 >> From: Kingsley Idehen <kide...@openlinksw.com> >> Subject: Re: [Virtuoso-users] Automating RDF data imports in VIrtuoso >> To: virtuoso-users@lists.sourceforge.net >> Message-ID: <560c7d8e.7020...@openlinksw.com> >> Content-Type: text/plain; charset="windows-1252" >> >> On 9/30/15 6:10 PM, Haag, Jason wrote: >>> Thanks Daniel. I'm running Linux/Debian + Ubuntu. I am able to import >>> the HTML/RDFa file using the crawler. I'm able to find it in WEBDAV. >>> It just doesn't populate the database. If I import the same data in >>> RDF/XML format it does populate the database. >> Did you set a Named Graph IRI in the import/crawl job? That's how you >> get data into the quad store. Storing the ingest data to WebDAV is >> totally optional. Also note that you can invoke the sponger too, as part >> of this crawling functionality. >> >> You can also make Linked Data Folder Types that are mapped to named >> graph iris as part of the folder config. Once in place you can make said >> folder the target of RDF content that you want to import into the quad >> store. >> >> [1] https://www.pinterest.com/kidehen/virtuoso-universal-server-related/ >> -- I've added 4 screenshots showcasing Linked Data Folder Type setup >> >> >> Kingsley > ------------------------------------------------------------------------------ > _______________________________________________ > Virtuoso-users mailing list > Virtuoso-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/virtuoso-users > -- Regards, Kingsley Idehen Founder & CEO OpenLink Software Company Web: http://www.openlinksw.com Personal Weblog 1: http://kidehen.blogspot.com Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen Twitter Profile: https://twitter.com/kidehen Google+ Profile: https://plus.google.com/+KingsleyIdehen/about LinkedIn Profile: http://www.linkedin.com/in/kidehen Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this
smime.p7s
Description: S/MIME Cryptographic Signature
------------------------------------------------------------------------------
_______________________________________________ Virtuoso-users mailing list Virtuoso-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/virtuoso-users