On 10/1/15 6:52 PM, Jason Haag wrote:
> Does anyone happen to have a sample RDFa/HTML file that works in
> crawler for vituoso? I have tried all of the options (including both
> sitemap and semantic sitemaps). I'm curious if my RDFa/HTML import
> (URI is here: http://purl.org/xapi/adl/verbs) might not be working
> with virtuoso for some reason. It shows it being fetched in the import
> que and retrieved sites in the conductor interface, but there is no
> data in the database.

Here's an example using our URIBurner instance (a public instance of
Virtuoso with the Sponger and various Cartridges enabled):

[1]
http://linkeddata.uriburner.com/about/html/http/xapi.vocab.pub/datasets/adl/verbs/index.html
-- Using /about service
[2] http://linkeddata.uriburner.com/c/9DB7EM2T - Using /describe service .
>
> I am able to import my data when I use a RDF/XML file (.rdf) as the
> graph URI, but I would really like to demonstrate that it can process
> RDFa/HTML as an option. Thanks in advance.

You should be able to use the /about service, as I have to achieve
similar results. Once that works you simply enable the crawler to use
the sponger. If you have any issues, share a screenshot of your crawler
job configuration.


Ensure you hatch the following checkboxes:
[1] Run Sponger
[2] Accept RDF -- this adds transparent content negotiation algorithm
into the HTTP request
[3] Store Metadata -- instruction to persist the sponger transformation
to the designated named graph.


Kingsley
>
> On Thu, Oct 1, 2015 at 3:29 PM, Jason Haag <jhaa...@gmail.com> wrote:
>> Hi Kingsley,
>>
>> Thank you for the reply. I entered a Graph URI for the import into the
>> Target URL field. Is this the correct place to enter it?
>>
>> I also selected the "Do not store crawled documents" radio button
>> option as I don't care if the data is stored locally in WEBDAV.
>>
>> However, when I chose this option I receive the warning message "Host,
>> Target collection and Start path must be specified."
>>
>> So it seems that providing a webdav path is required no matter what?
>>
>> I also have installed the RDF sponger cartridges and have those
>> checked as well as "Accept RDF."
>>
>> Alternatively, I tried entering my URI into the field where it states:
>> "If Graph IRI is unassigned use this Data Source URL:"
>>
>> When I try using this alternative data source URL field I receive the
>> warning message, "Incorrect target URL."
>>
>> Appreciate any tips or advice you have for using the crawler interface
>> with RDFa. Will also take a look at the linked data folder setup you
>> suggested.
>>
>> Kind Regards,
>>
>> J Haag
>>
>>
>> Date: Wed, 30 Sep 2015 20:25:50 -0400
>> From: Kingsley Idehen <kide...@openlinksw.com>
>> Subject: Re: [Virtuoso-users] Automating RDF data imports in VIrtuoso
>> To: virtuoso-users@lists.sourceforge.net
>> Message-ID: <560c7d8e.7020...@openlinksw.com>
>> Content-Type: text/plain; charset="windows-1252"
>>
>> On 9/30/15 6:10 PM, Haag, Jason wrote:
>>> Thanks Daniel. I'm running Linux/Debian + Ubuntu. I am able to import
>>> the HTML/RDFa file using the crawler. I'm able to find it in WEBDAV.
>>> It just doesn't populate the database. If I import the same data in
>>> RDF/XML format it does populate the database.
>> Did you set a Named Graph IRI in the import/crawl job? That's how you
>> get data into the quad store. Storing the ingest data to WebDAV is
>> totally optional. Also note that you can invoke the sponger too, as part
>> of this crawling functionality.
>>
>> You can also make Linked Data Folder Types  that are mapped to named
>> graph iris as part of the folder config. Once in place you can make said
>> folder the target of RDF content that you want to import into the quad
>> store.
>>
>> [1] https://www.pinterest.com/kidehen/virtuoso-universal-server-related/
>> -- I've added 4 screenshots showcasing Linked Data Folder Type setup
>>
>>
>> Kingsley
> ------------------------------------------------------------------------------
> _______________________________________________
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>


-- 
Regards,

Kingsley Idehen       
Founder & CEO 
OpenLink Software     
Company Web: http://www.openlinksw.com
Personal Weblog 1: http://kidehen.blogspot.com
Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen
Twitter Profile: https://twitter.com/kidehen
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this


Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users

Reply via email to