Hi Alexander,

If you prefer to leave them uncompressed then fine, change the reference in the 
ld_dir () procedure to "*.nt" in that case. The gz recommendation was more to 
save space on the machine ...

BTW, their is rogue "i" character on the end of the graph name in the ld_dir 
command below I just noticed, which should be:

        "ld_dir ('/opt/lgdelement', *.gz', 'http://linkedgeodata.org#')"

Best Regards
Hugh Williams
Professional Services
OpenLink Software
Web: http://www.openlinksw.com
Support: http://support.openlinksw.com
Forums: http://boards.openlinksw.com/support
Twitter: http://twitter.com/OpenLink

On 16 Apr 2010, at 12:03, Alexander Sidorov wrote:

> Hi Hugh,
> 
> Thank you for feedback. I am decompressing the dataset right now... But I 
> didn't understand why I should compress the dataset again (using gzip) if I 
> can load uncompressed .nt file.
> 
> Regards,
> Alexander
> 
> 2010/4/16 Hugh Williams <hwilli...@openlinksw.com>
> Hi Alexander,
> 
> The Virtuoso Bulk loader scripts do not support the extraction of "bz2" 
> compressed files, only gzip compressed or uncompressed files, so you will 
> need to uncompress the LGD bz2 file(s) first. Also, as you have "*.*" in the 
> ld_dir() command the loader is trying to load all file in the directory 
> including the global.graph file as if it is a dataset, which it is not. 
> 
> Thus I would suggest you do the following:
> 
> 1) Compress the LDG NT files you want to load with gzip
> 2) Remove all entries from the "DB.DBA.LOAD_LIST" table
> 3) Run "ld_dir ('/opt/lgdelement', '*.gz', 'http://linkedgeodata.org#i')"
> 4) Run "rdf_loader_run()"
> 
> Let us know if this now works for you ...
> 
> Best Regards
> Hugh Williams
> Professional Services
> OpenLink Software
> Web: http://www.openlinksw.com
> Support: http://support.openlinksw.com
> Forums: http://boards.openlinksw.com/support
> Twitter: http://twitter.com/OpenLink
> 
> On 16 Apr 2010, at 08:11, Alexander Sidorov wrote:
> 
>> Hello!
>> 
>> I'm trying to load LGD Elements dataset to my EC2 instance but as result get 
>> the following error at logs:
>> 
>> 02:46:41 PL LOG:  File /opt/lgdelements/global.graph error 37000 SP029: 
>> TURTLE RDF loader, line 1: Undefined namespace prefix at 
>> http://linkedgeodata.org#
>> 02:46:41 PL LOG:  File /opt/lgdelements/lgd-elements.nt.bz2 error 37000 
>> SP029: TURTLE RDF loader, line 8: syntax error
>> 
>> Here is my ld_dir configuration:
>> 
>> ld_dir ('/opt/lgdelements', '*.*', 'http://linkedgeodata.org#');
>> 
>> Folder /opt/lgdelements contains file global.graph that contains 
>> http://linkedgeodata.org string.
>> 
>> Regards,
>> Alexander
>> ------------------------------------------------------------------------------
>> Download Intel&#174; Parallel Studio Eval
>> Try the new software tools for yourself. Speed compiling, find bugs
>> proactively, and fine-tune applications for parallel performance.
>> See why Intel Parallel Studio got high marks during beta.
>> http://p.sf.net/sfu/intel-sw-dev_______________________________________________
>> Virtuoso-users mailing list
>> Virtuoso-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
> 
> 

Reply via email to