On 8 May 2014 12:25, Visser, Marc <marc.viss...@ordina.nl> wrote:
>
> HI All,
> Recently I have set up an image with SOLR. My goal is to index and extract 
> files on a Windows and Linux server. It is possible for me to index and 
> extract data from multiple file types. This is done by the SOLR CELL request 
> handler. See the post.jar cmd below.
>
> j ava -Dauto -Drecursive -jar post.jar Y:\ SimplePostTool version 1.5 Posting 
> files to base url localhost:8983/solr/update.. Entering auto mode. File 
> endings considered are xml,json,csv,pdf,doc,docx,ppt,pp 
> tx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log Entering recursive 
> mode, max depth=999, delay=0s 0 files indexed.
>
> Is it possible to index and extract metadata/content from file types like .sh 
> and .sql? If it is possible I would like to know how of course :)

Don't know about Windows, but on Linux these are just text files. What
metadata are you referring to? Normally, a Linux text file only has content,
unless you are talking about metadata such as obtained from:
   file cmd.sh

Regards,
Gora

Reply via email to