Thanks for the help.. But anyway I solved it using RegexTransformer
In the db_link entity I used RegexTransformer and set the link field as:
and in tika-doc entity I set the Url value as:
${db_link.link}
--
View this message in context:
http://lucene.472066.n3.nabble.com/Tika-url-issue
parts
> by '#' and taking the first path using the method *getFilePath(row)* but no
> luck.
>
> I am still getting the path as stored in db. This gives a *FileNotFound*
> exception while trying to index it and that is obvious because the path is
> incorrect.
>
> What can be done to get only the path and leaving out rest of the path
> having # and all?
>
> Help would be much appreciated :)
>
>
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Tika-url-issue-tp4139781.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
get only the path and leaving out rest of the path
having # and all?
Help would be much appreciated :)
--
View this message in context:
http://lucene.472066.n3.nabble.com/Tika-url-issue-tp4139781.html
Sent from the Solr - User mailing list archive at Nabble.com.