Solr 4- You mean the Solr 'trunk' source or the Solr 1.4.1 release?
The 1.4.1 release does not have the TikaEntityProcessor, only the /extract code.
The Solr 3.x branch and the trunk have the TikaEP. I use the 3.x
branch and, well, the TikaEP has a few problems but can be hacked
around.
Whatever
thanx Alexey
I downloaded Solr 4 and implemented the TikaEntityProcessor, it worked fine
with Tika 0.6.
didn't work with Tika 0.7 nor Tika 0.8 SNAPSHOT
On Sat, Nov 27, 2010 at 4:05 AM, Alexey Serba wrote:
> > 1- How to combine data from DIH and content extracted from file
> system
> > docu
> 1- How to combine data from DIH and content extracted from file system
> document into one document in the index?
http://wiki.apache.org/solr/TikaEntityProcessor
You can have one sql entity that retrieves metadata from database and
another nested entity that parses binary file into additiona