thanx Alexey
I downloaded Solr 4 and implemented the TikaEntityProcessor, it worked fine
with Tika 0.6.
didn't work with Tika 0.7 nor Tika 0.8 SNAPSHOT
On Sat, Nov 27, 2010 at 4:05 AM, Alexey Serba wrote:
> > 1- How to combine data from DIH and content extracted from file
> system
> > docu
The best Solr solution often involves indexing the same source fields into
several Solr fields for different purposes. In this case I'd go with your
idea of delimiting the author id, along with indexing it separately:
A001
111
John Smith
111|John Smith
The field author_id_name could be stored but
You can also configure your logging framework to output the relevant logs to a
separate file:
log4j.logger.org.apache.solr.core.SolrCore=INFO, A1
This way you'll avoid too much noise from other componets, but you'll get all
update and admin requests as well, so you'll have to filter on core nam
> I can't see a way to do it without functionqueries at the moment, which
> doesn't mean there isn't any.
If you want to use the suggested sort method, you could probably sort first by
score:
sort=score desc, num_copies desc, num_comments desc
To let the score be influenced by exact author match
http://wiki.apache.org/solr/FieldCollapsing#Known_Limitations
> Hi All,
>
> I'm in a situation where I need to perform a facet on a query with field
> collapsing.
>
> Let's say the main query is something like this
>
> title:apple&fq={!tag=sources}source_id:(33 OR
> 44)&facet=on&facet.field={!e
Two things come to mind, neither optimal, but...
First, index both author and ID with a delimiter, something like
Mary Johnson | 222
and deal with breaking that info up for display when you were displaying
the documents. Make sure your analyzer breaks this up appropriately or
your searching will b
I've done my best to search through the archives for this problem, and
found at least one person dealing with a similar issue (with no
responses). I'm sure this has been asked more than once before, but
my search-fu is apparently lacking.
Essentially, I need to be able to retrieve some metadata (