Re: Basic Solr Configurations and best practice

2010-11-28 Thread Darx Oman
thanx Alexey I downloaded Solr 4 and implemented the TikaEntityProcessor, it worked fine with Tika 0.6. didn't work with Tika 0.7 nor Tika 0.8 SNAPSHOT On Sat, Nov 27, 2010 at 4:05 AM, Alexey Serba wrote: > > 1- How to combine data from DIH and content extracted from file > system > > docu

Re: multi-valued with metadata?

2010-11-28 Thread Binkley, Peter
The best Solr solution often involves indexing the same source fields into several Solr fields for different purposes. In this case I'd go with your idea of delimiting the author id, along with indexing it separately: A001 111 John Smith 111|John Smith The field author_id_name could be stored but

Re: Logging queries and hit count

2010-11-28 Thread Jan Høydahl / Cominvent
You can also configure your logging framework to output the relevant logs to a separate file: log4j.logger.org.apache.solr.core.SolrCore=INFO, A1 This way you'll avoid too much noise from other componets, but you'll get all update and admin requests as well, so you'll have to filter on core nam

Re: Is this sort order possible in a single query?

2010-11-28 Thread Jan Høydahl / Cominvent
> I can't see a way to do it without functionqueries at the moment, which > doesn't mean there isn't any. If you want to use the suggested sort method, you could probably sort first by score: sort=score desc, num_copies desc, num_comments desc To let the score be influenced by exact author match

Re: Facet.query and collapsing

2010-11-28 Thread Markus Jelsma
http://wiki.apache.org/solr/FieldCollapsing#Known_Limitations > Hi All, > > I'm in a situation where I need to perform a facet on a query with field > collapsing. > > Let's say the main query is something like this > > title:apple&fq={!tag=sources}source_id:(33 OR > 44)&facet=on&facet.field={!e

Re: multi-valued with metadata?

2010-11-28 Thread Erick Erickson
Two things come to mind, neither optimal, but... First, index both author and ID with a delimiter, something like Mary Johnson | 222 and deal with breaking that info up for display when you were displaying the documents. Make sure your analyzer breaks this up appropriately or your searching will b

multi-valued with metadata?

2010-11-28 Thread Andrew Houghton
I've done my best to search through the archives for this problem, and found at least one person dealing with a similar issue (with no responses).  I'm sure this has been asked more than once before, but my search-fu is apparently lacking. Essentially, I need to be able to retrieve some metadata (