Exact match search problem

2009-02-01 Thread mahendra mahendra
Hi, I have indexed my data as "custom123, customer, custom" for the "UserName" field. I need to search the records for exact match, when I am trying to search with UserName:"customer" I am finding the records where UserName is custom123 and custom. As per my understanding solr splits the Alpha

Dynamic fields in schema.xml file

2009-02-01 Thread Sagar Khetkade
Hi, I am trying out the dynamic field in schema.xml with its attribute as true. Right now I indexing 1 articles having five fields in which the two fields are explicitly mention as text field and others are the dynamic fields. But while search if the query is fired on the last of the dyn

Re: solr as the data store

2009-02-01 Thread Lance Norskog
Problems: 1) If you get the schema wrong it is painful to live with. You may need to extract all data and reindex with your new schema. To ease this I wrote an XSL script that massaged the default Solr XML output into the Solr XML input format. Extracting is really slow and this process took days.

Dismax q.alt field for field level boosting

2009-02-01 Thread tushar kapoor
Hi, I am trying to test relevancy of results with the q.alt field on a Dismax Request Handler. Term level boosting based on bq information in solrconfig.xml works fine. However field level boosting based on the qf information in solrconfig.xml doesn't seem to work. Query q=&q.alt=for&rows=1000&qt

Re: Range search question

2009-02-01 Thread Lance Norskog
A bit of Solr Kung Fu on this topic: Let us suppose that your data source cannot be changed to use leading zeroes. Also suppose that the field is required in every record. The copyField directive automatically populates other fields with your input data. If you do this: fieldQuestion type=

Re: Performance "dead-zone" due to garbage collection

2009-02-01 Thread Lance Norskog
Sorting creates a large array with "roughly" an entry for every document in the index. If it is not on an 'integer' field it takes even more memory. If you do a sorted request and then don't sort for a while, that will drop the sort structures and trigger a giant GC. We went through some serious c

Re: Optimizing & Improving results based on user feedback

2009-02-01 Thread Lance Norskog
To avoid the "users only see the first page" problem, one solution is: if the result set has more than one page with high scores near each other, scramble them. That is, if the top 20 results range in score from 19.0 to 20.0, they really are all about the same relevance, so just card-shuffle them.

Re: DIH FileListEntityProcessor recursion and fileName clash

2009-02-01 Thread Shalin Shekhar Mangar
On Mon, Feb 2, 2009 at 2:36 AM, Fergus McMenemie wrote: > Hello > > I have been trying to find out why DIH in FileListEntityProcessor > mode did not appear to be recursing into subdirectories. Going through > FileListEntityProcessor.java I eventually tumbled to the fact that my > filename filter

DIH FileListEntityProcessor recursion and fileName clash

2009-02-01 Thread Fergus McMenemie
Hello I have been trying to find out why DIH in FileListEntityProcessor mode did not appear to be recursing into subdirectories. Going through FileListEntityProcessor.java I eventually tumbled to the fact that my filename filter setting from data-config.xml also applied to directory names.

Re: distributed search doubts

2009-02-01 Thread Shalin Shekhar Mangar
On Mon, Feb 2, 2009 at 1:53 AM, Marc Sturlese wrote: > > Hey there, > I have asked before about this two questions but got no response... I am > really interested in moving to distributed search so... hopefuly someone > can > give me a hand. > > 1.-In distributed search, Solr do a search per shard

distributed search doubts

2009-02-01 Thread Marc Sturlese
Hey there, I have asked before about this two questions but got no response... I am really interested in moving to distributed search so... hopefuly someone can give me a hand. 1.-In distributed search, Solr do a search per shard and get the score of the documents based on the other docuements o

Re: Recent document boosting with dismax

2009-02-01 Thread Erik Hatcher
Is your date_added field multiValued and you've assigned multiple to some documents? Erik On Jan 31, 2009, at 4:12 PM, James Brady wrote: Hi,I'm following the recipe here: http://wiki.apache.org/solr/SolrRelevancyFAQ#head-b1b1cdedcb9cd9bfd9c994709b4d7e540359b1fdfor boosting recent do