edismax and mm: strange behaviour

2015-01-10 Thread leostro
Hi all I'm studying SOLR for implement it in my website. I've imported the db and I'm making some tests about edismax and mm. I'm searching for documents containing "xbox 360". - If I specifiy mm=100% (I have the same result setting default operator to "AND") SOLR give me 5 documents: [http://loc

ant compile = not able to resolve dependencies

2015-01-10 Thread Anurag Sharma
Hi, I am facing issue in resolving the dependency while doing 'ant compile' on trunk, seeing following error: [ivy:retrieve] :: UNRESOLVED DEPENDENCIES :: [ivy:retrieve] :: [ivy:retrieve] :: org.restlet.jee#or

Re: How does text-rev work?

2015-01-10 Thread Jack Krupansky
No, it's just what I said. Read the original Jira (from Solr 1.4!): https://issues.apache.org/jira/browse/SOLR-1321 You, the user, need to specify and populate a reversed wildcard field if that is want you want - it is not magic and not automatic. This should be documented as an "expert" feature

Re: ignoring bad documents during index

2015-01-10 Thread SolrUser1543
>From reading this (https://issues.apache.org/jira/browse/SOLR-445) I see that there is no solution provided for the issue of aggregating responses from several solr instances is available . Solr is not able to do that ? -- View this message in context: http://lucene.472066.n3.nabble.com/ig

Re: ignoring bad documents during index

2015-01-10 Thread Jack Krupansky
Correct, Solr clearly needs improvement in this area. Feel free to comment on the Jira about what options you would like to see supported. -- Jack Krupansky On Sat, Jan 10, 2015 at 5:49 AM, SolrUser1543 wrote: > From reading this (https://issues.apache.org/jira/browse/SOLR-445) I see > that > t

Re: ignoring bad documents during index

2015-01-10 Thread SolrUser1543
Would it be a good solution to index single document instead of bulk ? In this case I will know about the status of each message . What is recommendation in this case : Bulk vs Single ? -- View this message in context: http://lucene.472066.n3.nabble.com/ignoring-bad-documents-during-index-t

Re: ignoring bad documents during index

2015-01-10 Thread Jack Krupansky
Sending individual documents will give you absolute control - just make sure not to "commit" on each document sent since that would really slow down indexing. You could also send smaller batches, life 5 to 20 documents to balance between fine control and performance. It also depends on your docume

Re: edismax and mm: strange behaviour

2015-01-10 Thread Ahmet Arslan
Hi, What are query fields qf and their field types? Ahmet On Saturday, January 10, 2015 11:10 AM, leostro wrote: Hi all I'm studying SOLR for implement it in my website. I've imported the db and I'm making some tests about edismax and mm. I'm searching for documents containing "xbox 360". -

document highlights positions

2015-01-10 Thread Kempelen Ákos
We would like to get the highlights position informations when searching within one document. The fulltext field is not stored but it has termVectors, termPositions and termOffsets. The original document is stored in an outside database. As I understand the highlighted text can not be reconstruct

Re: ant compile = not able to resolve dependencies

2015-01-10 Thread Anurag Sharma
Any directions to resolve it is also helpful. On Sat, Jan 10, 2015 at 2:48 PM, Anurag Sharma wrote: > Hi, > > I am facing issue in resolving the dependency while doing 'ant compile' on > trunk, seeing following error: > > [ivy:retrieve] :: UNRESOLVED DEPENDENCIES :: > [

Re: ant compile = not able to resolve dependencies

2015-01-10 Thread Erik Hatcher
I think you’ll need to clear your ivy cache. See last comment here: https://issues.apache.org/jira/browse/SOLR-4839 > On Jan 10, 2015, at 9:30 AM, Anurag Sharma wrote: > > Any directions to resolve it is also helpful. > > On Sat, Jan 10, 201

Re: edismax and mm: strange behaviour

2015-01-10 Thread Jack Krupansky
Why are you using the mm parameter at all? In my experience, anyone setting mm to 0 or 100% is misusing the mm feature. mm stands for "minimum should match" and is designed to give expert users fine control over recall when terms are optional ("should" occur but are not "required".) So, please expl

Re: edismax and mm: strange behaviour

2015-01-10 Thread leostro
Hi Ahmet, I don't specify any qf in this query. Reading here (http://wiki.apache.org/solr/ExtendedDisMax#mm_.28Minimum_.27Should.27_Match.29) it seems that mm is referred to the text provided as "q" in querystring, I am wrong? Reading the doc above, my expectation is that if I specify a q value wi

Re: edismax and mm: strange behaviour

2015-01-10 Thread leostro
Hi Jack, I read the documentation here: http://wiki.apache.org/solr/ExtendedDisMax#mm_.28Minimum_.27Should.27_Match.29 My question is quite simple, maybe it's not clear for my poor english. As explained in the response to ahmet my goal is to get ALL and ONLY the documents that contains the two wo

Re: ignoring bad documents during index

2015-01-10 Thread Erick Erickson
There are some significant throughput improvements when you batch up a bunch of docs to Solr (assuming SolrJ). You can go ahead and send, say, 1,000 docs in a batch and if the batch fails, re-process the list to find the bad doc. But as Jack says, Solr could do better here. Best, Erick On Sat, J

Re: edismax and mm: strange behaviour

2015-01-10 Thread Ahmet Arslan
Hi, Basically, (e)dimax is designed to search over multiple fields. It could be used to search over single field. Please see for more about it : https://lucidworks.com/blog/whats-a-dismax/ You mention title field but I don't see title in search URLs you provided. So my question remains, what fi