Re: Multi-words synonyms matching

2012-04-11 Thread elisabeth benoit
<' mapping instead? Something < mairie Have you tried the "=>' mapping instead? Something > like > hotel de ville => mairie > might work for y

Re: using solr to do a 'match'

2012-04-11 Thread jmlucjav
I have done that by getting X top hits, finding the best match among them (combination of Levenshtein distance, contains...tweaked the code till testing showed good results), and then deciding if the candidate was a match or not, again based in custom code plus a user defined leniency value xab -

Re: using solr to do a 'match'

2012-04-11 Thread Mikhail Khludnev
Hi, This use case is similar to matching boolean expression problem. You can find recent thread about it. I have an idea that we can introduce disjunction query with dynamic mm (minShouldMatch parameter http://lucene.apache.org/core/old_versioned_docs/versions/3_5_0/api/all/org/apache/lucene/searc

Re: using solr to do a 'match'

2012-04-11 Thread Li Li
I searched my mail but nothing found. the thread searched by key words "boolean expression" is Indexing Boolean Expressions from joaquin.delgado to tell which terms are matched, for BooleanScorer2, a simple method is to modify DisjunctionSumScorer and add a BitSet to record matched scorers. When co

Re: Large Index and OutOfMemoryError: Map failed

2012-04-11 Thread Michael McCandless
Hi, 65K is already a very large number and should have been sufficient... However: have you increased the merge factor? Doing so increases the open files (maps) required. Have you disabled compound file format? (Hmmm: I think Solr does so by default... which is dangerous). Maybe try enabling

Solr Http Caching

2012-04-11 Thread Kissue Kissue
Hi, Are any of you using Solr Http caching? I am interested to see how people use this functionality. I have an index that basically changes once a day at midnight. Is it okay to enable Solr Http caching for such an index and set the max age to 1 day? Any potential issues? I am using solr 3.5 wit

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Darren Govoni
Hard to say why its not working for you. Start with a fresh Solr and work forward from there or back out your configs and plugins until it works again. On Tue, 2012-04-10 at 17:15 -0400, Benson Margulies wrote: > In my cloud configuration, if I push > > > *:* > > > followed by: > > > > I

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-11 Thread pcrao
Hi, Any update on this? Please let me know if you need additional information on this. Thanks, PC Rao. -- View this message in context: http://lucene.472066.n3.nabble.com/EmbeddedSolrServer-and-StreamingUpdateSolrServer-tp3889073p3902171.html Sent from the Solr - User mailing list archive at Na

solr 3.4 with nTiers >= 2: usage of ids param causes NullPointerException (NPE)

2012-04-11 Thread Dmitry Kan
Hello, Hopefully this question is not too complex to handle, but I'm currently stuck with it. We have a system with nTiers, that is: Solr front base ---> Solr front --> shards Inside QueryComponent there is a method createRetrieveDocs(ResponseBuilder rb) which collects doc ids of each shard and

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-11 Thread Mikhail Khludnev
Hi, it's hard to help until you tell us why you think that index is corrupted. Logs&steps&stacktraces are useful. Regards On Wed, Apr 11, 2012 at 2:56 PM, pcrao wrote: > Hi, > > Any update on this? > Please let me know if you need additional information on this. > > Thanks, > PC Rao. > > -- >

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Benson Margulies
See https://issues.apache.org/jira/browse/SOLR-3347. I can replace the solrconfig.xml with the vanilla solrconfig.xml and the problem remains. On Wed, Apr 11, 2012 at 6:35 AM, Darren Govoni wrote: > Hard to say why its not working for you. Start with a fresh Solr and > work forward from there or

Re: I've broken delete in SolrCloud and I'm a bit clueless as to how

2012-04-11 Thread Benson Margulies
I didn't have a _version_ field, since nothing in the schema says that it's required! On Wed, Apr 11, 2012 at 6:35 AM, Darren Govoni wrote: > Hard to say why its not working for you. Start with a fresh Solr and > work forward from there or back out your configs and plugins until it > works again.

Re: How to get a list of values of a specified field

2012-04-11 Thread a sd
The type of content is "solr.string", actually is a sequence of any characters,"_",number,etc. On Wed, Apr 11, 2012 at 7:06 PM, Marcelo Carvalho Fernandes < mcf2...@gmail.com> wrote: > What type of content do you have in this field? > > --- > Marcelo Carvalho Fernandes > > On Wednesday, April 11,

Re: How to facet data from a multivalued field?

2012-04-11 Thread Thiago
Thank you very much, Erik. I just changed the fieldtype to String and it worked as I expected. Now I can select the count of the series. Thanks again and thanks the others too. Thiago Erik Hatcher-4 wrote > > Thiago - > > You'll want your series field to be of type "string". If you also need

Re: Multi-words synonyms matching

2012-04-11 Thread Jeevanandam Madanagopal
Elisabeth - As you described, below mapping might suit for your need. mairie => hotel de ville, mairie mairie gets expanded to "hotel de ville" and "mairie" at index time. So "mairie" and "hotel de ville" searchable on document. However, still white space tokenizer splits at query time will be

Re: solr hangs

2012-04-11 Thread Pawel Rog
You wrote that you can see such error "OutOfMemoryError". I had such problems when my caches were to big. It means that there is no more free memory in JVM and probably full gc starts running. How big is your Java heap? Maybe cache sizes in yout solr are to big according to your JVM settings. -- R

Re: Facets involving multiple fields

2012-04-11 Thread Erick Erickson
Have you considered facet.query? You can specify an arbitrary query to facet on which might do what you want. Otherwise, I'm not sure what you mean by "faceted search using two fields". How should these fields be combined into a single facet? What that means practically is not at all obvious from y

Re: Default qt on SolrCloud

2012-04-11 Thread Erick Erickson
What does your "query" request handler look like? By adding qt=standard you're specifying the standard request handler, whereas your ...solr/query?q=*:* format goes at the request handler you named "query" which presumably you've defined in solrconfig.xml... What does &debugQuery=on show? Best Er

Re: term frequency outweighs exact phrase match

2012-04-11 Thread Erick Erickson
Consider boosting on phrase with a SHOULD clause, something like field:"apache solr"^2.. Best Erick On Tue, Apr 10, 2012 at 12:46 PM, wrote: > Hello, > > I use solr 3.5 with edismax. I have the following issue with phrase search. > For example if I have three documents with content like > > 1

Re: Suggester not working for digit starting terms

2012-04-11 Thread Erick Erickson
Hmmm, I can't pursue this right now, anyone want to jump in? Erick On Tue, Apr 10, 2012 at 2:41 PM, jmlucjav wrote: > I have double checked and still get the same behaviour. My field is: >                 positionIncrementGap="100"> >                         >                                 cla

Re: custom query string parsing?

2012-04-11 Thread sam ”
Yah, RequestHandler is much better. Thanks! I don't know why I started with QParserPlugin and SearchComponent. Even with my own RequestHandler that only passes down selected query params, people can still get around it through qt parameter: ?qt=/update&stream.update=*:*&commit=true I think I

Does the lucene can read the index file from solr?

2012-04-11 Thread neosky
both are version 3.5 I have tried that the solr can read the index file by lucene, but I tried to use the lucene to read the index file from a specific field. It returns me the result when I do the *.* search -- View this message in context: http://lucene.472066.n3.nabble.com/Does-the-lucene-can-

Re: custom query string parsing?

2012-04-11 Thread sam ”
Actually, /solr/mycore/myhandler/?qt=/updatestill uses my handler. Only /solr/mycore/select/?qt=/update uses update handler :P On Wed, Apr 11, 2012 at 11:41 AM, sam ” wrote: > Yah, RequestHandler is much better. Thanks! I don't know why I started > with QParserPlugin and SearchComponent

Re: How to get a list of values of a specified field

2012-04-11 Thread Erick Erickson
Consider using the TermsComponent (http://wiki.apache.org/solr/TermsComponent) You could get some number of terms from your field at a time by judicious use of, say, facet.prefix if you wanted. But why do you want to do this? It's kind of an odd requirement, and since you say there are millions of

Re: Boost differences in two environments for same query and config

2012-04-11 Thread Erick Erickson
Well, you're matching a different number of records, so I have to assume your indexes are different on the two machines. Here is one case where doing an optimize might make sense, that'll purge the data associated with any deleted records from the index which should make comparisons better Ad

Re: custom query string parsing?

2012-04-11 Thread Chris Hostetter
: Only /solr/mycore/select/?qt=/update uses update handler :P or just register your handler using the name "/select" then the request dispatcher will use it, and ignore "qt". In trunk, the legacy SolrServlet has been removed, so you'll be able to set handleSelect="false" on the and not worry

Re: Does the lucene can read the index file from solr?

2012-04-11 Thread Erick Erickson
Solr uses Lucene, so any index written with Solr should be usable by Lucene and vice-versa. But searching will be significantly different in the sense that Solr wraps the raw Lucene search so you'll have to make sure your use use of Lucene is compatible with your Solr configurations if you compare

Re: Default qt on SolrCloud

2012-04-11 Thread Benson Margulies
On Wed, Apr 11, 2012 at 11:19 AM, Erick Erickson wrote: > What does your "query" request handler look like? By adding qt=standard > you're specifying the standard request handler, whereas your > ...solr/query?q=*:* format goes at the request handler you named > "query" which presumably you've defi

SOLR 4 autocommit - is it working as I think it should?

2012-04-11 Thread vybe3142
I've gotten past most of my initial hurdles with SOLR, with some useful suggestions from this group. Thank You. On to tweaking. This morning, I've been looking at the autocommit functionality as defined in solrconfig.xml. By default, it appears that it should kick in 15 seconds after a new doc

Re: SOLR 4 autocommit - is it working as I think it should?

2012-04-11 Thread Yonik Seeley
On Wed, Apr 11, 2012 at 12:58 PM, vybe3142 wrote: > This morning, I've been looking at the autocommit functionality as defined > in solrconfig.xml. By default, it appears that it should kick in 15 seconds > after a new document has been added. I do see this event triggered via the > SOLR/tomcat lo

solr 3.5 taking long to index

2012-04-11 Thread Rohit
We recently migrated from solr3.1 to solr3.5, we have one master and one slave configured. The master has two cores, 1) Core1 - 44555972 documents 2) Core2 - 29419244 documents We commit every 5000 documents, but lately the commit is taking very long 15 minutes plus in some cases. What co

Re: SOLR 4 autocommit - is it working as I think it should?

2012-04-11 Thread vybe3142
Thanks, makes perfect sense -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-4-autocommit-is-it-working-as-I-think-it-should-tp3903135p3903353.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Moving to Maven from Ant solr.build.dir Not Found

2012-04-11 Thread Eli Finkelshteyn
Alright, for those interested, I got this to work using the dependencies I mentioned before by swapping in the web.xml from the latest nightly build and passing in parameters for my custom stuff. I'm now running Solr with all my dependencies neatly stashed away in Maven and custom code just bui

Re: Suggester not working for digit starting terms

2012-04-11 Thread jmlucjav
Just to be sure, reproduced this with example config from 3.5. 1. add to schema.xml

Question about solr.WordDelimiterFilterFactory

2012-04-11 Thread Jian Xu
Hello, I am new to solr/lucene. I am tasked to index a large number of documents. Some of these documents contain decimal points. I am looking for a way to index these documents so that adjacent numeric characters (such as [0-9.,]) are treated as single token. For example, 12.34 => "12.34" 12,

Re: How to get a list of values of a specified field

2012-04-11 Thread a sd
I know,i know, This is a very expensive operation,the requirement is also very odd ,but is also very real. It is actually desired to go through the whole documents within lucene again and again. List the all potential value of a specified, and then divide the all work (to go through) into a series

Re: solr 3.5 taking long to index

2012-04-11 Thread Lance Norskog
It's telling you the problem. Try your solrconfig.xml against the one in 3.5/solr/example/solr/conf. You will what has changed in the suggested tools. On Wed, Apr 11, 2012 at 10:42 AM, Rohit wrote: > We recently migrated from solr3.1 to solr3.5,  we have one master and one > slave configured. T

Solr 3.5 takes very long to commit gradually

2012-04-11 Thread Rohit
We recently migrated from solr3.1 to solr3.5, we have one master and one slave configured. The master has two cores, 1) Core1 - 44555972 documents 2) Core2 - 29419244 documents We commit every 5000 documents, but lately the commit time gradually increase and solr is taking as very long 15 minute

Re: Suggester not working for digit starting terms

2012-04-11 Thread Robert Muir
On Wed, Apr 11, 2012 at 4:37 PM, jmlucjav wrote: > Just to be sure, reproduced this with example config from 3.5. > Regardless of your tokenizer, be aware that with this version of solr its going to split up terms based on 'identifier rules' (including splitting on whitespace). This is because su

Re: solr 3.5 taking long to index

2012-04-11 Thread Bernd Fehling
There were some changes in solrconfig.xml between solr3.1 and solr3.5. Always read CHANGES.txt when switching to a new version. Also helpful is comparing both versions of solrconfig.xml from the examples. Are you sure you need a MaxPermSize of 5g? Use jvisualvm to see what you really need. This i

Re: Multi-words synonyms matching

2012-04-11 Thread elisabeth benoit
oh, that's right. thanks a lot, Elisabeth 2012/4/11 Jeevanandam Madanagopal > Elisabeth - > > As you described, below mapping might suit for your need. > mairie => hotel de ville, mairie > > mairie gets expanded to "hotel de ville" and "mairie" at index time. So > "mairie" and "hotel de ville"