"order" question on solr multi value field

2012-12-17 Thread hellorsanjeev
Hi - We have been using solr since over an year. Some of our fields are multi valued. We are saving and retrieving data from those multi valued fields perfectly fine assuming the order in which we save, we will get the values in the same order. The question is - Is it documented somewhere that Sol

Solr Cloud 4.0 Production Ready?

2012-12-17 Thread Cool Techi
Hi, We have been using solr 3.5 in our production for sometime now and facing the problems faced by a large solr index. We wanted to migrate to Solr Cloud and have started some experimentation. But in the mean time also following the user forum and seem to be noticing a lot of bugs which were r

Solr atomic update of multi-valued field

2012-12-17 Thread Dikchant Sahi
Hi, Does Solr 4.0 allows to update the values of multi-valued field? Say I have list of values for skills field like java, j2ee and i want to change it to solr, lucene. I was trying to play with atomic updates and below is my observation: I have following document in my index: 1 Dikchant softwa

Re: Beginner's view

2012-12-17 Thread Shawn Heisey
On 12/17/2012 11:55 PM, Alexandre Rafalovitch wrote: Again, thinking from the beginner's view, is there any reason Solr throws scary looking exception traces in the console when it cannot find optional html files for admin interfaces (e.g. admin-extra.menu-top.html). From my days of tech suppor

Re: configuring per-field similarity in Solr 4: "the global similarity does not support it"

2012-12-17 Thread Tom Burton-West
Thanks Markus! Adding fixed the problem. >>Keep in mind that coord and queryNorm (=1.0f) are not implemented now, so you will get different scores for TF-IDF! Can you explain more about this, or is it documented somewhere? Do I need to read the source for solr.SchemaSimilarityFactory? Is there

RE: configuring per-field similarity in Solr 4: "the global similarity does not support it"

2012-12-17 Thread Markus Jelsma
Hi Tom, The global similarity must be able to delegate similarity to your per-field setting. Solr has the SchemaSimilarityFactory that can do this. Please replace your global similarity with: Keep in mind that coord and queryNorm (=1.0f) are not implemented now, so you will get different sco

configuring per-field similarity in Solr 4: "the global similarity does not support it"

2012-12-17 Thread Tom Burton-West
Hello, I have Solr 4 configured with several fields using different similarity classes according to: http://wiki.apache.org/solr/SchemaXml#Similarity However, I get this error message: " FieldType 'DFR' is configured with a similarity, but the global similarity does not support it: class org.apac

RE: Spell Check is not working properly

2012-12-17 Thread Dyer, James
The spellcheckers (IndexBasedSpellChecker and DirectSolrSpellChecker) both have tuning parameters that control how similar a potential correction needs to be from the original query term in order to be considered. For IndexBasedSpellChecker, there is "spellcheck.accuracy", which should be a nu

Re: fieldType custom search

2012-12-17 Thread Otis Gospodnetic
Hi Antoine, We didn't use grouping. We didn't try to reorder millions of documents because that would mean they would all need to first read from disk/cache, then reordered, and then returned, which would be slooow. We did get more docs than we needed because in our case our goal was to diversify

Re: Large import making solr unresponsive

2012-12-17 Thread Otis Gospodnetic
Hi Brent, You said "from what I can tell there is no disk, network, or memory pressure " - maybe you can share what and how you checked this? (see my signature for a tool that can help with this) I'm asking because the above is in conflict with "responses from solr still come back with a <10ms qt

Re: How to retrieve the maximum id in one core/collection effectively

2012-12-17 Thread Otis Gospodnetic
Hello, Have a look at http://wiki.apache.org/solr/StatsComponent Otis -- SOLR Performance Monitoring - http://sematext.com/spm/index.html Search Analytics - http://sematext.com/search-analytics/index.html On Mon, Dec 17, 2012 at 9:10 AM, SuoNayi wrote: > Hi dear list, > > How to retrieve th

Spell Check is not working properly

2012-12-17 Thread Dixline
When i try spell check with query parameter q=testtt it is returning the results properly but when i try with q=tett i'm not getting any suggestions. The correct value is test. Why does spell check work properly for certain query where it fails in certain cases? Is there any format for the quer

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Dirk Högemann
Ah - now I got it. My solution to this was to use phrase queries - now I know why: Thanks! 2012/12/17 Jack Krupansky > No, the "query" analyzer tokenizer will simply be applied to each term or > quoted string AFTER the query parser has already parsed it. You may have > escaped or quoted character

Re: if I only need exact search, does frequency/score matter?

2012-12-17 Thread Jie Sun
Hi Otis, do you think I should customize both tf and idf to disable the term frequency? i.e. something like: public float tf(float freq) { return freq > 0 ? 1.0f : 0.0f; } public float idf(int docFreq, int numDocs) { return docFreq > 0 ? 1.0f : 0.0f; } t

Re: if I only need exact search, does frequency/score matter?

2012-12-17 Thread Jie Sun
thanks, this is very helpful -- View this message in context: http://lucene.472066.n3.nabble.com/if-I-only-need-exact-search-does-frequency-score-matter-tp4026893p4027559.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Jack Krupansky
No, the "query" analyzer tokenizer will simply be applied to each term or quoted string AFTER the query parser has already parsed it. You may have escaped or quoted characters which will then be seen by the analyzer tokenizer. -- Jack Krupansky -Original Message- From: Dirk Högemann

Re: Searches with phonetics

2012-12-17 Thread Steve Rowe
In addition to reindexing, you should insure that your query analyzer has the same lowercasing behavior as your index analyzer. Otherwise "july" may not match "July", and vice versa. On Dec 17, 2012, at 4:57 AM, Sangeetha wrote: > Hi, > > I have not done anything in my schema.xml for phoneti

Re: how to understand this benchmark test results (compare index size after schema change)

2012-12-17 Thread Jie Sun
thanks Erik ... I did run optimize on both indices to get ride of the deleted data when compare to each other. (and my benchmark tests were just indexing 5000 new documents without duplicates...into a new core... but I did optimize just to make sure). I think one results is consistent that the .f

Re: Highlighting data stored outside of Solr

2012-12-17 Thread P Williams
Your problem seems really similar to "It should be possible to highlight external text" in JIRA. Tricia [https://issues.apache.org/jira/browse/SOLR-1397] On Tue, Dec 11, 2012 at 12:48 PM, Michael Ryan wrote: > Has anyone ever attempted to highli

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Dirk Högemann
Ok- right, changed that... Nevertheless I thought I should always use the same analyzers for the query and the index section to have consistent results. Does this mean that the tokenizer in the query section will always be ignored by the given query parsers? 2012/12/17 Jack Krupansky > The que

Faceting on Dynamic fields

2012-12-17 Thread Mohamed Zahoor
Hi I have many dynamic fields in my schema name_X where X can range from 0 to 10. Not all documents will have all the fields from 0 to 10. I want to do a fecet on these fields. I have seen SOLR-247 and other queries in this list. Is there any other way other than patching SOLR-247 on 4.0? ./Z

Index Update for XPathEntityProcessor

2012-12-17 Thread Lighton Phiri
Hello, First, apologies for cross posting; I initially posted this on Stackoverflow [1] and just realised I might have better luck posting the question here. I am relatively new to Solr and currently working the the XPathEntityProcessor DIH. I have a dataset that will be periodically updated with

Re: Wildcard inside " " ?

2012-12-17 Thread Jack Krupansky
Yeah, it's fully integrated into 4.0. There is a patch here that may or may not work with 3.6: https://issues.apache.org/jira/browse/SOLR-2703 -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Monday, December 17, 2012 9:31 AM To: solr-user@lucene.apache.org Subject: Re:

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Jack Krupansky
The query parsers normally tokenize on white space and query operators, but you can escape any white space with backslash or put the text in quotes and then it will be tokenized by the analyzer rather than the query parser. Also, you have: Change "search" to "query", but that won't change yo

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Lee Carroll
I use *analyzer type*="*query*" can you use search ? On 17 December 2012 11:01, Dirk Högemann wrote: > {!q.op=AND df=cl2Categories_NACE}08 > Gewinnung von Steinen und Erden, sonstiger Bergbau name="parsed_filter_queries">+cl2Categories_NACE:08 > +cl2Categories_NACE:gewinnung +cl2Categories_NAC

Re: Wildcard inside " " ?

2012-12-17 Thread Bruno Mannina
Both method doesn't work on my Solr 3.6, with the error message: _Unknown query type 'surround'_ Le 17/12/2012 15:18, Jack Krupansky a écrit : Not with the standard query parsers, but you can do it with the "surround" query parser: defType=surround&q=title:(plast* w bicycle) Or q={!surro

Re: Wildcard inside " " ?

2012-12-17 Thread Jack Krupansky
Not with the standard query parsers, but you can do it with the "surround" query parser: defType=surround&q=title:(plast* w bicycle) Or q={!surround}title:(plast* w bicycle) Two notes: 1. Surround performs NO analysis, so you have to manually analyze your terms, like lower case them if neces

Re: Core URL with solrj

2012-12-17 Thread Carlos Alexandro Becker
PS: I got the 404 when I try to reload the core, I forgot to say. THanks On Mon, Dec 17, 2012 at 11:50 AM, Carlos Alexandro Becker < caarl...@gmail.com> wrote: > I'm trying to use solrj with the new solr4, but having some issues... > > I'm used to use Solr3. I create the cores with something li

Re: Wildcard inside " " ?

2012-12-17 Thread Bruno Mannina
Le 17/12/2012 14:13, Ahmet Arslan a écrit : Wildcard seems not to work inside double quote request. I get always 0 result. I.e: title:"plastic bicycle" <== 79 results in my database title:"plast* bicycle" <== 0 result found Is exist a solution for that ? Hi Bruno, You can make use of https:/

Re: Wildcard inside " " ?

2012-12-17 Thread Ahmet Arslan
> Wildcard seems not to work inside double quote request. > I get always 0 result. > > I.e: > title:"plastic bicycle" <== 79 results in my database > title:"plast* bicycle" <== 0 result found > > Is exist a solution for that ? Hi Bruno, You can make use of https://issues.apache.org/jira/browse

Wildcard inside " " ?

2012-12-17 Thread Bruno Mannina
Dear Users, Wildcard seems not to work inside double quote request. I get always 0 result. I.e: title:"plastic bicycle" <== 79 results in my database title:"plast* bicycle" <== 0 result found Is exist a solution for that ? Thanks a lot, Bruno

Re: SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Artyom
Thank you, Upayavira, I know about the DirectSpellChecker. But I want to know how IndexBasedSpellChecker is handled in SolrCloud. -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-with-Near-Realtime-Search-buildOnOptimize-in-IndexBasedSpellChecker-tp4027499p4027514.h

Re: SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Tomás Fernández Löbbe
It only rebuilds on explicit optimize operations. A background merge that merges all segments (to 1) won't fire the rebuild AFAIK. And Upayavira is right, you can choose to use a DirectSolrSpellChecker, that way you don't need an external index at all. On Mon, Dec 17, 2012 at 9:46 AM, Upayavira

Re: SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Upayavira
Note with 4.0 you don't need to build a spellcheck index. Spellchecking can happen from your main index (unless you are providing your own dictionary). Upayavira On Mon, Dec 17, 2012, at 12:36 PM, Artyom wrote: > Thank you, Tomás. > > This wiki > http://wiki.apache.org/solr/UpdateXmlMessages#A.2

Re: SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Artyom
Thank you, Tomás. This wiki http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22 says "*Segments are normally merged over time anyway (as determined by the merge policy), and optimize just forces these merges to occur immediately.*" Doesn't the merge policy affects buil

Re: SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Tomás Fernández Löbbe
"optimize" operations only occur when you explicitly request for them. All nodes should get the command, so if you have set the "buildOnOptimize" in all nodes (you probably are, as you are using the same configuration) then all of them should rebuild the spellcheck index. Tomás On Mon, Dec 17, 2

Re: Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Dirk Högemann
{!q.op=AND df=cl2Categories_NACE}08 Gewinnung von Steinen und Erden, sonstiger Bergbau+cl2Categories_NACE:08 +cl2Categories_NACE:gewinnung +cl2Categories_NACE:von +cl2Categories_NACE:steinen +cl2Categories_NACE:und +cl2Categories_NACE:erden, +cl2Categories_NACE:sonstiger +cl2Categories_NACE:bergbau

Re: Searches with phonetics

2012-12-17 Thread Erik Hatcher
You need to reindex :) Erik On Dec 17, 2012, at 03:59 , Sangeetha wrote: > > > I have docs which contains the word July. If i search with juli also it > return July. > > I have removed in my > schema.xml. Now it return nothing when i search July. It return July only if > i give juli.

SolrCloud with Near Realtime Search: buildOnOptimize in IndexBasedSpellChecker

2012-12-17 Thread Artyom
When an optimization event occurs in this case? Should I reindex this spellchecker on every shard manually? Or does this even occurs every hard or soft commit? -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-with-Near-Realtime-Search-buildOnOptimize-in-IndexBasedSp

Re: Searches with phonetics

2012-12-17 Thread Sangeetha
I have docs which contains the word July. If i search with juli also it return July. I have removed in my schema.xml. Now it return nothing when i search July. It return July only if i give juli. What should i do? I want to search the exact words which are in docs. I need to do auto-suggest al

Solr3.5 PatternTokenizer / Search Analyzer tokenizing always at whitespace?

2012-12-17 Thread Dirk Högemann
Hi, I am not sure if am missing something, or maybe I do not exactly understand the index/search analyzer definition and their execution. I have a field definition like this: Any field starting with cl2 should be recogni

Re: Solrcloud and Node.js

2012-12-17 Thread Per Steffensen
Luis Cappa Banda skrev: Thanks a lot, Per. Now I understand the whole scenario. One last question: I've been searching trying to find some kind of request handler that retrieves cluster status information, but no luck. I know that there exists a JSON called clusterstate.json, but I don't know the

Re: Searches with phonetics

2012-12-17 Thread Erik Hatcher
This is probably due to stemming. Removing the stemming (porter or snowball) from you analysis chains should do the trick. (And reindex) Erik On Dec 17, 2012, at 2:57, Sangeetha wrote: > Hi, > > I have not done anything in my schema.xml for phonetics search. But it > searches and returns

Searches with phonetics

2012-12-17 Thread Sangeetha
Hi, I have not done anything in my schema.xml for phonetics search. But it searches and returns *july *when i give *juli*. But i dont want this. How to avoid that? Thanks, Sangeetha -- View this message in context: http://lucene.472066.n3.nabble.com/Searches-with-phonetics-tp4027487.html Sen

RE: SolrCloud breaks distributed query strings

2012-12-17 Thread Markus Jelsma
Anyone else noticed a similar issue where Solr mangles distributed query parameters? Any hints on how to track this issue? Where to look? Thanks -Original message- > From:Markus Jelsma > Sent: Wed 12-Dec-2012 15:11 > To: solr-user@lucene.apache.org > Subject: RE: SolrCloud breaks dist

Re: Update / replication of offline indexes

2012-12-17 Thread Dikchant Sahi
Thanks Erick and Upayavira! This answers my question. On Mon, Dec 17, 2012 at 8:05 AM, Erick Erickson wrote: > See the very last line here: > http://wiki.apache.org/solr/MergingSolrIndexes > > Short answer is that merging will lead to duplicate documents, even with > uniqueKeys defined. > > So y

Re: Faceted result , based on words specified in search query.

2012-12-17 Thread Upayavira
You can facet on terms in a field. When a document is selected, you cannot facet on some terms for a field in that document, and not others. In your case, if Sony, Samsung and LG were in your 'manufacturer' field, and that is what you searched across, then surely it would be straight-forward to cr

Deploying war

2012-12-17 Thread Arkadi Colson
Hi From time to time it takes quite some time to start tomcat. The logging is reporting thissnippet below. Any idea? Sorry I'm a beginner with Java. Dec 17, 2012 8:01:38 AM org.apache.coyote.AbstractProtocol init INFO: Initializing ProtocolHandler ["ajp-bio-8009"] Dec 17, 2012 8:01:38 AM org.a