Re: replication --> missing field data file

2010-01-11 Thread Shalin Shekhar Mangar
On Thu, Jan 7, 2010 at 9:34 PM, Giovanni Fernandez-Kincade < gfernandez-kinc...@capitaliq.com> wrote: > Right, but if you want to take periodic backups and ship them to tape or > some DR site, you need to be able to tell when the backup is actually > complete. > > It's seems very strange to me tha

Re: Adaptive search?

2010-01-11 Thread Shalin Shekhar Mangar
On Fri, Jan 8, 2010 at 3:41 AM, Otis Gospodnetic wrote: > > - Original Message > > > From: Shalin Shekhar Mangar > > To: solr-user@lucene.apache.org > > Sent: Wed, December 23, 2009 2:45:21 AM > > Subject: Re: Adaptive search? > > > > On Wed, Dec 23, 2009 at 4:09 AM, Lance Norskog wrote

Re: Understanding the query parser

2010-01-11 Thread rswart
I am running in to the same issue. I have tried to replace my WhitespaceTokenizerFactory with a PatternTokenizerFactory with pattern (\s+|-) but I still seem to get a phrase query. Why is that? Ahmet Arslan wrote: > > >> I am using Solr 1.3. >> I have an index with a field called "name". It

RE: Synonyms from Database

2010-01-11 Thread Peter A. Kirk
You could try to take the code for SynonymFilterFactory as a starting point, and adapt it to obtain the synonym configuration from another source than a text file. But I'm not sure what you mean by checking for synonyms at query time. As I understand it, Solr works like that anyway - depending

Re: Synonyms from Database

2010-01-11 Thread Ravi Gidwani
Thanks all for your replies. I guess what I meant by Query time, and as I understand solr (and I may be wrong here) I can add synonyms.txt in the query analyser as follows: By this my understanding is , even if the document (at index time) has a word "

Re: Adaptive search?

2010-01-11 Thread Ravi Gidwani
Shalin: Can you point me to pages/resources that talk about this approach in details ? OR can you provide more details on the schema and the function(?) used for ranking the documents. Thanks, ~Ravi. On Mon, Jan 11, 2010 at 1:00 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote

RE: Synonyms from Database

2010-01-11 Thread Peter A. Kirk
Hi - I don't think you'll see a "performance hit" using a DB for your synonym configuration as opposed to a text file. The configuration is only done once (at startup) - or when you "reload". You won't be reloading every minute, will you? After reading the configuration, the synonyms are avail

Re: Synonyms from Database

2010-01-11 Thread Erik Hatcher
On Jan 11, 2010, at 4:51 AM, Peter A. Kirk wrote: The reload feature actually sounds quite neat - it will reload "in the background", and "switch in" the newly read configuration when it's ready - so hopefully no down-time waiting for configuration. Correct me if I'm wrong, but I don't thin

Re: Synonyms from Database

2010-01-11 Thread Shalin Shekhar Mangar
On Mon, Jan 11, 2010 at 4:15 PM, Erik Hatcher wrote: > > On Jan 11, 2010, at 4:51 AM, Peter A. Kirk wrote: > >> The reload feature actually sounds quite neat - it will reload "in the >> background", and "switch in" the newly read configuration when it's ready - >> so hopefully no down-time waiting

Re: Understanding the query parser

2010-01-11 Thread Ahmet Arslan
> I am running in to the same issue. I have tried to replace > my > WhitespaceTokenizerFactory with a PatternTokenizerFactory > with pattern > (\s+|-) but I still seem to get a phrase query. Why is > that? It is in the source code of QueryParser's getFieldQuery(String field, String queryText) m

Re: No Analyzer, tokenizer or stemmer works at Solr

2010-01-11 Thread MitchK
Hello Hossman, sorry for my late response. For this specific case, you are right. It makes more sense to do such work "on the fly". However, I am only testing at the moment, what one can do with Solr and what not. Is the UpdateProcessor something that comes froms Lucene itself or from Solr? Th

Multi language support

2010-01-11 Thread Daniel Persson
Hi Solr users. I'm trying to set up a site with Solr search integrated. And I use the SolJava API to feed the index with search documents. At the moment I have only activated search on the English portion of the site. I'm interested in using as many features of solr as possible. Synonyms, Stopword

Re: No Analyzer, tokenizer or stemmer works at Solr

2010-01-11 Thread Erik Hatcher
On Jan 11, 2010, at 7:33 AM, MitchK wrote: Is the UpdateProcessor something that comes froms Lucene itself or from Solr? It's at the Solr level - Erik

Re: Synonyms from Database

2010-01-11 Thread Erik Hatcher
On Jan 11, 2010, at 5:50 AM, Shalin Shekhar Mangar wrote: On Mon, Jan 11, 2010 at 4:15 PM, Erik Hatcher wrote: On Jan 11, 2010, at 4:51 AM, Peter A. Kirk wrote: The reload feature actually sounds quite neat - it will reload "in the background", and "switch in" the newly read configurati

Re: Could not start SOLR issue

2010-01-11 Thread Grant Ingersoll
On Jan 11, 2010, at 1:38 AM, dipti khullar wrote: > Hi > > We are running master/slave Solr 1.3 version on production since about 5 > months. > > Yesterday, we faced following issue on one of the slaves for the first time > because of which we had to restart the slave. > > SEVERE: Could not st

Re: Could not start SOLR issue

2010-01-11 Thread dipti khullar
We were able to resolve the problem by restarting the slave. Also these failed snapshot install incidents occur after the exception was observed, which seems logically correct also. "Could not start SOLR. Check solr/home property" We just want to avoid such instances for future. Is it possible tha

update solr index

2010-01-11 Thread Marc Des Garets
Hi, I am running solr in tomcat and I have about 35 indexes (between 2 and 80 millions documents each). Currently if I try to update few documents from an index (let's say the one which contains 80 millions documents) while tomcat is running and therefore receiving requests, I am getting few very

Re: No Analyzer, tokenizer or stemmer works at Solr

2010-01-11 Thread MitchK
Is there any schemata that explains which class is responsible for which level of processing my data to the index? My example was: I have categorized, whether something is cheap or expensive. Let's say I didn't do that on the fly, but with the help of the UpdateRequestProcessor. Imagine there i

How to display Highlight with VelocityResponseWriter?

2010-01-11 Thread qiuyan . xu
Hi, we need a web gui for solr and we've noticed that VelocityResponseWriter is integrated in solr-proj for that purpose. But i have no idea how i can configure solrconfig.xml so that snippet with highlight can also be displayed in the web gui. I've added name="hl">true into the standard re

Re: Getting solr response data in a JS query

2010-01-11 Thread Gregg Hoshovsky
You might be running into an Ajax restriction. See if an article like this helps. http://www.nathanm.com/ajax-bypassing-xmlhttprequest-cross-domain-restriction/ On 1/9/10 11:37 PM, "Otis Gospodnetic" wrote: Dan, You didn't mention whether you tried &wt=json . Does it work if you use that

Re: Getting solr response data in a JS query

2010-01-11 Thread Matt Mitchell
I remember having a difficult time getting jquery to work as I thought it would. Something to do with the wt. I ended up creating a little client lib. Maybe this will be useful in finding your problem? example: http://github.com/mwmitchell/get_rest/blob/master/solr_example.html lib: http://git

Re: How to display Highlight with VelocityResponseWriter?

2010-01-11 Thread Sascha Szott
Qiuyan, with highlight can also be displayed in the web gui. I've added true into the standard responseHandler and it already works, i.e without velocity. But the same line doesn't take effect in itas. Should i configure anything else? Thanks in advance. First of all, just a few notes on the /it

Re: Multi language support

2010-01-11 Thread Markus Jelsma
Hello, We have implemented language specific search in Solr using language specific fields and field types. For instance, an en_text field type can use an English stemmer, and list of stopwords and synonyms. We, however did not use specific stopwords, instead we used one list shared by both langu

Replication problem

2010-01-11 Thread Jason Rutherglen
Hi, sorry for the somewhat inane question: I setup replication request handler on the master however I'm not seeing any replicatable indexes via http://localhost:8080/solr/main/replication?command=indexversion Queries such as *:* yield results on the master (so I assume the commit worked). The re

Re: Replication problem

2010-01-11 Thread Yonik Seeley
Did you try adding "startup" to the list of events to replicate after? -Yonik http://www.lucidimagination.com On Mon, Jan 11, 2010 at 12:25 PM, Jason Rutherglen wrote: > Hi, sorry for the somewhat inane question: > > I setup replication request handler on the master however I'm not > seeing any

Re: Replication problem

2010-01-11 Thread Jason Rutherglen
Yonik, I added startup to replicateAfter, however no dice... There's no errors the Tomcat log. The output of: http://localhost-master:8080/solr/main/replication?command=indexversion 0 0 0 0 The master replication UI: Local Index Index Version: 1263182366335, Generation: 3 Locati

help implementing a couple of business rules

2010-01-11 Thread Joe Calderon
hello *, im looking for help on writing queries to implement a few business rules. 1. given a set of fields how to return matches that match across them but not just one specific one, ex im using a dismax parser currently but i want to exclude any results that only match against a field called 'd

Re: help implementing a couple of business rules

2010-01-11 Thread Erik Hatcher
On Jan 11, 2010, at 12:56 PM, Joe Calderon wrote: 1. given a set of fields how to return matches that match across them but not just one specific one, ex im using a dismax parser currently but i want to exclude any results that only match against a field called 'description2' One way could be

Re: help implementing a couple of business rules

2010-01-11 Thread Joe Calderon
thx, but im not sure that covers all edge cases, to clarify 1. matching description2 is okay if other fields are matched too, but results matching only to description2 should be omitted 2. its okay to not match against the people field, but matches against the people field should only be phrase ma

Re: Understanding the query parser

2010-01-11 Thread Avlesh Singh
> > It is in the source code of QueryParser's getFieldQuery(String field, > String queryText) method line#660. If numTokens > 1 it returns Phrase > Query. > That's exactly the question. Would be nice to hear from someone as to why is it that way? Cheers Avlesh On Mon, Jan 11, 2010 at 5:10 PM, Ah

Cores + Replication Config

2010-01-11 Thread Giovanni Fernandez-Kincade
If you want to share one config amidst master & slaves, using Solr 1.4 replication, is there a way to specific whether a core is Master or Slave when using the CREATE Core command? Thanks, Gio.

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread darniz
Thanks we were having the saem issue. We are trying to store article content and we are strong a field like This article is for blah . Wheni see the analysis.jsp page it does strip out the tags and is indexed. but when we fetch the document it returns the field with the tags. >From solr point of

Re: Tokenizer question

2010-01-11 Thread Grant Ingersoll
What do your FieldTypes look like for the fields in question? On Jan 10, 2010, at 10:05 AM, rswart wrote: > > Hi, > > This is probably an easy question. > > I am doing a simple query on postcode and house number. If the housenumber > contains a minus sign like: > > q=PostCode:(1078 pw)+AND+H

Re: Tokenizer question

2010-01-11 Thread Grant Ingersoll
And also, what query parser are you using? On Jan 11, 2010, at 2:46 PM, Grant Ingersoll wrote: > What do your FieldTypes look like for the fields in question? > > On Jan 10, 2010, at 10:05 AM, rswart wrote: > >> >> Hi, >> >> This is probably an easy question. >> >> I am doing a simple query

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread Erick Erickson
This page: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters shows you many of the SOLR analyzers and filters. Would one of the various *HTMLStrip* stuff work? HTH ERick On Mon, Jan 11, 2010 at 2:44 PM, darniz wrote: > > Tha

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread darniz
Well thats the whole discussion we are talking about. I had the impression that the html tags are filtered and then the field is stored without tags. But looks like the html tags are removed and terms are indexed purely for indexing, and the actual text is stored in raw format. Lets say for examp

Re: Adaptive search?

2010-01-11 Thread Chris Hostetter
: I was talking about boosting documents using past popularity. So a user : searches for X and gets 10 results. This view is recorded for each of the 10 : documents and added to the index later. If a user clicks on result #2, the : click is recorded for doc #2 and added to index. We boost using cl

Re: Getting solr response data in a JS query

2010-01-11 Thread James McKinney
AJAX Solr does more or less the following: jQuery.getJSON('http://localhost:8983/solr/select/?q=*:*&wt=json&json.wrf=?', {}, function (data) { // do something with data, which is the eval'd JSON response }); -- View this message in context: http://old.nabble.com/Getting-solr-response-data-in-a-

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread Chris Hostetter
: stored without tags. But looks like the html tags are removed and terms are : indexed purely for indexing, and the actual text is stored in raw format. Correct. Analysis is all about "indexing" it has nothing to do with "stored" content. You can write UpdateProcessors that modify the content

Re: Tokenizer question

2010-01-11 Thread rswart
We are using the standard query parser (so no dismax). Fieldtype is solr.TextField with the following query analyzer:

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread Erick Erickson
Ah, I read your post too fast and ignored the title. Sorry 'bout that. Erick On Mon, Jan 11, 2010 at 2:55 PM, darniz wrote: > > Well thats the whole discussion we are talking about. > I had the impression that the html tags are filtered and then the field is > stored without tags. But looks lik

Re: Search query log using solr

2010-01-11 Thread Chris Hostetter
: application. I am planning to add a search query log that will capture all : the search queries (and more information like IP,user info,date time,etc). : I understand I can easily do this on the application side capturing all the : search request, logging them in a DB/File before sending them to

Re: Understanding the query parser

2010-01-11 Thread Erik Hatcher
On Jan 11, 2010, at 1:33 PM, Avlesh Singh wrote: It is in the source code of QueryParser's getFieldQuery(String field, String queryText) method line#660. If numTokens > 1 it returns Phrase Query. That's exactly the question. Would be nice to hear from someone as to why is it that way?

Commons Lang

2010-01-11 Thread Jeff Newburn
We have a solr plugin that would be much easier to write if commons-lang was available. Why does solr not have this library? Is there any drawbacks to pulling in the commons lang for StringUtils? -- Jeff Newburn Software Engineer, Zappos.com

Re: XmlUpdateRequestHandler with HTMLStripCharFilterFactory

2010-01-11 Thread darniz
no problem Erick Erickson wrote: > > Ah, I read your post too fast and ignored the title. Sorry 'bout that. > > Erick > > On Mon, Jan 11, 2010 at 2:55 PM, darniz wrote: > >> >> Well thats the whole discussion we are talking about. >> I had the impression that the html tags are filtered and t

Re: Multi language support

2010-01-11 Thread Don Werve
This is the way I've implemented multilingual search as well. 2010/1/11 Markus Jelsma > Hello, > > > We have implemented language specific search in Solr using language > specific fields and field types. For instance, an en_text field type can > use an English stemmer, and list of stopwords and

Encountering a roadblock with my Solr schema design...use dedupe?

2010-01-11 Thread Kelly Taylor
I am in the process of building a Solr search solution for my application and have run into a roadblock with the schema design. Trying to match criteria in one multi-valued field with corresponding criteria in another multi-valued field. Any advice would be greatly appreciated. BACKGROUND: My R

Re: Encountering a roadblock with my Solr schema design...use dedupe?

2010-01-11 Thread Markus Jelsma
Hello Kelly, I am not entirely sure if i understand your problem correctly. But i believe your first approach is the right one. Your question: "Which products are available that contain skus with color Green, size M, and a price of $9.99 or less?" can be easily answered using a schema like yours

EOF IOException Query

2010-01-11 Thread Osborn Chan
Hi all, I got following exception for SOLR, but the index is still searchable. (At least it is searchable for query "*:*".) I am just wondering what is the root cause. Thanks, Osborn INFO: [publicGalleryPostMaster] webapp=/multicore path=/select params={wt=javabin&rows=12&start=0&sort=/gallery

Re: Encountering a roadblock with my Solr schema design...use dedupe?

2010-01-11 Thread Kelly Taylor
Hi Markus, Thanks for your reply. Using the current schema and query like you suggest, how can I identify the unique combination of options and price for a given SKU? I don't want the user to arrive at a product which doesn't completely satisfy their search request. For example, with the "col

Re: Encountering a roadblock with my Solr schema design...use dedupe?

2010-01-11 Thread Markus Jelsma
Hello Kelly, Simple boolean algebra, you tell Solr you want color = green AND size = M so it will only return green t-shirts in size M. If you, however, turn the AND in a OR it will return all t-shirts that are green OR in size M, thus you can then get M sized shirts in the blue color or green sh

Re: Encountering a roadblock with my Solr schema design...use dedupe?

2010-01-11 Thread Kelly Taylor
Hi Markus, Thanks again. I wish this were simple boolean algebra. This is something I have already tried. So either I am missing the boat completely, or have failed to communicate it clearly. I didn't want to confuse the issue further but maybe the following excerpts will help... Excerpt from "

Re: Commons Lang

2010-01-11 Thread Erik Hatcher
There's no point in moving it to Solr core unless something in core depends on it. The VelocityResponseWriter depends on commons-lang, though, and I am aiming to integrate that into core at some point. But, you can put commons-lang in your /lib and your plugin will be able to see it fine.

Re: Tokenizer question

2010-01-11 Thread Chris Hostetter
: q=PostCode:(1078 pw)+AND+HouseNumber:(39-43) : : the resulting parsed query contains a phrase query: : : +(PostCode:1078 PostCode:pw) +PhraseQuery(HouseNumber:"39 43") This stems from some fairly fundemental behavior i nthe QueryParser ... each "chunk" of input that isn't deemed "markup (ie:

Re: Tokenizer question

2010-01-11 Thread Avlesh Singh
> > If the analyzer produces multiple Tokens, but they all have the same > position then the QueryParser produces a BooleanQuery will all SHOULD > clauses. -- This is what allows simple synonyms to work. > You rock Hoss!!! This is exactly the explanation I was looking for .. it is as simple as it

Re: Understanding the query parser

2010-01-11 Thread Avlesh Singh
Thanks Erik for responding. Hoss explained the behavior with nice corollaries here - http://www.lucidimagination.com/search/document/8bc351d408f24cf6/tokenizer_question Cheers Avlesh On Tue, Jan 12, 2010 at 2:21 AM, Erik Hatcher wrote: > > On Jan 11, 2010, at 1:33 PM, Avlesh Singh wrote: > > >>>

Solr 1.4 Field collapsing - What are the steps for applying the SOLR-236 patch?

2010-01-11 Thread Kelly Taylor
Hi, Is there a step-by-step for applying the patch for SOLR-236 to enable field collapsing in Solr 1.4? Thanks, Kelly -- View this message in context: http://old.nabble.com/Solr-1.4-Field-collapsing---What-are-the-steps-for-applying-the-SOLR-236-patch--tp27122621p27122621.html Sent from the So

Re: Solr 1.4 Field collapsing - What are the steps for applying the SOLR-236 patch?

2010-01-11 Thread Joe Calderon
it seems to be in flux right now as the solr developers slowly make improvements and ingest the various pieces into the solr trunk, i think your best bet might be to use the 12/24 patch and fix any errors where it doesnt apply cleanly im using solr trunk r892336 with the 12/24 patch --joe On

Seattle Hadoop / HBase / Lucene / NoSQL meetup Jan 27th!

2010-01-11 Thread Bradford Stephens
Greetings, A friendly reminder that the Seattle Hadoop, NoSQL, etc. meetup is on January 27th at University of Washington in the Allen Computer Science Building, room 303. I believe Razorfish will be giving a talk on how they use Hadoop. Here's the new, shiny meetup.com link with more detail: ht

Re: Tokenizer question

2010-01-11 Thread rswart
Cristal clear. Thanks for your response&time! -- View this message in context: http://old.nabble.com/Tokenizer-question-tp27099119p27123281.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: update solr index

2010-01-11 Thread Shalin Shekhar Mangar
On Mon, Jan 11, 2010 at 7:42 PM, Marc Des Garets wrote: > > I am running solr in tomcat and I have about 35 indexes (between 2 and > 80 millions documents each). Currently if I try to update few documents > from an index (let's say the one which contains 80 millions documents) > while tomcat is ru

What is this error means?

2010-01-11 Thread Ellery Leung
When I am building the index for around 2 ~ 25000 records, sometimes I came across with this error: Uncaught exception "Exception" with message '0' Status: Communication Error I search Google & Yahoo but no answer. I am now committing document to solr on every 10 records fetched fro