Re: Solr Cell (ExtractingRequestHandler) and plain text files

2009-02-09 Thread Erik Hatcher
And yes, the file does have textual content :) And I tried both ext.resource.name and stream.contentType to no avail. Erik On Feb 9, 2009, at 10:17 PM, Erik Hatcher wrote: One other person has reported this to me off-list, and I just encountered it myself. ExtractingRequestHandler d

Solr Cell (ExtractingRequestHandler) and plain text files

2009-02-09 Thread Erik Hatcher
One other person has reported this to me off-list, and I just encountered it myself. ExtractingRequestHandler does not handle plain text files properly (no text is extracted). Here's an example: curl "http://localhost:8982/solr/update/extract?ext.ignore.und.fl=true&wt=ruby&stream.file=/User

Re: UpdateResponse status codes?

2009-02-09 Thread Jacob Singh
0 is actually a communication failure (can't connect at all). 200 is good Solr returns 400s when if bails. I always thought this was strange, because I thought 500 is an application error (what I would expect) and 400 is a general HTTP error. Best, J On Tue, Feb 10, 2009 at 7:22 AM, Koji Sekig

Re: UpdateResponse status codes?

2009-02-09 Thread Koji Sekiguchi
Mark, I'm not solrj user, but I think you don't need to check status code. Solr server always return 0 for status when success. If something goes wrong, Solr server returns HTTP 400/500 response, then you'll get an Exception. Koji Mark Ferguson wrote: Hello, I am wondering if the UpdateResp

Re: Moving from single core to multicore

2009-02-09 Thread Chris Hostetter
: Now all that is left is a more cosmetic change I would like to make: : I tried to place the solr.xml in the example dir to get rid of the : "-Dsolr.solr.home=multicore" for the start and changed the first entry : from "core0" to "solr" and moved the core1 dir from multicore directly : under the

Re: Vertical Partitioning advice

2009-02-09 Thread Mark Kranz
Just an update on my own research: I have discovered the 'ParallelReader' class (subclass of IndexReader) in lucene, which is designed for searching across multiple indexes. This appears to suit our needs - and I do not expect will be too difficult to integrate into Solr. -- View this message i

Re: lazily loading search components?

2009-02-09 Thread Chris Hostetter
: We have a standard solr install that we use across a lot of different uses. : In that install is a custom search component that loads a lot of data in its : inform() method. This means the data is initialized on solr boot. Only about : half of our installs actually ever call this search componen

Re: Fwd: Separate error logs

2009-02-09 Thread Chris Hostetter
: OK, so java.util.logging has no way of sending error messages to a separate : log without writing your own Handler/Filter code. : If we just skip over the absurdity of that, and the rage it makes me feel, FWIW: that's a slight mischaracterization of java.util.logging (JUL): the API & framework

Re: several snapshot ...

2009-02-09 Thread Chris Hostetter
: I would like to get how is a snapshot really. It's obviously a hard link to : the files. : But it just contain the last update ?? the nature of lucene indexes is that files are never modified -- only created, or deleted. this makes rsyncing very efficient when updates have been made to an i

Re: Severe errors in solr configuration

2009-02-09 Thread Chris Hostetter
: Subject: Severe errors in solr configuration It sounds like you solved your problem, but a few things to clarify for people who might find this thread later... : java.security.AccessControlException: access denied (java.io.FilePermission : /var/lib/tomcat6/solr/solr.xml read) at : java.secu

Re: 500 Errors on update

2009-02-09 Thread Chris Hostetter
: org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out: : SingleInstanceLock: write.lock : at org.apache.lucene.store.Lock.obtain(Lock.java:85) : at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1140) are there any other ERROR messages in your log before th

Re: Exact match search problem

2009-02-09 Thread Chris Hostetter
: I have indexed my data as "custom123, customer, custom" for the : "UserName" field. I need to search the records for exact match, when I : am trying to search with UserName:"customer" I am finding the records : where UserName is custom123 and custom. : : As per my understanding solr splits t

Re: Dismax q.alt field for field level boosting

2009-02-09 Thread Chris Hostetter
: I am trying to test relevancy of results with the q.alt field on a Dismax : Request Handler. Term level boosting based on bq information in : solrconfig.xml works fine. However field level boosting based on the qf : information in solrconfig.xml doesn't seem to work. : : Query : q=&q.alt=for&ro

Re: solr booosting

2009-02-09 Thread Chris Hostetter
: As I understood lucene's boost, if you search for "John Le Carre" it will : give better score to the results that contains just the searched string that : results that have, for example, 50 words and the search is contained in the : words. : : In Solr, my goal is to give more score to the docs

Re: Rsyncd start and stop for multiple instances

2009-02-09 Thread Chris Hostetter
: How can I hack the existing script to support multiple rsync module you might want to just consult some rsyncd resources to answer this question, i believe adding a new "[modname]" block is how you add a module, with the path/comment keys listed underneight, however... 1) i don't believe it'

Re: User tag design for read-only index

2009-02-09 Thread Chris Hostetter
: The behavior I would like is identical to 'tagging' each document with the : list-id/user/order and then using standard faceting to show what lists : documents are in and what users have put the docs into a list. : : But - I would like the main index to be read only. The index needs to be : sh

Performance degradation caused by choice of range fields

2009-02-09 Thread wojtekpia
In my schema I have two copies of my numeric fields: one with the original value (used for display, sort), and one with a rounded version of the original value (used for range queries). When I use my rounded field for numeric range queries (e.g. q=RoundedValue:[100 TO 1000]), I see very consisten

Re: Performance "dead-zone" due to garbage collection

2009-02-09 Thread wojtekpia
I tried sorting using a function query instead of the Lucene sort and found no change in performance. I wonder if Lance's results are related to something specific to his deployment? -- View this message in context: http://www.nabble.com/Performance-%22dead-zone%22-due-to-garbage-collection-tp21

Re: Performance "dead-zone" due to garbage collection

2009-02-09 Thread wojtekpia
I've been able to reduce these GC outages by: 1) Optimizing my schema. This reduced my index size by more than 50% 2) Smaller cache sizes. I started with filterCache, documentCache & queryCache sizes of ~10,000. They're now at ~500 3) Reduce heap allocation. I started at 27 GB, now I'm 'only' all

UpdateResponse status codes?

2009-02-09 Thread Mark Ferguson
Hello, I am wondering if the UpdateResponse status codes are documented somewhere? I haven't been able to find them. I know 0 is success.. Thanks, Mark

Re: Improving the highlighter output for use in html

2009-02-09 Thread Jeffrey Baker
On Mon, Feb 9, 2009 at 2:59 PM, Jeffrey Baker wrote: > The default highlighter output is bogus if you're trying to use the > snippets in a web browser. With the default delimiters, the > temptation is to just stick the snippets in an innerHTML property, but > the problem is that other HTML speci

Improving the highlighter output for use in html

2009-02-09 Thread Jeffrey Baker
The default highlighter output is bogus if you're trying to use the snippets in a web browser. With the default delimiters, the temptation is to just stick the snippets in an innerHTML property, but the problem is that other HTML special characters (< > and &) are not escaped. For example, a hig

Re: search returns matches for non-starting wildcard prefix queries

2009-02-09 Thread Otis Gospodnetic
Rupert, Try using "string" field type instead of "text" and test it out with some unusual/rare last name patterns. For example, try it with last names that consist of more than one word and see if you are happy with those results. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nu

Re: exceeded limit of maxWarmingSearchers

2009-02-09 Thread Jon Drukman
Otis Gospodnetic wrote: I'd say: "Make sure you don't commit more frequently than the time it takes for your searcher to warm up", or else you risk searcher overlap and pile-up. cool. i found a place in our code where we were committing the same thing twice in very rapid succession. fingers

search returns matches for non-starting wildcard prefix queries

2009-02-09 Thread Rupert Fiasco
(I think I have a horrible subject line but I wasnt sure how to properly explain myself). I have a text field that I store last names in (and everything is lowercased prior to insertion, not sure if that matters). The field is described as:

Re: [ANN] Lucid Imagination

2009-02-09 Thread Renaud Delbru
Hi Mark, Mark Miller wrote: Hey Renaud - in the future, its probably best to direct Gaze questions (unless it directly relates to Solr) to supp...@lucidimagination.com . Right, I was not aware of this mailing list. Gaze is a tool thats stores RequestHandl

Re: Moving from single core to multicore

2009-02-09 Thread Michael Lackhoff
On 09.02.2009 17:01 Ryan McKinley wrote: > Check your solrconfig.xml you probably have somethign like this: > > >${solr.data.dir:./solr/data} > (from the example) > > either remove that or make each one point to the correct location Thanks, that's it! Now all that is left is a more co

Re: Moving from single core to multicore

2009-02-09 Thread Ryan McKinley
On Feb 9, 2009, at 10:40 AM, Michael Lackhoff wrote: On 09.02.2009 15:40 Ryan McKinley wrote: But I have some problems setting this up. As long as I try the multicore sample everything works but when I copy my schema.xml into the multicore/core0/conf dir I only get 404 error messages when I e

Re: Multi-valued dynamic fields in schema.xml

2009-02-09 Thread Bruno Aranda
Hi, I asked the same question a few days ago. Using multiValued dynamic fields is fine even if the documentation or examples do not say anything about it, Cheers, Bruno 2009/2/9 Ian Sugar > Hi > > I'd like to use multi-valued dynamic fields. > > Example: > > > >multiValued="true" />

Re: [ANN] Lucid Imagination

2009-02-09 Thread Mark Miller
Hey Renaud - in the future, its probably best to direct Gaze questions (unless it directly relates to Solr) to supp...@lucidimagination.com . Gaze is a tool thats stores RequestHandler statistics avgs (over small intervals) for long time ranges, and then le

Re: Combination of EmbeddedSolrServer and CommonHttpSolrServer

2009-02-09 Thread Ryan McKinley
Keep in mind that the way lucene/solr work is that the results are constant from when you open the searcher. If new documents are added (without re-opening the searcher) they will not be seen. tells solr to re-open the index and see the changes. 1. Does this mean that committing on t

Multi-valued dynamic fields in schema.xml

2009-02-09 Thread Ian Sugar
Hi I'd like to use multi-valued dynamic fields. Example: Seems to work fine so far, but it doesn't seem to be described in the wiki here http://wiki.apache.org/solr/SchemaXml Just wanted to check if it's a known and deliberate feature and if anyone knows of any issues with using it?

Re: Moving from single core to multicore

2009-02-09 Thread Michael Lackhoff
On 09.02.2009 15:40 Ryan McKinley wrote: >> But I have some problems setting this up. As long as I try the >> multicore >> sample everything works but when I copy my schema.xml into the >> multicore/core0/conf dir I only get 404 error messages when I enter >> the >> admin url. > > what is the

RE: Combination of EmbeddedSolrServer and CommonHttpSolrServer

2009-02-09 Thread Jana, Kumar Raja
Hi, I have a few queries regarding this: 1. Does this mean that committing on the indexing (Embedded) server does not reflect the document changes when we fire a search through another (HTTP) server? 2. What happens to the commit fired on the indexing server? Can I remove that and just commit on

Re: Combination of EmbeddedSolrServer and CommonHttpSolrServer

2009-02-09 Thread Ryan McKinley
yes. This works fine. But make sure only one SolrServer is writing to the index at a time. Also note that if you use the EmbeddedSolrServer to index and another one to read, you will need to call on the 'read only' server to refresh the index view (the work "commit" is a bit misleading)

Re: Separate error logs

2009-02-09 Thread Ryan McKinley
Is Solr 1.4 (and its nice SLF4J logging) in a state ready for intensive production usage? While it is not officially recommended, trunk is quite stable. Of course back up and make sure to test well before deploying anything real. ryan

Combination of EmbeddedSolrServer and CommonHttpSolrServer

2009-02-09 Thread Bapat, Mayur
Hi, Has anybody tried the combination of EmbeddedSolrServer only for indexing and CommonHttpSolrServer only for searching? So in my architecture with the EmbeddedSolrServer I want to use the advantage of direct API calls for indexing purpose and for searching I would rely on HTTP requests. I trie

Re: Moving from single core to multicore

2009-02-09 Thread Ryan McKinley
But I have some problems setting this up. As long as I try the multicore sample everything works but when I copy my schema.xml into the multicore/core0/conf dir I only get 404 error messages when I enter the admin url. what is the url you are hitting? Do you see links from the index page?

Moving from single core to multicore

2009-02-09 Thread Michael Lackhoff
Hello, I am not that experienced but managed to get a Solr index going by copying the "example" dir from the distribution (1.3 released version) and changing the fields in schema.xml to my needs. As I said everything is working very well so far. Now I need a second index on the same machine and th

Re: several snapshot ...

2009-02-09 Thread sunnyfr
Hi guys do you have any idea where come form this problem. Don't get what did I miss there ?? thanks, sunnyfr wrote: > > Hi > > I would like to get how is a snapshot really. It's obviously a hard link > to the files. > But it just contain the last update ?? > > My problem is ... Ive cron

a problem about solr synonyms

2009-02-09 Thread 李学健
hi, all how to search 'us' to get 'united states'? through synonyms filter ? --steven.li