AW: Help with Setup

2007-04-27 Thread Burkamp, Christian
Hi, You can use curl with a file if you put the "@" char in front of it's name. (Otherwise curl expects the data on the commandline). curl http://localhost:8080/solr/update --data-binary @articles.xml -Ursprüngliche Nachricht- Von: Sean Bowman [mailto:[EMAIL PROTECTED] Gesendet: Donner

Re: Help with Setup

2007-04-27 Thread Mike
I thought that too, I opened it up via vi and nothing was there. Usually if I have a PC encoding issue ( I use EditPlus as a text editor) it will show up in vi. On Apr 26, 2007, at 5:19 PM, Ryan McKinley wrote: paladin:/data/solr mtorgler1$ curl http://localhost:8080/solr/ update --data-bin

Re: Help with Setup

2007-04-27 Thread Mike
Wrapping is purely via email, the text xml is single line (on purpose, originally it was a 3 paragraph field that was html encoded, I simplified the text as a 'just in case') On Apr 26, 2007, at 5:09 PM, Cody Caughlan wrote: For the storyText "field" element, is that wrapping only in this em

Re: Question to php to do with multi index

2007-04-27 Thread Michael Kimsal
The curl_multi is probably the most effective way, using straight PHP. Another option would be to spawn several jobs, assuming unix/linux, and wait for them to get done. It doesn't give you very good error handling (well, none at all actually!) but would let you run multiple indexing jobs at once

Re: case sensitivity

2007-04-27 Thread Yonik Seeley
On 4/26/07, Michael Kimsal <[EMAIL PROTECTED]> wrote: We're (and by 'we' I mean my esteemed colleague!) working on patching a few of these items to be in the solrconf.xml file and should likely have some patches submitted next week. It's being done on 'company time' and I'm not sure about the ex

Re: case sensitivity

2007-04-27 Thread Michael Kimsal
Can you point me to the process for submitting these small patches? I'm looking at the jira site but don't see much of anything there outlining a process for submitting patches. Sorry to be so basic about this, but I'm trying to follow correct procedures on both sides of the aisle, so to speak.

RE: unused fields

2007-04-27 Thread Will Johnson
Moved to: https://issues.apache.org/jira/browse/SOLR-217 patch included. - will -Original Message- From: Will Johnson [mailto:[EMAIL PROTECTED] Sent: Wednesday, April 25, 2007 2:54 PM To: solr-user@lucene.apache.org Subject: unused fields Is there any way to make Solr not throw an Exc

Re: Help with Setup

2007-04-27 Thread Sean Bowman
That's an awesome tip to keep in ol' toolbox, Christian.

Re: case sensitivity

2007-04-27 Thread Ryan McKinley
Michael Kimsal wrote: Can you point me to the process for submitting these small patches? I'm looking at the jira site but don't see much of anything there outlining a process for submitting patches. Sorry to be so basic about this, but I'm trying to follow correct procedures on both sides of t

Facet Results Strange - Help

2007-04-27 Thread realw5
Hello, I'm running into some strange results for some facets of mine. Below you'll see the XML returned from solr. I did a query using the standard request handler. Notice the duplicated values returned (american standard, delta, etc). There is actually quite a few of them. At first I though it ma

Re: resin faile to start with solr.

2007-04-27 Thread Bill Au
Have you tried using the schema.xml that is in example/solr/conf. It that works then the problem is definitely in your schema.xml. Bill On 4/26/07, James liu <[EMAIL PROTECTED]> wrote: but it is ok when i use tomcat. 2007/4/26, Ken Krugler <[EMAIL PROTECTED]>: > > >3.0.23 yesterday i try and

Re[8]: Things are not quite stable...

2007-04-27 Thread Jack L
Hello Bertrand, Is there a build script that automagically grab files from jetty's source tree (local) and build a solr release? In other words, I can try building with a newer version of jetty if it doesn't take too much work - I don't know much about jetty or solr at the code level. -- Best re

Re: Facet Results Strange - Help

2007-04-27 Thread Jennifer Seaman
Hopefully someone can chime in with some tips, thanks! It's likely you have the facet category added more than once for one or more docs. Like this; american standard american standard Are you adding the facet values on-the-fly? This happened to me and I solved it by removing the duplicat

Re: AW: Help with Setup

2007-04-27 Thread Mike
On Apr 27, 2007, at 4:24 AM, Burkamp, Christian wrote: curl http://localhost:8080/solr/update --data-binary @ I think the issue was with a bad file in /data/solr/conf. After a bunch of testing out of the installtion directory, I was able to use the post.sh script to post correctly into t

Re: case sensitivity

2007-04-27 Thread Michael Kimsal
So I just create my own 'issue' first? OK. Thanks. On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: Michael Kimsal wrote: > Can you point me to the process for submitting these small patches? I'm > looking at the jira site but don't see much of anything there outlining a > process for su

Re: case sensitivity

2007-04-27 Thread Michael Kimsal
What's the procedure then for something to get included in the next release? Thanks again all! On 4/27/07, Michael Kimsal <[EMAIL PROTECTED]> wrote: So I just create my own 'issue' first? OK. Thanks. On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: > > Michael Kimsal wrote: > > Can you

Solr and memcached

2007-04-27 Thread Otis Gospodnetic
Hi, I'm considering adding support for caching results in memcached. Questions: 1. Has anyone already done this? (searched, didn't find anything) 2. Can anyone think of any reasons why this might not be a good idea? (I *just* started considering this) 3. I read http://wiki.apache.org/solr/SolrC

Re: case sensitivity

2007-04-27 Thread Otis Gospodnetic
Once the code/patch in the issue is put/committed to SVN, it means it will be in the next release. You get your patch committed faster if it's clear, well written and explained, if it comes with a unit test if it's a code change, and so on. Otis . . . . . . . . . . . . . . . . . . . . . . . .

Re: Re[8]: Things are not quite stable...

2007-04-27 Thread Bertrand Delacretaz
Hi Jack, On 4/27/07, Jack L <[EMAIL PROTECTED]> wrote: ...Is there a build script that automagically grab files from jetty's source tree (local) and build a solr release? In other words, I can try building with a newer version of jetty if it doesn't take too much work - I don't know much about j

Re: Requests per second/minute monitor?

2007-04-27 Thread Otis Gospodnetic
Would creating a new QueryRateFilter servlet filter be a good place to put this? This way it could stay out of the Solr core and coult be turned on/off via web.xml. Otis - Original Message From: Chris Hostetter <[EMAIL PROTECTED]> To: "solr-user@lucene.apache.org" Sent: Thursday, Apr

EmbeddedSolr class from Wiki

2007-04-27 Thread Daniel Einspanjer
The example EmbeddedSolr class on the wiki makes use of getUpdateHandler which was added after 1.1 (so it seems to be available only on trunk). I'd really like to move to an embedded Solr sooner rather than later. My questions are: - Would it be easy/possible to work around the lack of getUp

Re: solr and https

2007-04-27 Thread Chris Hostetter
: Has anyone tried using solr over https? not personally, my Solr instances are usually protected behind a firewall (just like my databases) so i'm not typically worried about securingconnections to them from my applications : Is there any reason why this might not be feasible? none that i can

Re: Additive scoring using Dismax...

2007-04-27 Thread Chris Hostetter
: AND does not controll scoring, only matching. If you want dismax to : be purely additive, pass tie=1.0 to the handler. more specificly: the defaultOperator option in the schema.xml does not affect the dismax parser used on the q param at all (only the stock SolrQueryParser used for things like

Re: EmbeddedSolr class from Wiki

2007-04-27 Thread Ryan McKinley
Daniel Einspanjer wrote: The example EmbeddedSolr class on the wiki makes use of getUpdateHandler which was added after 1.1 (so it seems to be available only on trunk). I'd really like to move to an embedded Solr sooner rather than later. My questions are: - Would it be easy/possible to work

Re: EmbeddedSolr class from Wiki

2007-04-27 Thread Fuad Efendi
Additional questions regarding EmbeddedSolr (for using the Solr API directly without HTTP): - Can I use separate JVMs for same Directory object? One process will create/update/delete, and another search. - Can I use separate JEE contexts inside same JVM? Looks like "singleton" is a must, but sep

Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote: Hello, I'm running into some strange results for some facets of mine. Below you'll see the XML returned from solr. I did a query using the standard request handler. Notice the duplicated values returned (american standard, delta, etc). There is actual

multiple solr instances using same index files?

2007-04-27 Thread Ryan McKinley
Is it possible / is it an ok idea to have multiple solr instances running on the same machine pointing to the same index files? Essentially, I have two distinct needs - in some cases i need a commit immediately after indexing one document, but most of the time it is fine to wait 10 mins for ch

Re: Solr and memcached

2007-04-27 Thread Yonik Seeley
If you store internal docids, then you need to add the specific reader (or index version?) as part of the key since the ids are transient. -Yonik On 4/27/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Hi, I'm considering adding support for caching results in memcached. Questions: 1. Has any

Re: solr and https

2007-04-27 Thread Bertrand Delacretaz
On 4/25/07, brian beard <[EMAIL PROTECTED]> wrote: ...Has anyone tried using solr over https? Is there any reason why this might not be feasible?... As with any servlet-based system, using apache httpd's as an SSL front-end with mod_proxy will work, and it has other benefits besides not needin

Re: multiple solr instances using same index files?

2007-04-27 Thread Yonik Seeley
On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: Is it possible / is it an ok idea to have multiple solr instances running on the same machine pointing to the same index files? If only one at a time is used to update the index, then yes it is possible. Essentially, I have two distinct nee

Re: Solr and memcached

2007-04-27 Thread Ken Krugler
Hi Otis, I'm considering adding support for caching results in memcached. Questions: 1. Has anyone already done this? (searched, didn't find anything) Not exactly, but we do something similar to this for Nutch searches using ehcache (http://krugle.com/kse/projects/eFNJEmX). But we store th

Re: case sensitivity

2007-04-27 Thread Yonik Seeley
On 4/26/07, Erik Hatcher <[EMAIL PROTECTED]> wrote: I think we should open up as many of the switches as we can to QueryParser, allowing users to tinker with them if they want, setting the defaults to the most common reasonable settings we can agree upon. I think we should also try and handle w

Re: Requests per second/minute monitor?

2007-04-27 Thread Yonik Seeley
On 4/27/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Would creating a new QueryRateFilter servlet filter be a good place to put this? This way it could stay out of the Solr core and coult be turned on/off via web.xml. There's already gotta be some nice external tools that parse log files

Re: Requests per second/minute monitor?

2007-04-27 Thread Walter Underwood
This is for monitoring -- what happened in the last 30 seconds. Log file analysis doesn't really do that. I think the XML output in admin/stats.jsp may be enough for us. That gives the cumulative requests on each handler. Those are counted in StandardRequestHandler DisMaxRequestHandler and are ava

Re: Solr and memcached

2007-04-27 Thread Chris Hostetter
: 2. Can anyone think of any reasons why this might not be a good idea? (I : *just* started considering this) : 3. I read http://wiki.apache.org/solr/SolrCaching , but I think the : whole cache discarding and warming wouldn't be needed if what I store in : memcached is: StringRepresentationOfQuer

Re: Facet Results Strange - Help

2007-04-27 Thread realw5
I have a dynamic field setup for facets. It looks like this: I do this, because we add facets quite often, so having to modify the schema every time would be unfeasible. I'm currently reindexing from scratch, so I cannot try wt=python for little bit longer. Once it's done indexing I'll give t

Re: EmbeddedSolr class from Wiki

2007-04-27 Thread Chris Hostetter
: - Can I use separate JVMs for same Directory object? One process will : create/update/delete, and another search. : - Can I use separate JEE contexts inside same JVM? : : Looks like "singleton" is a must, but separate "search" should be : possible... in theory it should work, Solr doens't do an

Re: multiple solr instances using same index files?

2007-04-27 Thread Chris Hostetter
: > Essentially, I have two distinct needs - in some cases i need a commit : > immediately after indexing one document, but most of the time it is fine : > to wait 10 mins for changes if that has better performance. : : Sounds like a configuration issue... set autocommit to 10 minutes, but : explic

Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote: I have a dynamic field setup for facets. It looks like this: I do this, because we add facets quite often, so having to modify the schema every time would be unfeasible. I'm currently reindexing from scratch, so I cannot try wt=python for little b

Re: Requests per second/minute monitor?

2007-04-27 Thread Ryan McKinley
Walter Underwood wrote: This is for monitoring -- what happened in the last 30 seconds. Log file analysis doesn't really do that. I think the XML output in admin/stats.jsp may be enough for us. That gives the cumulative requests on each handler. Those are counted in StandardRequestHandler DisMax

Re: just advice

2007-04-27 Thread Chris Hostetter
: i will use /usr/solrapp/conf/solr1_solrconfig.xml, solr2_solrconfig.xml, : solr3_solrconfig.xml...and so. : : when i test these instance, i just stay in /usr/solrapp.conf/,,,not like : now, : : i have to change : /usr/solrapp/solr1/conf,,,/usr/solrapp/solr2/conf,,,/usr/solrapp/solr3/conf, Hmmm.

Re: wrong path in snappuller

2007-04-27 Thread Chris Hostetter
: The "solr" on the rsync command line is just a label which is defined in : rsyncd.conf on the master. rsyncd.conf is created on the fly by the script : rsyncd-start: ... : This label is then mapped to the path defined in $data_dir. Ah... right, i forgot about that. : > Why does it nee

Re: Facet Results Strange - Help

2007-04-27 Thread realw5
Ok, I just finished indexing about 20k in documents. I took a look at so far the problem has not appearred again. What I'm thinking caused it was I was not adding overwritePending & overwriteCommited in the add process. Therefor over time as data was being cleaned up, it was just appending to the

Re: multiple solr instances using same index files?

2007-04-27 Thread Ryan McKinley
Chris Hostetter wrote: : > Essentially, I have two distinct needs - in some cases i need a commit : > immediately after indexing one document, but most of the time it is fine : > to wait 10 mins for changes if that has better performance. : : Sounds like a configuration issue... set autocommit to

Re: Facet Results Strange - Help

2007-04-27 Thread Yonik Seeley
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote: Ok, I just finished indexing about 20k in documents. I took a look at so far the problem has not appearred again. What I'm thinking caused it was I was not adding overwritePending & overwriteCommited in the add process. Therefor over time as data was

Re: Facet Results Strange - Help

2007-04-27 Thread Chris Hostetter
: It's likely you have the facet category added more than once for one : or more docs. Like this; : : american standard : american standard : : Are you adding the facet values on-the-fly? This happened to me and I : solved it by removing the duplicate facet fields. that's really odd ... i can't t

Re: case sensitivity

2007-04-27 Thread Yonik Seeley
On 4/26/07, Michael Kimsal <[EMAIL PROTECTED]> wrote: My colleague, after some digging, found in SolrQueryParser (around line 62) setLowercaseExpandedTerms(false); The default for Lucene is true. Was this intentional? Or an oversight? Way back before Solr was opensourced, and Chris

Vote For Jira Issues

2007-04-27 Thread Chris Hostetter
Hi everybody, I just wanted to point out that there has been some discussion going on on the solr-dev list about making a Solr 1.2 release in the near future. There is no ETA on when this will happen, at this point it's mainly a discussion of "what pending Jira patches should we ensure to include

Unicode characters

2007-04-27 Thread HUYLEBROECK Jeremy RD-ILAB-SSF
Hi, We experience some encoding probs with the unicode characters getting out of solr. Let me explain our flow: -fetch a web page -decode entities and unicode characters(such as $#149; ) using Neko library -get a unicode String in Java -Sent it to SOLR through XML created by SAX, with the right

Re: Facet Results Strange - Help

2007-04-27 Thread Chris Hostetter
: writer, I notice a space at the end. I can fix this issue by triming all my : values before sening them to solr :-) The built in Field Faceting works on the indexed values, so Solr can solve this for you if you use something like this for your facet field type...

Re: case sensitivity

2007-04-27 Thread Michael Pelz Sherman
In our experience, setting a LowercaseFilter in the query did not work; we had to call setLowercaseExpandedTerms(true) to get wildcard queries to be case-insensitive. Here's our analyzer definition from our solr schema:

Re: case sensitivity

2007-04-27 Thread Yonik Seeley
On 4/27/07, Michael Pelz Sherman <[EMAIL PROTECTED]> wrote: In our experience, setting a LowercaseFilter in the query did not work; we had to call setLowercaseExpandedTerms(true) to get wildcard queries to be case-insensitive. Correct, because in that case the QueryParser does not invoke anal

Re: Unicode characters

2007-04-27 Thread Yonik Seeley
On 4/27/07, HUYLEBROECK Jeremy RD-ILAB-SSF -In the query output from SOLR (XML message), the character is not encoded as an entity (not •) but the character itself is used (character 149=95 hexadecimal). That's fine, as they are equivalent representations, and that character is directly represe

Re: Unicode characters

2007-04-27 Thread Chris Hostetter
: -fetch a web page : -decode entities and unicode characters(such as $#149; ) using Neko : library : -get a unicode String in Java : -Sent it to SOLR through XML created by SAX, with the right encoding : (UTF-8) specified everywhere( writer, header etc...) : -it apparently arrives clean on the SO

Re: AW: Leading wildcards

2007-04-27 Thread Chris Hostetter
: when we do a search on a nonexisting field, we get a SolrException: : undefined field : (this was for query "nonfield:test") : : but when we use wildcards in our query, we dont get the undefined field : exception, : so the query "nonfield:*test" works fine ... just zero results... : : is this n

Re: AW: Leading wildcards

2007-04-27 Thread Paul Fryer
PLEASE REMOVE ME FROM THIS MAILING LIST!!! Whoever manages this list, can you please remove me i have tried sending emails to the unsubscribe email, but i just keep getting more emails. This is really an issue for me... so your help would be great! Thanks, Paul From: Chris Hostetter <

Re: just advice

2007-04-27 Thread James liu
HMmm mkdir+ln-s => solve my problem thks hoss. 2007/4/28, Chris Hostetter <[EMAIL PROTECTED]>: : i will use /usr/solrapp/conf/solr1_solrconfig.xml, solr2_solrconfig.xml, : solr3_solrconfig.xml...and so. : : when i test these instance, i just stay in /usr/solrapp.conf/,,,not like : now, :

Re: Solr and memcached

2007-04-27 Thread James liu
i used to think cache data with memcached. why i think that? for example, i have 45 solr instance, and i have to merge their results into on array and sort by score or datetime, if i use rows=10, it means i will get max 45*10 results,,but it only show 10result per page. how to do with 440 resul

showing range facet example = by Range ( 1 to 1000 )

2007-04-27 Thread Jery Cook
im stuck: Have a facet, and field in an document called estimatedRepairs, it is declared in the schema.xml as I execute a query with the below parameters q=state%3Avirgina; &facet.query=estimatedRepairs:[*+TO+1000.0] &facet.query=estimatedRepairs:[1000.0+TO+*] &fac

Re: multiple solr instances using same index files?

2007-04-27 Thread James liu
I think performance u should notice. when one instance do index,commit,optimize,,,it will cose much CPU,RAM. i think it will disturb your search(when they use other solr instance) unless u know the time they will search or search's frequency is low. 2007/4/28, Ryan McKinley <[EMAIL PROTECTED]>:

Re: resin faile to start with solr.

2007-04-27 Thread James liu
yes, i tried and failed. afternoon i will redownload solr and test . 2007/4/28, Bill Au <[EMAIL PROTECTED]>: Have you tried using the schema.xml that is in example/solr/conf. It that works then the problem is definitely in your schema.xml. Bill On 4/26/07, James liu <[EMAIL PROTECTED]> wrot

Re: Question to php to do with multi index

2007-04-27 Thread James liu
i think curl_multi is slow. thks, i will try. 2007/4/27, Michael Kimsal <[EMAIL PROTECTED]>: The curl_multi is probably the most effective way, using straight PHP. Another option would be to spawn several jobs, assuming unix/linux, and wait for them to get done. It doesn't give you very good

Re: resin faile to start with solr.

2007-04-27 Thread James liu
now i test the newest solr (nothing modified) i failed to start solr with resin 3.0 2007/4/28, James liu <[EMAIL PROTECTED]>: yes, i tried and failed. afternoon i will redownload solr and test . 2007/4/28, Bill Au <[EMAIL PROTECTED]>: > > Have you tried using the schema.xml that is in exampl

Re: Requests per second/minute monitor?

2007-04-27 Thread Otis Gospodnetic
I think the real-time-ness of this is the key. What's the current QPS? How many in-flight queries do we have? What is the average or mean response time? What's the response time for the 90% percentile? etc. Anyhow, not my current itch, just trying to point out what Wunder is after. Otis