SOLR 1.4: how to configure the improved chinese analyzer?

2009-12-09 Thread Fer-Bj
Hello, is there any existing FAQ or HowTo on how to setup the improved (and new?) chinese analyzer on Solr 1.4? I'd appreciate any help you may provide on this. Thanks, -- View this message in context: http://old.nabble.com/SOLR-1.4%3A-how-to-configure-the-improved-chinese-analyzer--tp267067

Selection of returned fields - dynamic fields?

2009-12-09 Thread Ian Smith
Hi Guys, We need to eliminate one of our stored fields from the Solr response to reduce traffic as it is very bulky and not used externally. I have been experimenting both with "fl=FIELDNAME" and addField(FIELDNAME) from SolrJ and have found it is possible to achieve this effect for fixed fields

Re: indexing XML with solr example webapp - out of java heap space

2009-12-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
the post.jar does not stream. use "curl" if you are using *nix. --Noble On Wed, Dec 9, 2009 at 12:28 AM, Feroze Daud wrote: > Hi! > > > > I downloaded SOLR and am trying to index an XML file. This XML file is > huge (500M). > > > > When I try to index it using the "post.jar" tool in example\examp

DIH solrconfig

2009-12-09 Thread Lee Smith
Hi All There seems to be massive difference between the solrconfig in the DIH example to the one in the normal example ? Would I be correct in saying if I was to add the dataimport request handler in the solrconfig.xml thats all I will need ? ie: class="org.apache.solr.handler.dataimp

Re: Solr Cell and Spellchecking.

2009-12-09 Thread Grant Ingersoll
What's your schema and your config look like for the various relevant pieces? On Dec 8, 2009, at 8:04 PM, Michael Boyle wrote: > Following Eric Hatcher's post about using SolrCell and acts_as_solr { > http://www.lucidimagination.com/blog/2009/02/17/acts_as_solr_cell/ }, I have > been able to in

Re: DIH solrconfig

2009-12-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Dec 9, 2009 at 3:34 PM, Lee Smith wrote: > Hi All > > There seems to be massive difference between the solrconfig in the DIH > example to the one in the normal example ? > > Would I be correct in saying if I was to add the dataimport request handler > in the solrconfig.xml thats all I will

Hi. What Configuration we require?

2009-12-09 Thread kalidoss
Hi To run a Solr-1.3.0 with Data/index directory size of 11GB, 80 lakhs documents and 11 lakhs read request and 30 thousand writes. Every month 200mb of index directory size getting increase. Please suggest me. What type of configuration(CPU, Ram, hard disk) server require to make the s

Re: Re: Solr Cell and Spellchecking.

2009-12-09 Thread boyleme
I just resolved the issue (fresh coffee == good) ! In my schema, I had added: but missed the copyField definition. Adding these: and a restart and everything is working properly. Thanks for the reply and for LucidImagination -- the only reason I have been able to get Solr integrated int

RE: Facet query with special characters

2009-12-09 Thread Peter 4U
Hi, Thanks for your help and answers. I believe I have isolated the issue, and yes, it was 'schema/write'-related. Basically, the issue was this: All indexing is performed via solrj objects (to an EmbeddedSolrServer instance), and this was ported over from 'raw' Lucene java indexing cod

Re: SOLR 1.4: how to configure the improved chinese analyzer?

2009-12-09 Thread Robert Muir
hello, in order to use smart chinese analyzer with Solr 1.4 (it is not yet included), you need to go get the lucene-smartcn.jar file from lucene-2.9.1.zip and put this jar file in your solr lib directory then you can define a field type similar to the the greek example in schema.xml: except

Logging

2009-12-09 Thread Lee Smith
Im trying to import data with DIH (mysql) All my SQL's are good having been tested manually. When I run full import ie: http://localhost:8983/solr/dataimport?command=full-import I get my XML result but nothing is being imported and it Rolles back. In loggin I set DIH logging to fine and set

Facet across second level of hierarchy - Dismax Request Handler - Best practice?

2009-12-09 Thread Daniel Knapp
Hello, i want a second level of hierarchy in my facets (as seen here: http://www.lucidimagination.com/search/?q=) My RequestHandler is the following: dismax *:* 10 courseid^1.0 module^1.0 vorname^1.0 nachname^1.0 email^1.0 postauthor^1.0 post

how to use boost factor

2009-12-09 Thread Doddamani, Prakash
Hi, While searching (querying) the solr, how can we achieved following scenario. Search priority should be in the following order: 1. Genre 2. nowplaying 3. Stationname 4. Keywords Say I am searching for "rock" it should search in "genre" field first and then nowplayin

atypical MLT use-case

2009-12-09 Thread Mike Anderson
This is somewhat of an odd use-case for MLT. Basically I'm using it for near-duplicate detection (I'm not using the built in dup detection for a variety of reasons). While this might sound like an okay idea, the problem lies in the order of which things happen. Ideally, duplicate detection would pr

Re: how to use boost factor

2009-12-09 Thread Erick Erickson
I don't quite understand what you mean by priority. Are you clear about the difference between boosting and sorting? If you're sure you want to boost, have you seen: http://wiki.apache.org/solr/DisMaxRequestHandler#bq_.28Boost_Query.29

Concurrent access to EmbeddedSolrServer

2009-12-09 Thread Jon Poulton
Hi there, I'm about to start implementing some code which will access a Solr instance via a ThreadPool concurrently. I've been looking at the solrj API docs ( particularly http://lucene.apache.org/solr/api/index.html?org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.html ) and I just wa

Re: atypical MLT use-case

2009-12-09 Thread Andre Parodi
the solr 1.4 book says you can do this. usages of mlt: "As a request handler with an external input document: What if you want similarity results based on something that isn't in the index? A final option that Solr supports is returning MLT results based on text data sent to the MLT handler (

SolrQuerySyntax : Types of Range Queries in Solr 1.4

2009-12-09 Thread Israel Ekpo
Hi Guys, In Lucene 2.9 and Solr 1.4, it is possible to perform inclusive and exclusive range searches with square and curly brackets respectively. However, when I looked at the SolrQuerySyntax, only the the include range search is illustrated. It seems like the examples only talk about the inclu

copyField question

2009-12-09 Thread P Franks
All, Can one use the copyField option and copy a TextField field into a longField field? I have some data that i want to extract (filter) out all but the long and/or integer values. Example data:xxx yyy aaa 504 yyy 444234 eee hh I have the copyField in place and the destination fi

Re: SolrQuerySyntax : Types of Range Queries in Solr 1.4

2009-12-09 Thread Yonik Seeley
Solr standard query syntax is an extension of Lucene query syntax, and we reference that on the page: http://lucene.apache.org/java/2_4_0/queryparsersyntax.html -Yonik http://www.lucidimagination.com On Wed, Dec 9, 2009 at 1:08 PM, Israel Ekpo wrote: > Hi Guys, > > In Lucene 2.9 and Solr 1.4, it

Re: Exception encountered during replication on slave....Any clues?

2009-12-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
try the url http://localhost:8080/postingsmaster/replication?command=indexversion using ur browser On Tue, Dec 8, 2009 at 9:56 PM, William Pierce wrote: > Hi, Noble: > > When I hit the masterUrl from the slave box at > > http://localhost:8080/postingsmaster/replication > > I get the following xml

Re: SolrQuerySyntax : Types of Range Queries in Solr 1.4

2009-12-09 Thread Israel Ekpo
On Wed, Dec 9, 2009 at 1:13 PM, Yonik Seeley wrote: > Solr standard query syntax is an extension of Lucene query syntax, and > we reference that on the page: > http://lucene.apache.org/java/2_4_0/queryparsersyntax.html > > -Yonik > http://www.lucidimagination.com > > On Wed, Dec 9, 2009 at 1:08 PM

Re: bool default - if missing when "updating" uses current or default value?

2009-12-09 Thread Chris Hostetter
: But if I update an existing document, and I don't pass in a value for the : booleen field, will Solr keep the existing booleen value unchanged, or will : it update the booleen value again using the default? - ie. true. Updating a document is very simplistic -- it removes the old one (by id) an

Re: Selection of returned fields - dynamic fields?

2009-12-09 Thread Chris Hostetter
: Unfortunately this does not seem to work for dynamic fields - you can definiltely ask for a field that exists because of a dynamicField by name, but you can't use wildcard style patterns in the "fl" param. : "fl=PREFIX*" does not return anything, and neither does "fl=*POSTFIX". : What seems t

Re: Can we build complex filter queries in SOLR

2009-12-09 Thread darniz
Can you please let me know how to describe that condition. For example lets say i want to give the following condition ((category:audio or category:video) AND (brand:sony OR brand:samsung OR brand:sanyo)) How would you represent this condition in fq paramenter of dismax condition goes here is

content stream/MLT

2009-12-09 Thread Mike Anderson
I'm trying to understand how content stream works with respect to MLT. I did a regular MLT query using a document ID and specifying two fields to do MLT on and got back a set of results. I then copied the xml for the document with the aforementioned ID and pasted it to a text file. Then I made the

Re: content stream/MLT

2009-12-09 Thread Erik Hatcher
The MoreLikeThis content stream support is implemented such that the content stream is simply text, analyzed as if it was in the mlt.fl. It doesn't handle Solr XML as you'd expect - simply treats it as a string and analyzes it to get the terms out. Erik On Dec 9, 2009, at 10:21 PM

Re: Stopping & Starting

2009-12-09 Thread Lance Norskog
This would be a handy addition to solr-contrib. The further evolution we had is that sometimes java freezes the the 'stop' command does not work. It is better to use the 'stop' command than kill the process, so we added a sleep command that gave it maybe 30 seconds to shut down and then hit it wit

does fq parameter effects boosting

2009-12-09 Thread darniz
Hello can somone please answer this. someone told me that using fq parameter in the dismax handler might cuase some relevancy and weighting issues. I haven't read this anywhere. Please let me know if this is the case. Thanks darniz -- View this message in context: http://old.nabble.com/does-fq

Re: does fq parameter effects boosting

2009-12-09 Thread Yonik Seeley
fq's are filters and have no effect on the relevancy scores generated for documents. They only affect which documents are matched. -Yonik http://www.lucidimagination.com On Wed, Dec 9, 2009 at 5:00 PM, darniz wrote: > > Hello > can somone please answer this. > someone told me that using fq par

RE: SolrPlugin Guidance

2009-12-09 Thread Vauthrin, Laurent
Thanks for the response. I went ahead and gave it a shot. In my case, the directory name may not be unique so if I get multiple ids back then I create a BooleanQuery (Occur.SHOULD) to substitute the directory name query. This seems to work at the moment so hopefully that's the right approach.

Re: does fq parameter effects boosting

2009-12-09 Thread darniz
Thanks Yonik The question i was asking was that since filter queries are cached, if i change the relevancy model the cached queries wont be returned. correct me if i am wrong. Yonik Seeley-2 wrote: > > fq's are filters and have no effect on the relevancy scores generated > for documents. The

full-text indexing XML files

2009-12-09 Thread Feroze Daud
Hi! I am trying to full text index an XML file. For various reasons, I cannot use Tika or other technology to parse the XML file. The requirement is to full-text index the XML file, including Tags and everything. So, I created a input index spec like this: 1001 NASA Advanced Researc

Re: does fq parameter effects boosting

2009-12-09 Thread Yonik Seeley
On Wed, Dec 9, 2009 at 6:37 PM, darniz wrote: > The question i was asking was that since filter queries are cached, if i > change the relevancy model the cached queries wont be returned. Not sure I understand the question... is there something that you think that Solr won't handle properly? Or is

Re: Hi. What Configuration we require?

2009-12-09 Thread kalidoss
Hi. To run a Solr-1.3.0 with Data/index directory size of 11GB, 80 lakhs documents and 11 lakhs read request and 30 thousand writes. Every month 200mb of index directory size getting increase. Please suggest me. What type of configuration(CPU, Ram, hard disk) server require to make the s

UI for solr core admin?

2009-12-09 Thread Jason Rutherglen
I assume there isn't one? Anything in the works?

Re: UI for solr core admin?

2009-12-09 Thread Shalin Shekhar Mangar
On Thu, Dec 10, 2009 at 11:52 AM, Jason Rutherglen < jason.rutherg...@gmail.com> wrote: > I assume there isn't one? Anything in the works? > Nope. -- Regards, Shalin Shekhar Mangar.

Re: UI for solr core admin?

2009-12-09 Thread Mattmann, Chris A (388J)
Hi Jason, Patches welcome, though! :) Cheers, Chris On 12/9/09 10:31 PM, "Shalin Shekhar Mangar" wrote: > On Thu, Dec 10, 2009 at 11:52 AM, Jason Rutherglen < > jason.rutherg...@gmail.com> wrote: > >> I assume there isn't one? Anything in the works? >> > > Nope. > > -- > Regards, > Shal

Re: Hi. What Configuration we require?

2009-12-09 Thread Shalin Shekhar Mangar
On Wed, Dec 9, 2009 at 5:36 PM, kalidoss < kalidoss.muthuramalin...@sifycorp.com> wrote: > Hi > To run a Solr-1.3.0 with Data/index directory size of 11GB, 80 lakhs > documents and 11 lakhs read request and 30 thousand writes. Every month > 200mb of index directory size getting increase. > > 11

Re: full-text indexing XML files

2009-12-09 Thread Shalin Shekhar Mangar
On Thu, Dec 10, 2009 at 5:13 AM, Feroze Daud wrote: > Hi! > > > > I am trying to full text index an XML file. For various reasons, I > cannot use Tika or other technology to parse the XML file. The > requirement is to full-text index the XML file, including Tags and > everything. > > > > So, I cr

Re: UI for solr core admin?

2009-12-09 Thread Erik Hatcher
Just a note about the hidden gem that I haven't taken as far as I'd like... With the VelocityResponseWriter, it's as easy as creating a Velocity template (and wiring in VwR in solrconfig, which I'll set up by default in 1.5). It will even default to the template named after the handler n

Re: Hi. What Configuration we require?

2009-12-09 Thread kalidoss
Thanks Shalin Shekhar. 11 lakh read requests and 30 thousand write requests within how much time? Per day average of 11 lakh read requests and 30 thousand write requests. The system configuration is 4GB RAM and 4 core x 2 CPUs. are you suggesting us to increase the configuration? -Kalidoss.

Re: UI for solr core admin?

2009-12-09 Thread Mattmann, Chris A (388J)
Nice, Erik! Cheers, Chris On 12/9/09 10:39 PM, "Erik Hatcher" wrote: > Just a note about the hidden gem that I haven't taken as far as I'd > like... > > With the VelocityResponseWriter, it's as easy as creating a Velocity > template (and wiring in VwR in solrconfig, which I'll set up by > de

Re: Hi. What Configuration we require?

2009-12-09 Thread Shalin Shekhar Mangar
On Thu, Dec 10, 2009 at 12:18 PM, kalidoss < kalidoss.muthuramalin...@sifycorp.com> wrote: > Thanks Shalin Shekhar. > > > 11 lakh read requests and 30 thousand write requests within how much time? > Per day average of 11 lakh read requests and 30 thousand write requests. > > The system configura

Re: Can we build complex filter queries in SOLR

2009-12-09 Thread Shalin Shekhar Mangar
On Thu, Dec 10, 2009 at 2:50 AM, darniz wrote: > > Can you please let me know how to describe that condition. > > > For example lets say i want to give the following condition > > ((category:audio or category:video) AND (brand:sony OR brand:samsung OR > brand:sanyo)) > How would you represent thi

Re: UI for solr core admin?

2009-12-09 Thread Erik Hatcher
After I sent that, though, I realized that the core admin is special in that it isn't within the context of a single core. I'll have to research this and see, but I suspect there may be an issue with using VwR for this particular handler, as it wouldn't have a solr-home/conf/ velocity direc

Re: copyField question

2009-12-09 Thread Shalin Shekhar Mangar
On Wed, Dec 9, 2009 at 11:43 PM, P Franks wrote: > All, > > Can one use the copyField option and copy a TextField field into a > longField field? I have some data that i want to extract (filter) out all > but the long and/or integer values. > > No, that won't work. It'd be best to use a TokenFi

Re: Concurrent access to EmbeddedSolrServer

2009-12-09 Thread Shalin Shekhar Mangar
On Wed, Dec 9, 2009 at 11:17 PM, Jon Poulton wrote: > Hi there, > I'm about to start implementing some code which will access a Solr instance > via a ThreadPool concurrently. I've been looking at the solrj API docs ( > particularly > http://lucene.apache.org/solr/api/index.html?org/apache/solr/cl