Re: Short DismaxRequestHandler Question

2010-05-07 Thread MitchK
Okay, let me be more specific: I got a custom StopWordFilter and a WordMarkingFilter. The WordMarkingFilter is an easy implementation to determine which type a word is. The StopWordFilter (my implementation) removes specific types of words *and* all markers from all words. This leads to a delet

Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hello Solr community, When a user search on our web page, we need to run 3 related but different queries. For SEO reasons, we cannot use Ajax so at the moment we run 3 queries sequentially inside a PHP script. Allthough Solr is superfast, the extra network overhead can make the 3 queries 400ms sl

RE: Embedded Solr search query

2010-05-07 Thread caman
Why not write a custom request handler which can parse, split, execute and combine results to your queries? From: Eric Grobler [via Lucene] [mailto:ml-node+783150-1027691461-124...@n3.nabble.com] Sent: Friday, May 07, 2010 1:01 AM To: caman Subject: Embedded Solr search query Hello

Re: Embedded Solr search query

2010-05-07 Thread Svein Parnas
Or send the queries in parallell from the PHP script (use CURL). Svein 2010/5/7 caman : > > Why not write a custom request handler which can parse, split, execute and > combine results to your queries? > > > > > > > > From: Eric Grobler [via Lucene] > [mailto:ml-node+783150-1027691461-124...@n3.

Re: Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hi Camen, I was hoping someone has done it already :-) I am also new to Solr/lucene, can you perhaps point me to a request handler example page? Thanks and Regards Eric On Fri, May 7, 2010 at 9:05 AM, caman wrote: > > Why not write a custom request handler which can parse, split, execute and >

Re: Embedded Solr search query

2010-05-07 Thread Eric Grobler
Hi Svein, Yes we thought of sending parallell queries, but you still have the extra network overhead. Regards Eric On Fri, May 7, 2010 at 9:11 AM, Svein Parnas wrote: > Or send the queries in parallell from the PHP script (use CURL). > > Svein > > > 2010/5/7 caman : > > > > Why not write a cust

Re: Short DismaxRequestHandler Question

2010-05-07 Thread MitchK
Btw: This thread helps a lot to understand the difference between qf and pf :-) http://lucene.472066.n3.nabble.com/Dismax-query-phrases-td489994.html#a489995 -- View this message in context: http://lucene.472066.n3.nabble.com/Short-DismaxRequestHandler-Question-tp775913p783379.html Sent from the

Long Lucene queries

2010-05-07 Thread Pooja Verlani
Hi all, In my web-app, i have to fire a query thats too long due to the various boosts I have to give. The size changes according to the query and many a times I get a blank page as I probably cross lucene's character limit. Is it possible to post it otherwise, to solr. Shall I be using POST inste

Re: Long Lucene queries

2010-05-07 Thread Erik Hatcher
On May 7, 2010, at 6:56 AM, Pooja Verlani wrote: In my web-app, i have to fire a query thats too long due to the various boosts I have to give. The size changes according to the query and many a times I get a blank page as I probably cross lucene's character limit. Is it possible to post it

schema.xml question

2010-05-07 Thread Antonello Mangone
Hello everyone, my question is Is it possible in schema.xml set a group of fields to use as a default field to query in "OR" or in "AND" ??? example:

RE: schema.xml question

2010-05-07 Thread Markus Jelsma
You could write your own requestHandler in solrconfig.xml, it'll allow you to predefine parameters for your configured search components.   -Original message- From: Antonello Mangone Sent: Fri 07-05-2010 15:17 To: solr-user@lucene.apache.org; Subject: schema.xml question Hello everyone,

Re: schema.xml question

2010-05-07 Thread Antonello Mangone
For the moment I don't know how to do it, but I'll follow your suggestion :) Thank you very much ... ps. I'm just a novel 2010/5/7 Markus Jelsma > You could write your own requestHandler in solrconfig.xml, it'll allow you > to predefine parameters for your configured search components. > > -

RE: How to load Core Properties after Core creation?

2010-05-07 Thread Ankit Bhatnagar
What properties are you adding ? Do you have the persistence = true Ankit -Original Message- From: Ying Huang [mailto:yhu...@capitaliq.com] Sent: Thursday, May 06, 2010 6:33 PM To: solr-user@lucene.apache.org Subject: How to load Core Properties after Core creation? Hi All, Does anyo

Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
Hi, I am new in Solr. I would like to index some PDF files. How can I do using example schema from 1.4.0 version? Regards, Leo

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma
A requestHandler works as an URL that can have predefined parameters. By default you will be querying the /select/ requestHandler. It, for instance, predefines the default number of rows to return (10) and returns all fields of a document (*).   explicit   But you can also define more

RE: Re: schema.xml question

2010-05-07 Thread Markus Jelsma
I forgot, there is actually a proper wiki page on this subject: http://wiki.apache.org/solr/SolrRequestHandler     -Original message- From: Antonello Mangone Sent: Fri 07-05-2010 15:26 To: solr-user@lucene.apache.org; Subject: Re: schema.xml question For the moment I don't know how t

RE: Help indexing PDF files

2010-05-07 Thread Markus Jelsma
Hi,     The wiki page [1] on this subject will get you started.   [1]: http://wiki.apache.org/solr/ExtractingRequestHandler     Cheers   -Original message- From: Leonardo Azize Martins Sent: Fri 07-05-2010 15:37 To: solr-user@lucene.apache.org; Subject: Help indexing PDF files Hi

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
I am using this page, but in my downloaded version there is no site directory. Thanks 2010/5/7 Markus Jelsma > Hi, > > > > > > The wiki page [1] on this subject will get you started. > > > > [1]: http://wiki.apache.org/solr/ExtractingRequestHandler > > > > > > Cheers > > -Original message--

RE: Re: Help indexing PDF files

2010-05-07 Thread Markus Jelsma
You don't need it, you can use any PDF file.   -Original message- From: Leonardo Azize Martins Sent: Fri 07-05-2010 15:45 To: solr-user@lucene.apache.org; Subject: Re: Help indexing PDF files I am using this page, but in my downloaded version there is no site directory. Thanks 2010/5/7

Re: increase(change) relevancy

2010-05-07 Thread MitchK
Hi Ramzesua, take a look at the example of the function query that influences relvancy by the popular-field of the example-directory. http://wiki.apache.org/solr/FunctionQuery#Using_FunctionQuery Kind regards - Mitch -- View this message in context: http://lucene.472066.n3.nabble.com/increase

Re: Example of using "stream.file" to post a binary file to solr

2010-05-07 Thread Chris Hostetter
: Sorry. That is what I meant. But, I put it wrongly. I have not been : able to find examples of using solrj, for this. did you look at the link i included? : > To POST a raw stream using SolrJ you need to use the : > ContentStreamUpdateRequest... : > : > http://wiki.apache.org/solr/Extracting

Re: Example of using "stream.file" to post a binary file to solr

2010-05-07 Thread Sandhya Agarwal
Yes, I did. But, I don't find a solrj example there. The example in the doc uses curl. - Sent from iPhone On 07-May-2010, at 8:12 PM, "Chris Hostetter" wrote: > : Sorry. That is what I meant. But, I put it wrongly. I have not been > : able to find examples of using solrj, for this. > > did

RE: How to load Core Properties after Core creation?

2010-05-07 Thread Ying Huang
Thanks for your reply, Ahkit. I'm adding properties like "masterEnabled/slaveEnabled", "pollInterval", "autoCommitTime" and etc., so that I can easily configure these properties respectively for each Core and use them in solrconfig.xml. I'm also using persistent = true, and that's exactly the

Re: Example of using "stream.file" to post a binary file to solr

2010-05-07 Thread Praveen Agrawal
Sandhya, Chris's link (with anchor name) directly goes to solrj example On Fri, May 7, 2010 at 8:15 PM, Sandhya Agarwal wrote: > Yes, I did. But, I don't find a solrj example there. The example in > the doc uses curl. > > - Sent from iPhone > > On 07-May-2010, at 8:12 PM, "Chris Hostetter" > wr

RE: Help indexing PDF files

2010-05-07 Thread caman
Take a look at Tika library From: Leonardo Azize Martins [via Lucene] [mailto:ml-node+783677-325080270-124...@n3.nabble.com] Sent: Friday, May 07, 2010 6:37 AM To: caman Subject: Help indexing PDF files Hi, I am new in Solr. I would like to index some PDF files. How can I do using ex

RE: Embedded Solr search query

2010-05-07 Thread caman
I would just look at SOLR source code and see how standard search handler and dismaxSearchHandler are implemented. Look under package 'org.apache.solr. handler' From: Eric Grobler [vi

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
I had Solr in machine A. In machine B I run the command below: curl "http://10.33.19.201:8983/solr/update/extract?&extractOnly=true"; --data-binary @VPSX_V1_R10.pdf and I get the response: java.lang.IllegalStateException: Form too large What I and doing wrong? Is it the right or best way to send

CommonsHttpSolrServer vs EmbeddedSolrServer

2010-05-07 Thread Blargy
Can someone please explain to me the use cases when one would use one over the other. All I got from the wiki was: (In reference to Embedded) "If you need to use solr in an embedded application, this is the recommended approach. It allows you to work with the same interface whether or not you hav

Re: Sanity check on numeric types and which of them to use

2010-05-07 Thread wojtekpia
> 3) The only reason to use a "sint" field is for backward compatibility > and/or to use sortMissingFirst/SortMissingLast, correct? > I'm using sint so I can facet and sort facets numerically. -- View this message in context: http://lucene.472066.n3.nabble.com/Sanity-check-on-numeric-types-

Re: Help indexing PDF files

2010-05-07 Thread Leonardo Azize Martins
Hi, Sorry, I am newbie. Using these two commands it works. curl " http://10.33.19.201:8983/solr/update/extract?stream.file=C:\\temp\\VPSX_V1_R10.pdf&stream.contentType=application/pdf&literal.id=M4968\\C$\\temp\\VPSX_V1_R10.pdf&commit=true " curl ' http://10.33.19.201:8983/solr/update/ext

Re: Can I use per field analyzers and dynamic fields?

2010-05-07 Thread Chris Hostetter
: : The "source" of my problems is the fact that I do not know in advance the : field names. Users are allowed to decide they own field names, they can, : at runtime, add new fields and different Lucene documents might have : different field names. I would suggest you abstract away the field name

Re: Commit takes 1 to 2 minutes, CPU usage affects other apps

2010-05-07 Thread Chris Hostetter
: The measurement was done outside our Solr client which sends the update : and then the commit to the handler. I also see the update-URL call in : the Tomcat Manager taking up that amount of time. so it's the full request time, and would be inclusive of any postCommit event handlers -- that's i

Re: Re: schema.xml question

2010-05-07 Thread Antonello Mangone
Thank you very much for your suggestions, I'll study immediatly ... 2010/5/7 Markus Jelsma > I forgot, there is actually a proper wiki page on this subject: > > http://wiki.apache.org/solr/SolrRequestHandler > > > > > > -Original message- > From: Antonello Mangone > Sent: Fri 07-05-201

Re: SEVERE: java.util.concurrent.RejectedExecutionException

2010-05-07 Thread Chris Hostetter
: I am working with creating my own custom dataimport handler evaluator class : and I keep running across this error when I am trying to delta-import. It : told me to post this exception to the mailing list so thats what I am doing : ;) : : [java] SEVERE: java.util.concurrent.RejectedExecutionExc

Re: schema.xml question

2010-05-07 Thread Chris Hostetter
: : ... group_name -Hoss

Re: schema.xml question

2010-05-07 Thread Antonello Mangone
It's seems like a copyField but is a group that I want ... and in your version is not a group, I want the possibility to search in a group of field using "AND" or "OR" 2010/5/7 Chris Hostetter > > : > : > > > > ... > > > > group_name > > > > > -Hoss > >

Re: Short DismaxRequestHandler Question

2010-05-07 Thread Chris Hostetter
: The StopWordFilter (my implementation) removes specific types of words *and* : all markers from all words. : : This leads to a deletion of some parts of sentences. Ah, yes i think you're running into the same confusion people have with dismax and stopwords -- there was a blog about this recen

Re: Highlighting Performance On Large Documents

2010-05-07 Thread Lance Norskog
Do you have these options turned on when you index the text field: termVectors/termPositions/termOffsets ? Highlighting needs the information created by these anlysis options. If they are not turned on, Solr has load the document text and run the analyzer again with these options on, uses that dat

Re: caching repeated OR'd terms

2010-05-07 Thread Lance Norskog
I would suggest benchmarking this before doing any more complex design. A field with only 10k unique integer or string values will search very very quickly. On Thu, May 6, 2010 at 7:54 AM, Nagelberg, Kallin wrote: > Hey everyone, > > I'm having some difficulty figuring out the best way to optimiz

Re: Custom DIH variables

2010-05-07 Thread Lance Norskog
Using a core via the Embedded front and the HTTP front end seems dangerous. SOLR-1499 does an HTTP call for the same info. https://issues.apache.org/jira/browse/SOLR-1499 On Thu, May 6, 2010 at 8:18 PM, Blargy wrote: > > So I came up with the following class. > > public class LatestTimestampEva

Re: Custom DIH variables

2010-05-07 Thread Blargy
Thanks for the tip Lance. Just for reference, why is it dangerous to use the HTTP method? I realized that the embedded method is probably not the way to go (obviously since I was getting that "SEVERE: java.util.concurrent.RejectedExecutionException") -- View this message in context: http://luc