unable to figure out nutch type highlighting in solr....

2007-10-02 Thread Ravish Bhagdev
I have tried very hard to follow documentation and forums that try to answer questions about how to return snippets with highlights for relevant searched term using Solr (as nutch does with such ease). I will be really grateful if someone can guide me with basics, i have made sure that the field t

Re: Searching combined English-Japanese index

2007-10-02 Thread Maximilian Hütter
Yonik Seeley schrieb: > On 10/1/07, Maximilian Hütter <[EMAIL PROTECTED]> wrote: >> Yonik Seeley schrieb: >>> On 10/1/07, Maximilian Hütter <[EMAIL PROTECTED]> wrote: When I search using an English term, I get results but the Japanese is not encoded correctly in the response. (although it

Re: Index multiple languages with multiple analyzers with the same field

2007-10-02 Thread Daniel Alheiros
Same Here. But I can't see how to fit into this UNLESS you are going to create an analyzer to handle a language parameter and based on it would be able to apply a set of filters (and sometimes you want a different - but compatible - set of filters in indexing/query time). It would work, but doing

Re: Index multiple languages with multiple analyzers with the same field

2007-10-02 Thread Ryan McKinley
But I can't see how to fit into this UNLESS you are going to create an analyzer to handle a language parameter and based on it would be able to apply a set of filters (and sometimes you want a different - but compatible - set of filters in indexing/query time). I don't think this is what you a

Re: Problem with html code inside xml

2007-10-02 Thread [EMAIL PROTECTED]
Thanks I use this solution: put in the xml to be indexed and it works, nothing to change in the xsl. In the schema I use this fieldType positionIncrementGap="100"> generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" ca

Re: Letter-number transitions - can this be turned off

2007-10-02 Thread F Knudson
Thanks for your helpful suggestions. I have considered other analyzers but WDF has great strengths. I will experiment with maintaining transitions and then consider modifying the code. F. Knudson Mike Klaas wrote: > > On 30-Sep-07, at 12:47 PM, F Knudson wrote: > >> >> Is there a flag to di

RE: Searching combined English-Japanese index

2007-10-02 Thread Lance Norskog
Python does not do Unicode strings natively, you have to do them explicitly. It is possible that your python receiver is not doing the right thing with the incoming strings. Also, Jetty has problems with UTF-8; the Wiki has more on this. Lance -Original Message- From: Maximilian Hütter

Re: Searching combined English-Japanese index

2007-10-02 Thread Yonik Seeley
On 10/2/07, Maximilian Hütter <[EMAIL PROTECTED]> wrote: > Are you sure, they are wrong in the index? It's not an issue with Jetty output encoding since the python writer takes the string and converts it to ascii before that. Since Solr does no charset encoding itself on output, that must mean th

schema for response

2007-10-02 Thread Yu-Hui Jin
Hi, there, Given that there's some questions on the updated XML schema for the response in Solr 1.2. Can someone points me to the XML schema? Is it documented somewhere? I'm particularly interested in the different status code we would have in the response for either update or select. -- Rega

Re: schema for response

2007-10-02 Thread Ryan McKinley
Yu-Hui Jin wrote: Hi, there, Given that there's some questions on the updated XML schema for the response in Solr 1.2. Can someone points me to the XML schema? Is it documented somewhere? I'm particularly interested in the different status code we would have in the response for either update o

Re: dataset parameters suitable for lucene application

2007-10-02 Thread Chris Harris
Hi There, Would you mind if I pasted your data onto the wiki page at http://wiki.apache.org/solr/SolrPerformanceData I think it would be helpful to get some more numbers on that page, so people can help decide if Solr is the right application for them. Thanks, Chris Harris, new Solr user On 9/

Solr live at Netflix

2007-10-02 Thread Walter Underwood
Here at Netflix, we switched over our site search to Solr two weeks ago. We've seen zero problems with the server. We average 1.2 million queries/day on a 250K item index. We're running four Solr servers with simple round-robin HTTP load-sharing. This is all on 1.1. I've been too busy tuning to up

Re: Re: Problem with html code inside xml

2007-10-02 Thread ycrux
Hi ! I'm facing a similar problem. Some HTML docs are correctly indexed and others are simply rejected even I encoded all problematic HTML tags as Thorsten suggested. In the following example, "my_doc.xml" is a valid "XML" file, compliant with my Solr's schema fields : $ java -jar post.jar ./

Re: Problem with html code inside xml

2007-10-02 Thread Chris Hostetter
: I created a field type: : : ... : Everything works (the div tags, p tags are removed) but some : nnn or tags are style in the text after indexing. i cut/paste that fieldtype into the example schema.xml, and experimented with the analysis tool (http://localhost:8983/solr/admin/ana

Re: Re: Problem with html code inside xml

2007-10-02 Thread Chris Hostetter
: SimplePostTool: FATAL: Connection error (is Solr running at http://localhost:8983/solr/update ?): java.io.IOException: Server returned HTTP response code: 500 for URL: http://localhost:8983/solr/update : : Is there any way to let "Solr" to be more verbose than that ? Solr outputs all errors

Re: Solr live at Netflix

2007-10-02 Thread Chris Hostetter
: Here at Netflix, we switched over our site search to Solr two weeks ago. That's great Walter ... could I persuade you to add a few notes about this to... http://wiki.apache.org/solr/PublicServers http://wiki.apache.org/solr/SolrPerformanceData -Hoss

Re: Solr live at Netflix

2007-10-02 Thread Walter Underwood
I think Chris Harris is doing that. I'll check it and touch it up afterwards. Avoid race conditions. --wunder On 10/2/07 4:26 PM, "Chris Hostetter" <[EMAIL PROTECTED]> wrote: > > : Here at Netflix, we switched over our site search to Solr two weeks ago. > > That's great Walter ... could I pers

Re: Solr live at Netflix

2007-10-02 Thread Tom Hill
Nice! And there seem to be some improvements. For example, "Gamers" and "Gamera" no longer stem to the same word :-) Tom On 10/2/07, Walter Underwood <[EMAIL PROTECTED]> wrote: > > Here at Netflix, we switched over our site search to Solr two weeks ago. > We've seen zero problems with the server

question about bi-gram analysis on query

2007-10-02 Thread Keene, David
Hey guys, I'm trying to index a field in Chinese using the CJKTokenizer, and I'm finding that my searches on the index are not working at all. The index is created properly (looking with Luke), and when I search against it with Luke the data comes back as I would expect. Also, when I use the

Re: Solr live at Netflix

2007-10-02 Thread Norberto Meijome
On Tue, 02 Oct 2007 15:26:33 -0700 Walter Underwood <[EMAIL PROTECTED]> wrote: > Here at Netflix, we switched over our site search to Solr two weeks ago. > We've seen zero problems with the server. We average 1.2 million > queries/day on a 250K item index. We're running four Solr servers > with si

Re: schema for response

2007-10-02 Thread Yu-Hui Jin
Got it. Thanks, Ryan. -Hui On 10/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: > > Yu-Hui Jin wrote: > > Hi, there, > > > > Given that there's some questions on the updated XML schema for the > response > > in Solr 1.2. Can someone points me to the XML schema? Is it documented > > somewhere? >

Re: searching remote indexes

2007-10-02 Thread Venkatraman S
Well, we do not have a Solr server and all the calls to index and search documents is done via Embedded Solr. What is the approach then? On 9/28/07, Mike Klaas <[EMAIL PROTECTED]> wrote: > > Solr's main interface is http, so you can connect to that remotely. > Query each machine and combine the re

Re: searching remote indexes

2007-10-02 Thread Ryan McKinley
Using embedded solr, there is no (built in) way to access remote indexes. If you want to access remote indexes you need to run a server. Solr 1.3 (trunk) includes a java client you may want to look at: http://wiki.apache.org/solr/Solrj If you poke around, this also includes simple ways to run