Re: help with using ngram analyser needed

2008-02-22 Thread Christian Wittern
Otis Gospodnetic wrote: Great, this works and should give me a start for further experiments. Thanks a lot! Christian

Re: Newbie question about search

2008-02-22 Thread Reece
Sounds like the docs aren't committed maybe? Go to /solr/admin/stats.jsp and look for: docsPending : X Where X is the number of docs that aren't committed yet. -Reece On Fri, Feb 22, 2008 at 3:07 PM, x8nnn <[EMAIL PROTECTED]> wrote: > > I tried to verify the readerdir. Which is fine. > Ins

Re: Newbie question about search

2008-02-22 Thread x8nnn
I tried to verify the readerdir. Which is fine. Inside index dir I can even see a file created _7.fdt which has all the content of text. Now I am surprised why I am not getting it in search? Santos x8nnn wrote: > > Recently I installed Solr. > > I made changes to schema.xml, added following ent

Re: Indexing content, storing html

2008-02-22 Thread Paul deGrandis
Thanks, this is perfect for what I'm trying to do. Paul On 2/22/08, Reece <[EMAIL PROTECTED]> wrote: > Well I don't remember the specific name of it, I just wrote that > because it sounded close :) > > There is a list of them here though: > http://wiki.apache.org/solr/AnalyzersTokenizersTokenF

Re: Indexing content, storing html

2008-02-22 Thread Reece
Well I don't remember the specific name of it, I just wrote that because it sounded close :) There is a list of them here though: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters -Reece On Fri, Feb 22, 2008 at 2:10 PM, Paul deGrandis <[EMAIL PROTECTED]> wrote: > Thanks! > > Does So

Re: Indexing content, storing html

2008-02-22 Thread Paul deGrandis
Thanks! Does Solr include an HTMLTokenFilterFactory? Paul On 2/22/08, Reece <[EMAIL PROTECTED]> wrote: > I did this as well, but found problems when searching (tags in between > words caused searching nightmares). I recommend stripping out all the > tags using the HTMLTokenFilterFactory or yo

Re: Indexing content, storing html

2008-02-22 Thread Reece
I did this as well, but found problems when searching (tags in between words caused searching nightmares). I recommend stripping out all the tags using the HTMLTokenFilterFactory or your own regex when indexing, and storing the actual HTML in an actual database. If you really want to store the HT

Newbie question about search

2008-02-22 Thread x8nnn
Recently I installed Solr. I made changes to schema.xml, added following entries Now I post a document like this: 0A0A1BC3:01183F59ADDC:CBFA:008AEED0 Interoperability Demonstration Project Report 110 page of text... Once I post it I see following entry in my catal

Re: solrj or any other solr java client

2008-02-22 Thread Otis Gospodnetic
Grab a nightly build, it should be in there. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Paul Treszczotko <[EMAIL PROTECTED]> > To: "solr-user@lucene.apache.org" > Sent: Friday, February 22, 2008 1:32:37 PM > Subject: solrj or any other s

solrj or any other solr java client

2008-02-22 Thread Paul Treszczotko
Hi all, Where can I find the latest and the greatest copy of SOLRJ or any other http java client for solr? pt ???u0? Paul Treszczotko Architect, Client Systems INPUT 11720 Plaza America Drive, Suite 1200 Reston, Virginia 20190 Direct: 703-707-3524; Fax 703-707-6201 Th

Re: help with using ngram analyser needed

2008-02-22 Thread Otis Gospodnetic
Hi, Append &debugQuery=true to your request URLs to see what's going on. Here is something I've used in the past. I suggest you throw out everything but n-grams while you're debugging. ... ... Otis -- Sematext -- http://sematext.c

Re: YAML update request handler

2008-02-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
Without breaking the existing stuff we can add another interface BinaryQueryResponse extends QueryResponseWriter{ public void write(OutputStream out, SolrQueryRequest request, SolrQueryResponse response) throws IOException; } and in the SolrDispatchFilter do something like this QueryResponseWrite

Re: Filter query cache issues

2008-02-22 Thread Yonik Seeley
On Fri, Feb 22, 2008 at 12:22 PM, Matt M. <[EMAIL PROTECTED]> wrote: > I'm working with an index that contains 4,447,390 documents. The response > time for querying using facets is pretty darn slow. I'm fairly new to more > advanced Solr usage and today have started looking into the solrconfig.x

Filter query cache issues

2008-02-22 Thread Matt M.
Hi, I'm working with an index that contains 4,447,390 documents. The response time for querying using facets is pretty darn slow. I'm fairly new to more advanced Solr usage and today have started looking into the solrconfig.xml. In the solr admin app, I noticed that the filterCache evictions were

Indexing content, storing html

2008-02-22 Thread Paul deGrandis
Hi all, I'm working on a solr app that pulls HTML from an embedded JavaScript WYSIWYG editor, and I need to index on the content, but store and reproduce the HTML. The problem I have is when I try to add and commit, the HTML gets interpreted as XML. Is the way to do this properly to create an HT

Re: Error messages in log, but everything seems fine

2008-02-22 Thread Yonik Seeley
On Fri, Feb 22, 2008 at 7:38 AM, amamare <[EMAIL PROTECTED]> wrote: > Hi, > Solr apparently writes loads of error messages with every update, commit, > search etc. Everything seems to be fine, searching and indexing is correct > and fast, but we are concerned it might affect other parts of the

RE: multiple "things" in a document

2008-02-22 Thread Will Johnson
Usually you do something like: (assuming this is in a rdbms) SELECT sku.id as skuid, sku.name as skuname, item.name as itemname, location.name as locationname FROM sku, item, location WHERE sku.item = item.id AND sku.location = location.id The you can search on any part of the 'flat' record and

Re: YAML update request handler

2008-02-22 Thread Grant Ingersoll
The DispatchFilter could probably be modified to have the option of using the ServletOutputStream instead of the Writer. It would take some doing to maintain the proper compatibility, but it can be done, I think. Maybe we could have a /binary path or something along those lines and SolrJ

multiple "things" in a document

2008-02-22 Thread Geoffrey Young
hi all :) I'm just getting up to speed with solr (and lucene, for that matter) for a new project. after reading through the available docs I'm not finding an answer to my most basic (newbie, certainly) question. please feel free to just point me to the proper doc :) this isn't my actual us

Re: YAML update request handler

2008-02-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
The API forbids use of any non-text format. The QueryResponseWriter's write() method can take only a Writer. So we cannot write any binary stream into that. --Noble On Fri, Feb 22, 2008 at 12:30 AM, Walter Underwood <[EMAIL PROTECTED]> wrote: > Python marshal format is worth a try. It is binary

Error messages in log, but everything seems fine

2008-02-22 Thread amamare
Hi, Solr apparently writes loads of error messages with every update, commit, search etc. Everything seems to be fine, searching and indexing is correct and fast, but we are concerned it might affect other parts of the system if they are in fact symptoms of errors internal to Solr. It seems that o

Re: solr to work for my web application

2008-02-22 Thread Thorsten Scherler
On Fri, 2008-02-22 at 04:11 -0800, newBea wrote: > Hi Thorsten, > > Many thanks for ur replies so far...finally i set up correct environment for > Solr. Its working:clap: :) Congrats, glad you got it running. > > Solr Rocks! Indeed. :) salu2 > > Thorsten Scherler wrote: > > > > On Thu, 20

Re: solr to work for my web application

2008-02-22 Thread newBea
Hi Thorsten, Many thanks for ur replies so far...finally i set up correct environment for Solr. Its working:clap: Solr Rocks! Thorsten Scherler wrote: > > On Thu, 2008-02-14 at 23:16 -0800, newBea wrote: >> Hi Thorsten... >> >> SOrry for giving u much trouble but I need some answer regarding

Re: YAML update request handler

2008-02-22 Thread Grant Ingersoll
See https://issues.apache.org/jira/browse/SOLR-476 On Feb 22, 2008, at 5:17 AM, Noble Paul നോബിള്‍ नोब्ळ् wrote: The SolrJ client is designed with the ResponseParser as an abstract class (which is good). But I have no means to plugin my custom ResponseParser class. Add a setter method . setR

Re: YAML update request handler

2008-02-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
The SolrJ client is designed with the ResponseParser as an abstract class (which is good). But I have no means to plugin my custom ResponseParser class. Add a setter method . setResponseParser(ResponseParser parser) and have a lazy initialization of Responseparser . if(_processor == null) _process

help with using ngram analyser needed

2008-02-22 Thread Christian Wittern
Hi Solr users, This is my first posting to this list, after experimenting with Solr for a few days. Please bear with me. I am trying to set up a text field for searching CJK text. At the moment, I am trying using the ngram tokenizer factory, defined in the schema.xml as follows:

Re: custom handler results don't seem to match manually entered query string

2008-02-22 Thread evol__
Hoss thanks, hm it might be a problem with not (specifically..) using analyzers. But I always thought such code: Term term = new Term("text", str); TermQuery tq = new TermQuery(term); query.add(tq, Occur.SHOULD); would get query terms through analyzers - since they are specified under ...