Re: Indexing of non-english text with Solr, any known limitations?

2006-04-12 Thread Chris Hostetter
: > Yes, with a little bit of work (making a Solr Filter Factory or : > Tokenizer factory) you can use any Lucene filter, tokenizer, or : > analyzer. : : ok. If my project actually happens I'll do my best to contribute such : changes if they make sense to Solr. Just to clarify: there's no coding

Re: Deleting documents

2006-04-12 Thread Chris Hostetter
: OK, I think I fixed this bug. Haven't added a test case yet... : I'll get to that sometime unless someone beats me to it. : It does sort of require a separate schema to test though (which out : test harness doesn't really handle yet, I think) actually the test harness *can* handle differnet sc

Re: Deleting documents

2006-04-12 Thread Yonik Seeley
OK, I think I fixed this bug. Haven't added a test case yet... I'll get to that sometime unless someone beats me to it. It does sort of require a separate schema to test though (which out test harness doesn't really handle yet, I think) -Yonik On 4/12/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:

Re: Deleting documents

2006-04-12 Thread Yonik Seeley
On 4/12/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > : Could it be the problem is that my unique key field > : is of type slong (as defined in the tutorial)? > > I just tried modifying the example schema to use type slong for the "id" > field, and i can reproduce what you are describing -- nu

Re: Deleting documents

2006-04-12 Thread Chris Hostetter
: Could it be the problem is that my unique key field : is of type slong (as defined in the tutorial)? I just tried modifying the example schema to use type slong for the "id" field, and i can reproduce what you are describing -- numDocs doesn't go down, ... because the delete isn't working. I'm

RE: Interest in Extending SOLR

2006-04-12 Thread Bryzek.Michael
Yonik - > So the number of filters is equal to the number of sites? > How many sites are there? Today: When new customers join, we generally don't do anything special. Currently we have roughly 400 customers, most of which have one site each. Note that a few customers have as many as 50 sites.

Re: Interest in Extending SOLR

2006-04-12 Thread Yonik Seeley
Welcome Michael, On 4/12/06, Bryzek.Michael <[EMAIL PROTECTED]> wrote: > * Integrated support for partitioning - database tables can be > partitioned for scalability reasons. The most common scenario for > us is to partition off data for our largest customers. For > example, imagine

Re: Deleting documents

2006-04-12 Thread Yonik Seeley
On 4/12/06, Marcus Stratmann <[EMAIL PROTECTED]> wrote: > > Yes, I believe the Wiki has an example like this (a uniqueKey field > > not named "id") > Right, I should have looked there, too. > > > > But after a I found the number of documents unchanged > > > in the stats. If numDocs didn't change,

Re: Indexing of non-english text with Solr, any known limitations?

2006-04-12 Thread Bertrand Delacretaz
Hi Yonik, Thanks very much for your replies! Le 12 avr. 06 à 16:45, Yonik Seeley a écrit : On 4/12/06, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: ...The project that I'm looking at is currently single-language (French), which I assume can be handled by static configuration of the appropr

Re: Indexing of non-english text with Solr, any known limitations?

2006-04-12 Thread Yonik Seeley
On 4/12/06, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote: > Hi Solr users, > > I'm investigating indexers for a project, played a bit with both Solr > and Nutch recently, and the Solr "RESTful indexing component" concept > fits our needs quite well. > > Before I dig too deep, are there any known l

Interest in Extending SOLR

2006-04-12 Thread Bryzek.Michael
All - My apologies in advance of a rather long email message, especially for a first time poster to this list. I'm looking at using SOLR to replace our custom http / xml infrastructure for Lucene that we built to tightly integrate with our web apps running in an oracle, non java environment. Eval

Re: Deleting documents

2006-04-12 Thread Marcus Stratmann
> Yes, I believe the Wiki has an example like this (a uniqueKey field > not named "id") Right, I should have looked there, too. > > But after a I found the number of documents unchanged > > in the stats. > What stat? maxDoc may be unchanged since it doesn't reflect deleted > documents that haven

Indexing of non-english text with Solr, any known limitations?

2006-04-12 Thread Bertrand Delacretaz
Hi Solr users, I'm investigating indexers for a project, played a bit with both Solr and Nutch recently, and the Solr "RESTful indexing component" concept fits our needs quite well. Before I dig too deep, are there any known limitations w.r.t indexing of non-english text? I know Lucene