Spellchecker in Solr

2006-12-06 Thread Otis Gospodnetic
Hi, A month ago, the topic of a spell checker in Solr came up (c.f. http://www.mail-archive.com/solr-user@lucene.apache.org/msg01254.html ). Has anyone made any progress with that? If not, I'll have to do this to scratch my own itch. Because I'm in a hurry with this, I think I will go with the

Re: Indexing XML files

2006-12-06 Thread Graham O'Regan
couldn't you use a cdata section? Chris Hostetter wrote: Since XML is the transport for sending data to Solr, you need to make sure all field values are XML escaped. If you wanted to index a plain text "title" and that tile contained an ampersand character Sense & Sensability ...y

Re: Indexing XML files

2006-12-06 Thread Yonik Seeley
On 12/6/06, Graham O'Regan <[EMAIL PROTECTED]> wrote: couldn't you use a cdata section? That's just another form of escaping. Mirko actually want's the XML field value to be part of the XML of Solr's response, not encapsulated by it. -Yonik

Multi-Valued Faceting

2006-12-06 Thread J.J. Larrea
Many thanks to Hoss, Yonik, et al. for their excellent efforts to bring faceted browsing to the masses! In most cases it works great! But for some fields I have a need which is unfortunately not filled by the current faceting code. These fields (Author Name, for example) have too many discre

Re: Multi-Valued Faceting

2006-12-06 Thread Yonik Seeley
On 12/6/06, J.J. Larrea <[EMAIL PROTECTED]> wrote: My thought was that the simplest approach would be to subclass FieldCacheImpl to introduce a getMultiStringIndex method derived from getStringIndex, defining and returning a MultiStringIndex class which stores order as int[][] rather than int[];

Changing the schema.xml

2006-12-06 Thread Andrew Nagy
What is necessary for the effects of changing the schema.xml to take effect for all of my records? I restarted tomcat, but it does not seem that my changes have taken effect. I wanted to change a full-text field from type:string to type:text to allow for better searching, but do no see any di

Re: Changing the schema.xml

2006-12-06 Thread David Halsted
I think you have to reindex; new additions to your index will follow the new schema but Solr doesn't reach in and reindex your old information automatically. Dave On 12/6/06, Andrew Nagy <[EMAIL PROTECTED]> wrote: What is necessary for the effects of changing the schema.xml to take effect for a

Re: Multi-Valued Faceting

2006-12-06 Thread Chris Hostetter
: > which stores order as int[][] rather than int[]; a variant of : > SimpleFacets.getFieldCacheCounts would simply need an inner loop to : > tally each of the Document's Term indexes for that field. : : I think something like that is the right approach, the only problem : being the size in memory

Re: Performance issue.

2006-12-06 Thread Gmail Account
I reindexed and optimized and it helped. However now each query averages about 1 second(down from 3-4 seconds). The bottleneck now is the getFacetTermEnumCounts function. If I take that call out it is a non measurable query time and the filtercache is being used. With the getFacetTermEnumCounts

Re: Performance issue.

2006-12-06 Thread Yonik Seeley
It is using the cache, but the number of items is larger than the size of the cache. If you want to continue to use the filter method then you need to increase the size of the filter cache to something larger than the number of unique values than what you are filtering on. I don't know if you wi

Re: Performance issue.

2006-12-06 Thread Gmail Account
It is currently a string type. Here is everything that has to do with manu in my schema... Should it have been multi-valued? Do you see anything wrong with this? multiValued="true"/> . Thanks... - Original Message - From: "Yonik Seeley" <[EMAIL PROTECTED]> To: Sent: Wedn

Re: Performance issue.

2006-12-06 Thread Yonik Seeley
Your snippet shows it as "text" not "string" Try faceting on manu_exact and you may get better results. -Yonil On 12/6/06, Gmail Account <[EMAIL PROTECTED]> wrote: It is currently a string type. Here is everything that has to do with manu in my schema... Should it have been multi-valued? Do you