Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
Logged a ticket for Solr: https://issues.apache.org/jira/browse/SOLR-1630 Thanks, Robin On Mon, Dec 7, 2009 at 9:36 PM, Robin Wojciki wrote: > Koji, > > In the sample I sent, the exception comes only if the > HTMLStripCharFilter is there. > > However, your test case seems to c

Re: Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-07 Thread Robin Wojciki
wrote: > Robin Wojciki wrote: >> >> Koji, I was able to create a minimal replication. >> >> Attached zip has solr.xml, solrconf.xml and Main.java. I was able to >> replicate the issue by replacing the conf files in >> apache-solr-1.4.0/example/solr/conf and

Re: HTML Stripping slower in Solr 1.4?

2009-12-04 Thread Robin Wojciki
Thanks Koji for logging the ticket. I noticed its priority is set to minor. Is there any work around? I feel like I am being half as productive as every iteration is taking twice as much time. Thanks Robin On Tue, Dec 1, 2009 at 11:47 AM, Koji Sekiguchi wrote: > Robin, > > Thank you for reportin

Solr 1.4: StringIndexOutOfBoundsException in SpellCheckComponent with HTMLStripCharFilterFactory

2009-12-04 Thread Robin Wojciki
I am running a search in Solr 1.4 and I am getting the StringIndexOutOfBoundsException pasted below. The spell check field uses HTMLStripCharFilterFactory. However, the search works fine if I do not use the HTMLStripCharFilterFactory. If I set a breakpoint at SpellCheckComponent.java: 248, the val

HTML Stripping slower in Solr 1.4?

2009-11-30 Thread Robin Wojciki
Hello, Our schema in Sol 1.3 looked like: It takes 30s to index 1500 docs. When we run the same in Sol 1.4 it take 70s. I noticed that HTMLStripStandardTokenizerFactory was deprecated. So changed the schema to: It still takes 70s. Instead, if I use the schema: It takes 30s in both 1.3