Outofmemory error for large files

2009-02-15 Thread Gargate, Siddharth
I am trying to index around 150 MB text file with 1024 MB max heap. But I get Outofmemory error in the SolrJ code. Exception in thread "main" java.lang.OutOfMemoryError: Java heap space at java.util.Arrays.copyOf(Arrays.java:2882) at java.lang.AbstractStringBuilder.expandCapacity

Multilanguage

2009-02-15 Thread revathy arun
Hi, I have a scenario where ,i need to convert pdf content to text and then index the same at run time .I do not know as to what language the pdf would be ,in this case which is the best soln i have with respect the content field type in the schema where the text content would be indexed to? Th

Release of solr 1.4 & autosuggest

2009-02-15 Thread Pooja Verlani
Hi All, I am interested in TermComponent addition in solr 1.4 ( http://wiki.apache.org/solr/TermsComponent). When should we expect solr 1.4 to be available for use? Also, can this Termcomponent be made available as a plugin for solr 1.3? Kindly reply if you have any idea. Regards, Pooja

debug distributed performance

2009-02-15 Thread Ian Connor
Is there any debug settings to see where the time is taken during a distributed search? I suspect some of the time is spent in network overhead between the shards consolidating the results but I don't have a good way to pin this down. Sometimes, the results come back very quickly - so I know it is

Word Locations & Search Components

2009-02-15 Thread Johnny X
Hi there, I was told before that I'd need to create a custom search component to do what I want to do, but I'm thinking it might actually be a custom analyzer. Basically, I'm indexing e-mail in XML in Solr and searching the 'content' field which is parsed as 'text'. I want to ignore certain el

suggestion queries

2009-02-15 Thread Yves Hougardy
Hi, What's the best way to set up a suggestion box with solr ? I mean, if i type one letter, it would resquest for all the "categories" beginning with that letter, and so on as the user adds letters. thanks -- Yves Hougardyhttp://www.clever-age.com Clever Age - conseil en architecture

Re: facet count on partial results

2009-02-15 Thread Yonik Seeley
On Sat, Feb 14, 2009 at 6:45 AM, karl wettin wrote: > Also, as my threadshold is based on the distance in score between the > first result it sounds like using a result start position greater than > 0 is something I have to look out for. Or? Hmmm - this isn't that easy in general as it requires k

Re: spellcheck.onlyMorePopular

2009-02-15 Thread Shalin Shekhar Mangar
On Sun, Feb 15, 2009 at 10:00 PM, Mark Miller wrote: > But if we make it past that line (onlyMorePopular=true), later there is: > > // don't suggest a word for itself, that would be silly > if (sugWord.string.equals(word)) { > continue; > } > > So you end up only getting all of

Re: spellcheck.onlyMorePopular

2009-02-15 Thread Mark Miller
Shalin Shekhar Mangar wrote: On Sun, Feb 15, 2009 at 8:56 AM, Mark Miller wrote: I think thats the problem with it. People do think of it this way, and it ends up being very confusing. If you dont use onlyMorePopular, and you ask for suggestions for a word that happens to be in the index,

Re: spellcheck.onlyMorePopular

2009-02-15 Thread Shalin Shekhar Mangar
On Sun, Feb 15, 2009 at 8:56 AM, Mark Miller wrote: > I think thats the problem with it. People do think of it this way, and it > ends up being very confusing. > > If you dont use onlyMorePopular, and you ask for suggestions for a word > that happens to be in the index, you get the word back. > >