TermVector term frequencies for tag cloud

2009-10-02 Thread aodhol
Hello, I'm trying to create a tag cloud from a term vector, but the array returned (using JSON wt) is quite complex and takes an inordinate amount of time to process. Is there a better way to retrieve terms and their document TF? The TermVectorComponent allows for retrieval of tf and df though I'm

Re: n-Gram, only works with queries of 2 letters

2009-09-30 Thread aodhol
Has this issue been fixed yet? can anyone shed some light on what's going on here please. NGramming is critical to my app. I will have to look to something other than Solr if it's not possible to do :(

NGramTokenFilter behaviour

2009-09-30 Thread aodhol
If I index the following text: "I live in Dublin Ireland where Guinness is brewed" Then search for: duvlin Should Solr return a match? In the admin interface under the analysis section, Solr highlights some NGram matches? When I enter the following query string into my browser address bar, I ge

Re: Highlighting in stemmed or n-grammed fields possible?

2009-09-28 Thread aodhol
But it would seem that Lucene has always supported highlighting on NGram fields? as show by the example here: https://issues.apache.org/jira/browse/LUCENE-1489 When I try to use highlighting with NGramming, none of the text is highlighted, and instead I get a long string in the highlighting field

Re: Highlighting in stemmed or n-grammed fields possible?

2009-09-28 Thread aodhol
Hi Koji et.al, You say https://issues.apache.org/jira/browse/SOLR-1268 is an open issue for the ngram highlighting problem, but it seems to refer to something unrelated. Can you/anyone confirm that it is not possible to use highlighting with an ngram tokenizer/filter.. Thanks, Aodh.