On Jun 23, 2007, at 11:24 PM, Jack L wrote:
I have some documents, each has a number of tags. I'd like to
have a query to return "similar" documents which share largest
number of tags with a given document. For example, if I have
doc that has 4 tags, and I'd like to return docs that also
have th
So then you write a tokenizer that creates a token stream consisting of both
uni-grams (e.g. C1, C2) and bi-grams (e.g. C1C2, C2C3), and you get both. I
already pointed to the n-gram tokenizers I wrote a while back and put under
lucene's contrib/analyzers/...
Otis
--
Lucene Consulting -- http:
: I have some documents, each has a number of tags. I'd like to
: have a query to return "similar" documents which share largest
: number of tags with a given document. For example, if I have
: doc that has 4 tags, and I'd like to return docs that also
: have these 4 tags. And if this doesn't make
which files i should change from source?
and if i change ok.
how to compile? just ant dist?
--
regards
jl
Hi James
I think you would be better of outputting an PHP array, and running
eval() over it, the PHP serialize format is quite complicated.
On that note, you might be interested in:
http://issues.apache.org/jira/browse/SOLR-196
-Nick
On 6/25/07, James liu <[EMAIL PROTECTED]> wrote:
which files
aha,,it seems good, how can i fix it with my solr, i don't know how do with
it
2007/6/25, Nick Jenkin <[EMAIL PROTECTED]>:
Hi James
I think you would be better of outputting an PHP array, and running
eval() over it, the PHP serialize format is quite complicated.
On that note, you might be int
I means how to add it to my solr(1.2 production)
2007/6/25, James liu <[EMAIL PROTECTED]>:
aha,,it seems good, how can i fix it with my solr, i don't know how do
with it
2007/6/25, Nick Jenkin <[EMAIL PROTECTED]>:
>
> Hi James
> I think you would be better of outputting an PHP array, and runn